The middle finger is a common symbol, a static formation of the fingers that the algorithm can easily learn from many common examples of the same formation of fingers.
There's a lot more diversity, nuance, and lack of structure that goes into how hands wrap around objects to hold them, what they do when idle, etc.