Sunday, April 29, 2012

Plot of the Week 1

Today is the first installment of Plot of the Week.  I am planning on posting a sports-related plot every week.  Some weeks I will interpret the plot; other weeks I hope that you will help post what you find interesting about the plot.


This is a scatterplot of athletes' win percentage in Majors vs. win percentage at other non-Major tournaments.  
  • I have plotted male golfers (Tiger Woods pre-scandal, current Tiger Woods, Phil Mickelson, Jack Nicklaus) and female golfers (Yani Tseng, Annika Sorenstam) in black and male tennis players (Pete Sampras, Roger Federer, Rafael Nadal, Novak Djokovic) and female tennis players (Venus and Serena Williams, Martina Navratilova, Chris Evert, Steffi Graf) in red.
  • Annika Sorenstam and Novak Djokovic are tough to read due to overplotting in the middle of the plot.
  • The solid line represents the 45-degree line (y = x) and the dashed line is the least squares regression line.
Here are the most interesting features in this graph (to me, at least):
  • Phil Michelson does not win tournaments (Majors and non-Majors) as frequently as the other athletes.
  • Graf won much more often than anyone else - over 50% of non-Major tournaments and 40% of Major tournaments.  From this view, it seems that Graf is clearly the best female tennis player, with Evert and Navratilova not too far behind.
  • Athletes close to the solid line win Majors at the same frequency as non-Majors.  This seems to represent the set of athletes who are not intimidated by the pressure at the Majors.
  • Yani Tseng is the only athlete much farther above the line - she seems to over-preform at the Majors.  It is still very early in her career, so I would not be surprised if she moves closer to the diagonal line as her career progresses.
  • The athletes below the line tend to under-perform at the Majors compared to other tournaments. But this set contains some of the best athletes ever, so this is not quite a fair assessment (Venus has 7 majors and Evert and Navratilova 18 each). Maybe it is more fair to say that they over-perform at non-Majors.
  • Federer and Nadal have very similar winning percentages.  Since Federer is 5 years older than  Nadal, it will be interesting to see if Nadal can keep this same pace for the next 5 years to catch up to Federer in terms of tournaments won.
Do you notice anything else interesting that I missed?

3 comments:

  1. I can make several generalizations here.

    1. Mickelson, like many others, has amassed a fortune with runner-up money. He would have left golf a long time ago if it was winner-take-all.
    2. It would seem it was more common to play tennis in non-majors, especially females, prior to 2000.
    3. I would expect a stack of tennis names between Rafa and Sampras, but for my point #2. Names like Laver, Borg and McEnroe. It would be interesting to see.
    4. I think it's far easier to stay above the line in tennis than it is in golf. The reason Grad and crew are so far to the right is that they played and won more NM tournaments. I really wouldn't expect to see a golfer above that line.

    ReplyDelete
  2. Great comment! I think there are 2 reasons for the main differences between the tennis players and golfers:
    1. Nowadays, tennis is only played on 3 surfaces: hardcourt (3/4 of the season), clay (1/4 season) and grass (1-2 tournaments). Golfers are playing on different courses each week with different layouts. This probably levels the playing field more for the golfers.
    2. Just recently (10-15 years?), the tennis tour starting making other non-Major tournaments mandatory for all top players. Back in the 70s and 80s, the top players rarely played each other outside of the Majors. This is why John McEnroe thought that Djokovic's 43-0 start to the 2011 season was much better than his own 43-0 season start in the 70s because of the talent Novak was beating in each tournament.

    ReplyDelete