No Sweat Statistics

St. Louis Rams $100K Giveaway

2014-04-15T21:47:00.000-05:00

It was announced today that the St. Louis Rams are offering $100,000 to anybody who correctly predicts their 2014 schedule. The 16 opponents are known, so you must guess which opponent the Rams play each week, along with their bye week. To make things more difficult, you must also predict if the game is played on Sun (which most are), Monday (typically 1 game each week), or Thursday (typically 1 game each week).

I did a quick back-of-the-envelope calculation to determine the approximate odds of winning such a contest and came up with 1 in 45,193,226,156,719,200 which is about "45 followed by 15 zeros". Here's how I got this number:

First, let's choose the bye week. Last year no teams had a bye in weeks 1-3 or 13-17. This leaves 9 choices for the bye week.

Next, we can place the 16 opponents among the remaining 16 weeks. This can be done 16! ways (reminder: 16! = 16 * 15 * 14 * ... * 2 * 1). That's a lot of choices. However, the Rams play the 3 teams in their division twice each, and we can assume that they will not play the same team in consecutive weeks. Accounting for the bye week, this removes about 15 * 14 * 3 = 630 possibilities (although there is some double counting that I'm ignoring).

Finally, we need to select the day of the game. On average, each team plays one Sunday night game and one Thursday night game. For simplicity, we will assume that the Rams will play exactly 1 Thurs. night and one Mon. night game. Ignoring the bye week, this leaves 16 * 15 = 240 possibilities. (In reality, games that are thought to be "better" are on Monday night and games that are thought to be less interesting are played on Thursday. Therefore, the odds are probably slightly different).

To get the final number of possibilities, we need to multiply the different possibilities together: 9 * (16! - 630) * 240 = 45,193,226,156,719,200. Although this is slim chances, I still took 5 minutes to fill out the schedule. So here's to hoping I'm $100,000 richer when I write my next post!

P.S. - I'm sure I made some silly mistake, so please comment if you notice one and I'll get it fixed.

Winter Olympics Medal Predictions Gone Wrong

2014-02-16T17:16:00.001-06:00

I recently came across an article about an analytics company, MicroStrategy, that used their dashboard to predict the number of medals that each country will win at the 2014 Winter Olympics. They predicted that Canada will win the most medals (35), followed by Germany (31), with the US in 7th place (16 medals). They explain the results in their blog post. I think the company was just trying to do a simple, light-hearted analysis to show off the types of analyses that their product can do. Unfortunately for the company, there are several flaws in their model/analysis. Two minor flaws have to do with the data itself:

In their post, they state "Keep in mind that the algorithm is based on historical data, and doesn’t necessarily reflect more current information such as emerging stars, recent funding boosts, and an unexpectedly large addition of new events to the program."
On a related note, they fail to explain how they handled the dissolution of old countries to form new countries. For example, are the medals from past Olympics won by the USSR now contributed to Russia? What about newer countries, such as Ukraine, that used to be a part of USSR?

The biggest problem is over-fitting of the model. Over-fitting occurs when you design a complex model using many variables that is more likely to describe random noise than the true signal. Because the model is describing the noise and not true signal of the historical data, it often results in inaccurate predictions (in this case, inaccurate medal predictions). If you are interested in performing predictions based on a model, one simple way to determine if a model is over-fit is by leaving out a subset of the data, fitting a model, then using that model to predict the data that you left out. Since you know the "truth" of these predicted data, you can assess the prediction accuracy. For example, MicroStrategy could have used all data prior to 2010, fit the model using that data, then predicted medal counts for 2010. We would be able to determine the accuracy by comparing to the true 2010 medal counts.

The 2014 Winter Olympics are only half way over, but let's compare their predictions to current medal counts for selected countries:

	Predicted Medal Count	Actual Medal Count (as of 2/16)
Canada	35	14
Germany	31	12
Russia	18	16
USA	16	16
Netherlands	7	17

While it is possible that Canada and Germany can still reach their predicted medal counts, it looks like Russia, USA, and the Netherlands will all greatly surpass their predicted medal count.

Again, I realize that the company was doing this for fun and to generate some press. I just hope that customers don't look at this analysis, see how bad these predictions are, and ultimately decide to not buy the product. This goes to show how a little statistical knowledge can go a long way!

Battle of the Sexes: Free Throw Edition

2013-12-29T16:50:00.001-06:00

As a UNC basketball fan, this season has definitely been a roller coaster ride. UNC is 3-0 against top 25 teams, beating #1 Michigan State on the road, #3 Louisville on a neutral court, and #11 Kentucky at home. UNC has also had some bad loses: UAB, Belmont, and Texas. Two of the losses were by 3 points, and UNC missed over 20 free throws in both of those games. Last weekend, I attended the Toledo-Dayton women's basketball game, and there were hardly any missed free throws. So this got me thinking: are women any better than men at free throw shooting?

When looking at all Division I free throw percentages for both men and women (as of December 27), men are slightly better at making free throws, as shown below. The median free throw percentages (thick black lines) are 69.1% for men and 68.4% for women. The variability is also much smaller for the men than women. Note that the UNC men make 61.3% of their free throws, ranking them 333 out of all 345 teams.

Next, I wanted to look the the free throw percentages of the top 25 ranked teams, which is shown below. For easier comparison, I have also included the distribution for all Division I teams.

For both men and women, teams ranked in the top 25 are on average better at making free throws than compared to all teams. When restricted to top 25 teams, women have a better free throw percentage than men.

So who is better at making free throws: men or women? Men are slightly better on average than women, but when restricted to the best 25 teams, women are on average better. Regardless, I'm hoping that UNC can increase their free throw percentage in the second half of this season.

Coming soon: More NBA data than you can imagine

2013-10-24T13:25:00.001-05:00

It was recently announced that the NBA is partnering with a company that will record and release data for all of the games (similar to all of the MLB data that is currently available, such as pitch placement). You can read the full article here.

The article mentions: "For the first time, all 29 of the NBA's arenas will have software-packed cameras that will record players' every move, mapping 25 images per second." This is a ton of data, so I am sure that they will not be releasing all of the raw image/mapping data, but some summarized version, such as where on the court each shot came from. Here are a few statistics that I would love to see made available:

Number of times a ball is dribbled per game.
Number of times a player travels but no call is made. I'm thinking about fast breaks where players take 4 steps without a dribble. It would also be cool to see this breakdown by player/team.
The arc of each free throw shot. For a given player (this will vary a lot between players), how consistent are their arcs, and how well can you predict if a free throw is made or missed based on its arc?
The distance that each player runs during a game.

Leave a comment if you have any other ideas.

Getting a shoutout from Sports Illustrated

2013-10-11T16:41:00.000-05:00

After writing my last post arguing that women should not blindly challenge calls more often in professional tennis, I sent a quick summary to Jon Wertheim at Sports Illustrated. It sounds like he appreciated my side of the argument, as he published part of my response in his weekly article. Pretty cool!

Why women should not challenge more in tennis

2013-09-15T18:41:00.000-05:00

In last week's Sports Illustrated, Jon Wertheim argues that women should use the challenge system more at Grand Slam tennis tournaments. Although he includes data in his article to make his case, I'm going to argue that he is misinterpreting the data and show that convincing women to challenge more often might actually hurt the game. Unfortunately, this article is not available online, so I will begin with a summary and include some direct quotes (*Update - the article is available here).

A little background: "In 2006 tennis instituted a replay challenge system not unlike the NFL's. Provided the court is equipped with the technology, players can appeal line calls for review." The rules are the same for men and women; however, men and women have different challenge behavior. "Men challenge their points 25% more often than women - though their success rates are virtually the same." For example, at Wimbledon this year, women challenged 2.6% of the points played, while men challenged 3.3% of the time. This trend holds over all Grand Slam tournaments over the past year. Men won their challenges 27.73% of the time while women won 27.37% of challenges.

Supplied with this data, the author concludes that "men are more prone to question their authority" and "women are more reluctant to challenge and be assertive or confrontational". Many assume that men are more likely to challenge because they hit the ball harder, so linespersons are more likely to make a mistake. The author's response: "Sounds logical. But if this were true - if it were harder for linespersons to trace 140-mph serves, as opposed to 120-mph serves - we would expect to see a disparity in accuracy of line calls". But because the challenge accuracy is equivalent between gender, the author concludes that women challenge less (and thus accept incorrect calls) because "women are uncomfortable with confrontation and negotiation". The article concludes with a quote from Martina Navratilova, "Women need to be more comfortable challenging. Here's one area where there's no reason we shouldn't be like the men."

The author's whole argument hinges on one point: there is no disparity in the accuracy of line calls for men's and women's matches. I agree that the author's conclusions would hold if the data could show this. However, we have no way of knowing the accuracy of all linepersons calls from the data - we only know that the accuracy of player challenges is roughly equivalent between genders (27%). What we cannot know is how many of the unchallenged points contained an incorrect call by a linesperson. The only way we could figure this out is if someone watching the match (presumably in the TV booth) "challenges" every point to determine how many points contained incorrect calls, then look at how many of these points were challenged by players. This would allow us to construct the following table for each gender:

	# Points containing all correct line calls	# Points containing an incorrect call
# Points Challenged	A	B
# Points With No Challenge	C	D

From the data presented, we know the values for A and B. With the Wimbledon 2013 men's data, A = 2.38% of all men's points (3.3% of challenged points x 72.27% of incorrect chalenges) and B = 0.92% of all men's points (3.3% of challenged points x 27.73% of overturned points). For women, A = 1.89% and B = 0.71% of women's points.

To determine whether or not linespersons make more incorrect calls in men's matches, we need to know D (the total number of incorrect calls is B + D). Without this knowledge, we cannot determine if more challenges from women would result in more overturned calls. However, we know that the value of B is larger for men than women by 0.21% of all points played. For the author's assertion that linespersons are equally as likely to be incorrect for women as for me, D would need to be larger for women than men by 0.21% of all points played.

Let's look at 3 examples using the data from Wimbledon 2013:

Assume women got every single wrong call overturned (D = 0). Therefore, using additional challenges will not get any call overturned. If women challenge every as frequently as men without any additional incorrect calls, then women's accuracy would drop to 21.56%. So urging women to challenge more would make them appear to have worse judgement than men (not a good thing when trying to argue for gender equality).
Assume the proportion of incorrect men's and women's calls are equivalent (D is equal for men and women). If this is the case, then you could argue that men and women should both be challenging more. However, because B+D is still smaller for women than men, their accuracy would decrease if they challenged the same number of times as men.
Assume the total number of points containing an incorrect call is equivalent between genders (D is larger by 0.21% of all points for women than men, resulting in B+D being equivalent between men and women). If this is true, as the author assumes, then his conclusion than women need to challenge more is correct. However, assuming that women challenge the same number of times as men, their accuracy on the "new" challenges would actually increase from 27.37% to 29.08% if they were to remain as successful as men at getting line calls overturned. This means that women are currently challenging calls that are less likely to be overturned than the calls that they are are missing (try explaining this to the women players!).

Therefore, we cannot make any assumptions as to whether linespersons miss the same number of calls for women as men. This also means that we cannot determine whether women can challenge more and still be as accurate as men in getting calls overturned. So what conclusions can be made from this data?

Men and women are equally successful at challenging. This means that one sex does not have better "eyes" than the other.
Men and women are equally bad at challenging. On average, less than 1 out of 3 challenges will result in a call being overturned. This number is likely due to "throw away" challenges when a player knows the call was correct but has challenges to waste at the end of a set (or needs a longer break to catch his/her breath after a long point).
Assuming that men and women are getting every incorrect call overturned (D=0), linespersons are 29.6% more likely to make a mistake on a men's point than a women's point (0.92% vs 0.71%).

Analyzing Pro Athletes' Physiological Dashboard

2013-08-04T18:18:00.001-05:00

I recently came across an article about the "sports science" changes that Chip Kelly has implemented since becoming head coach of the Philadelphia Eagles. Basically, the Eagles spent more than $1 million investing in new technology that measures physiological details (heart rate, amount of time spent running during practice, 3d views of how players are lifting weights, etc) in the hopes of creating a "physiological dashboard" for each player. They want to monitor the performance of each player during practice to increase training efficiency, such as ending practice early for players reaching their endurance limits or ensuring that players receive the correct amount of hydration based on what was lost during practice. A large portion of the article is dedicated to describing the Eagles sports-science coordinator, who has previously served as a strength coach and nutritionist for colleges and the Navy SEALs.

Here are some interesting quotes:

"The result is a data driven approach to training"
"Players can log into their personal computers to check their own fitness profiles"
"Last season Catapult helped on of its NFL clients compare practice data ... in weeks when the team won compared to those when it lost. A trend emerged: during Thursday practices before losses, offensive skill players were running a lot but not very quickly."

OK, so NFL teams are beginning to collect all of this data about their players. But who exactly is mining all of this data to find useful information? I can't believe that its the sports-science coordinator (he doesn't have a statistics degree). Plus, who can actually monitor and interpret all of this data in real-time (i.e. during practice)? It seems that Catapult, an IT consulting company focused on interpreting data, is doing some work after the season is over, but do any of these teams have the capacity to perform analysis in-house? Here are a few things to think about:

I'm sure most of the companies selling the equipment have guidelines or suggestions for how to interpret the data. So maybe a bell goes off when a player's heart rate gets too high. But how accurate are these baselines, especially when the same guidelines are applied to 180lb running backs and 350lb linemen?
What is the goal of collecting all of this data? Making real-time decisions about players' health during practice? Drawing team-wide conclusions about what does/doesn't work at the end of the season? These are 2 very different questions that could influence the most effective way to collect data.
How much are teams investing into analyzing this data (either in-house or through ouside companies)? For current genomic sequencing projects, more money is spent on the analysis than the sequencing experiment itself. So are the Eagles planning to spend an additional $1 million on interpreting all of this data? Or will this data just go to waste?

College Basketball Commitments

2013-08-01T21:06:00.000-05:00

I came across a very nice article describing the college commitment habits of 700 top basketball recruits. The author does a great job of delving through the data and concisely summarizing the main findings. A few of my favorite highlights:

First, just obtaining all of this data (all high schools and colleges attended for all 700 athletes) must have been a Herculean task.
I like that he also displays the data with several bar charts and a very colorful cumulative density plot showing how early in their high school career that recruits commit to a college.
Of the players who spent at least 2 seasons playing in college, over a third didn't end up where they started.
Think that these top recruits only bounce around universities to get the most exposure? 4 of the recruits attended 6 different high schools, including current NBA player Michael Beasley. Plus, over 50% of the 2013 recruiting class attended at least 2 high schools.

Statistics Playing Major Role in College Football Playoffs

2013-03-15T18:16:00.002-05:00

The potential impact of stats on the playoff selection committee

In the above article, Sports Illustrated sought the recommendations of 5 college football and basketball "stats gurus" to get a better feel for how the college football playoff committee should go about choosing the four teams to compete in the 2014 national championship playoffs. They discussed three primary themes:

1. The need for accountability and transparency. Although the BCS releases their rankings and scoring/point totals every week, the actual formula used in these calculations is proprietary. I am in agreement with the 5 experts in calling for full transparency in the system. However, this makes it difficult to include an "eye test" in the decision (whether this should be included is another debate). My favorite quote:

"I doubt this will happen, but I think they need to have a non-voting data person in the room as well. Someone to help the members interpret ratings and other data sources, answer questions that are posed and hold the group accountable to information that is shared."

2. Its about more than wins and losses. Should other factors like injuries and margin of victory/defeat play into account?

"Of course, the danger of using advanced stats or ignoring head-to-head results is the committee might wind up producing a bracket that the majority of the public -- accustomed to seeing rankings ordered largely by team records -- rejects."

3. Strength of schedule isn't what it seems.

"There are many ways to measure schedule strength, and many of them are valid. I like to use this example. Imagine two schedules. Schedule A consists of the six best teams in the country and the six worst. Schedule B consists of the 12 most average teams in the country. Which is tougher? Ask Alabama, and they'll obviously say Schedule A. Alabama would have a much easier time running the table against Schedule B. But ask the worst team in the country which one is easier, and they'll say the opposite. The worst team in the country would have a hell of a time winning a single game against Schedule B. ... So depending on who you are, you can perceive the exact same schedule of teams very differently."

Super Bowl Squares Strategy

2013-02-02T11:35:00.000-06:00

With the Super Bowl just a day away, I am hearing a lot of talk about Super Bowl Squares, the game of chance that only gets played one day out of the year. With most variations of the game, people sign up for squares, then once all squares have been taken, the numbers are randomly assigned to the rows and columns, thus making this purely a game of chance (I guess the football game also plays a role too).

But suppose that these numbers were not randomly assigned: you get to choose the numbers that you want. Which pair of numbers gives you the best chance of winning? I have seen a few articles online trying to answer this question, but all the ones that I have come across look at the score after each quarter of all previous Super Bowl games. While I see the point of only looking at Super Bowls, some of these games were played over 40 years ago and the game has clearly evolved since then. For example, I have to believe that field goals are much more common now then they were 40 years ago, as kickers are now able to routinely make 50+ yard field goals (I don't have data to back this up, so let me know if I'm wrong). Therefore, I have decided to look at all football games from this past season, including the playoffs. If my counting is correct, this covers 266 games. I should probably look at the score after each quarter of every game, but this would cover 1064 quarters, and I just don't have the time (or really care to) do this. So I have decided to only analyze the final scores of the 266 games. I also ignored whether the winning team was home or away, so to me, Team A winning by a score of 17-13 (making square 7,3 the winner) is equivalent to Team A losing 13-17. That is, I treated squares (7,3) and (3,7) as the same.

Let's first look at the most common point totals, with respect to the last digit. As expected the least likely point totals end in 5 (3.8% of all final scores) , 2 (4.3%) and 9 (5.1%). The most common point totals end in 3 (16.4%), 4 (16.0%), 7 (14.8%), and 0 (13.5%).

Now let's look at pairs of numbers. If you played over the full 2012 season, 3 squares would have never won (when only looking at final scores): (1,2), (2,9) and (5,6). This isn't too surprising because, as shown earlier, it is difficult to score total points ending in 2, 5 or 9. The most likely pairs this past season were (3,6) and (3,7)*, which each occurred 16 times this season. Combined, these 2 pairs would have won over 12% of the games. Additional pairs that would have won over 10 times this past season include (0,3), (0,4), (0,7), (0,8), (1,4) and (3,4).

In conclusion, if numbers were not randomly assigned in Super Bowl Squares, it would easily be possible to win in the long run.

* SI writer Peter King picked the Ravens to beat the 49ers 27-23, so he's playing the odds with his final score prediction.

UPDATE (2/4/2013). The score after each quarter (with the Ravens always leading) was 7-3, 21-6, 28-23 and 34-31. This means that the winning squares were (3,7), (1,6), (3,8) and (1,4). Did anyone follow my advice and bet on (3,7) or (1,4)?

MLB Hall of Fame Voting

2013-01-27T13:17:00.000-06:00

Here's a link to a really cool interactive graphic of MLB hall of fame voting. As with most datasets, there are many different variables that are useful to display visually. Including interactive graphics is a nice way to show multiple variables (or to select only a subset of variables) without making a million different 2-d plots. Below the graphics, the authors describe all of the features of this plot - I suggest that you read through it and try some of these things out.

MLB Hall of Fame Voting Graphic

The New York Times has begun to show similar visualizations for economic and political issues. As we move away from print articles and towards online reading, I think we will see a rise in popularity of these types of interactive graphics.

I'd love to learn how to make these type of graphics, but most of the heavy-lifting is done in Java (which I have no experience with), and I don't have enough free time :(

College Football 2012 Wrap-up

2013-01-21T19:59:00.001-06:00

This BCS bowl season, two teams, Wisconsin and Northern Illinois, were coached by interim coaches in their BCS bowl game after their head coach left the team to accept a new position. There seems to be an increasing trend of coaches leaving their teams before a bowl game to accept a new coaching position. I began wondering whether schools that are searching for a new head coach should try to scoop other coaches before the bowl games are completed, or if they should factor in the bowl game performance (maybe make the candidates feel some extra pressure to win)? Schools tend to want to fill their coaching vacancies ASAP because this provides the new coach an extra month to put together his coaching staff and recruit. But does this process of hiring coaches before the bowl game actually lead to better football success?

I chose to look at head coaches who lead their team to one of the BCS bowl games, then accepted a new college coaching position the following year. While this leaves a small sample size (n=9), it is easier to evaluate the performance of these coaches because it is assumed that the new school is expecting the new coach to lead his new team to BCS bowls. Here is a summary of the 9 coaches:

Coach	Previous Team	Old BCS Record*	Year	Last Bowl	New Team	Record	New BSC Record
Steve Spurrier	Florida	2-1	2001	W	South Carolina	66-37	0-0
Urban Meyer	Utah	1-0	2004	W	Florida	65-15**	3-0
Walt Harris	Pitt	0-1	2004	L	Stanford	6-17**	0-0
Rich Rodriguez	West Virginia	1-0	2007	W*	Michigan	15-22**	0-0
June Jones	Hawaii	0-1	2007	L	SMU	31-34	0-0
Brian Kelly	Cincinnati	0-1	2009	L*	Notre Dame	28-11	0-1
Randy Edsall	UConn	0-1	2010	L	Maryland	6-18	0-0
Bret Bielema	Wisconsin	0-2	2012	L*	Arkansas	-	-
Dave Doeren	Northern Ill.	0-0	2012	L*	NC State	-	-

* = Coach left team before BCS bowl game, so it was coached by interim coach. If the head coach left the school before the BCS bowl game, it is not reflected in his BCS record.

** = No longer with this team. Meyer retired and Harris and Rodriquez were fired.

A few interesting gems from looking at this table:

Only 3 of these 9 coaches won a BCS bowl game with their previous team (Spurrier, Meyer, Rodriguez).
Only 2 have taken their new teams to a BCS bowl game (Meyer, Kelly), with Meyer being the only coach to win a game (actually 3, including 2 national championships).
The only coach to win a BCS bowl game with their new team (Meyer) had won a BCS bowl game with his previous team.
3 of the 4 teams coached by intermin coaches lost their bowl game, with West Virginia being the only exception.

Yes, programs that are hiring coaches have probably suffered some losing seasons and need time to rebuild, so these results could change in another year or 2. Plus, this is a small sample size, so we would probably be better off by including all coaches who leave their teams, not just ones leaving after reaching a BCS bowl game. In my opinion, schools that are hiring college football coaches are placing too much emphasis on reaching BCS bowl games and not enough on winning these games. Even if it is all about the money of BCS bowl games and not actually about winning, most of these big-name hires are struggling to take their new teams to a BCS bowl game.

If I were in charge of hiring a new football coach to turn around a struggling program and win national championships, here would be my one major piece of advice:

If you are serious about winning national championships, hire a coach that has actually won a BCS bowl game. If none of these coaches are available/interested, then don't settle for a coach who has taken his team to a BCS bowl game but lost - what makes you think he can do better next time (ahem, Brian Kelly)? Save your money and take a chance by hiring a coach who hasn't been to a BCS bowl (but has preferably won other bowl games). You might just hire the next Les Miles (2-1 in BCS bowl games since 2005, including a national title).

Scheduleball

2013-01-03T20:59:00.000-06:00

Scheduleball: Colorado State, Pitt exploit weaknesses of RPI

By now, everyone is aware of the impact of Moneyball (statistics!) on the MLB. Due to the media firestorm surrounding Moneyball (aided by the catchy name), the perception seems to be that baseball GMs are the brainiest employees in professional sports. Occasionally, I'll hear a story about an NFL or NBA GM who breaks the mold by applying analytics to improve their team's performance, but this isn't very common (although I imagine all pro teams are now employing at least a few data analysts).

But I've NEVER heard a story about a college athletic director credited for improving/influencing his school's performance beyond the hiring of a high-profile coach ... until I read the above story. I imagine that the main reason for this is because it is extremely rare. You only hear about from the AD when its time to hire/fire a coach, respond to NCAA investigations, or build a new state-of-the-art athletic facility. This really doesn't make much sense, especially considering that all of the top universities have many great PhD-level statisticians on their payrolls. While not all statisticians do sports-related research, all a university needs to do is simply buy out one of a professor's courses to get his/her expertise on how to apply a Moneyball-style approach to give their athletic teams as many advantages as possible ("buying out a course" = allow a professor to dedicate the time he/she would normally spend teaching a course to do some other research activity).

I realize that this may ruffle some feathers, as many head coaches don't want to take advice from some academic wizard. One option is to hire an AD who can do these sorts of analyses himself, and only try to change "off the field" decisions, such as scheduling, which is what the linked story above explains. And as probably all coaches have bonuses built into their contracts for post-season appearances, what coach wouldn't want to do all he can to increase his likelihood that his team makes it into the playoffs (or bowl game)?

If any university wants to take a chance and hire a statistician as their next AD to get an advantage over their competition, I'll be more than happy to interview :)

NFL Pop Quiz

2013-01-02T21:35:00.001-06:00

As a new resident of St. Louis, I've enjoyed having a local NFL team to cheer for (although maybe not for much longer if they move to LA). Rookie punter Greg Zuerlein had some incredible special plays this year and completed 3 of 3 pass attempts for 42 yards and 1 touchdown. Can you guess which high-profile (and highly paid) quarterback threw for fewer yards? Find out here.

Moneyball: Grinnell College Style

2012-11-25T20:20:00.002-06:00

One of the big sports stories this past week was Jack Taylor of Grinnell College scoring 138 points in a single basketball game. When discussing this story with my family and friends, the conversation always seems to steer towards the "ethical-ness" (or sportmanship) of the coach's strategy: essentially going for steals using full-court defensive pressure and giving up easy 2's in order to take lots of open 3-pointers at the other end (Taylor was 27-for-71 from the 3-point line and 52-for-108 overall).

It turns out that the Grinnell coach's strategy is based on a student's project done in the early 1990's that pointed out statistical patterns that were keys to winning. These include:

Attempt at least 25 more shots than their opponent
Take at least 94 shots per game
At least half of all shots are 3-point attempts
Rebound at least 33% of its missed shots
Force the opponent into at least 32 turnovers

More recently, a pair of students at Grinnell re-examined these keys and came to a slightly different conclusion (namely, that turnover differential is more important than number of opponent turnovers). You can access the paper here, about half of which is statistical, but the other half should be easily understood by all. While the analysis is nice, by far the coolest thing about the paper is that the students interviewed the head coach, David M. Arseneault, got his feedback on their work, and included his comments in their paper. While Grinnell is only a DIII school, I think it's great that a head college basketball coach is not only willing to meet with and discuss a statistics project with undergrads, but has built his strategy based on previous students' work (no mention if he has incorporated these latest suggestions to his strategy).

Congrats to Jack Taylor and Coach Arseneault, and only time will tell if all of the publicity surrounding this game will prompt other coaches to adopt his strategy, or at least devise their own strategies based on statics (with bonus points for using undergrads to help)!

(Thanks to Simply Statistics for the link to the student paper described here.)

The Toughest Part ...

2012-10-25T21:28:00.001-05:00

...finding the data.

One reason that my posts have been getting sparser is the difficulty in finding the data that I'm looking for. Even in this age of "big data", finding the data of interest is not trivial (in fact, it may not exist). Here are 2 examples supporting my point.

First, even when the data does exist, it can take a long time to get it all collected. For example, in my post explaining why Roger Federer should never hit a second serve, I collected the data by going through match statistics for over 30 matches on the Wimbledon website and recording the numbers in Excel. As you can imagine, this gets very boring very quickly.

Second, sometimes it is unreasonably difficult to find the correct data, even when you are sure that it exists somewhere. For example, I was interested in writing a post about whether NFL players should attempt a kick-off return when catching the ball in the end zone, or if they should take a touch-back and start on the 20 yard line. After about an hour of searching on Google, I came up dry. The closest thing that I could find is the average kick-off return and number of touch-backs per game, but it doesn't tell us where the player caught the ball (in the end zone or not). Additionally, this would still require me to aggregate the data by hand across all games.

If you ever find a "ready" dataset that could help answer a interesting question, send me an email and I'll be happy to take a shot at analyzing the data and turning it into a post.

Why the NFL is supporting the wrong cancer research

2012-10-12T00:10:00.001-05:00

Unless you live at the bottom of the ocean, you have probably noticed that the NFL is showing support for breast cancer research by having the players and officials wear pink accessories (sounds a little girly when I say it like that). As a cancer researcher, I think its great that the league with the most exposure in the USA is joining the American Cancer Society in its fight to end cancer. However, the NFL is making a huge mistake by choosing to support breast cancer research over prostate cancer. Here are a few statistics from the American Cancer Society that may surprise most people:

1. Approximately 1 out of every 6 men will develop prostate cancer in his lifetime. In contrast, 1 out of every 8 women and less than 1 out of every 1,000 men will develop breast cancer in her/his lifetime.

2. There will be an estimated 241,740 patients diagnosed with prostate cancer in 2012, compred to 229,060 new cases of breast cancer (<1% of those cases ocuring in males).

3. Treatments are not as effective for breast cancer as prostate cancer, and this is reflected in the 5 year survival rates: 99% of prostate cancer patients will survive 5 years, compared to only 89% for breast cancer. However, until the 1990s, males with prostate cancer had a higher 5-year mortality rate than females with breast cancer.

4. It is estimated that 39,510 women and 410 men will die of breast cancer in 2012. An estimated 28,170 men will die of prostate cancer this year. That is, over 65 times more men will die of prostate cancer than breast cancer.

5. Prostate cancer is the second most deadly cancer type for males, behind only lung cancer.

6. African American males are 1.6 times more likely to develop prostate cancer and 2.5 times more likely to die from it than white males.

Do these numbers surprise you? While breast cancer is a more deadly disease than prostate cancer, all of the support for breast cancer month and "wearing pink" makes it seem like the disparity between the two diseases is much larger. Considering that there is not a single female player in the NFL, the league is going out of its way to promote research for a cancer that its players are 150 times less likely to develop than prostate cancer (supporting evidence: allowing players to wear pink shoes and towels but fining them $5,000 for wearing a red undershirt). Additionally, with the NFL consisting of a large proportion of African Americans males, you would think it would support research for a disease with significant racial disparities.

If you happen to meet Roger Goodell on the street and point these facts out to him, he will mention that the NFL is committed to promoting prostate health, and he is technically correct. However, I don't see the NFL encouraging players to wear blue during September. A bit hypocritical, don't you think?

How accurate are football preseason polls?

2012-09-08T15:21:00.001-05:00

I'm a few weeks late on this post, but I think its still worth blogging about. Every year, the media (especially ESPN) makes such a huge deal about college preseason football polls. Without any games yet to be played, these polls are little more than speculation. I wanted to look into how accurate these polls are at choosing that season's national champion. In this post, I will be using exclusively the AP poll preseason and final results, which can make a difference in years before the BCS when there could be multiple national champs based on the poll used.

This first plot shows the final ranking of preseason top 5 teams since 1990. The bar furthest to the right shows the teams that were in the top 5 preseason poll but finished the year unranked.

A few interesting notes:
1. 14 of the past 22 national champions were ranked in the preseason top 5 (I think the last winner outside the top 5 was Auburn led by then-unknown Cam Newton).
2. More national champs were ranked preseason #2 than preseason #1. This is good news for Alabama who started this season ranked #2 behind USC (but who jumped to #1 after their first win).
3. Looking only at the preseason #1 teams (blue bars), they are more likely to finish the season ranked 3rd than any other rank. Also, no preseason #1 has finished worse than #16 in the final polls.

Next, I wanted to look whether teams ranked higher in the preseason poll tended to be ranked higher at the end of the season. A simple way to do this is to look at the median finish of the top 5 preseason teams.

**Median Final Ranking since 1990**
Preseason Rank	Median Final Rank
1	3
2	5
3	6.5
4	9.5
5	8

So although the national champions are not always ranked preseason #1, the top preseason teams in general finish higher in the standings than the other preseason teams. The exception is that teams ranked #5 tend to finish the season ranked better than the #4 preseason team. There could be some bias causing this result, as there needs to be some tie between teams with the same final season record, and this may be influenced by the preseason rankings.

Finally, I wanted to see how the final rankings of the previous season influence the preseason polls of the next season. For example, do teams who finish the year #1 tend to be the top ranked preseason team the following year (even though 1/4 of the team likely graduated)?

This plot shows that the preseason top 5 teams tended to finish the previous season ranked highly. We see that, since 1990, 9 of the 23 teams finishing the previous season #1 were the top ranked preseason team. This is a somewhat questionable strategy, as only 2 teams have repeated as national champs since 1990: Nebraska in 1994-95 (who was not ranked preseason #1 in 1995) and USC in 2003-04.

In summary, while the top preseason team more often than not does not win the national championship, on average they finish the season ranked better than any other preseason team.

Swimming and the Fast Suit Aftermath

2012-08-16T23:48:00.000-05:00

At the end of 2009, the governing body of competitive swimming, FINA, banned the use of high-tech full-body fast suits (try saying that 5 times!). See here for a summary. As proof of the influence of fast suits, all but 2 world records (both men's and women's) were broken in either 2008 or 2009! The general consensus was that these world records would be untouchable for a long time, and for the most part, that has been true. However, 8 world records were broken at the 2012 Olympics. I am now going to answer the question, "Were the winning Olympic times significantly slower than the world records?"

First, let's look at a box plot of the difference between the world record and the winning Olympic time (a value less than zero denotes that the world record was broken). Note: 2 men's world records were set in 2011, so I am comparing to these current records rather than pre-2010 records.

While the times are generally above zero (slower than WR time), the boxplot whiskers do extend below zero. There is one clear outlier for the men, and this ocurred when the 1500m free WR was broken by 3 seconds. Most of the variation is due to events being different distances (50, 100, 200, 400, 800 and 1500m). To account for this difference, I have normalized all times to 100m (multiply 50m time differences by 2, divide 200m time differences by 2, etc.). The normalized times are reported in the following box plots. Now the times are much less variable and there are no clear outliers.

To officially answer our question of whether times were significantly slower without fast suits, I performed a t-test for mean difference. Our null hypothesis is:

Ho: no difference between average world record time and winning Olympic time.

Leaving out the details, we obtain p-values of 0.18 for the men and 0.26 for the women. Thus, since these p-values are large (> typical cutoff of 0.05), we fail to reject the null hypothesis. We can conclude that there is no significant evidence that the winning swimming times in the 2012 Olympics were significantly slower than the world records. I also repeated the calculations after removing the 3 relays from the analysis and arrived at the same conclusion.

We cannot tell from this analysis if the fast suits has a smaller influence on time decreases as originally thought, or if swimmers are just training harder and getting stronger (I tend to believe the latter). It's also too early to tell if any of these records will be thought of as unbreakable (example: Phelp's 2008 Olympic performance + fast suit = some really fast world records). But, I think we can safely conclude that, unlike the steroid era in baseball, world records set in the fast suit era will not require an asterisk.

How Usain Bolt can rival Michael Phelps

2012-08-13T00:17:00.006-05:00

OK, so this post isn't exactly statistics related, but watching Olympic coverage talk comparing Usain Bolt to Michael Phelps is ridiculous. First, let me say that without a doubt, Bolt is the fastest man alive. His performances are the highlight of track and field Olympics. But, winning back to back gold medals in the 100 and 200 is nowhere near Phelp's 22 medals (18 gold) over 3 Olympics. Here are 3 ways, in my opinion, for Bolt to end his career on the same page as Phelps.

1. Compete in at least 4 Olympic games. Phelps competed as a 15 year old at Sydney, swimming in the 200 fly. Combined with Athens, Beijing and London, Phelps swam in 4 Olympics. Bolt is only half way there with 2 Olympics.

2. Win both the 100 and 200 at Rio 2016. Phelps became the first swimmer to (twice) win gold in the same event in 3 consecutive Olympics (100 fly, 200 IM), while just missing out on three-peating with the 200 fly.

3. Add additional events. Every commentator who says that Bolt does not have as many opportunities to race as Phelps should be fired on the spot. Here are other reasonable events for him to race.

400 m: This is only running 2-200's in a row.
4 x 400 relay: This is the most reasonable race for him to add. Phelps swims the 4x100 free relay (turning in the 2nd fastest split this Olympics), yet he has never swum the 100 free as an individual event. Bolt doesn't need to be the fastest 400 runner to win a medal, but be part of the fastest team.
110 m hurdles: Yes, this involves hurdles, but he's tall enough to make it over the hurdles and would definitely be the fastest pure runner in the race.
Long jump: Jesse Owens won 4 gold medals in the 1936 Olympics including the 100, 200, 4x100 and long jump. Bolt has never had a single Olympics as successful as Owen's performance.
High jump and triple jump: see above.

I just listed 6 additional events for Bolt to possibly compete in. Yes, many of the events would take him out of his comfort zone, but winning these off events is what distinguishes legends from greats. If he added the 4x400 relay with another individual event and won golds in those events, I would then start to think of Bolt as competing on equal footing with Phelps.

And please don't argue that the schedule wouldn't work. Phelps (and Lochte, Franklin, etc.) won gold medals within an hour of swimming in another final or semi-final. I've yet to see a top Olympic track athlete push themselves and race multiple finals/semi races in the same day. So Bolt could be innovative in this manner too.

Finally, Ranomi Kromowidjojo is a female swimmer from the Netherlands won gold in both the 50 and 100 free and silver in the 4x100 free relay this Olympics. If she wins gold in all 3 events next Olympics and sets a few records in the process, will she be considered the greatest female swimmer ever? NO. But isn't her event schedule comparable to Bolt's (minus the relay)? YES!

Why Roger Federer (and the entire women's tour) should never hit a second serve

2012-07-28T15:54:00.001-05:00

In the 2012 Wimbledon Men's Championship match, Roger Federer beat Andy Murray to win his record tying 7th Wimbledon title and record 17th major overall. But it was a rough start for Federer, as he lost the first set, which included losing his first 7 second serve points (he won 3/11 overall in the first set). Federer was having a much easier time with his first serve (as do most players), winning 65% of the points when he made his first serve. This got me thinking ... does it ever make strategical sense for a player to never hit a slower second serve but always hit a first serve? The answer may surprise you (although I guess I give it away in the title).

I went through several matches for both men and women at this year's Wimbledon. Let's use the men's final as an example. Over the whole match, Federer made his first serve 68.7% of the time, winning 75.6% of those first serve points. Multiplying these two numbers together, we see that 51.9% of time time Federer made his first serve and won the point (conversely, 48.1% of the time he either made his first serve and lost the point or had to hit a second serve). Federer won 48.8% of the points when he hit a second serve. This shows that Federer would have been better off going for his first serve even when hitting a second serve, as he would be expected to win an additional 3.1% (51.9 - 48.8) of second serve points. Federer hit 41 second serves, so he would expect to win an additional (41)*(3.1%) = 1.3 points in the match had he not hit his regular second serve. This doesn't sound like much, especially since Federer won the match, but it could translate to winning an additional game.

One drawback of only hitting first serves is an increased number of double faults. Because Federer made 68.7% of his first serves, we would expect him to miss his first serve 31.3% of the time. If we consider two serves as independent events, the probability that he would miss two in a row and double fault is (31.3%)(31.3%) = 9.8%, resulting in an expected 12.8 double faults (he served 131 points in the match), much higher than the 3 double faults he actually served. But he would have also been expected to serve an additional 3.75 aces (work not shown, but trust me). So this turn out to a net decrease of (12.8 expected double faults) - (3 actual df's) - (3.75 additional aces) = 6.05 points. How can this make sense if I just said that we expect Federer to win a net of 1.3 additional points? When he does make his first serve, he wins a much higher percentage of those points compared to second serve points (75.6% to 48.8%) that it more than makes up for the double faults. In other words, the risk of only hitting first serves pays off for Roger Federer.

I decided to look at all men's matches starting with the 4th round of Wimbledon. Of the 30 matches (or 60 players, some counted more than once), 10 of the players would have expected to benefit by only hitting first serves. Roger Federer was the only player to show up twice on the list (Finals and QF matches). Of these 10, 7 lost the match (Federer won both matches and Djokovic won his QF). In 3 of Andy Murray's 4 matches that I looked at, his opponent would have been better off only hitting first serves (Federer, Tsonga and Cilic), showing how great of a returner Murray is (the other opponent was Ferrer, who doesn't have a big first serve). The largest expected gain was for Tsonga, who could have expected to win an additional 7.7 points (2 games) against Murray in the semis.

I wanted to look a little more closely at how this strategy may impact the match. My original thought was that a player probably loses a set when he is serving poorly, so only hitting first serves would result in a lot more double faults and a net loss of points. I was surprised to see that this was not the case. Looking at the QF through the Finals, players lost 26 sets. In 10 of these 26 sets, the player who lost the set would have expected to win more points by serving only first serves. This definitely supports players only hitting first serves when they lose the set, but I guess this doesn't help much retrospectively.

Looking at sets won, for 19 of 26 sets, the winner would have expected to lose more points by only hitting first serves. However, only 10 of those 19 would be expected to lose more than 2 points. Interestingly, Federer could have won more points in 3 of the 9 sets he won by only hitting first serves and never would have expected to lose an additional 2.5 points in the 9 sets that he won. This shows how great his first serve was working during the Championships.

The players are back at Wimbledon for the next 2 weeks, but this time trying to win an Olympic medal. My recommendations would be for Federer to only hit first serves, but he is probably the only male player I would recommend this to regardless of opponent. Plus, this may have the added benefit of screwing up the opponent's game plan. I would also suggest that all of Andy Murray's opponents to employ this strategy. But as I said before, no coach will tell their player to use this strategy. My guess is that this is due to aesthetic reasons - no coach wants to see his player double fault in the double digits even if he would win more points in the long run.

I also performed the same analysis for the women. To keep things short, every women player should only hit first serves. Of the final 7 matches (14 players) that I looked at, only 5 players would not have benefited by this strategy. In fact, 4 of those 5 players who would not have benefited lost the match anyway (the only exception being Serena Williams in the SF). On average, the women would expect to win an additional 1.1 points per match. Compare this to the men, who on average would expect to lose an additional 2.5 points.

Will any players employ this strategy at the Olympics? I'm guessing not.

Wimbledon 2012 wrap-up

2012-07-15T13:13:00.003-05:00

Sorry for the long delay in posting - I'm still alive, but have been extremely busy with the new job. I have also decided to spend more time on "novel" posts where I analyze the data myself to answer new questions, as these seem to get a much better response from readers. But these posts take more time to write, so I will not be posting as often. I am finishing up a new analysis with some data from Wimbledon, and I hope to blog about this soon. For now, I want to give a quick update on my Wimbledon picks (you can read about my picks here).

First the good news: I correctly picked Serena Williams. The bad news: I picked Novak Djokovic, but he lost to eventual champion Roger Federer. So I went 1/2, or 3/4 when combining my 2 correct French Open Picks. When looking at the ESPN expert picks over these past 2 grand slams, only Chris Evert has correctly picked all 4 champions.

Let's hope I can keep up the steam going into the US Open in August/September.

Are Free Agents Worth the Money? (Plot of the Week 9)

2012-07-01T14:58:00.000-05:00

This week I created a few plots to look at the effect of major MLB and NBA free agent players switching teams.

First, I wanted to see if players tended to move to teams with a better chance at winning a championship (think LeBron James), or if they moved to worse teams that would pay them more money (Albert Pujols who left after winning a championship with the Cardinals). The first plot shows the regular-season team winning percentage for the three years before the player switched teams (old team) and three years after switching to a new team.

Note: Since some players recently switched teams (LeBron, Bosh, Pujols, Fielder), we don't have a full 3 years of data to look at.

Teams that win championships are denoted by small squares. LeBron and Bosh are the only players to win a championship within 3 years of switching teams. Pujols was the only player to switch teams after winning a championship within the past 3 years.
For this small sample size, it looks like basketball players (dashed lines) that switched teams moved to better teams, where baseball players received a lot of money to move to teams that had roughly equivalent records.

Next, let's look at the winning percentage of teams before and after signing a big free agent (i.e, the Heat before and after LeBron). The thought is that spending all of this money to sign a marquee player will help the team win more.

Basketball players have an immediate effect on increasing the win percentage.
Baseball teams seem to have a worse record in the year immediately after signing a big free agent. None of the baseball players brought a championship to their new team within 3 years.
This shows that it is worth trying to sign big name basketball free agents. However, baseball free agents don't seem to be worth the big bucks in terms of immediately winning more games.
One could argue that it makes economical sense to sign big name free agents (increased ticket sales, marketing, etc.) even if they don't increase winning percentage.

Finally, I wanted to look at the winning percentage of the teams that lose a free agent (i.e., the Cavaliers before and after LeBron).

Again, we see that basketball stars have a huge effect on their team. Every team had a much lower winning percentage after losing a key free agent. But the teams seem to recover within a few years, presumably because they are able to rebuild quickly through the draft.
Baseball teams do not suffer the same losses after losing a key free agent. In fact, some teams have a huge increase in win percentage immediately after losing they player. This includes the 2003 Chicago White Sox who won the World Series the year after Magglio Ordonez left for the Tigers.
The effects of losing a key player may have a longer-lasting impact, with win percentage decreasing 2 and 3 years after losing a free agent.

Overall conclusion: Baseball players don't have as large an impact on win percentage as basketball players. Thus, they probably don't deserve the huge contracts they are earning.

One caveat about this analysis is that the sample size is very small. I have limited the analysis to players who sign huge free agent contracts with another team, and exclude players who were traded. I would love to add more players to this analysis, so leave a comment if you can think of a player who you would like to see included.

Wimbledon Picks 2012 (Explanation)

2012-06-30T18:47:00.002-05:00

With the first week of Wimbledon now a thing of the past, I figured I should actually give my reasoning for my Wimbledon picks.

First, I went with my gut (over data because I didn't have the time) when picking the men's and women's sleepers - players outside the top 10 to go the furthest. I picked John Isner and Venus Williams who both lost first round. Guess I shouldn't listen to my gut feeling when making picks anymore.

So let's get to the picks that I actually put some thought into. I only looked at one stat when choosing my women's champion. Here are the results of the Williams sisters (both singles and doubles) at Wimbledon since 2000.

**Williams Sisters Wimbledon Results**
Year	Doubles Result	Venus Singles	Serena Singles
2000	WON	WON	SF
2001	3RD	WON	QF
2002	WON	RUP	WON
2003	3RD	RUP	WON
2004	-	2RD	RUP
2005	-	WON	3RD
2006	-	3RD	-
2007	2RD	WON	QF
2008	WON	WON	RUP
2009	WON	RUP	WON
2010	QF	QF	WON
2011	-	4RD	4RD

Since 2000, every year that the Williams sisters have played doubles at Wimbledon (highlighted), one of them also wins the singles title (8/8, with both Williams reaching the Final 4 times). Notice that it doesn't matter how they perform in the doubles, as long as they are playing. The 4 years that the Williams did not play doubles, often due to one sister being injured, only once did one of them win the singles title (Venus in 2005).

Serena and Venus entered to play doubles this year (they are currently in the 2nd round). Looking at past results, the probability that a Williams wins the single title given they play doubles is 1. So all I had to do was choose between Serena or Venus winning. I went with Serena because she is the higher ranked and Venus is coming back from injuries. As of this post, Serena has won her first 3 singles matches and is still alive.

For the men's, I looked at the winner's seed and previous year's performance since 2003 (Federer's first Wimbledon win).

**Men's Wimbledon Champions**
Year	Winner	Seed	Previous Result
2003	Roger Federer	4	1st
2004	Roger Federer	1	WON
2005	Roger Federer	1	WON
2006	Roger Federer	1	WON
2007	Roger Federer	1	WON
2008	Rafael Nadal	2	RUP
2009	Roger Federer	2*	RUP
2010	Rafael Nadal	2	-
2011	Novak Djokovic	2	SF

* Federer was the 2 seed, but he was technically the top seed after Nadal withdrew as the number 1 seed with injury before the 2009 tournament began.

Because Federer, Nadal and Djokovic have won 28 of the last 29 major titles, I have eliminated the rest of the field. I eliminated Federer because it has been one of the top 2 seeds to win every year since 2004. I went with Djokovic over Nadal because the defending champion has won 6 of the last 9 years (if I consider Federer the defending champ in 2009 since Nadal withdrew and Nadal the defending champ in 2010 since he had not defended his title.**) Also, Federer and Nadal each defended their first Wimbledon title by winning the following year that they played. This gives me belief that Djokovic will continue this trend and win his second straight Wimbledon title.

My men's pick is looking even better after Nadal lost in the second round. (Still, I'll be cheering for Roger Federer to win the title as he is my favorite player.)

** Under this argument, both Federer and Nadal would have been considered defending champions in 2010, so you could change the fraction to 6/10 defending champs winning.

MIT Sloan Sports Analytics Conference

2012-06-24T12:10:00.001-05:00

I came across a great article today (via Simply Statistics) describing the largest sports analytics (i.e. statistics) conference in the world, held at MIT. A few points that I found most interesting:

The guy that they highlight (Kirk Goldsberry) created the plot of the week last week.
They mentioned that they had a panel on tennis analytics for the first time this year, including Pete Sampras and Roger Federer's coach Paul Annacone and former player Todd Martin. Two great quotes:

On why tennis analytics is lacking: "There's no shared service," says Martin. "This isn't a team sport with a $500,000 budget for analytics."

"[Analytics] should be a strength of our game. Tennis is a game of patterns." - Craig O'Shannessy

This conference could be worth a vacation next year. Might depend on whether I have one or two great blog posts to report on.