Category: Sports Stats


MLS Elo Ratings Update

May 6th, 2012 — 8:01pm

Usually I would post this as another weekly entry in the 2plus2 thread for the 2012 MLS season, but since the forum got hacked about 10 days ago, and we seem to have at least another week of downtime ahead of us I wanted to throw a quick update here.

It’s here because the current MLS Elo page is pretty awful. I’m meaning to do something about that in the near future when I get a couple days of free time.

Also, these ratings tend to not mean much until every team has played about 10-12 games (strictly anecdotal experience, no mathematical reason why) so yeah there’s some funny stuff. But just in case someone was wondering what was going on recently…

MLS Elo Ratings
1564.6888 – Seattle Sounders
1549.3225 – Sporting Kansas City
1545.2690 – Real Salt Lake
1537.6687 – San Jose Earthquakes
1530.5509 – New York Red Bulls
1525.1645 – Los Angeles Galaxy
1522.0902 – Houston Dynamo
1515.8164 – Colorado Rapids
1506.4407 – Chicago Fire
1504.5356 – DC United
1490.4999 – Montreal Impact
1487.3706 – FC Dallas
1485.6442 – Vancouver Whitecaps
1480.2150 – Columbus Crew
1477.4835 – Philadelphia Union
1466.3639 – Portland Timbers
1455.9092 – Chivas USA
1452.0821 – New England Revolution
1402.8849 – Toronto FC

 MLS Single Table

Team Games W-D-L Points Goal Diff
San Jose Earthquakes 10 7-1-2 22 +10
Real Salt Lake 11 7-1-3 22 +6
Sporting Kansas City 9 7-0-2 21 +7
Seattle Sounders FC 8 6-1-1 19 +8
DC United 11 5-3-3 18 +5
Vancouver Whitecaps 9 5-2-2 17 +2
New York Red Bulls 9 5-1-3 16 +5
Colorado Rapids 10 5-0-5 15 +3
FC Dallas 10 3-3-4 12 -4
Chicago Fire 7 3-2-2 11 +0
Montreal Impact 10 3-2-5 11 -4
Los Angeles Galaxy 9 3-1-5 10 -3
New England Revolution 9 3-0-6 9 -4
Chivas USA 9 3-0-6 9 -6
Houston Dynamo 6 2-2-2 8 -1
Columbus Crew 8 2-2-4 8 -4
Portland Timbers 9 2-2-5 8 -4
Philadelphia Union 8 2-1-5 7 -4
Toronto FC 8 0-0-8 0 -12

Comment » | Sports Stats

BBC Article I’ve Been Sitting on Forever

October 26th, 2011 — 3:25pm

A cool look at the perception that black people are just that much faster than white people. And not as “that’s racist!” as you’d expect.

Every winner of the 100m since the inaugural event in 1983 has been black, as has every finalist from the last 10 championships with the solitary exception of Matic Osovnikar of Slovenia, who finished seventh in 2007.

Assuming that this success is driven by genes rather than environment, there is a rather obvious inference to make – black people are naturally better sprinters than white people. Indeed, it is an inference that seems obligatory, barring considerations of political correctness.

But here’s the thing. This inference is not merely false – it is logically flawed. And it has big implications not merely for athletics, but for the entire issue of race relations in the 21st Century.

The same analysis applies to the sprints, where success is focused on Jamaicans and African-Americans. Africa, as a continent, has almost no success at all. Not even West Africans win much.

The combined forces of Mauritania, Guinea-Bissau, Sierra Leone, the Republic of Guinea, Liberia, Ivory Coast, Togo, Niger, Benin, Mali, the Gambia, Equatorial Guinea, Ghana, Gabon, Senegal, Congo and Angola have not won a single sprinting medal at the Olympics or World Championships.

The fallacy, then, is simple. Just because some black people are good at something does not imply that black people in general will be good at it.

Imagine a similar argument using the Central African Bambuti, a black tribe more commonly known as Pygmies. With an average height of 4ft we could assert that the Bambuti are naturally better at walking under low doors. Would it be legitimate to extrapolate that black people in general have a natural advantage at walking under low doors?

Our tendency to generalise rests on a deeper fallacy – the idea that “black” refers to a genetic type. We put people of dark skin in a box labelled black and assume that a trait shared by some is shared by all.

Is it wrong to note 100m winners are always black?

Comments Off | Sports, Sports Stats

Two Fun Projects Back Up and Running

June 9th, 2011 — 8:12pm

This has been a big week for me. Two of the things I really enjoy doing are back up and running for public consumption.

First is the 2nd 2p2 NBA Auction which you can read about here and watch in real time here. Basically I created a blind auction platform where people can spend up to $1,500 to fill out an NBA roster of 10 players. The players are released 15 per day with a mostly random selection (all the top 100 players are guaranteed to go in the first week). Last time was a ton of fun with plenty of people overbidding early or snatching great values late. I bet no one learns though, and history repeats itself.

The second is the return of the MLS Elo Ratings! I talked about the difference in the new way I do the ratings a few months ago if you’re interested, otherwise just go check ‘em out. They’re certainly not perfect this early in the season, but I like the way they’re trending a lot more than I did the old method.

Comments Off | Sports, Sports Stats

Differences in the Two Elo Systems

March 23rd, 2011 — 7:23pm

Assuming that you read my last post about updating the MLS Elo Rating system, you’ve probably been on the edge of your seat waiting for the revised end-of-season ratings. Well, here you go!

The big difference is that, in my opinion, game results (win/tie/loss) were being heavily overvalued against goal differential, especially for playoff games. 10-15% of the games were having way too much effect on the overall end-of-season ratings that should be based, at least somewhat, on all 30-something games.

I’m much happier with the balance between the two now.

Comments Off | Sports, Sports Stats

Serious Updates to the Elo Ratings

March 21st, 2011 — 9:53pm

For the first time in recorded history, I’ve changed my mind on something.

After some input from a few people (including one of the guys from http://www.smartfootballrankings.com/) and a bit of experimenting I’ve made some very significant changes to how the Elo Ratings are calculated. As of right now I haven’t updated the original pdf, so you can still check out the basics for the original calculations. I expect to have the new version up with all the changes spelled out by the end of the week.

Or you can just read on!

The first and biggest change is that the ratings are now iterative. What does that mean exactly? Well it means that I run through the whole season 15 times in a row, recalculating after each game, in order to get the final ratings. I got to 15 loops because that’s the number of iterations I found to be necessary for the ratings to change less than 1/100 of 1% on each additional go around.

The two big pros I see from this method:

1) It makes sure that teams are accurately rated for each match earlier in the season. If Real Salt Lake turns out to be terrible this year (they won’t) we don’t want to give opponents too much credit for beating them early in the season when we thought they were the best team in the league.

2) It gives games more meaning. Each game is calculated in there 15 times now rather than just once. In a league that will only play 34 games this year, that means I should be able to get reliable ratings a little earlier in the year than halfway through this time, and hopefully have a better sense of what the results mean rather than just where the team started the season.

The biggest con:
I could not find one single example of anyone else doing this, so I kinda just played around with stuff until I was happy with how the numbers were coming out.

So with that gigantic looming potential negative in mind, allow me to explain some of the other changes.

K-factors, the values that change based on the game’s setting, have been drastically reduced. Last time it was 15 for a regular season game, 30 for a playoff game, and 45 for the MLS Cup Final. Now those numbers are 1, 1.25, and 1.5 respectively. This was done mostly for cosmetic reasons. When I kept the numbers at 15/30/45 for 15 iterations the ratings tended to span from 1200-1700 rather than the 1350-1625 that we’re used to and seemed to drastically overvalue the MLS Cup Final as a rating point. 1/1.25/1.5 got the numbers in line with where they were previously.

What I really should do is go back through the ratings with a few different variations, and then see how the real results match up to the expected to results given each matchups rankings. For instance, a team with a 100 point advantage playing at a neutral site should win 64% of the time. That would give us the most accurate model, but it’s a lot of work I haven’t put in yet.

As for the calculations, I still set all beginning-of-season ratings by halving the difference between the previous year’s end-of-season rating and the 1500 base. So if Philadelphia Union ended at 1450, they would start the next season at 1475. I will re-run the entire season’s results off of those base values after each match day, incorporating all new results from that day at the same time. One other idea I played with was changing the home field advantage in the calculations by team, but couldn’t decide if that was a good idea so I held off.

I’ll do a follow up post either tomorrow or Wednesday with the end of year rankings under each method. Suffice to say that I really like the newer versions output a lot more. I think the previous method gave too much weight to results and not enough to goal differential.

And one final note, if you’ve actually made it this far, I caution you to remember what the MLS Elo Ratings are and what they are not. They do have a large proponent of saying which team is the best in MLS, but it is not strictly a power ranking. It looks at the sum of a team’s results on the season, and rates them by most to least impressive.

Phew, that was a lot of words for a lot of work.

Comments Off | Sports, Sports Stats

Updating the last post

February 8th, 2011 — 5:44pm

The good news: Mom is much better now.

The bad news: (not shockingly) I found absolutely no way of predicting fumbles with any measure of success. The most interesting thing to me was that having a higher percentage of running plays was actually slightly negatively correlated with the average number of tackles it took before a team fumbled.

The other news: Didn’t do anything on Elo because it’s going to be an incredibly time-intensive process until I get a little better at python (which I “know” now!) and can script it all.

Comments Off | Sports Stats

Time For Some Stress Relief

February 3rd, 2011 — 8:08pm

I’ve had a pretty stressful couple days involving lots of hospital visits and phone calls, so I need something to take my mind off everything. As usual, that means it’s time to play with sports statistics.

Two leading candidates right now:

1) Fumbles in football

How easy is it to predict who will fumble a lot? How random is the distribution of which team recovers the fumble? Under what scenarios are fumbles more likely to be recovered by the offense/defense? etc

2) MLS Elo as a predictive tool

This is the MLS Elo rating system I created. I’ve been talking with a student in Paris though (gotta love the Internet) and I think we may have made a couple improvements to it. I’m going to test these improvements against another piece of data I found recently, which is the gambling odds of every regular season MLS game from 1999 to 2009. Can Elo beat the bookies? We’ll know soon but, spoiler alert, without any adjustments it put up a 15% ROI over a season in the French league.

Comments Off | Sports Stats

What Makes It Worth It

July 24th, 2010 — 11:24am

You should know by now that I’m a huge nerd. One of my prouder nerdy accomplishments was developing the MLS Elo ratings. Since posting those ratings and the pdf file that explains how I formulated everything I’ve received a couple e-mails a month from people asking for advice on how to do something similar or just thanking me for the work I’ve done.

There’s nothing cooler than getting emails like this from people. So if you’re building something, put it out there for people to see. The response you get may surprise you.

Comments Off | Sports, Sports Stats

Book 1: The Book Of Basketball

January 17th, 2010 — 8:02pm

As I said, I plan to read 50 books this year and this was #1.  Technically I did start the book before January 1, but it’s 700 freakin pages and I still read 300-something this year, so I’m counting it.

Simmons does a handful of cool things in this book which centers around his quest to rank the 96 best players of all time in order to fill out his hypothetical Hall of Fame.  The writeups for each player are enjoyable and well researched, so if you can get past the fact that he’s still a Celtics homer you should enjoy it.  Other fun wrinkles: the top 10 “what-ifs” of all time (What if Dr. J had never gone to the ABA?), the top 10 teams of all time (’96 Bulls got jobbed), his “wine-cellar team” (12 best players’ single years to making an unbeatable basketball team), and a fascinating few pages covering an interview with Bill Walton that makes me wish he was my grandfather.

Yeah it’s long, but it’s one of the best books out there if you want to learn more about professional basketball in America, and want to do it with some humor.

If you like basketball and have a few months to kill, definitely check it out.

The Book of Basketball: The NBA According to The Sports Guy

Comments Off | Books, Sports, Sports Stats

Bias in the Coaches Poll: Part 3

December 22nd, 2009 — 3:05pm

And now here we are.  The final entry.  Part 3.

How coaches from one conference vote for various conferences

I only looked at the six BCS conferences here because I have to imagine you don’t care very much about the differences in voting between the Sun Belt and Conference-USA.

There’s not a whole lot to say here, it shows up pretty much how you’d expect.  Conferences rank their members higher, rank conferences close to them hire (ACC and SEC, Big 12 and Big Ten) and rank teams who are toss-ups with their own much lower (Big East ranking Florida 6th).  I think that the Pac-10 wins “Worst voter” award, although the ACC is pretty close thanks to the breadth of bad votes (mostly due to Bobby Bowden).

I hope you’ve learned this by now, but click the pictures to get a full sized version.

ACC Teams

Big 12 Teams

Big East Teams

Big Ten Teams

Pac 10 Teams

SEC Teams

And one final little nugget, a breakdown of how the top 25 would shake out under each conferences voting patterns.  The numbers to the right of the team are the number of points they earned.

Comments Off | Sports Stats

Back to top

Bad Behavior has blocked 308 access attempts in the last 7 days.