Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Tottenham's Plans for Northumberland Stadium Approved

Premier League 2010-11 Preview: Using SCIENCE To Project The Final Standings

With the English Premier League's start just days away, SB Nation Soccer editor Richard Farley decides to move away from a traditional predictions piece. All hail the power of the multi-core processor.

Aug 10, 2010 - With just days to go before the English Premier League (EPL) starts its 19th season, punditry is going on record with their predictions, so it's only fair that I also give you ammunition which, come next May, you can use to shoot me down from whatever pedestal onto which I've moved.  And I can guarantee you, I will have moved on, as preseason predictions are something I usually loathe.  After all, they're just glorified (if often highly-informed) lists.

This year, the SB Nation Soccer desk is taking a different approach, employing a method I've been using over the last couple of years.  The basic process goes something like this:  Look at each player's performance from the preceding season and try to determine his contributions to the team's goals for and goals allowed. Regress that performance as needed, and if a player is no longer with the same club, try to make the evaluation as team-neutral as possible.  Add in considerations for improvement or decline. Then take the player and plug him into his team's depth chart for the 2010-11 season and make an assessment regarding the playing time. With that, you can estimate how much a player will effect his team's goals for and allowed for this season.

If you read that with a combination of "Huh" and "Okay, but how, exactly," don't worry. That's half-intended because this isn't even half a science, but there are some huge benefits to this approach. With it, you can do things like this:

RkClubAvgWDLGFGA1stTop 4Top 7RelegatedBestWorstRange*
1 Manchester United 1.7 24.6 7.4 6.0 79.0 23.5 57.7% 98.1% 100.0% 0.0% 1 7 1-4
2 Chelsea 2.8 23.1 6.0 8.9 95.0 41.9 19.0% 88.4% 99.6% 0.0% 1 10 1-6
3 Arsenal 2.8 22.9 6.5 8.6 88.1 38.0 18.5% 88.2% 99.5% 0.0% 1 10 1-6
4 Tottenham Hotspur 5.3 19.5 7.4 11.1 68.4 41.9 1.4% 36.0% 89.5% 0.0% 1 17 2-9
5 Liverpool 5.4 19.2 8.0 10.8 62.2 37.9 1.3% 33.6% 88.0% 0.0% 1 17 2-9
6 Manchester City 5.5 18.7 9.1 10.2 54.2 31.7 1.4% 31.8% 87.2% 0.0% 1 18 2-9
7 Everton 6.1 18.5 7.6 11.9 63.7 43.1 0.7% 21.1% 79.6% 0.0% 1 17 2-10
8 Stoke City 10.0 14.6 8.7 14.7 46.1 46.2 0.0% 1.1% 18.1% 1.7% 2 20 5-16
9 Aston Villa 10.1 15.1 7.1 15.8 57.3 59.8 0.0% 0.9% 16.5% 1.8% 1 20 6-16
10 Fulham 12.5 13.2 7.6 17.2 48.2 60.2 0.0% 0.2% 4.6% 8.2% 3 20 7-19
11 Sunderland 12.6 13.1 7.9 17.0 45.7 57.0 0.0% 0.2% 4.5% 8.6% 3 20 7-19
12 Birmingham City 13.0 13.3 6.6 18.1 54.9 71.8 0.0% 0.1% 3.2% 10.5% 3 20 7-19
13 Bolton Wanderers 13.5 12.7 7.2 18.0 48.8 65.7 0.0% 0.1% 2.5% 13.4% 3 20 8-19
14 Newcastle United 13.7 12.1 8.6 17.3 39.4 53.5 0.0% 0.0% 2.2% 15.3% 4 20 8-19
15 West Ham United 14.2 12.4 6.7 18.9 51.3 73.1 0.0% 0.1% 1.8% 20.0% 2 20 8-19
16 Blackburn Rovers 14.6 11.9 7.5 18.6 44.4 65.0 0.0% 0.0% 1.2% 22.3% 4 20 8-19
17 West Bromwich Albion 14.9 11.6 7.8 18.6 41.5 62.2 0.0% 0.1% 1.0% 26.0% 4 20 9-20
18 Wigan Athletic 15.7 11.5 6.5 20.0 49.2 78.8 0.0% 0.0% 0.5% 35.0% 5 20 9-20
19 Wolverhampton Wanderers 16.0 11.0 7.1 19.8 43.2 71.6 0.0% 0.0% 0.6% 40.3% 5 20 10-20
20 Blackpool 19.7 6.7 6.7 24.6 28.9 86.5 0.0% 0.0% 0.0% 96.9% 8 20 18-20

* -  the Range represents two standard deviations in either direction of the clubs' mean finish.

Those are the projections you get when you take the player evaluations, aggregate them into team performance, and then write software to simulate 10,000 seasons. Every match is played adjusting for home-and-road conditions, the goal-scoring level of the league, and quality of opposition. And just as each match in real life has an inherent variability to it, so do the simulated matches, which is why we've run 10,000 seasons - to tease out anomalous results.

Thanks to this approach, we can provide more than just an ordered list. Scoring tons of goals or too many allowed? We get an idea of why a team may finish in a certain position. How likely is a team to make Europe, Champions League or win the league? We can make an informed guess. Could they get relegated? If so, how often?  And even considering the anomalous result, what's a team's best and worst possible finish? Extremes aside, what's a realistic range of performance? This method allows us to assess all those things.

But let's not get too carried away. At its heart, the tool is just people making qualitative assessments about players.  Those assessments are solidified (if you will), aggregated and then processed by a number of scripts, but the core is still looking at a player and asking "how is he going to do this year?"

I would like to think this process forces us to take a more detailed look at each player - incorporate more data, more rigorously than others might - but all the matters is the end product. You get to be the judge of that.

Over the next three days, we'll incorporate these end results into our club profiles, taking those questions marks in the table, filling them in, and giving you our outlook on the 2010-11 Premier League season.

Do you like this story?

Richard_medium

Richard Farley

Soccer Editor

Richard Farley covers The Beautiful Game for SBNation.com.

A resident of San Diego, Richard projects as a one-footed right back with a poor first touch. His "likes" include the royal we and... Read full bio


Comments

Display:

Interesting table

Some of the wide ranges on a few teams are quite interesting. Nice job and write up.

by JoshuaR on Aug 10, 2010 3:39 PM EDT reply actions  

I really should ...

… have published the inter-quartile range. That’s the z +/- 2 range, so yeah – it’s big. Deceivingly so.

-rf

by Richard Farley on Aug 10, 2010 3:50 PM EDT up reply actions  

Box plots perhaps?- graph these with the table as the Y and the nominal as the x?

I’d like to know the ‘jerk’ on these rankings.
Possession as a factor in addition to goals (there’s a good deal of correlation between possession and wins) would make this more accurate… but I think this is surprisingly good as it is.

"Voetbal is pas totaal als je wint"- Coach Adun
"The greatest sin is to spurn the gift"- Coach Alistair

by Londonjoe on Aug 10, 2010 5:47 PM EDT up reply actions  

Possession ...

.. is taken into account at a lower level, but I admit that might not be the best approach.

-rf

by Richard Farley on Aug 10, 2010 6:05 PM EDT up reply actions  

I think it might be pretty difficult to weigh posession, and to quantify how individual players affect the possession in a game- for example, I feel that Cole will be good for Liverpool’s overall possession, but I’m not sure to what degree.
Will you post the plots of the simulations or a curve of best fit? I’m really curious about that.

"Voetbal is pas totaal als je wint"- Coach Adun
"The greatest sin is to spurn the gift"- Coach Alistair

by Londonjoe on Aug 10, 2010 6:13 PM EDT up reply actions  

If not here ...

… I’ll try to put them on rffootball.com at some point. need to but this week’s previews done, but feel free to stay on me about it. the plots aren’t that hard to get done.

-rf

by Richard Farley on Aug 11, 2010 7:07 PM EDT up reply actions  

and thanks ...

… the results do pass the smell test, and previous times i’ve used these have …

… well, for the big leagues they seem to work well. to lower leagues, issues.

-rf

by Richard Farley on Aug 10, 2010 6:06 PM EDT up reply actions  

Comments For This Post Are Closed

119639_knicks_raptors_basketball_small

Jeremy Lin's Game-Winner Lifts Knicks Over Raptors, 90-87

DETROIT, MI - FEBRUARY 14: The Detroit Red Wings celebrate their NHL record breaking 21st consecutive home victory by beating the Dallas Stars 3-1 at Joe Louis Arena on February 14, 2012 in Detroit, Michigan. (Photo by Gregory Shamus/Getty Images)

Red Wings Set NHL Record With 21 Straight Wins At Home

Courtesy of Sports Illustrated

Kate Upton's Sports Illustrated Swimsuit Issue Cover