Sep 15, 2017
Sep 15, 2017

# Using Premier League odds to predict the league table

## Predictions for the 2017/18 Premier League season

The start of the Premier League season is awash with league position predictions. Instead of the subjective opinions of so-called “experts”, bettors can use more efficient methods to get a better idea of what the final league table will look like. This article explains how to use odds to predict the Premier League table.

Betting Resources has a number of valuable articles explaining how and why Pinnacle’s match betting odds for a soccer game offer the most value and are amongst the most efficient in the online sports betting market.

Teams priced at 1.50 win about 67% of the time - knowing this, we can say that if a team is priced at 1.50 it should have roughly a 67% probability of winning.

Armed with this knowledge we can use these match odds to determine whether a team has been lucky or unlucky, and where it ultimately might finish at the end of a season.

I have previously considered this methodology investigating the unlikelihood of Leicester City winning the 2015/16 Premier League. In this article I’m going to take a more general look at last season’s performances and try to make a few predictions based on those for the new 2017/18 season.

### Using Pinnacle’s closing line to predict the probability of results

Pinnacle’s closing line in a 1X2 soccer betting market is arguably an excellent measure of the ‘true’ probability of a particular result occurring. Of course, we can’t ever know what a true result probability is before a match is played, but based on hundreds or thousands of similarly priced matches, the percentage of them that finish with the predicted results closely matches the probability implied by the odds.

What this means in practice is that teams priced at 1.50 win about 67% of the time. Knowing this, we can say that if a team is priced at 1.50 it should have roughly a 67% probability of winning; if priced at 5.00, about a 20% chance.

### Removing the margin to find ‘true’ odds

Of course, to get a genuinely reliable measure of the result probability from the match odds, it is first necessary to remove Pinnacle’s margin. Naturally, exactly how Pinnacle will apply their margin to a set of ‘true’ or fair odds in the first place will remain a secret.

There are, however, a number of possible candidate methods that yield intuitively reasonable solutions that take into account the favourite–longshot bias, where prices on longshots are shortened proportionally more than they are on favourites.

Potentially, the outright Premier League betting market still overrates Manchester United as a dominant force after the Ferguson era.

One of these uses what is known as a logarithmic function. A logarithm is simply defined as the power to which a number must be raised in order to get some other number. For example, the base 10 logarithm of 100 is 2, because 10 to the power 2 = 100 and hence log10100 = 2.

The logarithm function for applying a margin to a set of ‘true’ odds assumes that the same power number is applied to each of the 1X2 odds. This is perhaps easiest to understand by means of an example.

Consider the fair book 2.00, 3.00 and 6.00 for 1X2 odds. Let’s now apply a margin by raising these numbers to the power 0.95. The resultant book will have 1X2 odds of 1.93, 2.84 and 5.49, with a margin (or overround) of 5.2%, because the implied result probabilities add up to 105.2%. You can see that the longer away odds have been shortened considerably more (by 9.4%) than those of the shorter home odds (by 3.5%).

Finding a set of ‘true’ odds from a published book of 1X2 odds simply reverses this process. The ‘logarithmic function’ model is intuitive in that it accords with the majority standard economic theory of risk and utility.

Assuming a logarithmic sensitivity to risks, a bettor would regard the difference between 2.00 and 1.93 to be equivalent to the difference between 6.00 and 5.49. The latter is greater than the former, but bettors are arguably less sensitive to lower probability events, so won’t make so much of a fuss over a proportionally less valuable price.

### Simulating soccer results and league points with Monte Carlo

Once the true outcome probabilities have been calculated, we need to decide the result of a match. Obviously in the real world, we know what that is once the game has finished, but here we are interested in expected outcomes.

Using the match odds to model, no team finished more than four places away from expectation, whilst six matched it exactly.

Like expected value and expected goals, this is really just a method for determining what we can expect to happen based on the forecasting model parameters, in this case the expected result probabilities as implied by Pinnacle’s closing odds.

One way to simulate expected results is to randomise outcomes over many thousands of matches by means of a Monte Carlo simulation. If the random number generated is less than the probability of the result, then assign that result to the game (i.e. it happened); if it is greater than the result probability, then assign the null result (i.e. it didn’t happen).

By repeating this process over 38 games for an entire season for each team we can simulate the expected number of points won. The Monte Carlo method gives us a normal distribution of expected points totals for each team. Using Pinnacle’s closing odds for the 2016/17 season these are shown below for the winners Chelsea, previous winners Leicester City and relegated Sunderland.

The average expected points total for Chelsea according to this model was 77.7. In fact we know they finished with 93 and this can imply two things.

Firstly, this model may not be a wholly accurate method for predicting the Premier League table. Indeed, according to the actual match odds, Manchester City should have won the title last season with an expected points total of 81.3. Of course, no model is wholly accurate, but some are better than others.

Assuming a logarithmic sensitivity to risks, a bettor would regard the difference between 2.00 and 1.93 to be equivalent to the difference between 6.00 and 5.49.

Secondly, it is known that model predictions are typically narrower than real life, with a range of expected points totals across all teams that is smaller than what actually occurs. Sunderland, who were relegated with 24 points had an expected total based on their season’s match odds of 32.1. This implies that whilst Chelsea were luckier than expected, Sunderland were unluckier.

True Score Theory tells us that the observed outcome is equal to (the variance in) skill plus (the variance in) luck. The forecasting model only considers the skill part, not the luck.

Despite these caveats, using the match odds to model the finishing league table did prove fairly accurate; at least in terms of positions. No team finished more than four places away from expectation, whilst six matched it exactly and another five were within one place.

## Premier League expected points

 Team Actual points Expected points Actual position Expected position Difference Chelsea 93 77.7 1 2 1 Tottenham 86 72.8 2 5 3 Man City 78 81.3 3 1 -2 Liverpool 76 74.2 4 4 0 Arsenal 75 74.3 5 3 -2 Man United 69 72.7 6 6 0 Everton 61 55.7 7 8 1 Southampton 46 57.5 8 7 -1 Bournemouth 46 43.5 9 12 3 West Ham 45 43.6 10 11 1 West Brom 45 40.8 11 14 3 Leicester 44 50.0 12 9 -3 Stoke 44 43.1 13 13 0 Crystal Palace 41 44.8 14 10 -4 Swansea 41 40.0 15 15 0 Watford 40 38.0 16 17 1 Burnley 40 33.2 17 19 2 Hull 34 34.7 18 18 0 Middlesbrough 28 38.2 19 16 -3 Sunderland 24 32.1 20 20 0

### Premier League predictions: Using the odds directly

Another way to simulate results and points is to skip the Monte Carlo process altogether. If a home team has fair odds of 2.0, 3.0 and 6.0 for home, draw and away respectively, this implies its expected points total for the match is (50% x 3 points) + (33.33% x 1 point) + (16.67% x 0 points) = 1.833 points. By contrast the away team would have an expected match points total of 0.833.

The simplest model for predicting the 2017/18 Premier League table is to assume that the past is the key to the future.

A Monte Carlo method with a large enough number of repetitions would arrive at the same answers, but this method is an easier shortcut. To demonstrate equivalence, the expected league points totals for the two methods are compared below. However, its shortcoming compared to Monte Carlo is that it provides us with no measure of error and variation. As a consequence it is unable to estimate the probability of finishing in each league position.

## Premier League expected points from odds

 Team Actual points Expected points (Monte Carlo) Expected points (direct calculation) Chelsea 93 77.7 77.7 Tottenham 86 72.8 72.8 Man City 78 81.3 81.3 Liverpool 76 74.2 74.1 Arsenal 75 74.3 74.4 Man United 69 72.7 72.8 Everton 61 55.7 55.7 Southampton 46 57.5 57.5 Bournemouth 46 43.5 43.5 West Ham 45 43.6 43.6 West Brom 45 40.8 40.8 Leicester 44 50.0 50.0 Stoke 44 43.1 43.1 Crystal Palace 41 44.8 44.8 Swansea 41 40.0 40.0 Watford 40 38.0 37.9 Burnley 40 33.2 33.3 Hull 34 34.7 34.7 Middlesbrough 28 38.2 38.2 Sunderland 24 32.1 32.1

### Predictions for the 2017/18 Premier League season

The simplest model for predicting the 2017/18 Premier League table is to assume that the past is the key to the future. Aside from other factors like new player transfers, the next table shows the number of times in the 100,000 Monte Carlo repetitions that each team (not relegated) finished first in the 2016/17 table, and by extension the probability of winning the 2017/18 teams and their implied fair odds. These are then contrasted with Pinnacle’s pre-season title winning odds to identify potential value.

## Predictions for the 2017/18 Premier League season

 Team Number of title wins in 100,000 Expected probability of being 2017/18 champions Implied fair betting odds Pinnacle’s betting odds (as of 10th August 2017) Man City 44,096 44.10% 2.27 2.65 Chelsea 23,406 23.41% 4.27 4.70 Arsenal 11,889 11.89% 8.41 12.00 Liverpool 11,812 11.81% 8.47 12.00 Tottenham 8,552 8.55% 11.69 9.15 Man United 8,298 8.30% 12.05 4.80 Southampton 99 0.10% 1,010 N/A Everton 37 0.037% 2,703 N/A Leicester 4 0.004% 25,000 N/A Bournemouth 1 0.001% 100,000 N/A West Ham 0 0% N/A N/A West Brom 0 0% N/A N/A Stoke 0 0% N/A N/A Crystal Palace 0 0% N/A N/A Swansea 0 0% N/A N/A Watford 0 0% N/A N/A Burnley 0 0% N/A N/A

According to our model, Manchester City, Chelsea, Arsenal and Liverpool all offer some value. This is at the expense of Manchester United and to a lesser extent Tottenham. Potentially, the outright Premier League betting market still overrates Manchester United as a dominant force after the Ferguson era and Mourinho as a manager. Alternatively, the model has understandably not considered Manchester United’s high-profile summer signings of Romelu Lukaku and Nemanja Matic.

### A Bayesian Process

A significant drawback of this model is that it requires a completed season of results to make inferences about the next one. However, there is no reason why we should restrict ourselves to this limitation. We could, instead, choose to run the model based on the games played to date, or on a rolling basis over the last 38 matches, updating our modelled probabilities and implied title winning betting odds as we go along.

Of course, we would also need to take into account real points already won as the season progresses. Such a Bayesian approach mimics a method of statistical inference where the probability for a hypothesis is updated as more evidence or information becomes available. We could also repeat this methodology for other markets like Premier League relegation betting and a top-four finish.

Although this model is simple, it evidently makes predictions that are not wildly dissimilar to those made by Pinnacle’s traders.

Betting Resources - Empowering your betting

Pinnacle’s Betting Resources is one of the most comprehensive collections of expert betting advice anywhere online. Catering to all experience levels our aim is simply to empower bettors to become more knowledgeable.