Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Bill Stewart Dead From Apparent Heart Attack

NBA Finals Simulation Predicts "Lakers in 7″ (and other insights)

NBA Finals Simulation Predicts “Lakers in 7″ (and other insights)

Star-divide

The Experiment

With the increased use of sports analytics and this year’s NBA finals, I thought it would be interesting to construct a rudimentary model projecting the outcome of the 2010 NBA Finals.

To gather data, I referenced the NBA Encyclopedia – Playoff Edition from NBA.com. I aggregated data points containing year, game number, home/away teams, home/away scores from all Finals games since 1946 (363 games).

These data points were fed into a model which ran 10,000 Monte Carlo simulations of the NBA finals.  We can use the results of these simulations to draw insights about the outcome of the series.

Some highlights:

  • In 64% of the simulations, the Lakers won the series
  • The most likely outcome was a Lakers sweep, representing 11% of the simulations
  • The probability of a 7-game series was 29%
  • Given the current state of the series (Lakers up 2-1), the chance of the Celtics winning the championship is now just 25%
The Results

After collecting the necessary data, I calculated three key statistics: home court winning percentage by game number, “streaking” momentum probability, and a weighted expected winning percentage based on the regular season stats.

These stats are obviously not fully independent, nor do they represent perfectly clean input data (for example, which teams played “home” in which games has changed during the NBA’s history).  However, they each provided some interesting insights into the historical outcome of NBA Finals match-ups.

The historical home team winning percentage by game is shown below.

Home Team Winning Percentage by Game

 

Throughout the life of the NBA, games 3 and 4 have been played at the home of the team with the worse regular-season record.  The impact of regular season records is evident in the dip seen during those games in the chart above.

The propensity to have a winning streak is also interesting:

"Streaking" Momentum Probability

 

As you might expect, with each consecutive win your chances of winning the next game go up.  However, what I found very interesting is how low the historical chances are of winning a second game in a row. I have a few theories on what might be influencing that statistic:

  • The Finals highlight the two most competitive teams so there is likely to be less dominance between the teams.  This makes you less likely to see a multi-game winning streak in finals play than it is to see some back-and-forth between teams.
  • Factors such as home court advantage could be influencing the numbers, as the “home” team switches often throughout the series.

My final statistic was a simple weighted average of the two teams’ regular season winning records.  I used this as a baseline probability that the Lakers would win any given game in the series (53.3%).

To simulate the games in the series, I took each of these three inputs and weighted them equally in a 10,000 iteration Monte Carlo simulation of the series.  There are obviously countless other ways I could have approached the problem or chosen to weigh these statistics.  I chose this rudimentary “equal weighting” methodology to provide some basic insights into how my inputs would combine to create simulated outcomes.

After running the simulations, the following statistics surfaced:

From beginning of NBA Finals:

  • Probability Lakers win series: 64%
  • Probability Celtics win series: 36%
  • Most likely game-by-game series outcome: A Lakers sweep (11% chance)
  • The next four most likely series outcomes were the four permutations of a Lakers Championship in 5 games (these, combined with the previous stat, meant a 26% chance of the Lakers winning inf 5 games or less).
  • Least likely series outcome: C-C-C-L-L-L-C
  • Expected length of series: 5.8 games

Given what has occurred through 3 games of the NBA Finals (with the Lakers up 2-1 as of Tuesday night):

  • Probability Lakers win series: 75%
  • Probability Celtics win series: 25%
  • Most likely series outcome: L-C-L-L-C-L (Lakers in 6)
  • Least likely series outcome: L-C-L-L-C-C-C (Celtics in 7)
  • Expected length of series: 6.2 games

Here are the chances of each possible remaining outcome:

  • Lakers in 7: 31%
  • Lakers in 5: 22%
  • Lakers in 6: 21%
  • Celtics in 7: 15%
  • Celtics in 6: 10%

Interestingly, even if the Celtics win game 4 (tying the series 2-2), the Lakers are still more favored than they were before the series started (60% vs 53%).

The Methodology

I found the process of extracting and analyzing the data to be quite educational.  If you’re curious about how I arrived at these numbers, read on.

Game-by-Game Home Court Advantage Explained

To create the game-by-game home court advantage for the Finals, I used all data points since the NBA finals began in 1946.  The data points were simply the percentage of “home court wins” across the entire data set by game number.

For those of you familiar with the NBA Finals format, you may know that the format changed from 2-2-1-1-1 to 2-3-2 after the 1984 finals. This means that games 1,2,6,7 have been played at the superior team’s home court for the past 25 years. Prior to 1985, the 2-2-1-1-1 Finals format held steady with a few exceptions.

The older series format held games 1,2,5,7 at the superior team’s home court from 1946-1984 (39 years). I initially wanted to just use the newer playoff format to avoid an inflation in the game 5 home winning percentage (from game 5 being held at the superior team home court for 39 years) and to avoid deflation in the game 6 home winning percentage (from game 6 being held at the inferior team home court for 39 years).  However, to achieve a statistically significant amount of data points in all situations, I aggregated the two playoff formats and took all 64 years of Finals data.

Momentum Analysis Explained

Every game that is played (except for the first in the series) is an opportunity to continue a streak.  Streaks can be as short as two games and as long as four (since, after winning four games, the series has ended).  Streaks also end organically when the series ends, so we have to be careful to not count “end of series” games as missed opportunities to continue streaks.

A 4 game sweep (as LA accomplished in this year’s Utah series) is viewed as and limited strictly to 3 statistical data points in our momentum analysis

  • Given one win, what was the outcome of the second game?
    • In this case, the result is a successful conversion of a 1 game streak into a 2 game streak
  • Given two consecutive wins, what was the outcome of the third game?
    • In this case, the result is a successful conversion of a 2 game streak into a 3 game streak
  • Given three consecutive wins, what was the outcome of the fourth game?
    • In this case, the result is a successful conversion of a 3 game streak into a 4 game streak

Note that we do not count mini-streaks within a streak as their own streaks (for example, the third win in a 3-game streak doesn’t also count as the second win in a two-game streak).

We chose to exclusively use historical Finals data for “streakiness.” We considered using 2009-2010 regular season data for Lakers and Boston streakiness but decided against it for two reasons.

  • There were not enough data points from the regular season to provide a good basis for analysis
  • The characteristics of a “streak” in the regular season are quite different, as they can span different teams and stretch far beyond the “4 game” limit of a playoff series.

Head to Head Winning Percentage Explained

To create the head to head winning percentages, I simply looked at each team’s regular season winning percentages (50-32 and 57-25) and determined that the winning percentage of the Lakers was 14% larger than that of the Celtics.

I then constructed a head-to-head winning probability for the Lakers that was 14% better than the complimentary winning probability of the Celtics.

Conclusion

I hope you enjoyed learning about my experience simulating the NBA finals using statistics.  There are obviously a number of areas where this model could be expanded and improved, and I hope to explore them in the future.

Thanks to RJMetrics for supporting this small project as part of my summer internship.  If your web-based business needs better insight into its backend data, RJMetrics can help you measure, manage, and monetize better.  Give it a try!

Comment 33 comments  |  8 recs  | 

Do you like this story?

Comments

Display:

Great Job

I really liked the graphics and all the details you presented. Thank you for all the statistical analysis and explanation of your methodology. I hope you enjoy the rest of your summer internship too.

by ClipperTheorist on Jun 10, 2010 1:55 PM PDT reply actions  

woah

Queensbridge. Littlerock.

"Derek Fisher shouldn't be allowed to shoot unless theres fewer than one second on the shot clock" - Kelly Dwyer

by bluexfalcon on Jun 10, 2010 2:11 PM PDT reply actions  

wow

this is some nice stuff.

by suzie-q on Jun 10, 2010 4:04 PM PDT reply actions  

damn bro, you could've just said, "Lakers win bitchez" and I still would've believed you...

Today's sports media excels at over-reaction to a single event and specializes in hyperboles. But hey, it's that or my biochem textbook...

by Mike1204 on Jun 10, 2010 7:23 PM PDT reply actions  

stop trollin

Queensbridge. Littlerock.

"Derek Fisher shouldn't be allowed to shoot unless theres fewer than one second on the shot clock" - Kelly Dwyer

by bluexfalcon on Jun 10, 2010 10:19 PM PDT up reply actions   1 recs

yep

they call a blocking foul when a player is driving out of control into the lane. They call ticky tack fouls on one end and leave it up to the other big men on the other side to fight through every bit of contact. They allow players to continuously be hit on the arms instead of calling the slaps. They allow players to throw someone in the air on a jumpball even though they were calling fouls for that sort of thing before. They sure let the better team play ball.

by Marty Mart on Jun 11, 2010 8:08 AM PDT up reply actions  

Listen if you want people to agree with you...

THEN GO AND COMMENT ON A CELTIC’S BLOG!

You’re not going to find anybody who’s going to agree with you here.

If you have a debate with a scholar, you can win. If you have a debate with an ignorant person, you will definitely lose.

Just take the ball inside- LakersForDeuce and just about everybody else on SSR

by akb24b on Jun 11, 2010 11:00 AM PDT up reply actions  

He doesn't want people to agree with him.

He just wants human interaction.

/troll is lonely

"Our deepest fear is not that we are inadequate. Our deepest fear is that we are powerful beyond measure."

http://www.silverscreenandroll.com/ - Visit, and be loved. Troll, and die a painful death. =]]

by Saurav A. Das on Jun 11, 2010 11:08 AM PDT up reply actions  

Lakers win

For me its the consistent inconsistency that concerns me - PAGFL
It's always AMMO Time, in spirit- DexterFishmore

by 99bc99 on Jun 11, 2010 11:03 AM PDT up reply actions  

This is what I've aspired to

Great job, love it, nice work.

For me its the consistent inconsistency that concerns me - PAGFL
It's always AMMO Time, in spirit- DexterFishmore

by 99bc99 on Jun 11, 2010 10:56 AM PDT reply actions  

I just put down $500 million dollars on the Lakers' money line based on this post

You’d better be right, Brent Linksy

"This is not a game for boys. This is a game for men." - Phil Jackson

by Gil Meriken on Jun 11, 2010 1:37 PM PDT reply actions  

Hollinger was wrong the whole playoffs about Boston winning each series.

These predictions are going to suffer the same fate.

But it’s been a good series so far, you guys have a good site.

by angryguy77 on Jun 14, 2010 3:35 PM PDT reply actions  

It's nice being up 3-2, got to say that.

Two more games at our house though. Let’s see what happens tomorrow my man, Celtics aren’t at home anymore.

For me its the consistent inconsistency that concerns me - PAGFL
It's always AMMO Time, in spirit- DexterFishmore

by 99bc99 on Jun 14, 2010 8:06 PM PDT up reply actions  

lol

Queensbridge. Littlerock.

"Derek Fisher shouldn't be allowed to shoot unless theres fewer than one second on the shot clock" - Kelly Dwyer

by bluexfalcon on Jun 15, 2010 9:56 PM PDT up reply actions  

better find a ball team to do it with...

what pulled into the Staples center today won’t get it done in game 7 ;)

Seriously though, here’s to an epic game 7 and may the best team win.

by poorwebguy on Jun 16, 2010 1:42 AM PDT up reply actions  

Wow
Here are the chances of each possible remaining outcome:

Lakers in 7: 31%

Queensbridge. Littlerock.

"Derek Fisher shouldn't be allowed to shoot unless theres fewer than one second on the shot clock" - Kelly Dwyer

by bluexfalcon on Jun 16, 2010 10:44 AM PDT reply actions  

This guy will look crazy good if Lakers win in 7

"This is not a game for boys. This is a game for men." - Phil Jackson

by Gil Meriken on Jun 16, 2010 12:00 PM PDT up reply actions  

When

For me its the consistent inconsistency that concerns me - PAGFL
It's always AMMO Time, in spirit- DexterFishmore

by 99bc99 on Jun 16, 2010 10:03 PM PDT up reply actions  

Tomorrow night

"This is not a game for boys. This is a game for men." - Phil Jackson

by Gil Meriken on Jun 16, 2010 11:54 PM PDT up reply actions  

Haha

Mean “when” instead of “if” … was the power of positive thinking! :P

For me its the consistent inconsistency that concerns me - PAGFL
It's always AMMO Time, in spirit- DexterFishmore

by 99bc99 on Jun 18, 2010 9:40 AM PDT up reply actions  

Holy shit.

Holy shit. That’s all I can say. Beautiful model, a decent, non-unique set of assumptions, and an almost textbook series turn makes this guy a genius if the Lakers win tomorrow.

Vikas Srinath

by vikas_s24 on Jun 16, 2010 5:09 PM PDT reply actions  

When

For me its the consistent inconsistency that concerns me - PAGFL
It's always AMMO Time, in spirit- DexterFishmore

by 99bc99 on Jun 16, 2010 10:03 PM PDT up reply actions  

Hope you get a job offer
Thanks to RJMetrics for supporting this small project as part of my summer internship.

Good job, and fun!

For me its the consistent inconsistency that concerns me - PAGFL
It's always AMMO Time, in spirit- DexterFishmore

by 99bc99 on Jun 18, 2010 9:40 AM PDT reply actions  

nice work

And of course, vindicated by the outcome.

I do have one comment about incorporating background information about streaks/momentum. Some part of streakiness is undoubtedly due to one team simply being better than the other. Another potential contributor is the home court advantage. Both of these should be factored out when adding a momentum component and—I could be mistaken—it seems that you don’t entirely account for this.

One way to do that might be to condition the probabilities of streaks in the background data on the overall series winning percentage of the team experiencing the streak. There are a number of different paths you could go from there, but by making the methodology clearer about that, others could see what would happen if they chose a different path to account for the different factors.

Also, just as a matter of wording, I wouldn’t say that a four-game sweep was the most likely outcome (“outcome” could refer to the overall series score); rather, I’d say it was the most likely sequence.

Again, though, very nice work.

by Brian Tung on Jun 18, 2010 9:50 AM PDT reply actions  

Comments For This Post Are Closed


User Tools

You are where Hollywood meets the Hardwood

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Kobelogo_small
Observations from the Nosebleeds
Small
Flagrant Foul: The Last Resort

Recent FanPosts

Grpbzshu_1__small
Burn it Down - Who Stays and Who Goes?
2012_la_marathon_medal_small
At The Bar - 5/21/12
Ryan_2008_small
Could Bynum Become the Best?
Monopoly_pub_crawl_small
One fan's random thoughts.
2012_la_marathon_medal_small
At The Bar - Weekend Edition
Lamparduefachampion_small
UEFA Champions League Final Fanpost: Bayern Munich v. Chelsea
Lamparduefachampion_small
SSR Awards - The REAL Experts: 2011-2012 Defensive Player of the Year
Lamparduefachampion_small
SSR Awards - The REAL Experts: 2011-2012 Most Improved Player

+ New FanPost All FanPosts >


Blog Managers

Silver-lg_small C.A. Clark

Brain3_jpg_small DexterFishmore

Editors

Ohkeedokelogolakers_small wondahbap

2012_la_marathon_medal_small SoCalGal

Beat Writers

Lakers_small vikas_s24

Img_0056_small Ben R

Udontsay_small bluexfalcon

Umad_small theshmoes

155_small Actuarially Sound

5449_1185754491845_1467777039_30486370_3889376_n_small Mark Travis

Nba_g_kbryant_sy_576_small TheGreatMambino

Small Robert Karpeles