tree-of-thought OP t1_j8171sh wrote on February 10, 2023 at 9:34 PM

#1,774,306

Source: nflfastr. They have rich play-by-play data for every NFL game of the last ~25 years. I was able to get a time series of win probabiltiy for every Super Bowl since the 2000 season.

Tools: R. nflfastr to get the data, data.table to clean it and develop excitement scoring metrics, and ggplot2 to visualize.My collaborator built interactive visualizations for this project in Flourish. Those visualizations are linked lower in this comment.

Explanation: I've seen win probabilty graphs used as a shorthand for the excitement of a game. I wanted to develop a metric which takes in a win probability time series and outputs an "excitement score."Ultimately, I decided on three different factors that should contribute to the excitement score...

How close is the average win probability to zero? This is intended to capture how surprising was the eventual outcome.
What is the average absolute distance between the win probability and 50%? This is intended to capture how closely contested the game was.
What is the root mean square of all the changes in win probability from one play to the next? This is intended to capture how "back and forth" the game was.

I took each of these scores, scaled (but did not center) them, and then used their euclidean norm as the composite score.

Visit this webpage for more information on this topic!

The plots above are ordered by the composite score descending from left to right, top to bottom.It seems to work pretty well! Especially at the lower end of the scale--those are all pretty clearly games that were lopsided and foregone conclusions early on.

I've gotten the feedback that Super Bowl LIII between the Patriots and Rams is evidence of a "bug" in the metric. That game was very tight--which accounts for its high score (in the metric sense), but it was tight because it was low scoring (in the FOOTBALL sense!) with neither team performing very well.

Stuff I might tinker with to make it better:

Assign different weights to the three constituent metrics
Weight the constituent metrics differently at different points in the game.
Factor in how much scoring is happening

Lower-Tackle3600 t1_j81g1o0 wrote on February 10, 2023 at 10:37 PM

#1,774,731

Great share! Now can you assign a quantitative way to rank Super Bowl snacks?

CheeseTheGood t1_j81imdt wrote on February 10, 2023 at 10:55 PM

#1,774,875

Having Super Bowl 53 ranked that high as an exciting game is criminal, you need to go back to formula.

tree-of-thought OP t1_j81k88j wrote on February 10, 2023 at 11:06 PM

#1,774,938

Replying to CheeseTheGood (#1,774,875)

I know, it's a real problem!

I think simply factoring in the sum of scores (or maybe the sum of scores averaged over every play of the game) would go a long way towards pushing that one down the rankings.

It's also been pointed out to me that SB 38 is remembered as especially exciting, but it's right in the middle of my rankings.

¯\_(ツ)_/¯ I'm sad the first attempt has such glaringly "wrong" results, but this is why we get feedback and iterate!

tree-of-thought OP t1_j81kfsl wrote on February 10, 2023 at 11:07 PM

#1,774,948

Replying to Lower-Tackle3600 (#1,774,731)

Thank you!

Great idea. Maybe a composite score of saltiness, cheesiness, and ranch-iness.

CheeseTheGood t1_j81koe4 wrote on February 10, 2023 at 11:09 PM

#1,774,961

Replying to tree-of-thought (#1,774,938)

Until you can factor in the "Eye Test" you're gonna have a hard time. It's why scouts still have jobs.

pantaloonsofJUSTICE t1_j81yi45 wrote on February 11, 2023 at 12:51 AM

#1,775,614

You could consider increasing the weight of Q4/OT. Pats panthers was especially exciting toward the end, and great games in general have great fourth quarters.

tree-of-thought OP t1_j825cz0 wrote on February 11, 2023 at 1:45 AM

#1,775,951

Replying to pantaloonsofJUSTICE (#1,775,614)

Thank you! I think this is a really good suggestion.

MathThatChecksOut t1_j825rzx wrote on February 11, 2023 at 1:48 AM

#1,775,976

Everyone talking about games which should be higher/lower but all I'm seeing is "New England being in the Superbowl has a dramatic increase in the excitement of the game"

Ac01001101 t1_j825uf0 wrote on February 11, 2023 at 1:49 AM

#1,775,981

Have you tried this data with Chat GPT to see what answer you would get?

throwaway1736484 t1_j82crd9 wrote on February 11, 2023 at 2:44 AM

#1,776,314

Replying to Ac01001101 (#1,775,981)

Chatgpt might be able to tell an exciting game but that’s bc of volume of articles in it’s training data saying “so exciting, what a game” after the fact and only up until 2021. Perhaps it developed an understanding somewhere in those billions of parameters, or perhaps it is only regurgitated human opinion.

throwaway1736484 t1_j82cv0j wrote on February 11, 2023 at 2:45 AM

#1,776,320

Replying to CheeseTheGood (#1,774,961)

Scouts have jobs bc sports orgs don’t understand stats

tsme-esr t1_j833e4i wrote on February 11, 2023 at 7:11 AM

#1,777,603

Of course the team that reached the lowest win probability is a certain New England team. One would think that if it were that low then they should have lost...

st4n13l t1_j83cxhd wrote on February 11, 2023 at 9:18 AM

#1,777,930

Replying to tsme-esr (#1,777,603)

Leave it alone

Volcic-tentacles t1_j83dhdy wrote on February 11, 2023 at 9:26 AM

#1,777,951

I only started watching NFL this year (in week 11 of the 2022 season). It took me a while to decode the labels, which require quite a bit of specialist knowledge of the NFL and there is no key. Marks off for that.

Once I tuned in though, I did find this interesting. How were the probabilities worked out though? Where is the background on this?

tree-of-thought OP t1_j83nhkt wrote on February 11, 2023 at 11:49 AM

#1,778,308

Replying to Volcic-tentacles (#1,777,951)

You’re right the labels are not very accessible to people without quite a bit of NFL domain knowledge. I had a hard time getting relevant info into them without them getting too long or taking too much real estate in the overall viz…but of course there should be a key. Thanks for the feedback!

nflfastr provides the win probabilities. the calculate win probability function docs have more info!

Thank you again for the comment!

tree-of-thought OP t1_j83o1yz wrote on February 11, 2023 at 11:56 AM

#1,778,336

Replying to Ac01001101 (#1,775,981)

I asked chat gpt for brief descriptions of various super bowls to help me remember how they played out so i could assess the ranking as I was building it. I also experimented with asking it broadly and non quantitatively “was this super bowl exciting?” “was that one?” to see if it was directionally agreeing with my scoring.

I did not go so far as to try and involve chat gpt directly in computing the score. How might that work? Simply ask it “please give a 1-10 score to the excitement level of recent super bowls”?

Ac01001101 t1_j83om0h wrote on February 11, 2023 at 12:03 PM

#1,778,368

Replying to tree-of-thought (#1,778,336)

Wow dude. That's fascinating.

I guess... I've never really thought about it until now. Your post gave the idea. I wonder if it can give you predictions based on information?

clamraccoon t1_j84kin8 wrote on February 11, 2023 at 4:06 PM

#1,780,177

Replying to tree-of-thought (#1,774,938)

Is it deemed more exciting if the winner’s win probability is low throughout the game?

If a team is heavily favored, even if the score is tied, their win probability is higher, meaning a back and forth game where the favorite has a narrow lead can skew this graph.

I think the Rams were favored in that dreadful SB LIII, and the biggest lead of the game was 10, meaning win probability stayed pretty level.

Cool data points. Like all data, it can be misleading

tree-of-thought OP t1_j84qk1s wrote on February 11, 2023 at 4:48 PM

#1,780,477

Replying to clamraccoon (#1,780,177)

“winners win probability is low throughout the game” is one of the three factors influencing the composite score. (if it were the sole factor, the patriots falcons bowl would be the most exciting game by far)

The other two factors are “win probability closeness to 0.5” and “win probability changes”

You’re right, that awful rams super bowl won the “closeness to 0.5” category, which is why it’s ranked top 5

And yeah, it’s a fun cool project but not perfect. The best way to assess super bowl excitement is still to sit down and watch!

Roscoes_Rashie t1_j85tpf9 wrote on February 11, 2023 at 9:18 PM

#1,782,769

538 did this exact study last week with a better methodology and presented better.

tree-of-thought OP t1_j85yfy4 wrote on February 11, 2023 at 9:53 PM

#1,783,066

Replying to Roscoes_Rashie (#1,782,769)

Well at least you let me down gently!

Recent Superbowl Win Probabilities, Ordered by "Excitement" [OC]

Comments