Submitted by ikashnitsky t3_ziojgj in dataisbeautiful
Comments
Not_Legal_Advice_Pod t1_izrwq9y wrote
So... good job by the bookmakers? Looks like over 18 or so years of doing this you'd be down fifty or so "points", which seems about right for any gambling.
ikashnitsky OP t1_izrwvm4 wrote
Exactly. The big question is whether the surprising results at World Cups are random or systematic.
Not_Legal_Advice_Pod t1_izrz4zo wrote
What that 2019 outlier? I don't think you can get to statistical significance on that one year. This looks like what I'd expect it to look like. You're probably better getting into the big data game by game statistics if you're looking for sings of tampering.
[deleted] t1_izsif2b wrote
[deleted]
Mooks79 t1_izsp7u1 wrote
The latter is because of the former.
Grenadefisherman t1_izspavr wrote
I really like the idea of what you have been doing but the presentation of the date hurts my eyes. Too many lines!!
jaytee158 t1_izsq8ih wrote
Yeah, Covid destroyed home advantage and the bookies couldn't adjust quickly enough.
I know you know this but pointing out for others
jjwoodworking t1_izsrkg3 wrote
What is the curve of end of season winnings. Is it a bell curve centered at 0? Looks like it might be
FrankDrakman t1_izsrlcu wrote
Wasn't that Leicester's year? No one was picking them for anything, so every game would have been an upset, at least at the beginning.
Eric1969 t1_izsuqse wrote
I actually was debating this with a friend so vigorously that he coded a program to compare yhe wins of betting randomly or on lagging possible outcomes of a randomly generated number. Betting on lagging outcomes brought more wins.
Professional sports is not quite random though.
Mooks79 t1_izsvfub wrote
>I actually was debating this with a friend so vigorously that he coded a program to compare yhe wins of betting randomly or on lagging possible outcomes of a randomly generated number. Betting on lagging outcomes brought more wins.
Can you elaborate more on what you mean? I think I misunderstand you somehow as it seems to me betting on lagging outcomes cannot outperform betting randomly in the long run or the outcome is not random (or there’s a coding / concept error).
>Professional sports is not quite random though.
Absolutely.
ThankGodSecondChance t1_izsxkif wrote
Nope, that was much longer ago
ThankGodSecondChance t1_izsxrep wrote
Disagree. At the halfway point, long before covid really was hitting, the damage was already done. That year pretty much stayed level after January
Eric1969 t1_izt3tqu wrote
My friend’s theory was that lagging outcomes (ex: tails having come up 20 times for 40 heads) would result in these outcomes having a higher probability than the theoretical probability (50% for heads and tails) of showing up, as they would surely eventually catch up. I tought the probability remained 50% regardless.
Mooks79 t1_izt3zcr wrote
I’m with you. I think if your friend’s code showed otherwise then there’s a flaw in the code and it’s not doing quite what it should be. Would they be willing to share it?
Lochbriar t1_izt4hea wrote
Yeah I was being snarky, the math suggests that COVID stopped games at the 300 mark. It's effect would be limited to the final spike.
Eric1969 t1_izt69qy wrote
It was 30 years ago in visual basic. So I don’t have access to it.
Mooks79 t1_izt6qzt wrote
Ah! If I were you I would stick with your original thinking as I’m reasonably confident there was a coding issue or the set-up of the simulation wasn’t quite what it should be. If I have time I might try and knock something together myself, but that’s a big if as I’ve got a mental couple of weeks.
Eric1969 t1_izt83c8 wrote
I agree. Also, the random numbers produced by a computer are “pseudo-random”.
Mooks79 t1_izt8d0z wrote
They should be random enough, albeit improperly generated random numbers could be a cause. My money is on them setting up the simulation incorrectly, though. When I say incorrectly, I mean it could be that they haven’t set the simulation up quite as they think they have.
ikashnitsky OP t1_izwfa4l wrote
Thanks, I can't see how the medium option may be beneficial, I guess this would be betting on the draw most of the times. I can check, of course.The question is: how do I report back the results? I feel I have already exhausted the limit of attention people are willing to spare here =)
Mooks79 t1_j1dtis7 wrote
Finally got round to knocking up a simulation, can confirm that the two methods (randomly choosing whether to bet H or T, or using the lagged counts) both give the same (0.5) win rate, as expected. Indeed, any strategy (e.g. always choosing H or T) will yield the same win rate for a truly random coin. I suspect there was a slight glitch in your friend’s simulation. Happy to share (R) code and plot(s) if you want.
ikashnitsky OP t1_izrvo53 wrote
See the previous similar posts::
Data: https://www.oddsportal.com/soccer/england/premier-league/results
Tools: R
R package with the data: https://github.com/ikashnitsky/oddor