A 3.8-Sigma Anomaly

Every professional football game begins with the flip of a coin, to determine who gets the ball first. In the case of the Super Bowl, the teams represent the National Football Conference (NFC) or American Football Conference (AFC). Interestingly, the last 14 coin flips have been won by the NFC.

Working out the numbers, the chances of 14 coin flips in a row being equal is 1 in 8,192. (The linked article says 1 in 16,000, which comes from 2^14; but that first coin flip has to be something, so the chances of 14 in a row are really 1 in 2^13. The anomaly would be just as strange if the AFC had won every time.) That’s a better than 3.8-sigma effect! Enough to call a press conference, if this were particle physics.

The question is … is this really a signal, or did we just get lucky? Is it a fair coin and the NFC has just been the happy recipient of a statistical fluctuation, or is there something fishy about the coin? Remember Barry Greenstein’s parable about how different people compute probabilities.

And let it be a lesson the next time you’re excited about 3-sigma anomalies.

44 Comments

44 thoughts on “A 3.8-Sigma Anomaly”

  1. Let me tell y’all how to make a fair toss even with a biased coin 🙂

    Toss it twice. Possible outcomes are HH,TT,TH and HT.
    Toss twice again if HH or TT come up. Repeat as necessary.

    One party chooses HT and the other gets TH before tossing starts.
    HT and TH have the same probability, even if the coin is biased.

  2. Replace the coin toss with which league has sold the most beer or TV advertising. Tie the kick-off to something relevant.

    I’ve found that SuperSunday is a perfect time to visit the state parks – you have the place to yourself.

  3. This is really interesting to me — ’cause I can win a Papa John’s pizza if I get it right LOL

    Anyway, the question we should be asking isn’t about the AFC/NFC, but about how often heads/tails comes up. I notice they didn’t even touch that point. So the real question isn’t “Will the NFC win the flip again?” but “Will the coin flip result in tails?”. (especially since the Giants captain already told the media he’ll pick tails)

    If the NFC was picking the result every year, then maybe that would be interesting, but they don’t. Also the pick is usually called while the coin is IN THE AIR — which means there’s really no way to make it biased (as the article asks) by the ref or the player. If heads/tails came up significantly more often, then I could see an argument for biasedness. Streaks happen in probability though, so it still wouldn’t be evidence (unless maybe 12 out of 14 were heads AND the AFC chose tails every time they picked)

    So it really looks like an anomaly to me that the NFC has won 14 straight coin flips (including 7 times the AFC picked the result!). So… heads or tails is the question.

  4. What about starting parameters, is the coin always heads up or tails up to start? Is there any correlation with the results?

  5. Actually, the first coin flip in the run has to be different from the one before, otherwise it’s a run of more than 14. Unless we’re talking about a run of at least 14…

  6. Devon, I bet you I can make the jack of spades rise of of this deck of cards and squirt water in your ear.

  7. Pingback: ZEUGS: Einige Links zum Superbowl 2012 « USA Erklärt

  8. I don’t think the probability calculation is quite right. There are two possible questions:
    1) What’s the probability of one team winning 14 tosses out of 14 tosses.
    2) What’s the probability of one team winning a string of 14 tosses out of 45 tosses (one toss for each Super Bowl).

    The first is (1/2)^13 as you’ve stated in the article. The second is different and much harder to calculate. Rather than finding an exact solution, I scripted a quick simulation and ran it 100,000 times. I came up with the number 0.2%.

  9. Alex @ 12.

    Total number of possibilities: 2^45.

    The sequence of 14 coin flips can occur a) at the begining; b) at the end; or c) in the middle. If at the beginning, we can have either 14 heads followed by a tail, or 14 tails followed by a head, and the remaining 30 can be anything we like, so that’s 2*2^30 combinations. Likewise, at the end we have 2*2^30 combinations.

    If the sequence occurs in the middle, we can have a string of 14 tails sandwiched by two heads, or a string of 14 heads sandwiched by two tails, with the other 2^29 being anything we like. This sequence of 16 flips can occur anywhere from starting at position 1 to position 30, so that’s 2*30*2^29 combinations.

    The total number of is about 2*2*2^30 + 2*30*2^29 = 3.651^10. I say “about” because sequences with more than one string of 14 coin flips will have been double counted, and I’m too lazy to allow for that. So I make the final probability of a run of 14 identical coin flips occuring somewhere in a sequence of 45 to be about 3.651^10 / 2^45 = 0.001038. Which is about half what you get.

  10. Kevin @ 13

    Very nice! Thanks for that.

    I’ve thought of one thing that might account for part of the discrepancy. If I toss a coin 45 times, and get a string of 20 heads, it’s also true that that sequence contained a string of 14 heads. The difference is between getting *exactly* 14 heads and getting *at least* 14 heads. My simulation looks for at least 1 string of at least 14 heads, and I think you’ve calculated the probability of getting at least 1 string of exactly 14 heads. The former is more likely. Is it twice as likely? Maybe. Or maybe my simulation is broken, but it pretty closely agrees with some reference values I’ve pulled out of a textbook.

  11. seems pretty obvious that the NFC employs a psychic to sit in the stadium and will the coin to fall in their favor… anomaly solved! (anyone spied Uri Geller in Indianapolis this weekend?)

  12. Alex @14

    Yes, that would be the reason. It’s double because one of the terminating coin flips would have to be fixed to make it exactly 14, but could be either for at least 14, doubling the number of combinations.

  13. Pingback: The only thing I know about the Super Bowl « Ted Bunn’s Blog

  14. Actually this is somewhat misleading.
    Winning a coin toss is not the same thing as whether the coin comes up heads or tails.
    I am sure there is info out there on whether the coin came up heads or tails, but being able to call a coin toss is an art. In most cases its question of how many rotations you would expect the coin to have from the initial state to its peak height and then back to the ground. If you take into account the fact that most people have a fairly decent muscle memory, an analysis of toin cosses by officials should reveal a bias if the official is tossing the coin under identical conditions. One would expect that the coin will rotate the same number of times after the toss under identical conditions by the same official. Its the same reason why the world record for free throws is so high.

  15. Although there is nothing in this run of coin flips that violates the concept of randomness, I suspect that randomness is really more complicated than we currently understand it to be, especially when we are dealing with scientific truth.

    Jonah Lehrer in a New Yorker article, “The Truth Wears Off”, talks how about a host of scientific research where results meeting standard statistical tests seem to become less and less significant over time to the point that some of the results appear to vanish. Although we might immediately think that the initial study that generated the results was poorly designed and the results were bogus to start with, that doesn’t always appear to be the case. There was a study of a memory phenomenon called “verbal overshadowing” cited over 400 times and extended to a variety of other tasks. The original discoverer Jonathan Schooler himself concluded, although there was nothing wrong with his design, he had been unlucky in his choice of original subjects and the phenomenon he himself discovered apparently didn’t exist or was vanishing with each new round of testing. A similar behavior can be seen the J.B. Rhine ESP tests where some subjects appeared to pull off amazing runs in guessing cards. One subject did a run a nine correct guesses (2 million to 1 odds) three times. The Amazing Randi would probably suspect cheating but, if so, the subject apparently got more and more honest over time and eventually could do no better than chance.

    Of course, there are poorly designed studies, selective reporting of results, a bias in publishing towards studies with positive results, plain dishonesty, and other explanations. Regression to mean doesn’t seem particularly satisfying as an explanation when the results meet the standard statistical tests.

    Lehrer talks about the Crabbe study involving cocaine and mice in which “same strains of mice were used in each lab, shipped on the same day from the same supplier. The animals were raised in the same kind of enclosure, with the same brand of sawdust bedding. They had been exposed to the same amount of incandescent light, were living with the same number of litter mates, and were fed the exact same type of chow pellets. When the mice were handled, it was with the same kind of surgical glove, and when they were tested it was on the same equipment, at the same time in the morning.” The end result was that three separate labs reported three different results on the behavior of the mice and the differences were not trivial. Lehrer adds: “The disturbing implication of the Crabbe study is that a lot of extraordinary scientific data are nothing but noise.”

    Randomness itself is something of an abstraction from reality. It is a concept and I suspect that it does not capture the whole reality of the coin flip. Randomness, of course, is usually counter-posed against causality/predictability, but there may be some twilight ground between the two when the conscious observer becomes involved.

  16. Just as a note, when I was a kid I could hit heads or tails with about a 90% accuracy when I was asked to toss a coin, much to the consternation of my siblings.

  17. Pingback: Super Bowl Stories You May Have Missed « NotionsCapital

Comments are closed.

Scroll to Top