5 Nerd-Its - +

Pattern Analysis of the PowerBall Lottery

Cup an opinion article by Matthew Vea (VnutZ), published on 17 March 2009
tagged as powerball, lottery, mathematics, and statistics
other nerds have left 19 comments below

Awhile back, the article Pattern Analysis of MegaMillions Lottery Numbers became quite popular and in recent months has increased its traffic yet again as the recession has set in harder. Since only a fraction of the United States can participate in that lottery, I have gone ahead and provided a new analysis for the remaining states and territories that play PowerBall. Unlike the previous article, the new Pattern Analysis of the PowerBall Lottery website is dynamic and will automatically generate a new analysis after each PowerBall drawing to provide the latest tips for you to think you’re beating the 1 in 23,429,886,480 mathematical odds of winning. Grab those dollars and head off to your local corner store! And in case you happen to win, don’t forget a cough tip to OmniNerd cough.

EDIT: As an added bonus, a new Pattern Analysis of the MegaMillions Lottery has also been launched with the same dynamic, automatic number loading system to provide historical trends for the current version going back to June of 2005.

30_article_2449_thumb_pb

Similarly tagged OmniNerd content:

Information This article was edited after publication by the author on 20 Mar 2009. View changes.
Thread parent sort order:
Thread verbosity:

Since you’ve only got data for about 20 drawings in there, your actual error bars are pretty bloody huge — way bigger than your formulae are computing for them.

Also, I think there’s a theoretical problem with using a fixed 90% confidence interval the way you appear to be using it.

There’s a 10% chance (using a 90% confidence interval) that a particular measurement likes outside the interval entirely by chance — which, of course, means that if you make many measurements, you expect that about 10% of them will lie outside the confidence interval, even if the effect being studied is entirely nonexistent. But, in your case, you’re presumably making measurements of the relative frequency of occurrence for each possible triple of numbers. That’s over 32,000 separate measurements.

Currently, the measurements are of very low precision — each drawing generates only 20 triples, so you’ve only got about 400 events. The actual expected value for the number of occurrences of each triple is around 0.012 — but events have to be counted with whole numbers, that alone forces the margin of error to be big.

Since you’ve got 20 drawings, and the one triple you’re currently showing says it’s occurred 10% of the time, that means it’s occurred twice. This is pretty close to being the birthday problem played on a really long calendar. A back-of-the-envelope calculation puts your odds at something a bit over 90% in favor of having at least one triples appearing twice in the dataset you have.

The “hit” you have isn’t at all as unlikely as “14.0712 standard deviations away from the mean” — it’s a near certainty.

3 Nerd-Its - +
Mathematical Expectation by scottb :: NR8

I’m still trying to figure out how to actually compute the probability distributions related to some of the analyses done on the site, but while I’m at it, I worked out one relevant formula I thought I’d share.

It’s not especially hard to answer this question: if I played the game on every drawing, and the drawings were completely random (that is, each ball has an equal chance of being drawn on each drawing independent of everything else), on the average, how much would I make (or lose)?

For those who don’t remember the details from their probability class, let’s work out a similar problem for a simpler game. Let’s say you paid a dollar to play the game, in which you picked just one number. When they held the drawing, they drew just one number from the hopper, and if you matched it, they paid you $50.

Each number has an equal chance of being drawn, with 59 balls in the hopper, so you’ve got a 1/59 chance of winning. If you played the game a million times, you’d win about 1/59th of them, gaining $50 each time. So, afterward, you’d have paid $1,000,000 to buy tickets, and received (1,000,000 / 59) * $50, or $847,457.62. In the net, you lost $152,542.37, or about $0.15 per game. Another way of saying it is that a ticket in this game is worth about $0.85 cents, even though it costs a dollar to buy.

The payoff and probabilities in the Powerball game are more complicated. See the Wikipedia page for the details. The biggest complexity comes from the Jackpot prize. The size of the Jackpot depends on how many tickets were sold, and if nobody wins the Jackpot in a given drawing, the money for it gets rolled over into the next drawing. If more than one ticket meets the Jackpot conditions, the prize is split equally among them.

All of that means we can’t work out an exact amount for the expected payoff. But let’s just use “J” to represent the amount you actually get (including rollovers and accounting for multi-way splits) for hitting the Jackpot, and work out the formula anyway.

For a ticket that doesn’t have the “PowerPlay” option, the average ticket value is (40,243,435 / 292,837,581 + J / 4,685,977,296). That first fraction works out to a bit under fourteen cents, and the denominator in the second fraction is pretty huge. The largest payout ever was a $365 million Jackpot, so the second fraction contributes only a little less than eight cents to the ticket value, even under the best circumstances.

The picture’s a little more rosy if you choose the “PowerPlay” option — the ticket now costs $2, instead of $1, but they pick a random multiplier, from two to five, that’s applied to all the non-Jackpot prizes. If we include that in the mix, the first term in the exact ticket value becomes 40,243,435 / 83,678,166 (3.5 times bigger), or a smidgen over $0.48, with the Jackpot accounting for the same amount as before. On average, you lose fifty-two cents per game.

What’s interesting to consider, then, is just how much the kind of data in the “Pattern Analysis” site might influence the payoff. Suppose we could use the data to always pick one number correctly, but it gave us no edge on predicting the other four, or the PowerBall. How would that change the expected payoff? How about if we could always get two balls right?

Both of these questions are fairly straightforward to answer, and I’ll give the actual number tomorrow or the next day — it’s late enough right now that I’m bound to screw it up. Clearly, if we could always get at least three right, we’d be well in the black, gaining at least $6 per play, or averaging $22.50 per play by taking the PowerPlay.

Meanwhile, though, realize that being able to always predict even one of the five balls, much less two, would be an enormous amount of information to extract out of the data. In reality, knowing what we do about how the machines that draw the balls are constructed, and the physical processes involved, it seems unlikely that even the most detailed analysis, using years of data, could give us more than a tiny fraction of the amount represented by being able to always pick one ball right, so working out how much knowledge of one ball improves the payoff would give us an upper bound (though one with a whole lot of clearance) on how much we could ever get out of this kind of analysis.

OmniNerd Article Propagation

The Showcase

Nerd-Its   Nerd Trends   Last Ten  

  1. RE: Just Because It's Unexplainable Doesn't Mean Divine :) in Catholic Exorcist Points Finger at Vatican
  2. Just Because It's Unexplainable Doesn't Mean Divine :) in Catholic Exorcist Points Finger at Vatican
  3. Catholic Exorcist in Catholic Exorcist Points Finger at Vatican
  4. RE: Rich People Olympics and Snowboarding Under Arms in Winter Olympic sports?
  5. RE: Where to start learning, and government support of religion in Scientology: We've had it with you
  6. No problem with ABP in Ad-Blocking: Dueling Views
  7. RE: Curling and Ice Dancing in Winter Olympic sports?
  8. RE: Where to start learning, and government support of religion in Scientology: We've had it with you
  9. The true solution in Scientology: We've had it with you
  10. RE: Where to start learning, and government support of religion in Scientology: We've had it with you

What is OmniNerd?

Omninerd_icon Welcome! OmniNerd's content is generated by nerds like you. Learn more.

Voting Booth

Winter Olympic sports?

31 votes, 12 comments