Loading 13 Nerd-Its - +

Pattern Analysis of MegaMillions Lottery Numbers

Page_white_text an academic article by Matthew Vea (VnutZ), published on 31 October 2007
tagged as mathematics, statistics, lottery, and megamillions
other nerds have left 136 comments below

[Author’s Note] This article is based on data from November 2007, for up to date results, please see the new analysis at Pattern Analysis of MegaMillions Lottery Numbers where the data is updated automatically and dynamically after each drawing. A similar analysis is also available for PowerBall players. Please continue reading the original article below for the concepts behind the analysis.

Improved Pattern Analysis of the MegaMillions Lottery!!

This website is an update to the article you're reading now that features a dynamic, historical pattern analysis engine that refreshes itself with the latest trends following every MegaMillions drawing.
Sponsored Advertisement

Everyone wants to get rich, especially if it only costs them $1. Fortunately, many local state governments host lotteries, allowing their constituents to donate cash into the budget in hopes of winning a multi-million dollar prize. In practice, most lottery drawings consist of a series of balls drawn randomly from a chamber which should guarantee a fair opportunity for everyone to win. Theories of rigged lotteries and fraud, however, run rampant across the Internet.1 The system is accused of not holding live drawings, publishing winning numbers prior to drawing them, permitting the tweaking of data archives to avoid payouts, intentionally modifying balls, or using balls with painted numbers whose natural weight affects their likelihood of appearance.

Rather than debunk any of these theories of lottery fraud or rigging, this article reveals the trends and patterns of winning lottery numbers for public scrutiny using basic data analysis. It uses the results of the MegaMillions lottery and consists of the following analyses:

  • distribution of winning numbers over time
  • behavioral stratification of numbers based on numerical position
  • relationship between mutually winning numbers
  • common differences between winning numbers
  • winning number frequency
    While such scrutiny has the potential to yield useful results, such as identifying the existence or lack of "better numbers" to play, it is presented so as to appeal to those interested in number patterns.

MegaMillions History

Recently, MegaMillions drew the largest jackpot ever recorded at $370 million, exceeding the previous record held by PowerBall.2 The prizes were not always so large, nor did the participants span the United States. Beginning life in 1996, MegaMillions originally existed under a different nomer: "The Big Game." For two years this lottery was drawn weekly on Fridays until 1998, when a Tuesday drawing was added. Over the past eleven years, the number of participating states has doubled from only six to twelve. Although there are minor interstate variations regarding how jackpots are paid to winners, the basic game play remains the same.3

A single dollar in MegaMillions purchases a 1 in 175,711,536 chance of landing the jackpot. A player may opt for a "QuickPick" set of numbers generated automatically by a computer or they may choose to select their own numbers. Since 2005, MegaMillions allows players to choose five numbers between 1 and 56 plus a sixth number, the MegaBall, between 1 and 46. This, however, was not always the selection pool. When the "Big Game" was conceived, players were given a pool of numbers 1 through 50 to choose for their first five balls and numbers 1 through 25 for their sixth. Beginning in 1999, players were offered the numbers 1 through 50 for the five regular balls and 1 through 36 for the sixth. When the game became MegaMillions in 2002, players selected numbers between 1 and 52 for both the five regular balls and the MegaBall.4

Gathering Data

As a first step, it was necessary to obtain a collection of MegaMillions’ lottery numbers. Fortunately, the New Jersey Lottery website has an archive of all winning numbers since September 6, 1996.5 As an added bonus, the archive of numbers exists in both HTML format for a pretty web presentation and as a delimited file which is conducive for importing into a database. For the purposes of this analysis, the winning lottery numbers were imported into Microsoft SQL Server Express for processing queries. Subsequent graphs were then created with Microsoft Excel to visualize the trends and behavior.6

The delimited file of winning lottery numbers contained the results for 1078 drawings and provides the following fields:

  • Year – formatted as YYYY
  • Month – formatted as MM
  • Day – formatted as DD
  • Day of Week – formatted as Tuesday and Friday
  • Ball 1, 2, 3, 4, 5 – as an integer
  • MegaBall – as an integer
  • Prize Payout – when present, formatted as a decimal value
  • Date – formatted as YYYYMMDD

Distribution of Winning Numbers Over Time

The first trend analyzed was whether or not the numbers occur with an even distribution. Balls 1, 2, 3, 4 and 5 were consolidated into a single list to analyze their overall frequency of occurrence. Each separate version of the lottery – two editions of BigGame and two editions of MegaMillions – were analyzed independently to identify any outlying activity. Subsequently, a similar grouping was performed to determine the distribution of the MegaBall number. The following charts detail the number of times each number was selected over the course of the the entire span of MegaMillion’s drawings.

30_article_110_thumb_distributionv1

When the BigGame debuted, players were allowed to choose numbers between 1 and 50 with a MegaBall ranging from 1 to 25. Each number has been drawn at least eight times as compared with the most frequent winners that occurred twenty-seven times. Overall, the numbers have been drawn approximately seventeen times each. The most frequent number, 35, exceeds the mean by 2.76 deviations as compared with the least occurring number, 49, which lagged the mean by -2.46 deviations. The first version of the BigGame is the only drawing with a descending trend line that when combined with the deviations identify the distribution as slightly abnormal. The MegaBall number 17 appeared fifteen times while 23 occurred only once. The average MegaBall was drawn seven times.


30_article_110_thumb_distributionv2

The BigGame was changed on January 13, 1999 to increase the number of available MegaBalls from pool of 25 to 36. The primary drawing pool remained the same, ranging between 1 and 50. Each number has been drawn at least twenty-one times while the most frequently drawn number has appeared forty-six times. Overall, all numbers have been drawn roughly thirty-four times. Only one number, 4, exceeds two deviations from the mean at 2.11 deviations. 19 was -2.29 deviations from the mean, the only number to appear less than two deviations from average. The second version of the BigGame has the flattest trend line that when combined with the deviations identify the distribution as normal. The numbers 2 and 3 were drawn the most as MegaBalls at fifteen times apiece while 11 was drawn the fewest at four times. On average, each MegaBall appeared ten times.


30_article_110_thumb_distributionv3

On May 15, 2002 the MegaMillions drawing expanded the pool of numbers available to players to range from 1 through 52. Each number has been drawn at least twenty times while the most frequently drawn number has appeared forty-five times. Overall, all numbers have been drawn roughly thirty-one times. The two most drawn numbers, 32 and 10, are the only two numbers to deviate from the mean by a factor larger than two; 2.59 and 2.41 respectively. The least drawn number, 33, is only -2.04 deviations from the mean. Although the linear trend line rises modestly, the primary numbers have a relatively even distribution over time. The MegaBall ranges from 1 through 52. The number 34 was drawn the most on 14 occasions while 51 appeared only once. Overall, each MegaBall has appeared an average of six times.


30_article_110_thumb_distributionv4

Players have had the option of drawing numbers between 1 and 56 since June 22, 2005. Every number has been drawn at least ten times while the most frequently drawn number has appeared thirty times. Overall, each number has appeared an average of twenty times. Despite being drawn the most, both 7 and 53 are only 2.17 deviations from the mean. Even the least drawn number, 47, is -2.17 deviations from the mean. Overall, the primary numbers have a relatively even distribution over time. The MegaBall ranges from 1 through 46. 4 and 42 share the most drawn position with nine wins while many numbers round up the low end with only two wins. Overall, each MegaBall has appeared an average of five times.

Behavioral Stratification of Numbers Based on Numerical Position

After looking at the behavior of the numbers in aggregate, the occurrence of numbers respective to their position was analyzed. Unfortunately, the lottery does not store the numbers in the order they were drawn. Rather, the data file saves the winning lottery numbers in ascending order.7 As such, positional analysis focused on how the numbers are stratified within their given position.

It is important to recognize the four variations of the lottery’s number pool has an impact on the ratio of occurrence for each number. As such, the data was broken into four sets titled (uncreatively) version 1, version 2, version 3 and version 4. Winning numbers per position were counted to determine the numbers that win most frequently within each set. Then, an aggregate winning percentage was assigned by combining the win ratio of each set multiplied by a time factor to obtain the overall likelihood of a number to win. The time factor represents the percentage share of drawings per version, which equates to 15.95%, 32.37%, 30.05% and 21.61%, respective to MegaMillions versions one (original) through four (current).

Each of the six graphs represent the top fifteen numbers per position:

  • Green bars represent the current version of MegaMillions where players choose from numbers 1 through 56 and a MegaBall number of 1 through 46.
  • Blue bars represent the weighted aggregation of a number’s winning percentage from all MegaMillions drawing variations since 1996.
  • The red line represents a five variable polynomial trend line to the winning percentage of the current MegaMillions drawing pool.
30_article_110_thumb_ball1analysis

Ignoring the version differences in number pooling across the lottery’s lifetime, the first ball has ranged between 1 and 37 with an average winning number of 8. Looking at the current version of MegaMillions, the first ball shows a steeper trend curve with nearly twice the drawings on winning numbers as any other ball. MegaMillions winning numbers ranged from 1 through 37 with an average winning ball of 17 since 2002. The numbers 7, 5, 1, 2 and 3 represent 39.91% of the winning numbers on the first ball.


30_article_110_thumb_ball2analysis

Ignoring the version differences in number pooling across the lottery’s lifetime, the second ball has ranged between 2 and 46 with an average winning number of 17. Looking at the current version of MegaMillions, the second ball shows a shallower trend curve. MegaMillions winning numbers ranged from 2 through 43 with an average winning ball of 22 since 2002. The numbers 13, 12, 17, 25, 10, 18, 20, 14 and 21 represent 40.77% of the winning numbers on the second ball.


30_article_110_thumb_ball3analysis

Ignoring the version differences in number pooling across the lottery’s lifetime, the third ball has ranged between 5 and 54 with an average winning number of 29. Looking at the current version of MegaMillions, the third ball shows the shallowest trend curve. MegaMillions winning numbers ranged from 3 through 54 with an average winning ball of 26 since 2002. The numbers 20, 35, 31, 25, 37, 26, 32, 24, 23 and 38 represent 39.06% of the winning numbers on the third ball.


30_article_110_thumb_ball4analysis

Ignoring the version differences in number pooling across the lottery’s lifetime, the fourth ball has ranged between 5 and 55 with an average winning number of 35. Looking at the current version of MegaMillions, the fourth ball’s trend curve begins to steepen. MegaMillions winning numbers ranged from 7 through 55 with an average winning ball of 35 since 2002. The numbers 51, 42, 46, 36, 48, 40, 38, 39 and 49 represent 40.77% of the winning numbers on the fourth ball.


30_article_110_thumb_ball5analysis

Ignoring the version differences in number pooling across the lottery’s lifetime, the fifth ball has ranged between 13 and 56 with an average winning number of 44. Looking at the current version of MegaMillions, the fifth ball’s trend curve steepens sharply. MegaMillions winning numbers ranged from 26 through 56 with an average winning ball of 41 since 2002. The numbers 53, 54, 56, 52 and 55 represent 42.92% of the winning numbers on the fifth ball.


30_article_110_thumb_megaballanalysis

Ignoring the version differences in number pooling across the lottery’s lifetime, the MegaBall has ranged between 1 and 52 with an average winning number of 20. Looking at the current version of MegaMillions, the MegaBall’s trend curve is relatively flat. MegaBall winning numbers ranged from 1 through 46 with an average winning ball of 23 since 2002. The numbers on the graph represent 44.64% of the winning numbers on the MegaBall.

Relationship Between Mutually Winning Numbers

Additionally, an analysis was performed to determine which numbers "win together." After all, a player does not need to pick all six numbers in order to win money from MegaMillions. Therefore, all possible combinations of balls 1, 2, 3, 4 and 5 were formed to analyze the occurrence of ball relationships.

30_article_110_thumb_pairs2

In the current version of MegaMillions, pairs of numbers win repeatedly quite often. In the lottery’s lifetime, particular pairs have won regularly.

Doubles

There are ten combinations of ball pairs: [1 2], [1 3], [1 4], [1 5], [2 3], [2 4], [2 5], [3 4], [3 5], & [4 5]. Using the MegaMillions data, there are 10,780 possible pairs of which 1503 are unique over the lottery’s lifetime. In the current version of MegaMillions, there are 2330 possible pairs of which 1202 are unique. Pairs of numbers occur quite frequently; 1426 pairs have occurred 10,703 times throughout the lottery’s lifetime compared with 692 pairs that have appeared 1820 times in since the fourth version of MegaMillions began.

The graph at right shows that in the current version of MegaMillions, pairs of numbers win repeatedly quite often. Over the lottery’s lifetime, particular pairs have won very regularly.

Triples

There are ten combinations of ball triples: [1 2 3], [1 2 4], [1 2 5], [1 3 4], [1 3 5], [1 4 5], [2 3 4], [2 3 5], [2 4 5], & [3 4 5]. There are 10,780 possible triples of which 8675 are unique over the lottery’s lifetime. Within the past version of MegaMillions, there are 2330 possible triples of which 2245 are unique. 1789 sets of triplets have repeated 3894 times in the lifetime of the lottery. Five sets of triplets have occurred five times and two sets of triplets have occurred six times. In the past year, however, only twenty-three sets of three balls have repeated twice.

30_article_110_thumb_pairs3

There have been many repeat winning combinations of three numbers in the lifetime of the lottery, although it is a less frequent phenomenon in the current version.


In the graph to the right, there have been many repeat winning combinations of three numbers in the lifetime of the lottery, although it is a less frequent phenomenon in the current version.

Quadruples

There are five combinations of ball quadruples: [1 2 3 4], [1 2 3 5], [1 2 4 5], [1 3 4 5], & [2 3 4 5]. There are 5390 possible quadruplets of which 5328 are unique over the lottery’s lifetime. Since June 22, 2005 there are 1165 unique combinations of possible quadruplets. Sixty-two sets of four numbers have repeated twice since MegaMillions began. There have been zero sets of quadruplets winning more than once in the current version of MegaMillions.

Quintuples

There is only one combination of ball quintuples: [1 2 3 4 5]. Only one set of five numbers has ever repeated twice in the history of MegaMillions: (11, 14, 18, 33, 48).

Common Differences Between Winning Numbers

A natural extension of analyzing number groups was to identify the trends by which numbers differ from one another. For example, while probability gives the numbers 20, 21, 22, 23 and 24 the same chance of appearing as any other combination, is it likely? The numbers for the most recent version of MegaMillions were scrutinized to determine if there is a common difference between each ball.

30_article_110_thumb_difference_analysis

The graph depicts the difference between the 1st/2nd position, 2nd/3rd position, 3rd/4th position and 4th/5th position. Overall, each split differs by an average of nine (indicated by intersecting zero). 70% of the difference distribution lie between 1 and 20. While it is unlikely for the entire series to have peculiar (tight or very wide) differences, it is not necessarily an unusual situation. Since 2005, there have been twenty-six occasions where two subsequent numbers have differed by more than 30.

Winning Number Frequency

Analyzing the distribution of numbers over time only provided half the picture in terms of any given number’s propensity towards winning. Another aspect to consider was the temporal frequency by which a number wins. For example, a number may have won on thirty occasions, but maybe they were all two years ago. To study this behavior, the time delta between each number’s appearance was cataloged to establish statistics for all numbers and for each number across the lifetime of the fourth version of MegaMillions. Then, the analysis was repeated using only the most recent six months of data to identify the cross section of numbers that win frequently consistently and which numbers are just a current flash in the pan.

30_article_110_thumb_freqanalysislifetime

This graph depicts the numbers with the shortest, average time between appearance since version four of MegaMillions began in 2005. Simply having a low average, however, is not a good indicator that a number occurs frequently. The blue line depicts the ratio of wins that occurred with a time delta below average. Numbers with a ratio above 50% are indicative of winning often in “clumps” whereas numbers below 50% represent a wider spread of win frequency. It should be noted, however, that low frequency “clumping” also comes with a corresponding dearth of appearance.


30_article_110_thumb_freqanalysis6mo

The six month graph replicates the logic found above on the lifetime graph, only this time restricted to winning numbers over the past six months. The intersecting cross section of frequently winning numbers represent those that not only win over time but those that have demonstrated a recent propensity for appearing often. Over six months, there is a span of numbers that have both a low delta of days between appearance ’’and’’ a high ratio of occurrences on a more frequent basis than average. Equally, a handful of the recent numbers have low ratios, indicating their average is skewed by bursts of low delta wins offset by many frequent gaps of significant length.

Conclusions

Interesting as these trends may be, they will not assist in making the odds of winning the MegaMillions lottery any better if the system is truly fair and random. However, in the event there is some peculiar factor skewing the ball selection such that any of these trends continue, a player stands a mildly better chance of winning a partial prize through the selection of weighted numbers.

1 "The Lottery is Rigged." Uncoverer. Accessed October 2007 from http://www.uncoveror.com/lottery.htm.

2 Roland, Neil. "Mega Millions Lottery Jackpot Now Record $370 Million." Bloomberg. Accessed October 2007 from http://www.bloomberg.com/apps/news?pid=20601103&sid=afUgc0t0u3hg&refer=us.

3 "How to Play: Play the game." MegaMillions.com_. Accessed October 2007 from http://megamillions.com/howtoplay/play_game.aspgame.asp.

4 "About Us: Game History." MegaMillions.com_. Accessed October 2007 from http://megamillions.com/aboutus/game_history.asphistory.asp.

5 New Jersey Lottery. Accessed September 2007 from http://www.state.nj.us/lottery/data/big.dat.

6 Microsoft SQL Server Express. Accessed September 2007 from http://www.microsoft.com/sql/editions/express/default.mspx.

7 Ultimately, order does not matter with lottery numbers.

Similarly tagged OmniNerd content:

Information This article was edited after publication by the author on 31 Jan 2011. View changes.
Thread parent sort order:
Thread verbosity:
2 Nerd-Its - +
Where's the analysis? by Anonymous :: NR0

You’ve collected and summarized a bunch of data here, but it would be much more useful (and straightforward) to run some inferential procedures (Monte Carlo procedures would be particularly easy to implement in this case) to see if these results were compatible with the hypothesis of a fair game. Give it a try!

Hi, first of all congratulations on your deep analysis.

I made something similar on the Spanish Lotto may years ago but using a different approach.

I started asumming the game was fair and the lottery wasn’t rigged. Then I compared every winning combination (and sub-combinations of 3, 4 and 5 winning numbers) with the expected number of winners for the total bets on each game each week.

Using this method I confirmed a theory many people (myself included) has: that not all the players play their numbers "randomly" but that they have some "favourite numbers" and others that people just don’t like. Combos like 1-2-3-4-5 are played a lot, also date combinations — while numbers from 31 seem to be played less — leading to too-many-winners or no-winner scenarios in each case.

You can detect this by checking how many winners are in each prize category on each game, comparing with the expected statistical results (not only for the big prize but for the smaller ones also).

On the Spanish Lotto (a basic 6/49 lotto, with 1 in 14 million odds) sometimes you have no big prize winners even when there are 30 or 40 million bets (and there "should be" at least 1 or 2); sometimes you may have 10 or 20 winners with only 10-15 million bets (for an «easy» combo like 1-2-4-8-9-24 or 7-14-21-28-41-42).

People can select their own numbers or let the «machine» select it at the shop, those numbers are supposed to be random and doesn’t seem to influence on this (even if they are large).

If you read Spanish or can find a good translator you can find my (totally amateur) work here

http://www.microsiervos.com/archivo/azar/loto-un-sistema.html

— Alvy

1 Nerd-It - +
Any predictions? by Anonymous :: NR0

I have read the article and would like to ask if you think that the results can be used to predict any trends in future outcomes of the lottery?

If not, what test would be necessary to prove that the balls are not evenly distributed?

-7 Nerd-Its - +
Um... by Anonymous :: NR0

Assuming a completely fair game could a player still benefit from playing numbers that were played less frequently by other players? For example, you could play more numbers higher than 31 which are not dates, so there are fewer other players to share a prize with, assuming other players play 1 to 31 more frequently. Not only would the winning numbers be looked at but also the number of winners for a given combination or number.

Turns out I did this exact same analysis (but i didnt think to write it up, nor that anyone would be interested in reading it).

At the time I did it (well over a year ago), 32 was the most common winning number and 3 was the most common mega ball.

The problem than, and the problem now, is that there havent been enough drawings to see that the histogram is truly flat or not (truly random).

I would suggest that, the drawing is in fact random, and choosing least common numbers will be better than choosing more common numbers. the reason for this is that eventually we would expect the histogram to be flat, in time the occurrence of infrequently drawn numbers should increase so as to flatten out the histogram.

However, each drawing is random. Before we see another 35 (the least common number when I did it), we could see 20 more 32s.

I read the article and the first thing I noticed is that he didn’t account for the number of times that there wasn’t any mega millions winner…you know, how many weeks went by without a jackpot winner…where are those "no winner" numbers in his analysis?

1 Nerd-It - +
Mixing errors by Anonymous :: NR0

Has anybody seen this – it’s about using mixing errors in lotto picks:

http://use4.com/Prove-it.html

1 Nerd-It - +
Thumbs52 by Anonymous :: NR0

In what area did you include the "luck" factor? The statistical analysis appears complete; except… in all areas of gathered factual information there tends to be a human factor skewing the results in one fashion or another. Might the human factor in your analysis be "luck"? After all, it appears that some people are simply luckier than others.

1 Nerd-It - +
Location of winners by Anonymous :: NR0

Does anyone have data showing the distribution of tickets vs winners? I remember, a while back, that it was skewed towards Georgia for some reason…

Some lottery corporations are changing the balls set over time.

1 Nerd-It - +
Great article! by Anonymous :: NR0

This is great work, thanks for posting this. I’ve done some similar ongoing analysis, and created a website http://www.lottoroller.com/megamillions for both MegaMillions and PowerBall.

1 Nerd-It - +
Vegasgamer by Anonymous :: NR0

I can claim myself as a Powerball winner and it was all luck. My winnings amounted to 3500 dollars and it was a bonus drawing ticket and had nothing to do with picking the right numbers. That being said, I lived in Las Vegas for seven years and played a lot of video keno which is just like the lottery. I won some jackpots but lost more than I won. I won on single screen keno and I won on multiple screen keno which is like playing up to twenty lottery number combinations on one game. I probably played well over 100,000 games of keno or more and I tried every trick in the book. What I found is that there are two ways to win without a lot of luck and that is 1.) to overwhelm the game with number combinations. Or 2.) Do like the 80 year old man who recently won powerball and play the same numbers every week for 17 years (which is just what he did). The first option is expensive and the second option relies on the possibility (not probability) that if you stay on the numbers long enough, they will hit (as is the case in Keno). Take your pick.

0 Nerd-Its - +
Additional information by Anonymous :: NR0

I recently performed some simple statistical testing on the data set corresponding to the current rules with 56 draw and 46 bonus balls. The data appears to exhibit acceptable variance, randomness and normality based on Chi Squared, runs test for randomness, and Anderson-Darling test for normality. The upshot, is that no matter how you pick your numbers, you have 1 in 175 million chance of winning. Best of luck.

Interesting line of thought regarding your data, matthew. I’ve also been down this road, but have since abandoned such trend/frequency analyses. However, i found a theory online specifically addressing the true randomness of lotteries when a mechanical means is used to generate the results, e.g. balls bouncing about in a glass box via tumblers, air, etc. What this theory, which i call “the mechanical selection of randomization” (since i don’t remember what the author called it) did was confirm i was seeing patterns in something allegedly random. simply stated, the theory hypothesizes that if a mechanical process is used to select something randomly, and there are no variables introduced to alter the natural mechanical process, unintended PATTERNS inherently occur over time due to the repetitive nature of the mechanical process. i believe the theory is sound and lotto commissioners took note of it, also. if you’re on this thread, i assume it’s because you’re also fascinated by the alleged randomness of a lottery, or you see peculiarities in lottery results just by reading the actual winning numbers in chronological order. nevertheless, as i’ve experienced through the type of research mr. vea shares, seeing a pattern and predicting one in a 6/42-based lottery is an entirely different matter. for those live lotto draws, wise commissioners could easily thwart folks like us who love to crunch and analyze data simply by changing the order balls are dropped, rearranging the ball setup, increasing the air pressure for applicable systems, letting the balls tumble about longer, etc. “weighted balls”, imo, raises the specter of impropriety and corruption. then there are those backroom lotto draws that mys tate either does or used to. for example…if folks like us knew every lotto combination purchased for a particular play date, we could then determine which combinations weren’t purchased. if the draw isn’t live, a lottery commissioner, with the aforementioned knowledge could conceivably control WHEN and WHAT number combinations won…assuming all possible winning combinations weren’t actually purchased. anyway, if someone can predict the next lottery number “pattern” by examining trends and frequencies, that person is a g’damn genius! (or one lucky sob.)

“unintended patterns due to the repetitive nature of a mechanical process undermines true randomness”. it is this thought that motivated me to no longer consider looking at how often a number appears nor even when, but to focus more on GROUPS. it is my theory, piggy-backing on that mechanical selection of randomness theory, that if true randomness is undermined by a mechanical selection process, over time those patterns that develop can be identified through GROUPS. hint: in my theory you must think beyond how many balls are actually drawn to comprise a winning combination. forgive me folks, but i love opining about lotto!

Repeated numbers (( Only one set of five numbers has ever repeated twice in the history of MegaMillions: (11, 14, 18, 33, 48)).Please tell me im wrong But i dont think that any numbers have repeated themself like that AT LEAST NOT AS OF THIS COMMENT. PLEASE INFORM ME IF IM WRONG. lottery pirate

0 Nerd-Its - +
No Sequential Order by Anonymous :: NR0

Too bad the lottery does not publish the numbers in the order they were drawn. That would make a measure of central tendencies more acurate to analyze. New York has a FOIL (Freedom of Information Law) which allows someone to request lottery records at $0.25 per page, but then you would have to enter the data in by hand.

Sucks huh?

Maybe the reason they don’t publish it is so it’s harder to make forecasts.

0 Nerd-Its - +
mega millions by mr-brent :: NR0

have you studied roulette numbers into lottery it comes alot to win 4 numbers 5 numbers in grandseries and tiers if you look at the numbers are standed tier is 33 16 24 5 10 23 8 30 11 36 13 27

-1 Nerd-It - +
Analysis... by Anonymous :: NR0

Great work! So with all this info, how much have you actually won? Must be a lot, because only someone with A LOT of time on their hands (like a lottery winner) would come up with something like this.

0 Nerd-Its - +
Plinko Rule by Anonymous :: NR0

Great work here on the data analysis. One thought to bear in mind, all the lotto balls are not dropped, thus you may not have a full frequency analysis. This may sound a little strange, but to really get the answers you are looking for you need a test. No self respecting engineer would do this on data alone, you need the physical test to accelerate and refine the data. I saw an exhibit recently that comes to mind, it was at a children’s science fair. It was a giant plinko board about 10 feet tall by 15 feet wide. It released balls at the center point at the top from a tumbler. It was designed to explain statistics and distribution theory. It worked wonderfully as you would get a nice distribution curve, almost perfect, visually impressive. My thought would be to build such a device and drop the lotto balls “numbered” and run the test 10,000 times. Log the data and then give me your comparative results. Two tests would even be better. try dropping all 50 balls and test the repeatable distribution of number sequences. The problem with most folks is we’re lazy and won’t take the time to engage such a project. Plywood, nails, ping-pong balls….away ya go…. Take your own data…..

Sincerely,
Test Engineer

0 Nerd-Its - +
Great Data by Anonymous :: NR0

Hey I have to say this is some great data, but I am not sure I understand what we are concluding here. The fact that the lottery is random, does this data indicate that over time the curve becomes normal for particular numbers, or is this saying just the opposite?

0 Nerd-Its - +
An Alternate take by Anonymous :: NR0

The biggest problem you face here is that the sample set versus the possible combinations is very small. To spot an honest trend here would require millions of drawings versus the thousands that have occurred. At least your data doesn’t point to any glaring non-random trends that would cause one to be suspicious. For those who have conspiracy theories about the game just remember there is no centralized data processing of the numbers sold – each state holds their own sold number data base and they all scan after the drawing to find winners and then report that upstream to their collective central control group.

For those of you looking for some edge in this game, there is no mathematical one as the data shows (half the proof being the author published the data!) but there is a way to play this game from a gamblers perspective that is superior to the usual approach.

An example: your gang from work buy 100 tickets each week, everyone throws in 5 bucks or whatever. You could get a “bullseye” win but when your buddy buys the tickets he is going to get 100 quick picks and thats it – in 100 tickets there is a fairly good chance (not high but real) that you dont even get the megaball (or powerball) once.

The lottery slips let you choose a powerball or megaball and have the rest chosen at random. The smart thing to do is this: instead of buying 100 tickets buy just 2×46 tickets so you have 92 tickets (or do 3x at 138). You now have 2 or 3 of every megaball. This guarantees a minimum of 2 or 3 ticket wins AND it reduces the chance of getting 5 numbers WITHOUT getting the powerball to some degree.

Looked at from another angle if you buy 46 tickets at random, they all have 3 matching/drawn/winng regular numbers for some insane reason BUT none of them have the megaball you win like 10 bucks for each. Buying every megaball for sure however turns one of the tickets into a $150 winner guaranteed.

If you intend to buy large amounts of tickets its insane to not buy every megaball because its easy – there are two tanks, one you need to match 4 of 56 numbers from – thats hard, the other you just need to match 1 of 46 – thats so easy you can just buy them all and reduce the game odds slightly by avoiding the “no megaball” outcome.

0 Nerd-Its - +
On The Right Track by Anonymous :: NR0

Your analysis is one of the more thoughtful I have seen online.

You are on the right track using SQL. That could cutdown on programming a bespoke algorithm, but may lack flexibilty in the long run.

I noticed you analysed data from a lottery site. Such data is skewed since no lottery draws are ever sequential. One of the first rules of statistics is boring old observation. Observing each draw is tedious but necessary.

One of the commentors mentions air pressure, ball drop order and mixing period. Mixing periods do vary per draw, but not significantly (in the lottery I study). This commentor has discovered the non-statistical aspect of lotteries which appears to be the inescapable power of geometry and physics, not to mention gravity and trajectories of bouncing balls.

Any real attempt at scientific prediction of lottery results would have to consider all non-statistical elements, as well as statistics based on well-observed data.

The fascinating aspect of lottery predictions are the extensive scientific disciplines that can be deployed to aid accuracy. Some of these are not apparent in prototype system development. Its likely a prediction system prototype for lotteries would take years to develop… even decades.

So, I find myself asking this question:

If I had to search through millions of combinations to find one that had a financial value way beyond what I could earn in a lifetime, would I still continue this search?

Of course most of us would. In a sense, we are biologically programmed to search for value. So the commentor who mentioned more valuable groups (of combinations) is certainly heading in the right direction. Better to search in rich fertile lands, than barren wastes, right?

0 Nerd-Its - +
Too many unknowns by Anonymous :: NR0

As in depth as your analysis is, without being able to recreate the drawing procedure exactly to minimize variables, I don’t believe statistical analysis of past results can prove rigged theories or help predictions.

I compared 2010 results and there are certainly odd frequencies that would leave one to think it’s absolutely rigged, but without being able to perform the procedure myself, I’ll just have to consume the data, pray the game isn’t rigged, and pick my numbers.

“The rigging works like this: super computers keep track of each combination sold, and then the ping-pong balls are weighted to assure that a losing combination comes up. On rare occasions, all possible combinations are sold, and they must let someone win. Only then is the game honest” as per the link supplied in the footnotes http://www.uncoveror.com/lottery.htm

This would be more believable if ticket sales didn’t close a mere 20 minutes prior to the drawing. 20 minutes isn’t much time to analyze non-purchased combinations and then weight the balls with a gas.
Aside from that, should the scandal be leaked, (as the recent Manning case) it would be political suicide and the lottery industry would collapse.

Terrific article and great work. I think the information here will lessen the odds of losing as you (we) know what not to do. Now I have to get back to work and study the previous years.

By the way, how long did it take you to gather this information and form your stats?

Good luck,
Joe

0 Nerd-Its - +
hmmm.... by Anonymous :: NR0

Using Excel for statistics is a big no-no. A proper statistical package, like R or SPSS should be used for this sort of things.

Also I don’t even see a proper statistical test here, I don’t know a chi square test against an uniform distribution of values would be a good start. Looking at the SD deviation from the linear regression of the counts is not very informative.

Does anyone know if Mega and/or Power Ball use more than one set of balls for the drawings. When I lived in AZ years ago, their state lotto had 3 or 4 (can’t remember exact number) different sets of balls. The sets were selected randomly before each draw.

0 Nerd-Its - +
Does this make sense? by Anonymous :: NR0

How about the 19 y.o. who had a dream about 6 numbers and never played the lottery in his life? He had no idea why he kept hearing these numbers and his Mother told him to play the lottery with these. He won $35million. Or the two women(separate winners) who visualized winning or praying about it? Or the people who use birthdates/anniversaries? Some say it’s the time of day that you buy your ticket.

It would be nice to know the answer and even better to win!!

Drawing balls are retired after a specific number of months, or earlier if the balls display any type of wear. Off camera, in front of an auditor two random envelope drawings are completed to choose both the drawing machine(s) and ball set(s). If there are any sort of patterns appearing in the envelope draw the auditor can ask for a redraw. Typically 2-3 on camera test drawings occur before each official draw (all in front of the same auditor). This is done to verify that the random machines are working correctly and that random balls are producing random results; if they are not the auditor can request additional testing, inspection, or even another random envelope selection. Machines are cleaned daily in front of auditor after official draw. Quarterly a physical inspection of each ball is performed, if one ball in the case is damaged the entire case is replaced with new balls. All those things to consider, the important factor is that balls get replaced – long term analysis on ball draws may be less accurate/effective than short term. This assumes that you know when a ball set has been replaced.
Oh, one last thought for anyone who thinks it might be rigged no one gets near locked machine/ball storage room alone without auditors; and if there is a fire alarm before drawing you have to protect the machines/balls first and bring them with you outside in front of auditor.

0 Nerd-Its - +
My approach by Anonymous :: NR0

I’ve played PowerBall for years now and use a very simple approach. I purchase a $5.00 ticket and select all 2’s. Odds are in my favor that the 2’s will eventually be drawn.

OmniNerd Article Propagation

The Showcase

Nerd-Its   Nerd Trends   Last Ten  

  1. Mon dieu! in Should we care about Syria et al?
  2. roach power in The Matrix Is Coming
  3. RE: Encrypted Data CAN Be Like Burned Data in Judges Rules In Favor of Forced Decryption
  4. RE: It's OK in Judges Rules In Favor of Forced Decryption
  5. RE: It's OK in Judges Rules In Favor of Forced Decryption
  6. RE: It's OK in Judges Rules In Favor of Forced Decryption
  7. RE: It's OK in Judges Rules In Favor of Forced Decryption
  8. Encrypted Data CAN Be Like Burned Data in Judges Rules In Favor of Forced Decryption
  9. RE: It's OK in Judges Rules In Favor of Forced Decryption
  10. RE: It's OK in Judges Rules In Favor of Forced Decryption

What is OmniNerd?

Omninerd_icon Welcome! OmniNerd's content is generated by nerds like you. Learn more.

Voting Booth

Should we care about Syria et al?

10 votes, 2 comments