What is OmniNerd?

Welcome! OmniNerd's content is generated by you, the reader. Through voting and moderation we strive to highlight the nerdiest of what's around and provide content that's a little more thought provoking than other sites.

Submit New Content

Voting Booth

Reaction to Michelle Obama saying, "For the first time, I am proud of my country"?

41 votes, 7 comments
2
Nerd-Its
+ -

Where's the analysis?

Comment comment by Anonymous on 01 November 2007

You've collected and summarized a bunch of data here, but it would be much more useful (and straightforward) to run some inferential procedures (Monte Carlo procedures would be particularly easy to implement in this case) to see if these results were compatible with the hypothesis of a fair game. Give it a try!

Star This to Save in Your Profile Favorite
Thread parent sort order:
Highest Voted : Lowest Voted : Oldest : Newest
Thread verbosity:
Expand All : Minimize Replies to Comments

You've collected and summarized a bunch of data here, but it would be much more useful (and straightforward) to run some inferential procedures (Monte Carlo procedures would be particularly easy to implement in this case) to see if these results were compatible with the hypothesis of a fair game. Give it a try!

Actually - using some optimized queries in MySQL against the tables that were created in the analysis allowed to rank score every conceivable lottery number. Optimization was needed to get the run time down from one year to about 36 hours. But, you don't think I'm going to give away MY number and share that -cough- winning -cough- with everybody?

5 Nerd-Its - +
RE: Where's the analysis? by Anonymous :: NR0

I did do some analysis. Compensating for the four (ouch) different types, I get an overall p-value of 0.792. This means there is absolutely NO statistical significance...

You can test it yourself if you have R:

#Get big.dat at http://www.state.nj.us/lottery/data/big.dat
big=read.table("big.dat",sep="%",fill=T)
big$date=as.Date(apply(big[,1:3],1,paste,collapse="-"))
big$type=ifelse(big$date>"1999-1-13",ifelse(big$date>"2002-3-15",ifelse(big$date>"2005-06-22",4,3),2),1)
big$maxnorm=c(50,50,52,56)[big$type]
big$maxspecial=c(25,35,52,46)[big$type]
maxnorms=table(big$maxnorm)
p=rep(0,56)
for(i in 1:nrow(maxnorms))
p[1:as.numeric(names(maxnorms)[i])]=p[1:as.numeric(names(maxnorms)[i])]+maxnorms[i]*5
maxspecial=table(big$maxspecial)
for(i in 1:nrow(maxspecial))
p[1:as.numeric(names(maxspecial)[i])]=p[1:as.numeric(names(maxspecial)[i])]+maxspecial[i]
p=prop.table(p)
allnum=unlist(big[,5:10])
t=table(allnum)
chisq.test(t,p=p)
plot(t/p)

P

1 Nerd-It - +
RE: Where's the analysis? by Occams :: NR6

I can't get over the fact that every ball has an equal chance of coming out. Therefore the sequence 1,2,3,4,5,6 is just as likely to come up as any other sequence. But it never does, so that demonstrates how likely you are to choose a winning sequence.

No amount of ex-post pattern analysis can change that.

1 Nerd-It - +
RE: Where's the analysis? by Anonymous :: NR0

I may have missed this in your graphs and charts, (my understanding of math is dubious at best) but I was wondering if you noticed what I noticed. In 2001 I noodled around with the numbers for 2 years in an excel sheet to see what numbers came up the most, and after mapping out the numbers by 3 month seasons I saw an ebb and tide pattern to the grouping of numbers.

For example, the first 5 numbers would have low sequences (mostly from 1 to 20) for couple of weeks, then it would crest to high sequences (mostly 21 and up) for a month and a half or so, and it kept repeating this over the previous couple of years. The Megaball did the same thing, but in an reverse pattern, staying mostly low, and cycling up for a couple of weeks.

At that point I was stuck on how to come up with playable numbers within the two different groupings and never went further.

By the way good thread! And did it ever work out for you?

TMX

themysteriousx@sbcglobal.net