I did do some analysis. Compensating for the four (ouch) different types, I get an overall p-value of 0.792. This means there is absolutely NO statistical significance…
You can test it yourself if you have R:
bc..
Get big.dat at http://www.state.nj.us/lottery/data/big.dat
big=read.table("big.dat",sep="%",fill=T)
big$date=as.Date(apply(big[,1:3],1,paste,collapse="-"))
big$type=ifelse(big$date>"1999-1-13",ifelse(big$date>"2002-3-15",ifelse(big$date>"2005-06-22",4,3),2),1)
big$maxnorm=c(50,50,52,56)[big$type]
big$maxspecial=c(25,35,52,46)[big$type]
maxnorms=table(big$maxnorm)
p=rep(0,56)
for(i in 1:nrow(maxnorms))
p[1:as.numeric(names(maxnorms)[i])]=p[1:as.numeric(names(maxnorms)[i])]maxnorms[i]*5
maxspecial=table(big$maxspecial)
for(i in 1:nrow(maxspecial))
p[1:as.numeric(names(maxspecial)[i])]=p[1:as.numeric(names(maxspecial)[i])]maxspecial[i]
p=prop.table(p)
allnum=unlist(big[,5:10])
t=table(allnum)
chisq.test(t,p=p)
plot(t/p)
I did do some analysis. Compensating for the four (ouch) different types, I get an overall p-value of 0.792. This means there is absolutely NO statistical significance…
You can test it yourself if you have R:
bc..
big=read.table("big.dat",sep="%",fill=T)
big$date=as.Date(apply(big[,1:3],1,paste,collapse="-"))
big$type=ifelse(big$date>"1999-1-13",ifelse(big$date>"2002-3-15",ifelse(big$date>"2005-06-22",4,3),2),1)
big$maxnorm=c(50,50,52,56)[big$type]
big$maxspecial=c(25,35,52,46)[big$type]
maxnorms=table(big$maxnorm)
p=rep(0,56)
for(i in 1:nrow(maxnorms))
p[1:as.numeric(names(maxnorms)[i])]=p[1:as.numeric(names(maxnorms)[i])]maxnorms[i]*5
maxspecial=table(big$maxspecial)
for(i in 1:nrow(maxspecial))
p[1:as.numeric(names(maxspecial)[i])]=p[1:as.numeric(names(maxspecial)[i])]maxspecial[i]
p=prop.table(p)
allnum=unlist(big[,5:10])
t=table(allnum)
chisq.test(t,p=p)
plot(t/p)
P