| |
 |
|
|
Science Forum Index » Cryptography Forum » issues with statistical test suite from http://csrc.nist.gov
Page 1 of 3 Goto page 1, 2, 3 Next
|
| Author |
Message |
| Luke Kenneth Casson Leigh |
Posted: Fri Jan 16, 2004 8:22 am |
|
|
|
Guest
|
many people, including myself, will be using the csrc's
statistical test suite, sts, which is available from the above URL.
i have been using this program for long enough now to have
become suspicious of some of its foibles and quirks.
eventually i downloaded the diehard cdrom data this is purported
to be a "true" random source, as a baseline comparison.
if anyone is using the sts test suite, please be suspicious of the
following tests, and avoid them. the list of tests which i am
suspicious of is incomplete: there could be others, but they could
equally well be perfectly okay.
i strongly urge anyone performing statistical analysis of their PRNGs
and encryption algorithms to DOUBLE CHECK the statistical test suite
tools that they are using against a known "true" random baseline.
the alternative explanation is of course that the diehard cdrom data,
http://stat.fsu.edu/pub/diehard/cdrom/ is not "true" random data.
- FFT test in anything less than 1e6 sample sizes
a test on the minimum recommended length of 1000 or a
anything up to around 1,000,000 bits, shows non-uniform
distribution of p-values on what is purported to be
"true" random data.
on a test of length 1,000, the C8 histogram bucket,
representing p-values of between 0.7 and 0.8, was empty!
- Rank test on anything less than 1e6 sample sizes
a test of 100,000 sample sizes on 8,000 runs gives a
skewed histogram of p-values - sufficient for the data
to be determinined as non-random.
- Lempel-Ziv test on anything.
this one is particularly weird: on sample sizes of 1e6,
you should not be looking for a uniform distribution of
p-values but instead should be looking for a small percentage
of extra p-values in the C4 (0.3 - 0.4) histogram bucket!
the exact reason why this test gives a non-uniform distribution
of p-values isn't clear, but modifying src/results.c HIST_SIZE
from 10 to 100 gives some clues as to what is going on.
in short, stay well clear of using this test.
ehbedehbethat's'all'folks.
l. |
|
|
| Back to top |
|
| Cristiano |
Posted: Fri Jan 16, 2004 3:03 pm |
|
|
|
Guest
|
Luke Kenneth Casson Leighton wrote:
Or that your implementation is bad.
Quote: - FFT test in anything less than 1e6 sample sizes
a test on the minimum recommended length of 1000 or a
anything up to around 1,000,000 bits, shows non-uniform
distribution of p-values on what is purported to be
"true" random data.
on a test of length 1,000, the C8 histogram bucket,
representing p-values of between 0.7 and 0.8, was empty!
Really!? I suggest to double check your implementation!
Running that test 100 times I get a very good distribution; all the bins
have about 10 p-values.
Quote: - Rank test on anything less than 1e6 sample sizes
a test of 100,000 sample sizes on 8,000 runs gives a
skewed histogram of p-values - sufficient for the data
to be determinined as non-random.
Also in this case I get the p-values: perfectly skewed (-0,031), correctly
binned and evenly distributed.
Quote: - Lempel-Ziv test on anything.
this one is particularly weird: on sample sizes of 1e6,
you should not be looking for a uniform distribution of
p-values but instead should be looking for a small percentage
of extra p-values in the C4 (0.3 - 0.4) histogram bucket!
I don't know what kind of code you are using. You should get p-values from 0
to 0.5 because they said "The test is preferably one-sided" so they use 1/2
erfc(...). But we need a p-values from 0 to 1, so you must throw away 1/2.
This way I get a good distribution (with good generators).
Quote: in short, stay well clear of using this test.
In short: you have not read the NIST paper, you have not checked your
implementation against the test vectors and you don't know what you're
saying, so you should avoid this kind of claims.
Cristiano |
|
|
| Back to top |
|
| Mack |
Posted: Fri Jan 16, 2004 5:47 pm |
|
|
|
Guest
|
On 16 Jan 2004 05:22:14 -0800, lkcl@lkcl.net (Luke Kenneth Casson
Leighton) wrote:
Quote: many people, including myself, will be using the csrc's
statistical test suite, sts, which is available from the above URL.
i have been using this program for long enough now to have
become suspicious of some of its foibles and quirks.
eventually i downloaded the diehard cdrom data this is purported
to be a "true" random source, as a baseline comparison.
if anyone is using the sts test suite, please be suspicious of the
following tests, and avoid them. the list of tests which i am
suspicious of is incomplete: there could be others, but they could
equally well be perfectly okay.
i strongly urge anyone performing statistical analysis of their PRNGs
and encryption algorithms to DOUBLE CHECK the statistical test suite
tools that they are using against a known "true" random baseline.
the alternative explanation is of course that the diehard cdrom data,
http://stat.fsu.edu/pub/diehard/cdrom/ is not "true" random data.
- FFT test in anything less than 1e6 sample sizes
a test on the minimum recommended length of 1000 or a
anything up to around 1,000,000 bits, shows non-uniform
distribution of p-values on what is purported to be
"true" random data.
on a test of length 1,000, the C8 histogram bucket,
representing p-values of between 0.7 and 0.8, was empty!
- Rank test on anything less than 1e6 sample sizes
a test of 100,000 sample sizes on 8,000 runs gives a
skewed histogram of p-values - sufficient for the data
to be determinined as non-random.
- Lempel-Ziv test on anything.
this one is particularly weird: on sample sizes of 1e6,
you should not be looking for a uniform distribution of
p-values but instead should be looking for a small percentage
of extra p-values in the C4 (0.3 - 0.4) histogram bucket!
the exact reason why this test gives a non-uniform distribution
of p-values isn't clear, but modifying src/results.c HIST_SIZE
from 10 to 100 gives some clues as to what is going on.
in short, stay well clear of using this test.
ehbedehbethat's'all'folks.
l.
The tests give passes but the distribution of the p values are
skewed. There appears to be a problem with the way the
p values are calculated beyond the tails.
Leslie 'Mack' McBride
remove text between _ marks to respond via e-mail |
|
|
| Back to top |
|
| Scott Nelson |
Posted: Fri Jan 16, 2004 11:54 pm |
|
|
|
Guest
|
On 16 Jan 2004 05:22:14 -0800, lkcl@lkcl.net (Luke Kenneth Casson
Leighton) wrote:
[snip happens]
Quote:
eventually i downloaded the diehard cdrom data this is purported
to be a "true" random source, as a baseline comparison.
DANGER WILL ROBINSON!
There is a known problem with early versions of the "random numbers"
on some diehard CDs. Newlines (0x0a) were appended to the
Carriage Returns (0x0d). (You can thank Microsoft for that...)
Check the data you have and see if there are any CRs
followed by something other NL.
Scott Nelson <scott@helsbreth.org> |
|
|
| Back to top |
|
| Luke Kenneth Casson Leigh |
Posted: Sat Jan 17, 2004 6:13 am |
|
|
|
Guest
|
lkcl@lkcl.net (Luke Kenneth Casson Leighton) wrote in message news:<29416bef.0401160522.676e27a9@posting.google.com>...
clarification [additional information]:
sts compiled under debian linux, with gcc 3.3, sts version 1.5.
numbers from diehard site downloaded (not sent cd) in approx nov 2003.
numbers don't appear to consist of carriage-return-newline (0xa is not
always followed alway by 0xd, or vice-versa), as demonstrated by a
quick hexdump -C | grep "0d " | more
l. |
|
|
| Back to top |
|
| Luke Kenneth Casson Leigh |
Posted: Sat Jan 17, 2004 6:52 am |
|
|
|
Guest
|
"Cristiano" <cristiano.pi@NSquipo.it> wrote in message news:<MsXNb.100814$VW.4053715@news3.tin.it>...
Quote: Luke Kenneth Casson Leighton wrote:
the alternative explanation is of course that the diehard cdrom data,
http://stat.fsu.edu/pub/diehard/cdrom/ is not "true" random data.
Or that your implementation is bad.
implementation of what? please be more specific.
i have compiled sts [with gcc 3.3, under linux].
i ran tests.
they failed against what is purported to be a "true" random data source.
therefore there is something wrong.
no "implementations" are involved.
Quote: - FFT test in anything less than 1e6 sample sizes
a test on the minimum recommended length of 1000 or a
anything up to around 1,000,000 bits, shows non-uniform
distribution of p-values on what is purported to be
"true" random data.
on a test of length 1,000, the C8 histogram bucket,
representing p-values of between 0.7 and 0.8, was empty!
Really!? I suggest to double check your implementation!
no "implementation" is involved.
Quote: Running that test 100 times I get a very good distribution; all the bins
have about 10 p-values.
a run of 100 times [on an unspecified or in fact _any_ sample size]
is not sufficient to show statistical abnormalities in what is expected
to be a uniform distribution of the p-values.
in order to demonstrate that the p-values are CLEARLY not uniformly
distributed, as expected, you will need to run AT LEAST 800 samples.
it is better to run 8,000.
Quote: - Rank test on anything less than 1e6 sample sizes
a test of 100,000 sample sizes on 8,000 runs gives a
skewed histogram of p-values - sufficient for the data
to be determinined as non-random.
Also in this case I get the p-values: perfectly skewed (-0,031), correctly
binned and evenly distributed.
how many runs, what sample size, and what data did you use?
Quote: - Lempel-Ziv test on anything.
this one is particularly weird: on sample sizes of 1e6,
you should not be looking for a uniform distribution of
p-values but instead should be looking for a small percentage
of extra p-values in the C4 (0.3 - 0.4) histogram bucket!
I don't know what kind of code you are using.
sts 1.5.
Quote: You should get p-values from 0
to 0.5 because they said "The test is preferably one-sided" so they use 1/2
erfc(...). But we need a p-values from 0 to 1, so you must throw away 1/2.
sorry?
Quote: This way I get a good distribution (with good generators).
i am not talking about using generators, i am talking about input from
what is purported to be a "truly" random data source.
Quote: in short, stay well clear of using this test.
In short: you have not read the NIST paper,
you have been spying on me for over a year, and have evidence that
supports your claim???
:)
Quote: you have not checked your
implementation against the test vectors and you don't know what you're
saying, so you should avoid this kind of claims.
this is not about any particular implementation: no "implementations",
whatever they are, are involved.
i have spoken to the people at csrc.nist.gov: they too are aware of a
problem with the lempel ziv test, and are considering removing it
altogether.
if you are perceiving "success" where i am not, perhaps with cooperation
we may both understand what is going on, and thereby improve the quality
of the tests.
so, reading between the lines, and filtering the "i am right, you are wrong"
attitude of your message, you are saying that you have run some
tests and find the results do not match with mine.
could you please provide background information: e.g. what version of sts
are you using, where did you obtain it from, did you compile it yourself
or run the pre-compiled version, have you made any modifications to the
sts code, what data are you using?
are you happy to download the same data set that i have been using in order
to eliminate one or more possibilities of something being wrong?
i.e. are you willing to be helpful and establish the facts, rather than
be dismissive?
l. |
|
|
| Back to top |
|
| Luke Kenneth Casson Leigh |
Posted: Sat Jan 17, 2004 6:55 am |
|
|
|
Guest
|
Scott Nelson <scott@helsbreth.org> wrote in message news:<dmfh00l7hq0ur3p2h07tl9j6sm2fmarplu@4ax.com>...
Quote: On 16 Jan 2004 05:22:14 -0800, lkcl@lkcl.net (Luke Kenneth Casson
Leighton) wrote:
[snip happens]
eventually i downloaded the diehard cdrom data this is purported
to be a "true" random source, as a baseline comparison.
DANGER WILL ROBINSON!
There is a known problem with early versions of the "random numbers"
on some diehard CDs. Newlines (0x0a) were appended to the
Carriage Returns (0x0d). (You can thank Microsoft for that...)
Check the data you have and see if there are any CRs
followed by something other NL.
ta scott. double-checked: there shouldn't have been any: i ran
hexdump -C | grep "0d " | more and there don't appear to be any
frequent occurrences of 0a 0d or 0d 0a even.
i should imagine it would depend on whether you were using MS and
a MS browser to perform the download: i would expect IE to do something
stupid like "oh, it's a text file, oh it looks like it's a unix text
file, let's convert it without _telling_ anyone".
l. |
|
|
| Back to top |
|
| Luke Kenneth Casson Leigh |
Posted: Sat Jan 17, 2004 7:02 am |
|
|
|
Guest
|
Mack <macckone@a_nospamjunk123_ol.com> wrote in message news:<l5qg00diqd1190jlqb18hus1isgldpvs45@4ax.com>...
Quote: On 16 Jan 2004 05:22:14 -0800, lkcl@lkcl.net (Luke Kenneth Casson
Leighton) wrote:
The tests give passes but the distribution of the p values are
skewed. There appears to be a problem with the way the
p values are calculated beyond the tails.
the people at csrc.nist.gov inform me that they used blum-blum-shub
as the "baseline" for the lempel-ziv test (i haven't asked them about
the other tests) and that they then took EMPIRICALLY OBSERVED values
for the mean and standard deviation of the information that generates
the p-values.
if they did the same on one or two other tests, it's possible that
they either didn't take a large enough pseudo-random sample from which
to derive the empirical mean and s.d., or that there is a problem with
the pseudo-random generator that they used.
either way, a skew of the p-values is, as you say, introduced.
basically i wanted to alert people that i've come across quite specific
problems with sts (1.5) and if you are developing an algorithm using
sts, which is incredibly good, you _really_ need to have a "true" source
of random data against which you should double-check the same sts test.
l. |
|
|
| Back to top |
|
| Cristiano |
Posted: Sat Jan 17, 2004 12:09 pm |
|
|
|
Guest
|
Luke Kenneth Casson Leighton wrote:
Quote: perhaps with cooperation we may both understand what is going on, and
thereby
improve the quality of the tests.
I'm not able to cooperate with one who says: "stay well clear of using this
test" without trying to understand where's the problem.
Cristiano |
|
|
| Back to top |
|
| Cristiano |
Posted: Sat Jan 17, 2004 12:09 pm |
|
|
|
Guest
|
Luke Kenneth Casson Leighton wrote:
Quote:
the people at csrc.nist.gov inform me that they used blum-blum-shub
as the "baseline" for the lempel-ziv test (i haven't asked them about
the other tests) and that they then took EMPIRICALLY OBSERVED values
for the mean and standard deviation of the information that generates
the p-values.
They have also used sha-1 based generator to get the mean and the variance.
They updated those values with:
mean = 69588.20190000
variance = 73.23726011
which are good enough.
Quote: if they did the same on one or two other tests, it's possible that
they either didn't take a large enough pseudo-random sample from
which to derive the empirical mean and s.d., or that there is a
problem with the pseudo-random generator that they used.
either way, a skew of the p-values is, as you say, introduced.
No skewed p-values introduced.
Are you able to calculate the mean and the variance by yourself without
whimper?
I done that for n = 10e6 bit and I use this sample size instead of 1e6.
Cristiano |
|
|
| Back to top |
|
| Douglas A. Gwyn |
Posted: Sun Jan 18, 2004 1:50 am |
|
|
|
Guest
|
Luke Kenneth Casson Leighton wrote:
Quote: no "implementations" are involved.
Obviously they must have meant implementation of the tests. |
|
|
| Back to top |
|
| Luke Kenneth Casson Leigh |
Posted: Sun Jan 18, 2004 9:35 am |
|
|
|
Guest
|
"Cristiano" <cristiano.pi@NSquipo.it> wrote in message news:<a%dOb.134287$_P.4856973@news4.tin.it>...
Quote: Luke Kenneth Casson Leighton wrote:
the people at csrc.nist.gov inform me that they used blum-blum-shub
as the "baseline" for the lempel-ziv test (i haven't asked them about
the other tests) and that they then took EMPIRICALLY OBSERVED values
for the mean and standard deviation of the information that generates
the p-values.
They have also used sha-1 based generator to get the mean and the variance.
They updated those values with:
mean = 69588.20190000
variance = 73.23726011
which are good enough.
try one of the two following things, which i will repeat for a second time:
1) increasing HIST_SIZE in src/results.c from 10 to 100: you will note
that the p-values fall into specific buckets and are NOT uniformly
distributed.
if that's not obvious enough, increase the HIST_SIZE to 1,000.
2) run at least 4,000 runs of 1e6 samples.
could someone else help by running a test and send [me/newsgroup] the
results?
i get histograms like this on 800 runs of 1e6 sample-length, using
the diehard cdrom data from november.
c1 c2 c3 c4 c5 c6 c7 c8 c9 c10 p-value
80 90 95 117 65 70 93 85 79 78 0.00011
the 117 is _very_ close to being unacceptable: running a 200mb or
500 mbyte test, the skew is the same and the p-value test of the
histogram comes out at 0.00000 - i.e. failure [non-uniform].
l. |
|
|
| Back to top |
|
| Luke Kenneth Casson Leigh |
Posted: Sun Jan 18, 2004 3:01 pm |
|
|
|
Guest
|
"Douglas A. Gwyn" <DAGwyn@null.net> wrote in message news:<dZqdnfuDK8Y4sZfd4p2dnA@comcast.com>...
Quote: Luke Kenneth Casson Leighton wrote:
no "implementations" are involved.
Obviously they must have meant implementation of the tests.
i'm not sure what he means: implementation of the tests, or
implementation of an algorithm that generates data that is tested
by sts.
or something else.
if he means implementation of the tests, then that's a non-issue:
it's the sts 1.5 source code, compiled under linux with gcc 3.3,
so i assumed that he meant the other possible interpretation.
if he means that other possible interpretation - implementation
of an algorithm that generates the data being tests - then i have
not _used_ any "implementation" to generate the data: i downloaded
it from the diehard site.
which is why it would be good to have independent reproducible
confirmation of what i have observed, and of what mr cristiano
has observed (disregarding the way he delivers his comments, and
coaxing him to provide sufficient repro information for his
findings).
aside from anything else, an independent and verifiable refutation
of what i have found would be extremely useful: i'm trying to
develop an algorithm and make sure it works. having a test suite
that doesn't work doesn't help!
l. |
|
|
| Back to top |
|
| Mack |
Posted: Mon Jan 19, 2004 7:10 am |
|
|
|
Guest
|
On 18 Jan 2004 06:35:13 -0800, lkcl@lkcl.net (Luke Kenneth Casson
Leighton) wrote:
Quote: "Cristiano" <cristiano.pi@NSquipo.it> wrote in message news:<a%dOb.134287$_P.4856973@news4.tin.it>...
Luke Kenneth Casson Leighton wrote:
the people at csrc.nist.gov inform me that they used blum-blum-shub
as the "baseline" for the lempel-ziv test (i haven't asked them about
the other tests) and that they then took EMPIRICALLY OBSERVED values
for the mean and standard deviation of the information that generates
the p-values.
They have also used sha-1 based generator to get the mean and the variance.
They updated those values with:
mean = 69588.20190000
variance = 73.23726011
which are good enough.
try one of the two following things, which i will repeat for a second time:
1) increasing HIST_SIZE in src/results.c from 10 to 100: you will note
that the p-values fall into specific buckets and are NOT uniformly
distributed.
if that's not obvious enough, increase the HIST_SIZE to 1,000.
2) run at least 4,000 runs of 1e6 samples.
could someone else help by running a test and send [me/newsgroup] the
results?
i get histograms like this on 800 runs of 1e6 sample-length, using
the diehard cdrom data from november.
c1 c2 c3 c4 c5 c6 c7 c8 c9 c10 p-value
80 90 95 117 65 70 93 85 79 78 0.00011
the 117 is _very_ close to being unacceptable: running a 200mb or
500 mbyte test, the skew is the same and the p-value test of the
histogram comes out at 0.00000 - i.e. failure [non-uniform].
l.
The following were generated by the BBS generator built in to the
STS suite. Each run was conducted independently. The data has
been condensed for easier reading. This was downloaded
from the NIST web site and compiled using MS VC 6.0 with
the latest updates. The percentage of passes on each
test is acceptable. The distribution of the p-values is not.
There is a problem with the distribution of the p-values for these
tests. I will be running additional test. I did not test 1e7 due to
time constraints (16+ hours of run time).
The igamc error encountered is not adequately documented.
It is an error in the incomplete gamma function but no further
description is given.
------------------------------------------------------------------------------
RESULTS FOR THE UNIFORMITY OF P-VALUES AND THE PROPORTION OF PASSING
SEQUENCES
------------------------------------------------------------------------------
1e6 x 1000
------------------------------------------------------------------------------
C1 C2 C3 C4 C5 C6 C7 C8 C9 C10 P-VALUE PROPORTION
STATISTICAL TEST
------------------------------------------------------------------------------
107 92 92 88 109 83 112 95 108 114 0.262249 0.9890 Rank
93 99 108 124 105 107 94 101 78 91 0.134944 0.9970 FFT
109 79 108 136 90 85 87 120 101 85 0.000435 0.9900
Lempel-Ziv
- - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - -
- - - - -
1e6 x 100
------------------------------------------------------------------------------
C1 C2 C3 C4 C5 C6 C7 C8 C9 C10 P-VALUE PROPORTION
STATISTICAL TEST
------------------------------------------------------------------------------
7 11 8 6 9 12 10 15 15 7 0.401199 1.0000 Rank
8 10 9 13 13 3 15 7 11 11 0.289667 1.0000 FFT
20 14 14 9 5 7 3 9 12 7 0.006196 0.9700
Lempel-Ziv
- - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - -
- - - - -
1e5 x 100
------------------------------------------------------------------------------
C1 C2 C3 C4 C5 C6 C7 C8 C9 C10 P-VALUE PROPORTION
STATISTICAL TEST
------------------------------------------------------------------------------
9 6 8 7 7 13 12 14 14 10 0.494392 1.0000 Rank
4 8 6 9 6 14 16 13 10 14 0.090936 1.0000 FFT
10 5 12 7 14 17 5 5 11 14 0.048716 0.9900
Lempel-Ziv
- - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - -
- - - - -
1e5 x 1000
------------------------------------------------------------------------------
C1 C2 C3 C4 C5 C6 C7 C8 C9 C10 P-VALUE PROPORTION
STATISTICAL TEST
------------------------------------------------------------------------------
90 129 82 99 102 91 108 98 106 95 0.096578 0.9940 Rank
23 71 74 76 129 131 131 120 121 124 0.000000 * 1.0000 FFT
134 69 87 84 106 118 87 87 76 152 0.000000 * 0.9850
Lempel-Ziv
- - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - -
- - - - -
1e4 x 1000
------------------------------------------------------------------------------
C1 C2 C3 C4 C5 C6 C7 C8 C9 C10 P-VALUE PROPORTION
STATISTICAL TEST
------------------------------------------------------------------------------
86 133 44 146 197 55 50 73 107 109 0.000000 * 0.9920 Rank
20 70 97 84 77 91 205 128 123 105 0.000000 * 0.9990 FFT
0 0 0 0 0 0 0 0 0 1000 0.000000 * 1.0000
Lempel-Ziv
igamc underflow error occurs for Lempel-Ziv
- - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - -
- - - - -
The minimum pass rate for each statistical test with the exception of
the random
excursion (variant) test is approximately = 0.980561 for a sample size
= 1000
binary sequences.
The minimum pass rate for each statistical test with the exception of
the random
excursion (variant) test is approximately = 0.960150 for a sample size
= 100
binary sequences.
For further guidelines construct a probability table using the MAPLE
program
provided in the addendum section of the documentation.
- - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - -
- - - - -
Leslie 'Mack' McBride
remove text between _ marks to respond via e-mail |
|
|
| Back to top |
|
| Mack |
Posted: Mon Jan 19, 2004 7:27 am |
|
|
|
Guest
|
On Sat, 17 Jan 2004 17:09:26 GMT, "Cristiano"
<cristiano.pi@NSquipo.it> wrote:
Quote: Luke Kenneth Casson Leighton wrote:
the people at csrc.nist.gov inform me that they used blum-blum-shub
as the "baseline" for the lempel-ziv test (i haven't asked them about
the other tests) and that they then took EMPIRICALLY OBSERVED values
for the mean and standard deviation of the information that generates
the p-values.
They have also used sha-1 based generator to get the mean and the variance.
They updated those values with:
mean = 69588.20190000
variance = 73.23726011
which are good enough.
if they did the same on one or two other tests, it's possible that
they either didn't take a large enough pseudo-random sample from
which to derive the empirical mean and s.d., or that there is a
problem with the pseudo-random generator that they used.
either way, a skew of the p-values is, as you say, introduced.
No skewed p-values introduced.
This is easily testable. The p-values are not uniformly distributed.
There is skew. I have posted examples in a seperate message.
Quote: Are you able to calculate the mean and the variance by yourself without
whimper?
I done that for n = 10e6 bit and I use this sample size instead of 1e6.
The sizes in question are 1e6 and below. I have not yet tested at
larger sample sizes. Lempel-Ziv performs badly at or below 1e6,
the rank test performs badly at or below 1e4, the FFT performs
badly at or below 1e5. There is a problem. The correct
percentages pass the tests but the p-values are not uniform.
Until the exact nature of the problem is identified these tests should
be viewed with extreme suspicion. The Lempel-Ziv test in particular
may not be correct.
Further testing is needed.
Quote:
Cristiano
Leslie 'Mack' McBride
remove text between _ marks to respond via e-mail |
|
|
| Back to top |
|
| |
Page 1 of 3 Goto page 1, 2, 3 Next
All times are GMT - 5 Hours
The time now is Mon Sep 08, 2008 11:23 am
|
|