| |
 |
|
|
Science Forum Index » Fractals Science Forum » Box-counting dimension of random and Gaussian data points?
Page 1 of 1
|
| Author |
Message |
| Ross Evans |
Posted: Tue Dec 07, 2004 8:45 pm |
|
|
|
Guest
|
I need help understanding something.
The box-counting dimension of a set of random data points -- random in the
sense that they are white noise, equally likely to occur anywhere within an
interval -- is theoretically 2. That is computed by adding 1 to the slope
of the log/log plot of the single-variable data points, counted at various
sizes of box measurent..
That works out empirically for me. My measured slope of such a set of data
points is very close to 1.
But what about data points that are distributed not as such white noise, but
as Gaussian-distributed data centered around a mean. These data points
visually do not fill the space as completely, because by definition they
tend to be clustered around the mean. And when I measure the slope of such
a set of points, it is only about .92.
But the literature generally seems to expect a box-counting dimension of 2
for Gaussian data, not just white noise. I don't get it. |
|
|
| Back to top |
|
| Roger Bagula |
Posted: Wed Dec 08, 2004 11:19 am |
|
|
|
Guest
|
Dear Ross Evans,
Are you measuring a time series as:
s[n]->{s[1],s[2], s[3],...,s[n]}
Or
{r[n],s[n]},-> {{r[1],s[1]},{r[2],s[2]},{r[3],s[3]},...,{r[n],s[n]}}
The first will be one dimensional and the second should be two dimensional.
How are you producing your white noise/ normal variates?
Different randomization method of producing your distribution will
produce different
results.
Ross Evans wrote:
Quote: I need help understanding something.
The box-counting dimension of a set of random data points -- random in the
sense that they are white noise, equally likely to occur anywhere within an
interval -- is theoretically 2. That is computed by adding 1 to the slope
of the log/log plot of the single-variable data points, counted at various
sizes of box measurent..
That works out empirically for me. My measured slope of such a set of data
points is very close to 1.
But what about data points that are distributed not as such white noise, but
as Gaussian-distributed data centered around a mean. These data points
visually do not fill the space as completely, because by definition they
tend to be clustered around the mean. And when I measure the slope of such
a set of points, it is only about .92.
But the literature generally seems to expect a box-counting dimension of 2
for Gaussian data, not just white noise. I don't get it.
--
Respectfully, Roger L. Bagula
tftn@earthlink.net, 11759Waterhill Road, Lakeside,Ca 92040-2905,tel: 619-5610814 :
alternative email: rlbtftn@netscape.net
URL : http://home.earthlink.net/~tftn |
|
|
| Back to top |
|
| Ross Evans |
Posted: Wed Dec 08, 2004 9:45 pm |
|
|
|
Guest
|
My white-noise data points were actually one-dimensional, but not a time
series. The white noise points were downloaded from random.org, not created
with a PRNG. For the Gaussian set, I tried several methods: One method
simply added n of these white-noise points and divided by n for each
Gaussian point.. Another method used two of these uniformly random points
at a time and transformed them into Gaussian values using the Box-Muller
algorithm.
Either way, the resulting set of Gaussian values produced slopes in the
range of .92 when run through a box-counting program.
I really do not think my difficulty lies in any artifact of my Gaussian
sets. Rather, there is a fundamental difference between uniform white noise
and Gaussian brown noise. I understand how the former produces a
box-counting slope of 1. The latter does not, because the space is
inherently more sparse in some areas.
With the white noise, as the box size gets smaller, the slope starts at
exactly unity until the boxes are small and numerous enough to produce some
empty boxes. With the Gaussian noise with the same number of data points,
the empty boxes start to occur sooner, with larger box intervals. So the
box-counting slope is lower.
I have not yet tried a cloud of x,y points, but it seems obvious that I
would get a similar result as for a set of points in one dimension.
"Roger Bagula" <tftn@earthlink.net> wrote in message
news:41B72988.4060200@earthlink.net...
Quote: Dear Ross Evans,
Are you measuring a time series as:
s[n]->{s[1],s[2], s[3],...,s[n]}
Or
{r[n],s[n]},-> {{r[1],s[1]},{r[2],s[2]},{r[3],s[3]},...,{r[n],s[n]}}
The first will be one dimensional and the second should be two
dimensional.
How are you producing your white noise/ normal variates?
Different randomization method of producing your distribution will
produce different
results.
Ross Evans wrote:
I need help understanding something.
The box-counting dimension of a set of random data points -- random in
the
sense that they are white noise, equally likely to occur anywhere within
an
interval -- is theoretically 2. That is computed by adding 1 to the
slope
of the log/log plot of the single-variable data points, counted at
various
sizes of box measurent..
That works out empirically for me. My measured slope of such a set of
data
points is very close to 1.
But what about data points that are distributed not as such white noise,
but
as Gaussian-distributed data centered around a mean. These data points
visually do not fill the space as completely, because by definition they
tend to be clustered around the mean. And when I measure the slope of
such
a set of points, it is only about .92.
But the literature generally seems to expect a box-counting dimension of
2
for Gaussian data, not just white noise. I don't get it.
--
Respectfully, Roger L. Bagula
tftn@earthlink.net, 11759Waterhill Road, Lakeside,Ca 92040-2905,tel:
619-5610814 :
alternative email: rlbtftn@netscape.net
URL : http://home.earthlink.net/~tftn
|
|
|
| Back to top |
|
| Roger Bagula |
Posted: Thu Dec 09, 2004 9:58 am |
|
|
|
Guest
|
Dear Ross Evans,
I have posted another way that is used and abused
for finding dimension in sets like this.
The correlation dimension can be applied to such cases as well.
The polar normal noise plotted in two dimensions is much sparser
as you get away from the origin at (0,0) , but is still considered to be
a space filling
curve. In IFS we have space fills that are nonlinear that behave this
way as well.
A finite time snap shot like a picture made by a computer
will give a different answer than the asymptotic limiting value.
Here what you do to determine that. You make a number of pictures
at intervals of time/ numbers of points ( maybe powers of ten).
You do box counting dimension on them all.
You plot them as number of points against the resulting box counting
dimension
and see where the curve tends to for "infinite"
number of points.
It is very difficult to get an IFS for a disk space filling curve that
is "uniform".
Ross Evans wrote:
Quote: My white-noise data points were actually one-dimensional, but not a time
series. The white noise points were downloaded from random.org, not created
with a PRNG. For the Gaussian set, I tried several methods: One method
simply added n of these white-noise points and divided by n for each
Gaussian point.. Another method used two of these uniformly random points
at a time and transformed them into Gaussian values using the Box-Muller
algorithm.
Either way, the resulting set of Gaussian values produced slopes in the
range of .92 when run through a box-counting program.
I really do not think my difficulty lies in any artifact of my Gaussian
sets. Rather, there is a fundamental difference between uniform white noise
and Gaussian brown noise. I understand how the former produces a
box-counting slope of 1. The latter does not, because the space is
inherently more sparse in some areas.
With the white noise, as the box size gets smaller, the slope starts at
exactly unity until the boxes are small and numerous enough to produce some
empty boxes. With the Gaussian noise with the same number of data points,
the empty boxes start to occur sooner, with larger box intervals. So the
box-counting slope is lower.
I have not yet tried a cloud of x,y points, but it seems obvious that I
would get a similar result as for a set of points in one dimension.
"Roger Bagula" <tftn@earthlink.net> wrote in message
news:41B72988.4060200@earthlink.net...
Dear Ross Evans,
Are you measuring a time series as:
s[n]->{s[1],s[2], s[3],...,s[n]}
Or
{r[n],s[n]},-> {{r[1],s[1]},{r[2],s[2]},{r[3],s[3]},...,{r[n],s[n]}}
The first will be one dimensional and the second should be two
dimensional.
How are you producing your white noise/ normal variates?
Different randomization method of producing your distribution will
produce different
results.
Ross Evans wrote:
I need help understanding something.
The box-counting dimension of a set of random data points -- random in
the
sense that they are white noise, equally likely to occur anywhere within
an
interval -- is theoretically 2. That is computed by adding 1 to the
slope
of the log/log plot of the single-variable data points, counted at
various
sizes of box measurent..
That works out empirically for me. My measured slope of such a set of
data
points is very close to 1.
But what about data points that are distributed not as such white noise,
but
as Gaussian-distributed data centered around a mean. These data points
visually do not fill the space as completely, because by definition they
tend to be clustered around the mean. And when I measure the slope of
such
a set of points, it is only about .92.
But the literature generally seems to expect a box-counting dimension of
2
for Gaussian data, not just white noise. I don't get it.
--
Respectfully, Roger L. Bagula
tftn@earthlink.net, 11759Waterhill Road, Lakeside,Ca 92040-2905,tel:
619-5610814 :
alternative email: rlbtftn@netscape.net
URL : http://home.earthlink.net/~tftn
--
Respectfully, Roger L. Bagula
tftn@earthlink.net, 11759Waterhill Road, Lakeside,Ca 92040-2905,tel: 619-5610814 :
alternative email: rlbtftn@netscape.net
URL : http://home.earthlink.net/~tftn |
|
|
| Back to top |
|
| Roger Bagula |
Posted: Thu Dec 09, 2004 12:05 pm |
|
|
|
Guest
|
It took me 4 years after getting my first
disk space fill to get a "uniform" space filling of a disk.
The polar normal type of Gaussian normal 2d is
very like this problem , because the "diffusion" algorithm
is not a uniform probability algorithm.
Diffusion in circularly curved space goes as the square of the distance
( how light dims
as the square of the distance). So you have another measure probability
entering in.
To get it uniform you have to alter how the radial variable is handled.
All Gaussian normal type of noise has this curvature built in.
It is still "space filling" or d=2 in two dimensions,
but only as the limiting case of infinite time
when maximum entropy is achieved.
True basic:
SET MODE "color"
SET WINDOW 0,1920,0,1024
SET BACKGROUND COLOR "white"
LET x=.2
LET y=.3
LET a=0
LET b =0
LET s1=1600/2
LET s2 =s1*1024/1920
RANDOMIZE
LET z=rnd
PRINT"Circle Cantor set I.F.S. space filling disk 2 transforms"
Print" square root radial probabulity added"
PRINT"BY R.L.BAGULA 11 June 2004 ©"
let r=0.1
FOR n= 1 TO 1000000
LET a =RND
LET b=RND
IF a<= 1/2 AND a>0 THEN
LET z1=z/2
LET r=SQR(2*a)
SET COLOR "black"
END IF
IF a<= 1 AND a>1/2 THEN
LET z1=z/2-1/2
LET r=SQR(2*a-1)
SET COLOR "red"
END IF
LET z=z1
LET x=r*cos(z*2*Pi)
LET y=r*sin(z*2*Pi)
IF n>10 THEN PLOT 1920/2+s1*x+50,1024/2+s2*y
NEXT n
END
Roger Bagula wrote:
Quote: Dear Ross Evans,
I have posted another way that is used and abused
for finding dimension in sets like this.
The correlation dimension can be applied to such cases as well.
The polar normal noise plotted in two dimensions is much sparser
as you get away from the origin at (0,0) , but is still considered to
be a space filling
curve. In IFS we have space fills that are nonlinear that behave this
way as well.
A finite time snap shot like a picture made by a computer
will give a different answer than the asymptotic limiting value.
Here what you do to determine that. You make a number of pictures
at intervals of time/ numbers of points ( maybe powers of ten).
You do box counting dimension on them all.
You plot them as number of points against the resulting box counting
dimension
and see where the curve tends to for "infinite"
number of points.
It is very difficult to get an IFS for a disk space filling curve that
is "uniform".
Ross Evans wrote:
My white-noise data points were actually one-dimensional, but not a time
series. The white noise points were downloaded from random.org, not
created
with a PRNG. For the Gaussian set, I tried several methods: One method
simply added n of these white-noise points and divided by n for each
Gaussian point.. Another method used two of these uniformly random
points
at a time and transformed them into Gaussian values using the Box-Muller
algorithm.
Either way, the resulting set of Gaussian values produced slopes in the
range of .92 when run through a box-counting program.
I really do not think my difficulty lies in any artifact of my Gaussian
sets. Rather, there is a fundamental difference between uniform
white noise
and Gaussian brown noise. I understand how the former produces a
box-counting slope of 1. The latter does not, because the space is
inherently more sparse in some areas.
With the white noise, as the box size gets smaller, the slope starts at
exactly unity until the boxes are small and numerous enough to
produce some
empty boxes. With the Gaussian noise with the same number of data
points,
the empty boxes start to occur sooner, with larger box intervals. So the
box-counting slope is lower.
I have not yet tried a cloud of x,y points, but it seems obvious that I
would get a similar result as for a set of points in one dimension.
"Roger Bagula" <tftn@earthlink.net> wrote in message
news:41B72988.4060200@earthlink.net...
Dear Ross Evans,
Are you measuring a time series as:
s[n]->{s[1],s[2], s[3],...,s[n]}
Or
{r[n],s[n]},-> {{r[1],s[1]},{r[2],s[2]},{r[3],s[3]},...,{r[n],s[n]}}
The first will be one dimensional and the second should be two
dimensional.
How are you producing your white noise/ normal variates?
Different randomization method of producing your distribution will
produce different
results.
Ross Evans wrote:
I need help understanding something.
The box-counting dimension of a set of random data points -- random in
the
sense that they are white noise, equally likely to occur anywhere
within
an
interval -- is theoretically 2. That is computed by adding 1 to the
slope
of the log/log plot of the single-variable data points, counted at
various
sizes of box measurent..
That works out empirically for me. My measured slope of such a set of
data
points is very close to 1.
But what about data points that are distributed not as such white
noise,
but
as Gaussian-distributed data centered around a mean. These data points
visually do not fill the space as completely, because by definition
they
tend to be clustered around the mean. And when I measure the slope of
such
a set of points, it is only about .92.
But the literature generally seems to expect a box-counting
dimension of
2
for Gaussian data, not just white noise. I don't get it.
--
Respectfully, Roger L. Bagula
tftn@earthlink.net, 11759Waterhill Road, Lakeside,Ca 92040-2905,tel:
619-5610814 :
alternative email: rlbtftn@netscape.net
URL : http://home.earthlink.net/~tftn
--
Respectfully, Roger L. Bagula
tftn@earthlink.net, 11759Waterhill Road, Lakeside,Ca 92040-2905,tel: 619-5610814 :
alternative email: rlbtftn@netscape.net
URL : http://home.earthlink.net/~tftn |
|
|
| Back to top |
|
| Roger Bagula |
Posted: Thu Dec 09, 2004 12:23 pm |
|
|
|
Guest
|
Let me say that this is a good quest
and the answer in not obvious
and may well be not all that certain, but
it is what the Einstein theory of Brownian motion tells us.
A French economist actually discovered this diffusion law before
Einstein, but
he is the one who applied it best to physics.
It all ties in with his ideas about curvature of space time as well.
To give you a counter idea: suppose that light got dimmer as the cube of
the distance
instead of as the square; that is the "measure" for space was:
m3(x,y,z)=(x^3+y^3+z^3)^(1/3)
instead of
m2(x,y,z)=(x^2+y^2+z^2)^(1/2)
The result that the diffusion of Brownian motion would
be much slower and be governed by a different set of differential
equations altogether.
The limiting Femat type nth measure is the Hausdorff space measure:
Mhd=Max[x]+Max[y]+Max[z]
Which is a 3d space filling measure.
If we add in an non-Euclidean time in a negative hyperbolic sense:
Mhd4d=Max[x]+Max[y]+Max[z]-c*Max[t]
You see the time probability and the curvature of space is "intrinsic"
in the diffusion measures that Brownian motion is based on.
Ross Evans wrote:
Quote: I need help understanding something.
The box-counting dimension of a set of random data points -- random in the
sense that they are white noise, equally likely to occur anywhere within an
interval -- is theoretically 2. That is computed by adding 1 to the slope
of the log/log plot of the single-variable data points, counted at various
sizes of box measurent..
That works out empirically for me. My measured slope of such a set of data
points is very close to 1.
But what about data points that are distributed not as such white noise, but
as Gaussian-distributed data centered around a mean. These data points
visually do not fill the space as completely, because by definition they
tend to be clustered around the mean. And when I measure the slope of such
a set of points, it is only about .92.
But the literature generally seems to expect a box-counting dimension of 2
for Gaussian data, not just white noise. I don't get it.
--
Respectfully, Roger L. Bagula
tftn@earthlink.net, 11759Waterhill Road, Lakeside,Ca 92040-2905,tel: 619-5610814 :
alternative email: rlbtftn@netscape.net
URL : http://home.earthlink.net/~tftn |
|
|
| Back to top |
|
| Roger Bagula |
Posted: Thu Dec 09, 2004 12:41 pm |
|
|
|
Guest
|
Looking it up in a very old noise theory book the
diffusion of a particle in a uniform field ( gravitation) is:
dPhi/dt=Sum[beta(i)*dPhi/dx(i)+D(i)*d^2Phi/dx(i)^2,{i,1,3}]
You notice that the derivative level is the second derivative.
Page 299 of Selected Papers Noise and Stochastic Processes, edited by
Nelson Wax, Dover books, 1954
Norbert Weiner and Benoit Mandelbrot and a professor name Ito and his
calculus have
modified the general theory some since then, but not
the basis of curvature, diffusion and time.
Roger Bagula wrote:
Quote: Let me say that this is a good quest
and the answer in not obvious
and may well be not all that certain, but
it is what the Einstein theory of Brownian motion tells us.
A French economist actually discovered this diffusion law before
Einstein, but
he is the one who applied it best to physics.
It all ties in with his ideas about curvature of space time as well.
To give you a counter idea: suppose that light got dimmer as the cube
of the distance
instead of as the square; that is the "measure" for space was:
m3(x,y,z)=(x^3+y^3+z^3)^(1/3)
instead of
m2(x,y,z)=(x^2+y^2+z^2)^(1/2)
The result that the diffusion of Brownian motion would
be much slower and be governed by a different set of differential
equations altogether.
The limiting Femat type nth measure is the Hausdorff space measure:
Mhd=Max[x]+Max[y]+Max[z]
Which is a 3d space filling measure.
If we add in an non-Euclidean time in a negative hyperbolic sense:
Mhd4d=Max[x]+Max[y]+Max[z]-c*Max[t]
You see the time probability and the curvature of space is "intrinsic"
in the diffusion measures that Brownian motion is based on.
Ross Evans wrote:
I need help understanding something.
The box-counting dimension of a set of random data points -- random
in the
sense that they are white noise, equally likely to occur anywhere
within an
interval -- is theoretically 2. That is computed by adding 1 to the
slope
of the log/log plot of the single-variable data points, counted at
various
sizes of box measurent..
That works out empirically for me. My measured slope of such a set
of data
points is very close to 1.
But what about data points that are distributed not as such white
noise, but
as Gaussian-distributed data centered around a mean. These data points
visually do not fill the space as completely, because by definition they
tend to be clustered around the mean. And when I measure the slope
of such
a set of points, it is only about .92.
But the literature generally seems to expect a box-counting dimension
of 2
for Gaussian data, not just white noise. I don't get it.
--
Respectfully, Roger L. Bagula
tftn@earthlink.net, 11759Waterhill Road, Lakeside,Ca 92040-2905,tel: 619-5610814 :
alternative email: rlbtftn@netscape.net
URL : http://home.earthlink.net/~tftn |
|
|
| Back to top |
|
| Tony Roberts |
Posted: Thu Dec 09, 2004 6:38 pm |
|
|
|
Guest
|
You have not commented on my reply,
Tony
Tony Roberts wrote:
Quote: Firstly, I presume you are distributing the points in the plane.
Generically, uniformly random points in n dimensions should have
dimension n.
Secondly, yes indeed a Gaussian cloud of points in n dimensions should
have dimension n.
Thirdly, well the previous statement only holds for some scales. You
must understand that a fractal dimension ONLY applies over a finite
(usually) range of length scales. Suppose your Gaussian cloud of k
points is distrubuted with standard deviation 1:
* on large lengths scales (say >10), the entire cloud will look like
just one point as they are all located at near ly the same pplace, thus
the cloud looks like it has dimension 1
* on medium length scales (say <2 and > k^(1/n)), the cloud of points
looks as if it fills up the space in a fuzz of points and som has
dimension n
* on really small length scales (say <k^(/n)) you will perceive the
points as individuals and the set of points will again have dimension 0.
The right dimension depends upon the length scale at which you want to
interact with the object.
Fourthly, the box counting dimension is a poor methodology which mixes
up the dimension of the middle regime with that of the surrounding
regimes. Thus the box counting dimension (in this example) will be a
little low.
Tony
Ross Evans wrote:
I need help understanding something.
The box-counting dimension of a set of random data points -- random in
the
sense that they are white noise, equally likely to occur anywhere
within an
interval -- is theoretically 2. That is computed by adding 1 to the
slope
of the log/log plot of the single-variable data points, counted at
various
sizes of box measurent..
That works out empirically for me. My measured slope of such a set of
data
points is very close to 1.
But what about data points that are distributed not as such white
noise, but
as Gaussian-distributed data centered around a mean. These data points
visually do not fill the space as completely, because by definition they
tend to be clustered around the mean. And when I measure the slope of
such
a set of points, it is only about .92.
But the literature generally seems to expect a box-counting dimension
of 2
for Gaussian data, not just white noise. I don't get it.
|
|
|
| Back to top |
|
| Ross Evans |
Posted: Thu Dec 09, 2004 10:38 pm |
|
|
|
Guest
|
I don't understand what you mean by choosing the appropriate scale for the
dimension. I thought the way the box-counting method works is to measure
the slope of a near-linear plot of measurements across a wide range of
scales.
So far I have not been measuring a cloud of points in a 2-dimensional space,
but a dust of points in a 1-dimensional space. But similar results would
occur across two dimensions. I don't see how the method could measure a
dimension of 2 for a Gaussian cloud just as I can't measure 1 for a Gaussian
dust.
With a uniformly distributed dust, the box-counting method yields a perfect
slope of 1 until the boxes get small enough to force some boxes to be empty
between points out of pure sparseness. Then the slope rapidly approaches 0
as the size of the boxes is halved, which value it is guaranteed to reach
once the number of filled boxes reaches n. One of the intermediate ranges
does not seem to be the "correct" range for the distribution, because the
dimension measured at the intermediate scale -- even for the uniformly
random set -- is less than 1.
Please understand I am not arguing. I am just stating what my understanding
is. But I don't think my undertanding is complete because I can't seem to
make the method work on empirical Gaussian data.
"Tony Roberts" <aroberts@usq.edu.au> wrote in message
news:41B8E203.3020708@usq.edu.au...
Quote: You have not commented on my reply,
Tony
Tony Roberts wrote:
Firstly, I presume you are distributing the points in the plane.
Generically, uniformly random points in n dimensions should have
dimension n.
Secondly, yes indeed a Gaussian cloud of points in n dimensions should
have dimension n.
Thirdly, well the previous statement only holds for some scales. You
must understand that a fractal dimension ONLY applies over a finite
(usually) range of length scales. Suppose your Gaussian cloud of k
points is distrubuted with standard deviation 1:
* on large lengths scales (say >10), the entire cloud will look like
just one point as they are all located at near ly the same pplace, thus
the cloud looks like it has dimension 1
* on medium length scales (say <2 and > k^(1/n)), the cloud of points
looks as if it fills up the space in a fuzz of points and som has
dimension n
* on really small length scales (say <k^(/n)) you will perceive the
points as individuals and the set of points will again have dimension 0.
The right dimension depends upon the length scale at which you want to
interact with the object.
Fourthly, the box counting dimension is a poor methodology which mixes
up the dimension of the middle regime with that of the surrounding
regimes. Thus the box counting dimension (in this example) will be a
little low.
Tony
Ross Evans wrote:
I need help understanding something.
The box-counting dimension of a set of random data points -- random in
the
sense that they are white noise, equally likely to occur anywhere
within an
interval -- is theoretically 2. That is computed by adding 1 to the
slope
of the log/log plot of the single-variable data points, counted at
various
sizes of box measurent..
That works out empirically for me. My measured slope of such a set of
data
points is very close to 1.
But what about data points that are distributed not as such white
noise, but
as Gaussian-distributed data centered around a mean. These data points
visually do not fill the space as completely, because by definition
they
tend to be clustered around the mean. And when I measure the slope of
such
a set of points, it is only about .92.
But the literature generally seems to expect a box-counting dimension
of 2
for Gaussian data, not just white noise. I don't get it.
|
|
|
| Back to top |
|
| Tony Roberts |
Posted: Mon Dec 13, 2004 5:43 pm |
|
|
|
Guest
|
Ross Evans wrote:
Quote: I don't understand what you mean by choosing the appropriate scale for the
dimension. I thought the way the box-counting method works is to measure
the slope of a near-linear plot of measurements across a wide range of
scales.
Yes indeed, that is why I said "appropriate RANGE of scales".
I npractise "wide range of scales" usually means only a decade, more if
you are lucky.
Quote: With a uniformly distributed dust, the box-counting method yields a perfect
slope of 1 until the boxes get small enough to force some boxes to be empty
between points out of pure sparseness. Then the slope rapidly approaches 0
as the size of the boxes is halved, which value it is guaranteed to reach
once the number of filled boxes reaches n.
Exactly my contention: on these small lengths scales the box cunting
algorithm detects that your dataset is just a finite set of points and
hence has zero dimension.
Quote: One of the intermediate ranges
does not seem to be the "correct" range for the distribution, because the
dimension measured at the intermediate scale -- even for the uniformly
random set -- is less than 1.
Now realise that the Gaussian has tails of density: there is a high
density core and the low density on either side. In the low density
areas, your data poits are further apart on average than they are in the
high denisity core. Thus your box counting algorithm will detect the
zero-dimensinal pointness of your data in the low density tails at a
significantly larger length scale than it detects it in the core. Thus
the curve at these larger length scales is "contaminated" by the zero
dimension being detected in the low denisty regions of the Gaussian.
A uniform distribution does not have low density tails and so does not
suffer from this defect.
Hmmm. I wonder if measuring fractal dimension could be improved with
some techniques from Importance Sampling?
Tony |
|
|
| Back to top |
|
| |
|
Page 1 of 1
All times are GMT - 5 Hours
The time now is Fri Dec 05, 2008 5:56 am
|
|