Main Page | Report this Page
 
   
Science Forum Index  »  Statistics - Math Forum  »  Quick question...
Page 1 of 1    
Author Message
...
Posted: Mon Jul 14, 2008 7:43 am
Guest
Can someone give me a rule that tells me when to use n trials in the
denominator when determining variance and when to use n-1?

Openoffice calc (spreadsheet program) appears to use n-1, and yet some
of the references I've been studying use n. Obviously, the smaller
denominator will give a larger variance and standard deviation, so
clarification on this would be greatly appreciated.

Thanks in advance.

Henry Sun
Ray Koopman...
Posted: Mon Jul 14, 2008 8:17 am
Guest
On Jul 14, 10:43 am, henrysun... at (no spam) yahoo.com wrote:
Quote:
Can someone give me a rule that tells me when to use n trials in the
denominator when determining variance and when to use n-1?

Openoffice calc (spreadsheet program) appears to use n-1, and yet some
of the references I've been studying use n. Obviously, the smaller
denominator will give a larger variance and standard deviation, so
clarification on this would be greatly appreciated.

Thanks in advance.

Henry Sun

It depends on what you want to talk about. If you divide by n you get
the variance of the data. If you divide by n-1 you get an estimate of
the variance of the population from which the data were drawn. So n
gets you a description of the data in hand, whereas n-1 gets you an
inference about the population that the data came from.
Jack Tomsky...
Posted: Mon Jul 14, 2008 8:40 am
Guest
Quote:
On Jul 14, 10:43 am, henrysun... at (no spam) yahoo.com wrote:
Can someone give me a rule that tells me when to
use n trials in the
denominator when determining variance and when to
use n-1?

Openoffice calc (spreadsheet program) appears to
use n-1, and yet some
of the references I've been studying use n.
Obviously, the smaller
denominator will give a larger variance and
standard deviation, so
clarification on this would be greatly appreciated.

Thanks in advance.

Henry Sun

It depends on what you want to talk about. If you
divide by n you get
the variance of the data. If you divide by n-1 you
get an estimate of
the variance of the population from which the data
were drawn. So n
gets you a description of the data in hand, whereas
n-1 gets you an
inference about the population that the data came
from.


And in another twist, you divide by n+1 if you want the minimum mean-squared error estimate of the population variance.

Jack
...
Posted: Mon Jul 14, 2008 8:42 am
Guest
On Jul 14, 11:17 am, Ray Koopman <koop... at (no spam) sfu.ca> wrote:
Quote:
On Jul 14, 10:43 am, henrysun... at (no spam) yahoo.com wrote:

Can someone give me a rule that tells me when to use n trials in the
denominator when determining variance and when to use n-1?

Openoffice calc (spreadsheet program) appears to use n-1, and yet some
of the references I've been studying use n.  Obviously, the smaller
denominator will give a larger variance and standard deviation, so
clarification on this would be greatly appreciated.

Thanks in advance.

Henry Sun

It depends on what you want to talk about. If you divide by n you get
the variance of the data. If you divide by n-1 you get an estimate of
the variance of the population from which the data were drawn. So n
gets you a description of the data in hand, whereas n-1 gets you an
inference about the population that the data came from.

Thanks for the clarification.
Ray Koopman...
Posted: Mon Jul 14, 2008 2:22 pm
Guest
On Jul 14, 11:40 am, Jack Tomsky <jtom... at (no spam) ix.netcom.com> wrote:
Quote:
[...] And in another twist, you divide by n+1 if you want the
minimum mean-squared error estimate of the population variance.

That's for a mesokurtic distribution, with mu4/m2^2 = 3.
In general, the divisor that minimizes the expected squared error
is n + k-2 - (k-3)/n, where k = mu4/m2^2.
Jack Tomsky...
Posted: Mon Jul 14, 2008 2:37 pm
Guest
Quote:
On Jul 14, 11:40 am, Jack Tomsky
jtom... at (no spam) ix.netcom.com> wrote:
[...] And in another twist, you divide by n+1 if
you want the
minimum mean-squared error estimate of the
population variance.

That's for a mesokurtic distribution, with mu4/m2^2 =
3.
In general, the divisor that minimizes the expected
squared error
is n + k-2 - (k-3)/n, where k = mu4/m2^2.



Ray, that's an interesting result, generalizing it beyond the normal distribution where the kurtosis is 3.

Jack
Ray Koopman...
Posted: Mon Jul 14, 2008 6:10 pm
Guest
On Jul 14, 5:37 pm, Jack Tomsky <jtom... at (no spam) ix.netcom.com> wrote:
Quote:
On Jul 14, 11:40 am, Jack Tomsky
jtom... at (no spam) ix.netcom.com> wrote:
[...] And in another twist, you divide by n+1 if you
want the minimum mean-squared error estimate of the
population variance.

That's for a mesokurtic distribution, with mu4/m2^2 = 3.
In general, the divisor that minimizes the expected
squared error is n + k-2 - (k-3)/n, where k = mu4/m2^2.

Ray, that's an interesting result, generalizing it
beyond the normal distribution where the kurtosis is 3.

Jack

Thanks, Jack. I'm glad you could see through the typos and
incompleteness. For the record, "m2" should have been "mu2",
with mu2 & mu4 being the 2nd & 4th central moments.
 
Page 1 of 1       All times are GMT - 5 Hours
The time now is Sat Nov 22, 2008 3:52 pm