Main Page | Report this Page
 
   
Science Forum Index  »  Statistics - Math Forum  »  formulae of confidence interval function of sample size
Page 1 of 1    
Author Message
Beorne
Posted: Fri Apr 11, 2008 4:55 am
Guest
Having a total population of n we check for abnormalities ('yes' or
'no') on a sample of size s (< n).
We found that x of the s samples are abnormal ('yes')
What is the formulae that connects the precision of the x/s estimate
of probability p of abnormalities to the sample/population ratio s/n ?

Thank you very much.
Beorne
Posted: Fri Apr 11, 2008 5:00 am
Guest
On Apr 11, 4:55 pm, Beorne <matteo...@gmail.com> wrote:
Quote:
Having a total population of n we check for abnormalities ('yes' or
'no') on a sample of size s (< n).
We found that x of the s samples are abnormal ('yes')
What is the formulae that connects the precision of the x/s estimate
of probability p of abnormalities to the sample/population ratio s/n ?

Thank you very much.

I forget to add that n*p is very small so the confidence interval
p +- 1.96 * Sqrt( p*(1-p)/s ) is not very appropriate.
Richard Ulrich
Posted: Sun Apr 13, 2008 8:05 pm
Guest
On Fri, 11 Apr 2008 07:55:31 -0700 (PDT), Beorne <matteo.dt@gmail.com>
wrote:

Quote:
Having a total population of n we check for abnormalities ('yes' or
'no') on a sample of size s (< n).
We found that x of the s samples are abnormal ('yes')
What is the formulae that connects the precision of the x/s estimate
of probability p of abnormalities to the sample/population ratio s/n ?

Are you looking for the formula for the "Finite Population"
correction for the standard deviation? You can Google
for that, I'm pretty sure. It assumes that the measured
fraction is exact (zero variance), and that the residual has
the mean and variance of what has been measured so far.
That is how I would figure it, since it works out in general,
for various models.

The safer thing is somewhat cruder -- Take the standard
deviation of the sample, and the confidence interval,
and inflate it to the full N. This is also the proper measure
when general inference is being implied, to other populations
or to the future. The FPC is used on election eve when
predicting the rest of the vote (also, taking into account
which districts are not-in).

--
Rich Ulrich
http://www.pitt.edu/~wpilib/index.html
Beorne
Posted: Sun Apr 13, 2008 10:05 pm
Guest
On Apr 14, 3:05 am, Richard Ulrich <Rich.Ulr...@comcast.net> wrote:
Quote:
On Fri, 11 Apr 2008 07:55:31 -0700 (PDT), Beorne <matteo...@gmail.com
wrote:

Having a total population of n we check for abnormalities ('yes' or
'no') on a sample of size s (< n).
We found that x of the s samples are abnormal ('yes')
What is the formulae that connects the precision of the x/s estimate
of probability p of abnormalities to the sample/population ratio s/n ?

Are you looking for the formula for the "Finite Population"
correction for the standard deviation? You can Google
for that, I'm pretty sure. It assumes that the measured
fraction is exact (zero variance), and that the residual has
the mean and variance of what has been measured so far.
That is how I would figure it, since it works out in general,
for various models.

The safer thing is somewhat cruder -- Take the standard
deviation of the sample, and the confidence interval,
and inflate it to the full N. This is also the proper measure
when general inference is being implied, to other populations
or to the future. The FPC is used on election eve when
predicting the rest of the vote (also, taking into account
which districts are not-in).

--
Rich Ulrichhttp://www.pitt.edu/~wpilib/index.html

Yes, i assume it is a finite population correction. The other thing I
noted is that the p +- 1.96 * Sqrt( p*(1-p)/s ) formulae assumes
binomuial distribution can be simplified as normal distribution, ut in
my case that n*p is almost zero so this approximation is untrue.

I tought this formulae would be very easy to find, since is the basic
formulae to justify sampling size in defect control process, but I
have some problem finding it and its theoretic framework.
Thanks
Richard Ulrich
Posted: Mon Apr 14, 2008 9:07 pm
Guest
On Mon, 14 Apr 2008 01:05:53 -0700 (PDT), Beorne <matteo.dt@gmail.com>
wrote:

Quote:
On Apr 14, 3:05 am, Richard Ulrich <Rich.Ulr...@comcast.net> wrote:
On Fri, 11 Apr 2008 07:55:31 -0700 (PDT), Beorne <matteo...@gmail.com
wrote:

Having a total population of n we check for abnormalities ('yes' or
'no') on a sample of size s (< n).
We found that x of the s samples are abnormal ('yes')
What is the formulae that connects the precision of the x/s estimate
of probability p of abnormalities to the sample/population ratio s/n ?

Are you looking for the formula for the "Finite Population"
correction for the standard deviation? You can Google
for that, I'm pretty sure. It assumes that the measured
fraction is exact (zero variance), and that the residual has
the mean and variance of what has been measured so far.
That is how I would figure it, since it works out in general,
for various models.

The safer thing is somewhat cruder -- Take the standard
deviation of the sample, and the confidence interval,
and inflate it to the full N. This is also the proper measure
when general inference is being implied, to other populations
or to the future. The FPC is used on election eve when
predicting the rest of the vote (also, taking into account
which districts are not-in).

--
Rich Ulrichhttp://www.pitt.edu/~wpilib/index.html

Yes, i assume it is a finite population correction. The other thing I
noted is that the p +- 1.96 * Sqrt( p*(1-p)/s ) formulae assumes
binomuial distribution can be simplified as normal distribution, ut in
my case that n*p is almost zero so this approximation is untrue.

I tought this formulae would be very easy to find, since is the basic
formulae to justify sampling size in defect control process, but I
have some problem finding it and its theoretic framework.
Thanks

Hmm. "Defect control" sounds somewhat like finite
correction.

However, "tolerance intervals" rather than "confidence
intervals" comprise a tougher set of theory that may be used,
when there is a process that is ongoing indefinitely.

It considers (I think) the variability in the variance estimate,
instead of accepting it wholly, and it makes a higher "level"
of probability statement.

--
Rich Ulrich

http://www.pitt.edu/~wpilib/index.html
 
Page 1 of 1       All times are GMT - 5 Hours
The time now is Sat Jul 26, 2008 11:00 pm