Main Page | Report this Page
 
   
Science Forum Index  »  Agriculture Forum  »  Bt pesticide resistance
Page 5 of 6    Goto page Previous  1, 2, 3, 4, 5, 6  Next
Author Message
Gordon Couger
Posted: Fri Aug 29, 2003 12:33 pm
Guest
"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:fu8ukvsoiqoobpqh8ihh132mhdbgvg4n97@4ax.com...
Quote:
On Fri, 29 Aug 2003 09:50:07 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:


"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:6rrskvgp0ca2h5ro939d6hlcc4adopfar6@4ax.com...

..> Lest you forget, you made this claim about her paper:
"data that didn't agree with the findings was discarded"
That's a pretty serious accusation.

I am sure not alone in my claims about Ms.Ingham.
Quote:

I am only quoting the paper that says outliers were discarded without
listing them or specifying them number. <snip hurricane of handwaving

Gordon, you said that "data that didn't agree with the findings was
discarded". This is a serious allegation against the authors of the
paper. What basis do you have for making it?

Every paper that I have been associated with or read in my field that

discarded outliers either plotted the data and marked the discards or
provided a place you could get the data and judge for yourself the relevance
of discarded data. Since the data on this paper was plotted in the paper the
marked discarded data must have been very inconvenient indeed. The bins the
data were broken into had no marked grouping it was all one almost uniform
spread of data that they found a confidence level of 99% form 90 trials in.

The statisticians I gave the paper to laughed. One of them was an engineer
and not aware of the food fight going on at the time. I did not tell them
anything about the factors surrounding the paper I just questioned the high
confide level from less than 100 trials. Every one mentioned the discarded
data as well.

Discarding outliers is not bad practice. Discarding them with out
documentation is. If they don't tell what they discarded they could have
thrown out anything.

The green lobby standing behind this kind of science is the reason that
serious scientist don't take you seriously. Concocted papers, misrepresented
affiliations, faked studies do a great deal more harm to your cause than the
temporary headlines they make. In the US it has put the Forestry back in
the management of local managers, scientist in charge of EPA and the Kyoto
Treaty flushed down the toilet over here. In the USGS and USDA we have well
over 100 years sound movement scientific research. The Royal Academy of
Science of London is the only thing that approaches it for longevity and
they don't fund research near to the degree that the US does. It is much
more difficult to sway US politicians and the public with sensational press.
For one thing some of our positions have been in pubic service longer than
some of the EU government have been in existence. We also have a many more
checks and balances preventing public opinion from hastily changing things
with out time to gather more data on the subject and prevent the tyranny of
the few by the many. unfortunately we are loosing that faster and a faster.

If you want to be taken seriously use good science. Not emotional appeals
and faked studies.

Gordon
Gordon Couger
Posted: Fri Aug 29, 2003 3:42 pm
Guest
"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:vh5ukv0vd52hje519omfop4se1vivsa57b@4ax.com...
Quote:
On Fri, 29 Aug 2003 06:34:46 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:


"Jim Webster" <Jim@feeswerve.spam.co.uk> wrote in message
news:bilsve$94l$1@news8.svr.pol.co.uk...
yes Gordon, I'm afraid you are flogging a dead horse. The woman could
be
tried and convicted by every court in Denmark and Torsten will still
never
admit that she might have been less than 100% truthful

what a maroon


Jim,

Torsten has the paper and I suppose as a chemist he can do statistics. He
could try to duplicate the statistical conclusions of the paper form data
in
the paper. Showing his work of course.

Gordon, that is a misunderstanding of what a paper is, of what
you can expect to be able to do on the basis of the information
given in it. A scientific paper is generally not supposed to put
the reader in a position in relation to the raw data, such as to
make it possible for him to duplicate the statistical analysis
of it. One could say, a paper is meant to be read on the trust
that the authors and peer reviewers of the paper have done a
proper, sound job. This is not to say that the trust in this
cannot be called in question, only that it must be there
a priori and until it may be proved unwarranted.

So, lest you forget, you've made a serious allegation in relation
to this paper, namely that:

"data that didn't agree with the findings was discarded"

All the data points used in the study except the discarded ones are in the
paper. That should be enough to duplicate the statistics.

Gordon
Torsten Brinch
Posted: Fri Aug 29, 2003 5:49 pm
Guest
On Fri, 29 Aug 2003 21:42:10 GMT, "Gordon Couger"
<gcouger@NOSPAMprovalue.net> wrote:

Quote:
All the data points used in the study except the discarded ones are in the
paper. That should be enough to duplicate the statistics.

Um. The plotted points you see in the graphics, and the numbers in
tables are not raw values, they are means (n=3), three replicates
per treatment and sampling date.

However, apparently it is your hypothesis while looking at these data
points, that they are not all there, that some data points have been
discarded. But hey, that should be easily verifiable. Check, and you
should find for some sampling dates in plots and tables, that data
points are missing. :-)

Unfortunately for your hypothesis, they are all there.
Gordon Couger
Posted: Fri Aug 29, 2003 8:01 pm
Guest
"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:hakvkvg5ka9oam4rdiin0g3llhnkih9kts@4ax.com...
Quote:
On Fri, 29 Aug 2003 21:42:10 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:

All the data points used in the study except the discarded ones are in
the
paper. That should be enough to duplicate the statistics.

Um. The plotted points you see in the graphics, and the numbers in
tables are not raw values, they are means (n=3), three replicates
per treatment and sampling date.

However, apparently it is your hypothesis while looking at these data
points, that they are not all there, that some data points have been
discarded. But hey, that should be easily verifiable. Check, and you
should find for some sampling dates in plots and tables, that data
points are missing. :-)

Unfortunately for your hypothesis, they are all there.

There is no hypothesis the paper clearly states outliers are discarded. If

they are all there it is even worse as they don't show which data the paper
is based on.

Gordon
Torsten Brinch
Posted: Sat Aug 30, 2003 7:01 am
Guest
On Sat, 30 Aug 2003 02:01:27 GMT, "Gordon Couger"
<gcouger@NOSPAMprovalue.net> wrote:

Quote:

"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:hakvkvg5ka9oam4rdiin0g3llhnkih9kts@4ax.com...
On Fri, 29 Aug 2003 21:42:10 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:

All the data points used in the study except the discarded
ones are in the paper. That should be enough to duplicate
the statistics.

Um. The plotted points you see in the graphics, and the numbers in
tables are not raw values, they are means (n=3), three replicates
per treatment and sampling date.

However, apparently it is your hypothesis while looking at these data
points, that they are not all there, that some data points have been
discarded. But hey, that should be easily verifiable. Check, and you
should find for some sampling dates in plots and tables, that data
points are missing. :-)

Unfortunately for your hypothesis, they are all there.

There is no hypothesis the paper clearly states outliers are discarded.

I am talking about your hypothesis that "data that didn't agree with
the findings was discarded".

God help you, if you have nothing else to base this on, than what is
clearly stated in the paper, that outliers in raw data were removed
from datasets before variance homogenity of data was evaluated
in residual plots. Whatever you may think of removal of outliers at
this particular stage in the statistical analysis, it obviously does
not and cannot constitute the authors discarding of data that doesn't
agree with the findings. There are no findings at this stage, just a
mass of raw values, unfitted to any model, untested for any
significant differences between them.
Gordon Couger
Posted: Sun Aug 31, 2003 3:00 am
Guest
"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:1q61lvkpd7ijsshe4rv1irf2pn8d36lmag@4ax.com...
Quote:
On Sat, 30 Aug 2003 02:01:27 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:


"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:hakvkvg5ka9oam4rdiin0g3llhnkih9kts@4ax.com...
On Fri, 29 Aug 2003 21:42:10 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:

All the data points used in the study except the discarded
ones are in the paper. That should be enough to duplicate
the statistics.

Um. The plotted points you see in the graphics, and the numbers in
tables are not raw values, they are means (n=3), three replicates
per treatment and sampling date.

However, apparently it is your hypothesis while looking at these data
points, that they are not all there, that some data points have been
discarded. But hey, that should be easily verifiable. Check, and you
should find for some sampling dates in plots and tables, that data
points are missing. :-)

Unfortunately for your hypothesis, they are all there.

There is no hypothesis the paper clearly states outliers are discarded.

I am talking about your hypothesis that "data that didn't agree with
the findings was discarded".

God help you, if you have nothing else to base this on, than what is
clearly stated in the paper, that outliers in raw data were removed
from datasets before variance homogenity of data was evaluated
in residual plots. Whatever you may think of removal of outliers at
this particular stage in the statistical analysis, it obviously does
not and cannot constitute the authors discarding of data that doesn't
agree with the findings. There are no findings at this stage, just a
mass of raw values, unfitted to any model, untested for any
significant differences between them.


You have no idea when the data was discarded. It may have been when it was
found to be inconvenient in the statistical calculations. The results had
been quoted many times before any work was done. Who knows in what order the
work were done.

Gordon
Torsten Brinch
Posted: Sun Aug 31, 2003 8:58 am
Guest
On Sun, 31 Aug 2003 09:00:17 GMT, "Gordon Couger"
<gcouger@NOSPAMprovalue.net> wrote:

Quote:

"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:1q61lvkpd7ijsshe4rv1irf2pn8d36lmag@4ax.com...
On Sat, 30 Aug 2003 02:01:27 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:

There is no hypothesis the paper clearly states outliers are discarded.

I am talking about your hypothesis that "data that didn't agree with
the findings was discarded".

God help you, if you have nothing else to base this on, than what is
clearly stated in the paper, that outliers in raw data were removed
from datasets before variance homogenity of data was evaluated
in residual plots. Whatever you may think of removal of outliers at
this particular stage in the statistical analysis, it obviously does
not and cannot constitute the authors discarding of data that doesn't
agree with the findings. There are no findings at this stage, just a
mass of raw values, unfitted to any model, untested for any
significant differences between them.


You have no idea when the data was discarded.

Come, the statistical analysis section in the paper clearly
describes the series of steps taken in the analysis, in the
order they were taken. How can anyone read that section with
comprehension and escape with no idea when outliers were
removed from raw data?

Quote:
It may have been when it was found to be inconvenient in
the statistical calculations. <snip

And, what is the relation, if any, between this hypothesis,
and your original hypothesis that "data that didn't
agree with the findings was discarded"?

I mean, are you just re-expressing that original hypothesis
in a fuzzy low-key manner, or are you referring to some
inconvenience of having a gross outlier in a residual plot?
Gordon Couger
Posted: Sun Aug 31, 2003 2:45 pm
Guest
"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:s134lv0hm8ndnfgobn1iduma8c8db4u144@4ax.com...
Quote:
On Sun, 31 Aug 2003 09:00:17 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:


"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:1q61lvkpd7ijsshe4rv1irf2pn8d36lmag@4ax.com...
On Sat, 30 Aug 2003 02:01:27 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:

There is no hypothesis the paper clearly states outliers are
discarded.

I am talking about your hypothesis that "data that didn't agree with
the findings was discarded".

God help you, if you have nothing else to base this on, than what is
clearly stated in the paper, that outliers in raw data were removed
from datasets before variance homogenity of data was evaluated
in residual plots. Whatever you may think of removal of outliers at
this particular stage in the statistical analysis, it obviously does
not and cannot constitute the authors discarding of data that doesn't
agree with the findings. There are no findings at this stage, just a
mass of raw values, unfitted to any model, untested for any
significant differences between them.


You have no idea when the data was discarded.

Come, the statistical analysis section in the paper clearly
describes the series of steps taken in the analysis, in the
order they were taken. How can anyone read that section with
comprehension and escape with no idea when outliers were
removed from raw data?

It may have been when it was found to be inconvenient in
the statistical calculations. <snip

And, what is the relation, if any, between this hypothesis,
and your original hypothesis that "data that didn't
agree with the findings was discarded"?

I mean, are you just re-expressing that original hypothesis
in a fuzzy low-key manner, or are you referring to some
inconvenience of having a gross outlier in a residual plot?

When the findings are used in a fraudulent manner before the work that the

paper is written from is preformed am strongly suspicious of the paper and
all connect to it. When the statistical claims they make don't agree with
the data they publish I am more cynical about it.

I went to the effort ot have experts look at the paper and they came to the
same conclusions and you can call OSU like I did and find out what Ms.
Ingram's relationship with them was when she claimed affiliation with them
when she had none. Or you can look up her CV and it verifies her employment
dates at OSU and shows she was not working for them and only had courtesy
privileges. You can get courtesy privileges at any land grant university by
just asking.

If you can make that pig of a paper sing by making the statistics work I
will continue the discussion.

Gordon
Torsten Brinch
Posted: Sun Aug 31, 2003 8:51 pm
Guest
On Sun, 31 Aug 2003 20:45:07 GMT, "Gordon Couger"
<gcouger@NOSPAMprovalue.net> wrote:

Quote:
When the findings are used in a fraudulent manner before the work that the
paper is written from is preformed am strongly suspicious of the paper and
all connect to it.

But, that's highly circumstantial, is it not, if you want to prove
abominable discarding of data? :-)

However, let's see your evidence for the claim that findings of
the paper were used, before the work that the paper is written
from was performed.

Quote:
When the statistical claims they make don't agree with
the data they publish I am more cynical about it.

You must be more specific, or noone will know what you are
critisising. Which statistical claims don't agree with
which data? If you can't answer that, you do not
have a critique of substance worth relating to.

Quote:
I went to the effort ot have experts look at the paper and they came to the
same conclusions

Understand that unidentified experts making unidentified conclusions
that happens to agree with whatever you say just doesn't cut it.

Quote:
and you can call OSU like I did and find out what Ms.
Ingram's relationship with them <snip

Ms. Inghams affiliation is irrelevant to the question, if data
points was discarded that did not agree with the findings in
that paper.

Quote:
If you can make that pig of a paper sing by making the statistics work
I will continue the discussion.

That's very kind of you. However don't you think it is about time
you coughed up some evidence for your claim that data that did not
agree with findings was discarded? How many times have you been
asked for that now. Five, seven times?
Gordon Couger
Posted: Sun Aug 31, 2003 9:30 pm
Guest
"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:k0d5lvc0o58mlfafq13uopo6p7r8csq10s@4ax.com...
Quote:
On Sun, 31 Aug 2003 20:45:07 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:

When the findings are used in a fraudulent manner before the work that
the
paper is written from is preformed am strongly suspicious of the paper
and
all connect to it.

But, that's highly circumstantial, is it not, if you want to prove
abominable discarding of data? :-)

However, let's see your evidence for the claim that findings of
the paper were used, before the work that the paper is written
from was performed.

When the statistical claims they make don't agree with
the data they publish I am more cynical about it.

You must be more specific, or noone will know what you are
critisising. Which statistical claims don't agree with
which data? If you can't answer that, you do not
have a critique of substance worth relating to.

I went to the effort ot have experts look at the paper and they came to
the
same conclusions

Understand that unidentified experts making unidentified conclusions
that happens to agree with whatever you say just doesn't cut it.

and you can call OSU like I did and find out what Ms.
Ingram's relationship with them <snip

Ms. Inghams affiliation is irrelevant to the question, if data
points was discarded that did not agree with the findings in
that paper.

If you can make that pig of a paper sing by making the statistics work
I will continue the discussion.

That's very kind of you. However don't you think it is about time
you coughed up some evidence for your claim that data that did not
agree with findings was discarded? How many times have you been
asked for that now. Five, seven times?

I have no idea what the effect of the discarded data did to the study did.

That's the point.

Gordon
Torsten Brinch
Posted: Mon Sep 01, 2003 5:48 am
Guest
On Mon, 01 Sep 2003 03:30:41 GMT, "Gordon Couger"
<gcouger@NOSPAMprovalue.net> wrote:

Quote:
I have no idea what the effect of the discarded data did to the study did.
That's the point.

Well, ask yourself what happens to a data set if you add an outlier
to it: The overall variance in data increases, and the ANOVA may
come out with either too low F-values to allow you to calculate a
figure for least significant difference (LSD) or you may end up with
an inflated figure for LSD. The effect of this may be that you will
reject otherwise significant differences.

Now turn that around: Without the outlier, finding significant
differences will be more likely. In other words, you can reasonably
suspect that those significant difference that are found without the
outlier, might turn into insignificant differences with the outlier
added back in.

However, this study actually found very few significant differences,
although some of those that were found were striking.

The main findings, in short, was that plants in the GE bacteria setup
turned chlorotic and wilted, apparently concomitant with a flush
period of nematode growth during which nematodes reached higher
numbers, and, with fungal feeding nematodes dominating.
Compared to this plants in the non-GE bacteria setups,plants grew
well, nematode numbers did not flush just as much, and bacterial
feeding nematodes remained dominating.

And here's the crunch, there is no way any discarding of outliers can
have 'produced' the finding that plants in the GE bacteria setup
wilted and died, while the plants in the nonGE bacteria setup grew.

So, from a cool minded perspective, you may well have concerns
as a matter of principle as regards handling of outliers,
but you cannot have concerns that this handling has affected
the main findings of the study.

Quite generally, a criticism of a study, on counts that do not
affect its main findings, is insubstantial. Perhaps some would
call such criticism mere nitpicking, I would go that far, because
it may well be educating. However, the point is, you can't 'kill'
a study by flawing it of something that does not change its
conclusions.
Brian Sandle
Posted: Mon Sep 01, 2003 5:52 am
Guest
In sci.agriculture Gordon Couger <gcouger@nospamprovalue.net> wrote:

Quote:
"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:k0d5lvc0o58mlfafq13uopo6p7r8csq10s@4ax.com...

On Sun, 31 Aug 2003 20:45:07 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:

If you can make that pig of a paper sing by making the statistics work
I will continue the discussion.

That's very kind of you. However don't you think it is about time
you coughed up some evidence for your claim that data that did not
agree with findings was discarded? How many times have you been
asked for that now. Five, seven times?

I have no idea what the effect of the discarded data did to the study did.
That's the point.

No you need to be quite good at the subject to deal properly with
outliers. I am thinking that sometimes people do not eliminate them when
they should be, and others do but don't acknowledge it.

Imagine you represent a conservative govt applying as little as possible
health funding to a village of 100 people of mainly low income, based on
whether they can pay for it themselves or not. When calculating the
average will you include the income of the one multi-millionaire in the
village? That would make the average income rather higher, so you can fund
less. But the other 99 people would have no ability to pay, consequently.
The place would become a real eyesore.

Then if you were looking for how much the village could potentially donate
to a cause would the high earner still be an outlier?

If you were recording times for a cross country race would you include
ones where runners had obviously taken a short cut or joined the race some
way through it because they were rather briefer than really possible?
(Memories of school cross-countries). Well you might if you were trying to
catch cheats, or runners mistaken about the route.
Torsten Brinch
Posted: Mon Sep 01, 2003 5:57 am
Guest
On Mon, 01 Sep 2003 03:30:41 GMT, "Gordon Couger"
<gcouger@NOSPAMprovalue.net> wrote:

Quote:
"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:k0d5lvc0o58mlfafq13uopo6p7r8csq10s@4ax.com...
On Sun, 31 Aug 2003 20:45:07 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:

When the findings are used in a fraudulent manner before
the work that the paper is written from is preformed am
strongly suspicious of the paper and all connect to it.

..>let's see your evidence for the claim that findings of
the paper were used, before the work that the paper is written
from was performed.

Let's see your evidence, Gordon.
Torsten Brinch
Posted: Mon Sep 01, 2003 8:02 am
Guest
On 1 Sep 2003 11:52:50 GMT, Brian Sandle
<bsandle@shell.caverock.net.nz> wrote:

Quote:
In sci.agriculture Gordon Couger <gcouger@nospamprovalue.net> wrote:

"Torsten Brinch" <iaotb@inet.uni2.dk> wrote in message
news:k0d5lvc0o58mlfafq13uopo6p7r8csq10s@4ax.com...

On Sun, 31 Aug 2003 20:45:07 GMT, "Gordon Couger"
gcouger@NOSPAMprovalue.net> wrote:

If you can make that pig of a paper sing by making the statistics work
I will continue the discussion.

That's very kind of you. However don't you think it is about time
you coughed up some evidence for your claim that data that did not
agree with findings was discarded? How many times have you been
asked for that now. Five, seven times?

I have no idea what the effect of the discarded data did to the study did.
That's the point.

No you need to be quite good at the subject to deal properly with
outliers. I am thinking that sometimes people do not eliminate them when
they should be, and others do but don't acknowledge it.

Imagine you represent a conservative govt applying as little as possible
health funding to a village of 100 people of mainly low income, based on
whether they can pay for it themselves or not. When calculating the
average will you include the income of the one multi-millionaire in the
village? That would make the average income rather higher, so you can fund
less. But the other 99 people would have no ability to pay, consequently.
The place would become a real eyesore.

But that's not an outlier problem, Brian. Indeed, there's not much of
a statistical problem in it Smile You have sampled the whole population,
you know the income of each and every individual in it, you know their
average income. The average is the average. It is just not a very good
descriptor for what the politicians want to describe.

Quote:
Then if you were looking for how much the village could potentially donate
to a cause would the high earner still be an outlier?

Yes. In that situation average income might be a more suitable
descriptor. But again, this is not an outlier problem. The high earner
is known to be part of the population studied, so the data point
representing his income can never be considered an outlier.

Quote:
If you were recording times for a cross country race would you include
ones where runners had obviously taken a short cut or joined the race some
way through it because they were rather briefer than really possible?
(Memories of school cross-countries). Well you might if you were trying to
catch cheats, or runners mistaken about the route.

That's more like it. If you observe from the recorded racing times
that all racers completed the race within a time range of 3-7 hours,
except one racer -- who has been recorded as completing it in 7
minutes -- that does raise the question if the racing time of this
runner should not be consider an outlier.
Brian Sandle
Posted: Mon Sep 01, 2003 5:50 pm
Guest
In sci.agriculture Torsten Brinch <iaotb@inet.uni2.dk> wrote:
Quote:
On 1 Sep 2003 11:52:50 GMT, Brian Sandle
bsandle@shell.caverock.net.nz> wrote:
No you need to be quite good at the subject to deal properly with
outliers. I am thinking that sometimes people do not eliminate them when
they should be, and others do but don't acknowledge it.

Imagine you represent a conservative govt applying as little as possible
health funding to a village of 100 people of mainly low income, based on
whether they can pay for it themselves or not. When calculating the
average will you include the income of the one multi-millionaire in the
village? That would make the average income rather higher, so you can fund
less. But the other 99 people would have no ability to pay, consequently.
The place would become a real eyesore.

But that's not an outlier problem, Brian. Indeed, there's not much of
a statistical problem in it Smile

Except in that part of statistics is deciding what measures to use and
what to measure.

You have sampled the whole population,
Quote:
you know the income of each and every individual in it, you know their
average income. The average is the average. It is just not a very good
descriptor for what the politicians want to describe.

Then you might change to the middle income. That might not work either if
there is a big tail of very low incomes.

Quote:
Then if you were looking for how much the village could potentially donate
to a cause would the high earner still be an outlier?

Yes. In that situation average income might be a more suitable
descriptor. But again, this is not an outlier problem. The high earner
is known to be part of the population studied, so the data point
representing his income can never be considered an outlier.

So you might change from a purely latitude and longitude basis for the
sample to some other. Perhaps it is the subset of employees in the region.

Say you wanted to persuade people that potatoes in general are not high on
solanine. How many sweet potatoes are you allowed in the sample?

Given the figures the sweet potatoes might appear as outliers. This might
lead back to calling into question whether a sweet potato is a potato.

I think Gordon has a little point, that he needed to be told a bit more
about the outlier categorizing. But when you search the web for how
frequently `Monsanto' occurs in studies mentioning outliers, how much do
you get?

Quote:
If you were recording times for a cross country race would you include
ones where runners had obviously taken a short cut or joined the race some
way through it because they were rather briefer than really possible?
(Memories of school cross-countries). Well you might if you were trying to
catch cheats, or runners mistaken about the route.

That's more like it. If you observe from the recorded racing times
that all racers completed the race within a time range of 3-7 hours,
except one racer -- who has been recorded as completing it in 7
minutes -- that does raise the question if the racing time of this
runner should not be consider an outlier.

I think they used to be picked up in a car and dropped off again near the
finish, to come in not suspiciously too early. In this case the finishing
time is not an `adequate measure' of running ability.
 
Page 5 of 6    Goto page Previous  1, 2, 3, 4, 5, 6  Next   All times are GMT - 5 Hours
The time now is Thu Jul 24, 2008 2:27 pm