| |
 |
|
|
Science Forum Index » Statistics - Education Forum » Doesn't a t-test work here?
Page 1 of 1
|
| Author |
Message |
| Guest |
Posted: Thu Feb 21, 2008 2:24 am |
|
|
|
|
Hello,
I am a biologist attempting to publish in a Scientific Journal. I
used t-tests to analyze some of my data and one of the reviewers made
a comment about this being inappropriate. This same reviewer made
other comments that led me to question whether or not he really
understood what was going on, but I wanted to get some input on
whether or not a t-test would be appropriate in this situation. My
statistics background is limited, I did take a class about 4 years ago
in statistics, but since I didn't need the information right away, the
majority of it left my brain almost immediately.
Reviewer's exact quote:
"A T-test or Anova are really for pairwise comparisons and cannot be
used for comparisons of multiple samples such as this. "
My experimental set up:
Leaves of a plant were treated with 1
of 9 different compounds, or one compound that served as a control.
Each of these compounds produced a certain amount of Green fluorescent
light when ultraviolet light was shown onto the leaves. This level of
light could be quantitatively measured as photons of light emitted per
second per square centimeter. Each compound was placed onto 10
different leaves and measurements from each of these treatments were
taken. The 10 measurements for each compound was compared to the 10
control measurements to determine if the compound increased
fluorescent light emitted at a statistically significant level as
compared to the control. I thought that to do this a T-test or
possibly an Anova was an appropriate way to make this analysis. Again
I wanted to compare Compound X with Control, Compound Y with control,
Compound Z with control, and so on. I do not care how Compound X
compares with Compound Y.
Am I correct here? If so how would you respond to the reviewer?
Thank you in advance
Jason |
|
|
| Back to top |
|
| Guest |
Posted: Thu Feb 21, 2008 4:33 am |
|
|
|
|
On Feb 21, 11:20 pm, Allen McIntosh <nos...@mouse-potato.com> wrote:
Quote: jgpow...@gmail.com wrote:
Hello,
I am a biologist attempting to publish in a Scientific Journal. I
used t-tests to analyze some of my data and one of the reviewers made
a comment about this being inappropriate. This same reviewer made
other comments that led me to question whether or not he really
understood what was going on, but I wanted to get some input on
whether or not a t-test would be appropriate in this situation. My
statistics background is limited, I did take a class about 4 years ago
in statistics, but since I didn't need the information right away, the
majority of it left my brain almost immediately.
Reviewer's exact quote:
"A T-test or Anova are really for pairwise comparisons and cannot be
used for comparisons of multiple samples such as this. "
My experimental set up:
Leaves of a plant were treated with 1
of 9 different compounds, or one compound that served as a control.
Each of these compounds produced a certain amount of Green fluorescent
light when ultraviolet light was shown onto the leaves. This level of
light could be quantitatively measured as photons of light emitted per
second per square centimeter. Each compound was placed onto 10
different leaves and measurements from each of these treatments were
taken. The 10 measurements for each compound was compared to the 10
control measurements to determine if the compound increased
fluorescent light emitted at a statistically significant level as
compared to the control. I thought that to do this a T-test or
possibly an Anova was an appropriate way to make this analysis. Again
I wanted to compare Compound X with Control, Compound Y with control,
Compound Z with control, and so on. I do not care how Compound X
compares with Compound Y.
Am I correct here?
In a word, no, especially if you set this up as a pairwise comparison.
Neither is the reviewer, but I'll give them the benefit of the doubt
because they're being quoted out of context. On the face of it, the
first half of the sentence (ANOVA really for pairwise comparisons) is
incorrect, though I can see what they are trying to get at. There is
some truth to the second half (naive use of ANOVA is not appropriate for
multiple comparisons).
If so how would you respond to the reviewer?
Maybe start by reading about multiple comparisons. (Perhaps someone
else more familiar with literature in your field could suggest a
reference. Otherwise, Google should be your friend.) It's hard to say
exactly how you should be doing your analysis, since some details of
your experimental setup are lacking. Were the control measurements on
the same 10 leaves in each case? Were the leaves all selected from the
same plant? More generally, what scheme was used to assign treatments
to leaves?
Ouch...not what I was hoping for...regarding the experimental setup.
10 measurements in total were made. The details for each measurement
are as follows: Each treatment was done on two leaves of one plant.
So in total 10 leaves from 5 different plants were analyzed. No
leaves were doubly treated, so each leaf was a "virgin." While
control leaves and treated leaves were not derived from the same
plant, they were derived from the same batch of plants, i.e. planted
at the same time with the same seeds, watered at the same time,
fertilized at the same time, etc. etc. As far as "what scheme was
used to assign treatments to leaves?" I don't know exactly what you
mean, but essentially I just choose similar looking plants to do all
the treatments with, there was no preferential choices going on in
terms of "oh, this plant is healthy looking, lets treat it with
compound y." Thanks for your help |
|
|
| Back to top |
|
| Bruce Weaver |
Posted: Thu Feb 21, 2008 4:33 am |
|
|
|
Guest
|
On Feb 21, 7:24 am, jgpow...@gmail.com wrote:
--- snip ---
Quote: I wanted to compare Compound X with Control, Compound Y with control, Compound Z with control, and so on. I do not care how Compound X compares with Compound Y.
Look up Dunnett's test.
--
Bruce Weaver
bweaver@lakeheadu.ca
www.angelfire.com/wv/bwhomedir
"When all else fails, RTFM." |
|
|
| Back to top |
|
| Allen McIntosh |
Posted: Thu Feb 21, 2008 10:20 am |
|
|
|
Guest
|
jgpowers@gmail.com wrote:
Quote: Hello,
I am a biologist attempting to publish in a Scientific Journal. I
used t-tests to analyze some of my data and one of the reviewers made
a comment about this being inappropriate. This same reviewer made
other comments that led me to question whether or not he really
understood what was going on, but I wanted to get some input on
whether or not a t-test would be appropriate in this situation. My
statistics background is limited, I did take a class about 4 years ago
in statistics, but since I didn't need the information right away, the
majority of it left my brain almost immediately.
Reviewer's exact quote:
"A T-test or Anova are really for pairwise comparisons and cannot be
used for comparisons of multiple samples such as this. "
My experimental set up:
Leaves of a plant were treated with 1
of 9 different compounds, or one compound that served as a control.
Each of these compounds produced a certain amount of Green fluorescent
light when ultraviolet light was shown onto the leaves. This level of
light could be quantitatively measured as photons of light emitted per
second per square centimeter. Each compound was placed onto 10
different leaves and measurements from each of these treatments were
taken. The 10 measurements for each compound was compared to the 10
control measurements to determine if the compound increased
fluorescent light emitted at a statistically significant level as
compared to the control. I thought that to do this a T-test or
possibly an Anova was an appropriate way to make this analysis. Again
I wanted to compare Compound X with Control, Compound Y with control,
Compound Z with control, and so on. I do not care how Compound X
compares with Compound Y.
Am I correct here?
In a word, no, especially if you set this up as a pairwise comparison.
Neither is the reviewer, but I'll give them the benefit of the doubt
because they're being quoted out of context. On the face of it, the
first half of the sentence (ANOVA really for pairwise comparisons) is
incorrect, though I can see what they are trying to get at. There is
some truth to the second half (naive use of ANOVA is not appropriate for
multiple comparisons).
Quote: If so how would you respond to the reviewer?
Maybe start by reading about multiple comparisons. (Perhaps someone
else more familiar with literature in your field could suggest a
reference. Otherwise, Google should be your friend.) It's hard to say
exactly how you should be doing your analysis, since some details of
your experimental setup are lacking. Were the control measurements on
the same 10 leaves in each case? Were the leaves all selected from the
same plant? More generally, what scheme was used to assign treatments
to leaves? |
|
|
| Back to top |
|
| z |
Posted: Thu Feb 21, 2008 11:15 am |
|
|
|
Guest
|
On Feb 21, 9:33 am, jgpow...@gmail.com wrote:
Quote: On Feb 21, 11:20 pm, Allen McIntosh <nos...@mouse-potato.com> wrote:
jgpow...@gmail.com wrote:
Hello,
I am a biologist attempting to publish in a Scientific Journal. I
used t-tests to analyze some of my data and one of the reviewers made
a comment about this being inappropriate. This same reviewer made
other comments that led me to question whether or not he really
understood what was going on, but I wanted to get some input on
whether or not a t-test would be appropriate in this situation. My
statistics background is limited, I did take a class about 4 years ago
in statistics, but since I didn't need the information right away, the
majority of it left my brain almost immediately.
Reviewer's exact quote:
"A T-test or Anova are really for pairwise comparisons and cannot be
used for comparisons of multiple samples such as this. "
My experimental set up:
Leaves of a plant were treated with 1
of 9 different compounds, or one compound that served as a control.
Each of these compounds produced a certain amount of Green fluorescent
light when ultraviolet light was shown onto the leaves. This level of
light could be quantitatively measured as photons of light emitted per
second per square centimeter. Each compound was placed onto 10
different leaves and measurements from each of these treatments were
taken. The 10 measurements for each compound was compared to the 10
control measurements to determine if the compound increased
fluorescent light emitted at a statistically significant level as
compared to the control. I thought that to do this a T-test or
possibly an Anova was an appropriate way to make this analysis. Again
I wanted to compare Compound X with Control, Compound Y with control,
Compound Z with control, and so on. I do not care how Compound X
compares with Compound Y.
Am I correct here?
In a word, no, especially if you set this up as a pairwise comparison.
Neither is the reviewer, but I'll give them the benefit of the doubt
because they're being quoted out of context. On the face of it, the
first half of the sentence (ANOVA really for pairwise comparisons) is
incorrect, though I can see what they are trying to get at. There is
some truth to the second half (naive use of ANOVA is not appropriate for
multiple comparisons).
If so how would you respond to the reviewer?
Maybe start by reading about multiple comparisons. (Perhaps someone
else more familiar with literature in your field could suggest a
reference. Otherwise, Google should be your friend.) It's hard to say
exactly how you should be doing your analysis, since some details of
your experimental setup are lacking. Were the control measurements on
the same 10 leaves in each case? Were the leaves all selected from the
same plant? More generally, what scheme was used to assign treatments
to leaves?
Ouch...not what I was hoping for...regarding the experimental setup.
10 measurements in total were made. The details for each measurement
are as follows: Each treatment was done on two leaves of one plant.
So in total 10 leaves from 5 different plants were analyzed. No
leaves were doubly treated, so each leaf was a "virgin." While
control leaves and treated leaves were not derived from the same
plant, they were derived from the same batch of plants, i.e. planted
at the same time with the same seeds, watered at the same time,
fertilized at the same time, etc. etc. As far as "what scheme was
used to assign treatments to leaves?" I don't know exactly what you
mean, but essentially I just choose similar looking plants to do all
the treatments with, there was no preferential choices going on in
terms of "oh, this plant is healthy looking, lets treat it with
compound y." Thanks for your help- Hide quoted text -
- Show quoted text -
what they're getting act, i think, is that if a t-test is designed to
be significant or not at the .95 level, that means that 1 time out of
20 it will show a significant difference when there is really no
difference between the treatments, out of random "noise". this is the
generally accepted level of error. however, what you are doing is
basically 10 t-tests simultaneously, so your chance of finding a
significant difference somewhere when in fact there are none at all is
1-(.95)^10, or about .4, if you treat each t-test as though it were
alone and require a .95 confidence limit. this kind of error rate is
obviously a bit high.
this what is meant by multiple comparisons. there are various ways to
adjust the individual limits on the t-tests to keep the overall limit
at .95. |
|
|
| Back to top |
|
| Guest |
Posted: Thu Feb 21, 2008 4:51 pm |
|
|
|
|
On Feb 22, 11:14 am, Allen McIntosh <nos...@mouse-potato.com> wrote:
Quote: z wrote:
what they're getting act, i think, is that if a t-test is designed to
be significant or not at the .95 level, that means that 1 time out of
20 it will show a significant difference when there is really no
difference between the treatments, out of random "noise". this is the
generally accepted level of error. however, what you are doing is
basically 10 t-tests simultaneously, so your chance of finding a
significant difference somewhere when in fact there are none at all is
1-(.95)^10, or about .4, if you treat each t-test as though it were
alone and require a .95 confidence limit. this kind of error rate is
obviously a bit high.
The 10 tests aren't independent, so you can't calculate the chance of
finding a significant difference that way.
Okay I have done a bit of reading. Based on what I have read I still
feel like T-tests would be alright. Perhaps it would be more powerful
to run an ANOVA followed by a Dunnett's but I think for my purposes a
t-test would be okay. Here are a couple websites that I think support
my thoughts:
http://www.anselm.edu/homepage/jpitocch/biostats/keysmeans.html
Based on this webpage I think I need a 2 independent sample t-test 1 -
direction
http://www.aiaccess.net/e_t.htm
"3) The third form of the t-test, called the "Two Independent Samples
t-test", looks very similar to the previous one. We still have two
sets of measurements, and we are trying to figure out if the averages
of these two sets of measurements are significantly different. But
this time, we assume that there is no relationship whatsoever between
the two sets of measurements, because they were conducted on two, non
intersecting sets of individuals."
I think where everyone gets tripped up is the fact that I have like 9
different treatments all being compared to a control treatment. But
those 9 treatments are all completely different and essentially
unrelated. For all intents and purposes I am really only looking at
two sets of data at any given time, one set of treated leaves vs. the
set of control treatment leaves.
I know you guys are all statistics people so you will probably hate me
when I say that based on my reading an ANOVA/Dunnett would be
statistically more powerful, but being so late in the game with this
paper I am hesitant to change my statistical analysis now. |
|
|
| Back to top |
|
| Guest |
Posted: Thu Feb 21, 2008 6:17 pm |
|
|
|
|
On Feb 22, 11:51 am, jgpow...@gmail.com wrote:
Quote: On Feb 22, 11:14 am, Allen McIntosh <nos...@mouse-potato.com> wrote:
z wrote:
what they're getting act, i think, is that if a t-test is designed to
be significant or not at the .95 level, that means that 1 time out of
20 it will show a significant difference when there is really no
difference between the treatments, out of random "noise". this is the
generally accepted level of error. however, what you are doing is
basically 10 t-tests simultaneously, so your chance of finding a
significant difference somewhere when in fact there are none at all is
1-(.95)^10, or about .4, if you treat each t-test as though it were
alone and require a .95 confidence limit. this kind of error rate is
obviously a bit high.
The 10 tests aren't independent, so you can't calculate the chance of
finding a significant difference that way.
Okay I have done a bit of reading. Based on what I have read I still
feel like T-tests would be alright. Perhaps it would be more powerful
to run an ANOVA followed by a Dunnett's but I think for my purposes a
t-test would be okay. Here are a couple websites that I think support
my thoughts:
http://www.anselm.edu/homepage/jpitocch/biostats/keysmeans.html
Based on this webpage I think I need a 2 independent sample t-test 1 -
direction
http://www.aiaccess.net/e_t.htm
"3) The third form of the t-test, called the "Two Independent Samples
t-test", looks very similar to the previous one. We still have two
sets of measurements, and we are trying to figure out if the averages
of these two sets of measurements are significantly different. But
this time, we assume that there is no relationship whatsoever between
the two sets of measurements, because they were conducted on two, non
intersecting sets of individuals."
I think where everyone gets tripped up is the fact that I have like 9
different treatments all being compared to a control treatment. But
those 9 treatments are all completely different and essentially
unrelated. For all intents and purposes I am really only looking at
two sets of data at any given time, one set of treated leaves vs. the
set of control treatment leaves.
I know you guys are all statistics people so you will probably hate me
when I say that based on my reading an ANOVA/Dunnett would be
statistically more powerful, but being so late in the game with this
paper I am hesitant to change my statistical analysis now.
Allen and others,
Thanks for your help. I did indeed talk to a real live statistician
(sort of) when I was first doing the experiment. He was a doctoral
student who just happened to have to work in the Statistics department
help center when I was analyzing my data. While he didn't help in
experimental design, he did help me do the data analysis. He
essentially said t-tests were fine but that it would be more
statistically powerful to do the ANOVA. I don't recall him saying
ANOVA followed by Dunnett's, but he very well may have, this was
awhile back (for all I know maybe this was understood). Unfortunately
now that I have reviewers comments, he is not responding to my emails
for guidance. Beyond this (and this is where I am really screwed) I
am actually in Korea for the next 4 months doing a research-based
study abroad program. Argh! So going back to a Statistics help
center is also not possible. With that said, don't worry I am not
relying solely on this messageboard for guidance, I found it by
accident, while doing some Googling on my own, but the advice thus far
has been very useful and I greatly appreciate it. Perhaps I will
wander over to one of the Biology groups after this to try to help
some other poor lost soul.
Allen: I made my 11:51 post before I read your 11:41 post. To answer
your question there are 10 data points for each set of treatments.
Regarding plant selection and randomizing these things I do realize
that there is a level of inherent uncertainty when doing these sorts
of statistical comparisons, and I also realize that it is not ideal to
ignore plant variations, but I think for my purposes it should be
okay. These plants are not like humans, they are all clones of each
other. All grown from the same batch of seeds. All planted in the
same soil, watered at the same time, exposed to the same amount of
light, etc. etc. I realize there will always some differences, even
in clones, but these should be minor.
Perhaps one final rephrasing of the problem:
10 different leaves were treated with a Sample X. 10 other leaves
were treated with a Control. After a period of time I measured GFP
fluorescence from the leaves. I want to compare whether the
measurements of GFP fluorescence are statistically different between
Sample X and the Control. |
|
|
| Back to top |
|
| Allen McIntosh |
Posted: Thu Feb 21, 2008 10:14 pm |
|
|
|
Guest
|
z wrote:
Quote: what they're getting act, i think, is that if a t-test is designed to
be significant or not at the .95 level, that means that 1 time out of
20 it will show a significant difference when there is really no
difference between the treatments, out of random "noise". this is the
generally accepted level of error. however, what you are doing is
basically 10 t-tests simultaneously, so your chance of finding a
significant difference somewhere when in fact there are none at all is
1-(.95)^10, or about .4, if you treat each t-test as though it were
alone and require a .95 confidence limit. this kind of error rate is
obviously a bit high.
The 10 tests aren't independent, so you can't calculate the chance of
finding a significant difference that way. |
|
|
| Back to top |
|
| Allen McIntosh |
Posted: Thu Feb 21, 2008 10:41 pm |
|
|
|
Guest
|
jgpowers@gmail.com wrote:
Quote: Leaves of a plant were treated with 1
of 9 different compounds, or one compound that served as a control.
... Each compound was placed onto 10
different leaves and measurements from each of these treatments were
taken.
regarding the experimental setup.
10 measurements in total were made. The details for each measurement
are as follows: Each treatment was done on two leaves of one plant.
So in total 10 leaves from 5 different plants were analyzed. No
leaves were doubly treated, so each leaf was a "virgin." While
control leaves and treated leaves were not derived from the same
plant, they were derived from the same batch of plants, i.e. planted
at the same time with the same seeds, watered at the same time,
fertilized at the same time, etc. etc.
So was each treatment done 10 times or twice? This isn't completely
clear. In any event, my concern here is that you can't treat your 10
measurements as independent since you know nothing about within-plant
correlation.
You really should have talked to a statistician before you did the
experiment. It's good that you posted here, but now you really should
try to find a live statistician to talk to, preferably (I'm assuming you
are in academia) the one who teaches the course in experimental design.
[Don't feel too bad. No one is saying "start over".]
Quote: I just chose similar looking plants to do all
the treatments with, there was no preferential choices going on in
terms of "oh, this plant is healthy looking, lets treat it with
compound y."
You really should have randomized the treatment assignment. There is no
reason not to, and it gives you that little bit of extra insurance that
you're not going to be misled by something systematic. |
|
|
| Back to top |
|
| Allen McIntosh |
Posted: Sat Feb 23, 2008 10:14 am |
|
|
|
Guest
|
jgpowers@gmail.com wrote:
Quote: On Feb 22, 11:51 am, jgpow...@gmail.com wrote:
Okay I have done a bit of reading. Based on what I have read I still
feel like T-tests would be alright. Perhaps it would be more powerful
to run an ANOVA followed by a Dunnett's but I think for my purposes a
t-test would be okay.
It's not a question of power (in the statistical sense). It's a
question of not finding differences when there are none.
Number of means: 3 or more
Comparisons: 3 or more means being compared after a significant ANOVA
Type of Comparisons: Any differences between each experimental group
mean and the control group mean
This describes your situation exactly, and recommends Dunnett's test.
Quote:
http://www.aiaccess.net/e_t.htm
"3) The third form of the t-test, called the "Two Independent Samples
t-test"
This web page covers material that would be taught in a first course to
undergraduates with no mathematical background. It isn't designed to
cover messier situations.
Quote:
I think where everyone gets tripped up is the fact that I have like 9
different treatments all being compared to a control treatment. But
those 9 treatments are all completely different and essentially
unrelated.
THey're not. You are analyzing them in the same paper.
For all intents and purposes I am really only looking at
two sets of data at any given time, one set of treated leaves vs. the
set of control treatment leaves.
I know you guys are all statistics people so you will probably hate me
when I say that based on my reading an ANOVA/Dunnett would be
statistically more powerful, but being so late in the game with this
paper I am hesitant to change my statistical analysis now.
If you are going to do that, you should be careful to state that your
significance level is per-treatment. Even then, the referee would be
within their rights to insist on Dunnett's or something similar.
Quote: Regarding plant selection and randomizing these things I do realize
that there is a level of inherent uncertainty when doing these sorts
of statistical comparisons, and I also realize that it is not ideal to
ignore plant variations, but I think for my purposes it should be
okay. These plants are not like humans, they are all clones of each
other. All grown from the same batch of seeds. All planted in the
same soil, watered at the same time, exposed to the same amount of
light, etc. etc. I realize there will always some differences, even
in clones, but these should be minor.
You haven't convinced me that the plants are identical genetically, but
I feel a little better. You can check this - do an ANOVA on the
pairwise sums and the pairwise differences, and compare the error terms. |
|
|
| Back to top |
|
| Robert |
Posted: Mon Feb 25, 2008 1:40 am |
|
|
|
Guest
|
message snipped
ideal to
Quote: ignore plant variations, but I think for my purposes it should be
okay. These plants are not like humans, they are all clones of each
other. All grown from the same batch of seeds. All planted in the
same soil, watered at the same time, exposed to the same amount of
light, etc. etc. I realize there will always some differences, even
in clones, but these should be minor.
As a biologist, I can say that with few exceptions, plants raised from
different seeds are by definition, not clones. Plants raised by seed
even from the result of cross fertilisation from the same two parents
can vary considerably, depending upon how pure bred the parents are.
Even self-fertilisation results in some variation between seeds due to
the meiotic process. A clone is an organism with identical genetic
makeup to another and is normally produced by what is called
"vegetative" reproduction, where for example cuttings are struck from
the same plant. |
|
|
| Back to top |
|
| |
|
Page 1 of 1
All times are GMT - 5 Hours
The time now is Tue Oct 07, 2008 3:49 am
|
|