Main Page | Report this Page
 
   
Science Forum Index  »  Statistics - Education Forum  »  interaction terms in regression model
Page 2 of 2    Goto page Previous  1, 2
Author Message
Old Mac User
Posted: Tue Jan 16, 2007 11:22 am
Guest
David...

You wrote...

"If the context is regression with continuous explanatory variables,
then the OP may gain some understanding by drawing contours of the
model-prediction as a function of the explanatory variables. That is,
look at the shapes of the functions being allowed into the prediction."

This is an interesting comment. If we do this for a "standalone
interaction" (meaning, large t-ratio associated with the 2-factor
interaction; corresponding linear terms nonexistent or very weak) then
the contour map will be a classic "saddle-point". Always. Every
time. Herein lies an interesting thought.

If we simply set up a graphic in terms of the independent variables X1,
X2 and write down the observed values of the response at the corners of
the operating space (I'm sort of assuming the data spawned from a
2-level factorial design here... but what I'm about to say applies in
other circumstances)... and if we just examine that graphic for a
bit... we will realize that alternative "lines of constant response"
are not only possible but that those alternatives make much more sense
than a saddle-point. In fact, on examination, we will see that contours
representing a ridge OR contours representing a valley are perfectly
feasible... and that those are a lot simpler and more appealing than a
saddle-point. In summary, when a standalone interaction is present
this indicates there are three rival models each of which will explain
the data... one model being just as good as the others.

Now for the fun part. If we "build a model" from the data, then with a
standalone interaction the model will be of the form Y = bo +
b12*(X1*X2) (the variables X1 and X2 should be properly centered, as
noted elsewhere) and if we move that model into software for plotting
contour maps, we'll get a saddle-point. Everytime. No exceptions.
This can blind us to the possibility of alternative models that explain
the data with the same "goodness" as a saddle-point.

I see this sort of nonsense frequently. Designed Experiment ---> data
---> models (with at least one standalone interaction) ---> contour
maps (behold: a saddle-point!!!). The folks who market and sell
commercial software usually put one or more saddle-points in their
multi-color advertisements to tout their software with "see... the
world is so complex and dangerous that you can't live without our
software which reveals things like this saddle-point" That's pure
hogwash. A simple graphic... an "eyeball examination of the data"...
will show that simpler (and more likely) models explain the data just
as well. Occam's rule... "go with the simple"... should be considered
before jumping in bed with a saddle-point.

OK, so I'm the nasty old guy who "trusts nothing... asks questions...
breaks icons... and afflicts the comfortable." Sadly, we now have a
generation of folks who ask few questions... trust that "the software
is right"... and, quite frankly, seldom just look at the numbers. I
love them dearly, but they need they aren't getting enough bad
experience. "People don't learn from experience... they learn from BAD
experience."

Courses in statistics, etc. are still rooted in concepts from 50+ years
ago combined with "learn to use the SmartStats software so you can get
a job where you'll blow them away by using exotic methods like
principal components, factor analysis, etc."

Sometimes it's a good idea to just look at the data before turning on
the computer. OMU


David Jones wrote:
Quote:
Jerry Dallal wrote:
mcam54@hotmail.com wrote:
pbrewster@hotmail.com wrote:
If I have an interaction term in my regression model that is
statistically significant, but one or both of the original
attributes are not statistically significant, should I remove
original attributes that are not statistically significant from
the
model?

For example, I create an interaction C=A*B. In my regression
model, C is statistically significant but A and B are not. Should
I leave them in the model or remove them?

Everyone is giving very statistical answers (which I guess makes
sense for a stats newsgroup Smile.

My question is can you explain this interaction clinically? Is it
plausible? Can you avoid overfit? If not, you may have grounds
for droping the individual predictor terms and the interaction
term.

Marc


Some would argue that interpretation is the ONLY thing that matters!

But interpretation demands knowing the context. I haven't seen
anything that says that the problem is not one where all factors are
of the simple 2-level type, where a significant interaction term with
non-significant individual factors (given the inclusion of the
interaction term) has the interpretation that there is no effect
unless both "treatments" are present.

If the context is regression with continuous explanatory variables,
then the OP may gain some understanding by drawing contours of the
model-prediction as a function of the explanatory variables. That is,
look at the shapes of the functions being allowed into the prediction.

An interesting question looking in the opposite direction to the one
discussed on other threads is: if an interaction term X*Y is included
in the model, should all the second-order terms (ie. X^2 and Y^2) be
included also.

To try to answer the other parts of the later question: " Can you
avoid overfit? If not, you may have grounds for droping the
individual predictor terms and the interaction term. " This depends a
lot on what the OP is trying to use the regression for. There are (at
least) two possibilities ....

(a) the final model is to be treated as deciding as to whether certain
effects really should be treated are present in comparison to a
simpler model where the effects are absent. In this case "avoiding
overfit" is broadly equivalent to the significance tests already being
done.

(b) the final model is to be treated as mechanism for creating
"predicted values" for future instances. For this, the presence or
absence of certain terms in the regression is irrelavant except in so
far as this affects the likely error in the prediction. In this case
"avoiding overfit" might be undertaken by using a criterion such as
FPE (Final Prediction Error), or AIC etc. These criteria don't
necessarily avoid the question of whether to force in individual terms
if higher-order terms seem necessary.

David Jones
Bruce Weaver
Posted: Tue Jan 16, 2007 11:53 am
Guest
Old Mac User wrote:
Quote:
David...

You wrote...

"If the context is regression with continuous explanatory variables,
then the OP may gain some understanding by drawing contours of the
model-prediction as a function of the explanatory variables. That is,
look at the shapes of the functions being allowed into the prediction."

This is an interesting comment. If we do this for a "standalone
interaction" (meaning, large t-ratio associated with the 2-factor
interaction; corresponding linear terms nonexistent or very weak) then
the contour map will be a classic "saddle-point". Always. Every
time. Herein lies an interesting thought.

If we simply set up a graphic in terms of the independent variables X1,
X2 and write down the observed values of the response at the corners of
the operating space (I'm sort of assuming the data spawned from a
2-level factorial design here... but what I'm about to say applies in
other circumstances)... and if we just examine that graphic for a

---- snip ----

OMU, I just want to make sure I understand your terminology. Where you
say 2-level factorial design, I think you mean that even though X1 and
X2 are continuous variables, only two fixed levels of each are used in
the study. Is that correct?

--
Bruce Weaver
bweaver@lakeheadu.ca
www.angelfire.com/wv/bwhomedir
Old Mac User
Posted: Tue Jan 16, 2007 3:20 pm
Guest
Bruce W.:

You wrote...

Quote:
OMU, I just want to make sure I understand your terminology. Where you
say 2-level factorial design, I think you mean that even though X1 and
X2 are continuous variables, only two fixed levels of each are used in
the study. Is that correct?

That's the way I described the situation because that's an easy version
to describe. If X1 and X2 can only be set to discrete choices such as
"yes vs. no" or "Supplier 1 vs. Supplier 2) then there are no values
between these
(no interpolation) hence contour maps would not be meaningful.

But even if the continuous variables are not set to precisely "low" and
"high" values... if there's a standalone interaction... then a contour
map derived from the model will be saddle-points. OMU


Bruce Weaver wrote:
Quote:
Old Mac User wrote:
David...

You wrote...

"If the context is regression with continuous explanatory variables,
then the OP may gain some understanding by drawing contours of the
model-prediction as a function of the explanatory variables. That is,
look at the shapes of the functions being allowed into the prediction."

This is an interesting comment. If we do this for a "standalone
interaction" (meaning, large t-ratio associated with the 2-factor
interaction; corresponding linear terms nonexistent or very weak) then
the contour map will be a classic "saddle-point". Always. Every
time. Herein lies an interesting thought.

If we simply set up a graphic in terms of the independent variables X1,
X2 and write down the observed values of the response at the corners of
the operating space (I'm sort of assuming the data spawned from a
2-level factorial design here... but what I'm about to say applies in
other circumstances)... and if we just examine that graphic for a

---- snip ----

OMU, I just want to make sure I understand your terminology. Where you
say 2-level factorial design, I think you mean that even though X1 and
X2 are continuous variables, only two fixed levels of each are used in
the study. Is that correct?

--
Bruce Weaver
bweaver@lakeheadu.ca
www.angelfire.com/wv/bwhomedir
Bruce Weaver
Posted: Tue Jan 16, 2007 4:10 pm
Guest
Old Mac User wrote:
Quote:
Bruce W.:

You wrote...

OMU, I just want to make sure I understand your terminology. Where you
say 2-level factorial design, I think you mean that even though X1 and
X2 are continuous variables, only two fixed levels of each are used in
the study. Is that correct?

That's the way I described the situation because that's an easy version
to describe. If X1 and X2 can only be set to discrete choices such as
"yes vs. no" or "Supplier 1 vs. Supplier 2) then there are no values
between these
(no interpolation) hence contour maps would not be meaningful.

But even if the continuous variables are not set to precisely "low" and
"high" values... if there's a standalone interaction... then a contour
map derived from the model will be saddle-points. OMU


I was with you until that last paragraph. Let me try again.

Imagine an experiment in which for each trial, the values of X1 and X2
are random numbers within some ranges I choose--so there are
observations of X1 and X2 across the whole range, and combinations of X1
and X2 across the whole range. Would you still be concerned about a
standalone interaction? Or is your concern limited to the case where
there are only 4 possible combinations of X1 and X2?

--
Bruce Weaver
bweaver@lakeheadu.ca
www.angelfire.com/wv/bwhomedir
Old Mac User
Posted: Tue Jan 16, 2007 4:48 pm
Guest
Bruce W.:

The concern/problem stems not from the arrangement of experimental
combinations in the X1, X2 space, but from the presence of a strong
standalone interaction. The further concern is that if we build a
model from such data (whether from four combinations or from the
situation you described) then interpretation of that model is
ambiguous. That is, there are three different interpretations and the
one most commonly cited is also the most complex... hence unlikely...
of those three.

It is true that standalone interactions tend to appear most frequently
when analyzing data from factorial designed experiments simply because
the machinery (software) and teachings/training is set up to emphasize
looking for interactions. But this does not preclude finding them when
analyzing data that comes from other arrangements of X1, X2. OMU



Bruce Weaver wrote:
Quote:
Old Mac User wrote:
Bruce W.:

You wrote...

OMU, I just want to make sure I understand your terminology. Where you
say 2-level factorial design, I think you mean that even though X1 and
X2 are continuous variables, only two fixed levels of each are used in
the study. Is that correct?

That's the way I described the situation because that's an easy version
to describe. If X1 and X2 can only be set to discrete choices such as
"yes vs. no" or "Supplier 1 vs. Supplier 2) then there are no values
between these
(no interpolation) hence contour maps would not be meaningful.

But even if the continuous variables are not set to precisely "low" and
"high" values... if there's a standalone interaction... then a contour
map derived from the model will be saddle-points. OMU


I was with you until that last paragraph. Let me try again.

Imagine an experiment in which for each trial, the values of X1 and X2
are random numbers within some ranges I choose--so there are
observations of X1 and X2 across the whole range, and combinations of X1
and X2 across the whole range. Would you still be concerned about a
standalone interaction? Or is your concern limited to the case where
there are only 4 possible combinations of X1 and X2?

--
Bruce Weaver
bweaver@lakeheadu.ca
www.angelfire.com/wv/bwhomedir
 
Page 2 of 2    Goto page Previous  1, 2   All times are GMT - 5 Hours
The time now is Thu Jan 08, 2009 5:38 pm