| |
 |
|
|
Science Forum Index » Statistics - Education Forum » Simple Question on R-Squared
Page 1 of 1
|
| Author |
Message |
| Guest |
Posted: Fri Feb 09, 2007 2:40 am |
|
|
|
|
Say, R-Squared=89%, this is interpreted as : 89% of the variation in Y
is explained by the Regression.
My question is: what "variation" in Y are we talking about here ? Is
it the variance(Yi) that is reduced or ........? |
|
|
| Back to top |
|
| Karl Ove Hufthammer |
Posted: Fri Feb 09, 2007 5:56 am |
|
|
|
Guest
|
TonyTudor@gmail.com:
Quote: Say, R-Squared=89%, this is interpreted as : 89% of the variation in Y
is explained by the Regression.
My question is: what "variation" in Y are we talking about here ? Is
it the variance(Yi) that is reduced or ........?
Y has a certain variance. The question is: How much of this variance is due
to the linear association with X, and how much is not (the residual
variance). You want much of the variance to be due to the linear
association (otherwise X wouldn't be of must use in predicting Y).
You assume a linear model Y = alpha + beta × X + eps, where X and eps are
independent. R² is simpliy an estimate of Var(E(Y|X))/Var(Y) =
Var(a + beta × X)
--------------------------- (*)
Var(alpha + beta × X + eps)
which you can write beta² × Var(X) / Var(Y). (This is also equal to the
squared correlation.)
In the regression, you estimate beta with b, and Var(X) and Var(Y) by Sxx
and Syy, respectively, and get the estimate R² = b² × Sxx / Syy.
When you have more than one regressor (explantory variable) in a linear
model, R² is still an estimate of Var(E(Y|X))/Var(Y); (*) will just have
more terms. I just showed the simple case here.
Also note that the interepretation of R² as the square of a correlation is
*only* meaningful when both X and Y are random. If X is fixed/chosen, you
can get whatever value of R² you want just by chosing your x values
carefully. (If their empirical variance is large, you get R² values close
to 1, if its small, you get values close to 0.)
--
Karl Ove Hufthammer |
|
|
| Back to top |
|
| Jerry Dallal |
Posted: Fri Feb 09, 2007 11:19 am |
|
|
|
Guest
|
TonyTudor@gmail.com wrote:
Quote: Say, R-Squared=89%, this is interpreted as : 89% of the variation in Y
is explained by the Regression.
My question is: what "variation" in Y are we talking about here ? Is
it the variance(Yi) that is reduced or ........?
In fact, nothing is "explained", only "fitted".
On the one hand, ignoring the predictors, there is the sum of squared
deviations from the sample mean (observed-mean), TSS.
With the predictors, there is the sum of squared deviations from the
regression hyperplane (observed-predicted) (ResSS)
R squared is 1 - ResSS/TSS |
|
|
| Back to top |
|
| |
|
Page 1 of 1
All times are GMT - 5 Hours
The time now is Wed Dec 03, 2008 10:58 pm
|
|