Main Page | Report this Page
 
   
Science Forum Index  »  Statistics - Education Forum  »  Simple Question on R-Squared
Page 1 of 1    
Author Message
Guest
Posted: Fri Feb 09, 2007 2:40 am
Say, R-Squared=89%, this is interpreted as : 89% of the variation in Y
is explained by the Regression.

My question is: what "variation" in Y are we talking about here ? Is
it the variance(Yi) that is reduced or ........?
Karl Ove Hufthammer
Posted: Fri Feb 09, 2007 5:56 am
Guest
TonyTudor@gmail.com:

Quote:
Say, R-Squared=89%, this is interpreted as : 89% of the variation in Y
is explained by the Regression.

My question is: what "variation" in Y are we talking about here ? Is
it the variance(Yi) that is reduced or ........?

Y has a certain variance. The question is: How much of this variance is due
to the linear association with X, and how much is not (the residual
variance). You want much of the variance to be due to the linear
association (otherwise X wouldn't be of must use in predicting Y).

You assume a linear model Y = alpha + beta × X + eps, where X and eps are
independent. R² is simpliy an estimate of Var(E(Y|X))/Var(Y) =

Var(a + beta × X)
--------------------------- (*)
Var(alpha + beta × X + eps)

which you can write beta² × Var(X) / Var(Y). (This is also equal to the
squared correlation.)

In the regression, you estimate beta with b, and Var(X) and Var(Y) by Sxx
and Syy, respectively, and get the estimate R² = b² × Sxx / Syy.

When you have more than one regressor (explantory variable) in a linear
model, R² is still an estimate of Var(E(Y|X))/Var(Y); (*) will just have
more terms. I just showed the simple case here.

Also note that the interepretation of R² as the square of a correlation is
*only* meaningful when both X and Y are random. If X is fixed/chosen, you
can get whatever value of R² you want just by chosing your x values
carefully. (If their empirical variance is large, you get R² values close
to 1, if its small, you get values close to 0.)

--
Karl Ove Hufthammer
Jerry Dallal
Posted: Fri Feb 09, 2007 11:19 am
Guest
TonyTudor@gmail.com wrote:
Quote:
Say, R-Squared=89%, this is interpreted as : 89% of the variation in Y
is explained by the Regression.

My question is: what "variation" in Y are we talking about here ? Is
it the variance(Yi) that is reduced or ........?



In fact, nothing is "explained", only "fitted".

On the one hand, ignoring the predictors, there is the sum of squared
deviations from the sample mean (observed-mean), TSS.

With the predictors, there is the sum of squared deviations from the
regression hyperplane (observed-predicted) (ResSS)

R squared is 1 - ResSS/TSS
 
Page 1 of 1       All times are GMT - 5 Hours
The time now is Wed Dec 03, 2008 10:58 pm