| |
 |
|
|
Science Forum Index » Statistics - Education Forum » Wage regression - use sample weights?
Page 1 of 1
|
| Author |
Message |
| Bob |
Posted: Thu Mar 27, 2008 12:18 am |
|
|
|
Guest
|
Hi,
I am trying to do a wage regression (ln(wage) dependent on several
individual characteristics like age, education, region, etc.) based on
household survey data and now I don't know if and why sample weights
(here: sample inflation factors, multipliers to inflate the sample to
the total population) should be used in the regression and if so, how
this is done.
I saw some references where they discussed the issue but I didn't
really understand why the one way or the other is preferred.
Theoretically, I think one could clone the individual observations
(single household) to equal the respective sample inflation factor and
adding an error term from the distribution of the subgroup sample to
each clone. But practically, the size of the data would not be
manageable.
Can anyone point me to a gentle introduction reference regarding this
issues or give me some clues?
Many thanks,
Bob |
|
|
| Back to top |
|
| Bob |
Posted: Tue Apr 01, 2008 11:39 am |
|
|
|
Guest
|
On Mar 27, 11:18 am, Bob <frott...@yahoo.com> wrote:
Quote: Hi,
I am trying to do a wage regression (ln(wage) dependent on several
individual characteristics like age, education, region, etc.) based on
household survey data and now I don't know if and why sample weights
(here: sample inflation factors, multipliers to inflate the sample to
the total population) should be used in the regression and if so, how
this is done.
I saw some references where they discussed the issue but I didn't
really understand why the one way or the other is preferred.
Theoretically, I think one could clone the individual observations
(single household) to equal the respective sample inflation factor and
adding an error term from the distribution of the subgroup sample to
each clone. But practically, the size of the data would not be
manageable.
Can anyone point me to a gentle introduction reference regarding this
issues or give me some clues?
Many thanks,
Bob
No one doing regression analysis on household survey data and knowing
about the sample weight issue?! |
|
|
| Back to top |
|
| Richard Ulrich |
Posted: Tue Apr 01, 2008 8:06 pm |
|
|
|
Guest
|
On Tue, 1 Apr 2008 14:39:32 -0700 (PDT), Bob <frotty22@yahoo.com>
wrote:
Quote: On Mar 27, 11:18 am, Bob <frott...@yahoo.com> wrote:
Hi,
I am trying to do a wage regression (ln(wage) dependent on several
individual characteristics like age, education, region, etc.) based on
household survey data and now I don't know if and why sample weights
(here: sample inflation factors, multipliers to inflate the sample to
the total population) should be used in the regression and if so, how
this is done.
SPSS, for instance, allows you to specify case-weights
in general, which are then used for (almost) every procedure.
Computer programs for surveys, I think, allow weighting
within the regression program. I don't remember if SPSS
does.
If you want the tests to be useful, at all, then the total
N after weighting is about the same as the total N
before weighting. If the weighting does very much to
distort the actual cell sizes, then the tests will be
screwed up to a corresponding degree.
Quote: I saw some references where they discussed the issue but I didn't
really understand why the one way or the other is preferred.
Theoretically, I think one could clone the individual observations
(single household) to equal the respective sample inflation factor and
adding an error term from the distribution of the subgroup sample to
each clone. But practically, the size of the data would not be
manageable.
Can anyone point me to a gentle introduction reference regarding this
issues or give me some clues?
Many thanks,
Bob
No one doing regression analysis on household survey data and knowing
about the sample weight issue?!
--
Rich Ulrich
http://www.pitt.edu/~wpilib/index.html |
|
|
| Back to top |
|
| Aniko |
Posted: Fri Apr 04, 2008 2:54 am |
|
|
|
Guest
|
On Mar 27, 5:18 am, Bob <frott...@yahoo.com> wrote:
Quote: Hi,
I am trying to do a wage regression (ln(wage) dependent on several
individual characteristics like age, education, region, etc.) based on
household survey data and now I don't know if and why sample weights
(here: sample inflation factors, multipliers to inflate the sample to
the total population) should be used in the regression and if so, how
this is done.
I saw some references where they discussed the issue but I didn't
really understand why the one way or the other is preferred.
Theoretically, I think one could clone the individual observations
(single household) to equal the respective sample inflation factor and
adding an error term from the distribution of the subgroup sample to
each clone. But practically, the size of the data would not be
manageable.
Can anyone point me to a gentle introduction reference regarding this
issues or give me some clues?
Many thanks,
Bob
The design of the survey has to be taken into account beyond the
weights, since most household surveys use quite complex designs with
multiple levels of stratification, sampling, etc. If the data come
from one of the big national surveys, they usually have some document
describing the recommended method of analysis. Usually you have to use
regression methods designed specifically for survey data to get
correct standard errors of the estimates (weighted regression will
give the right estimates, but the standard errors will be too low). I
know that Stata, SAS, R do have such facilities. In those programs you
just specify the strata, primary sampling units and weights and they
will adjust for those in the regression models.
Aniko |
|
|
| Back to top |
|
| Bob |
Posted: Tue Apr 08, 2008 11:26 pm |
|
|
|
Guest
|
Quote:
The design of the survey has to be taken into account beyond the
weights, since most household surveys use quite complex designs with
multiple levels of stratification, sampling, etc. If the data come
from one of the big national surveys, they usually have some document
describing the recommended method of analysis. Usually you have to use
regression methods designed specifically for survey data to get
correct standard errors of the estimates (weighted regression will
give the right estimates, but the standard errors will be too low). I
know that Stata, SAS, R do have such facilities. In those programs you
just specify the strata, primary sampling units and weights and they
will adjust for those in the regression models.
Aniko
Many thanks for the answers, Rich and Aniko!
I indeed know the survey design but they did not recommend any
specific regression method for this.
Apart from using these methods in a statistics package, I also would
like to understand why and how the data / weights / errors etc. are
adjusted but I found very little about this and nothing detailed
enough so that I could understand it. Of course, the main reason for
this is my insufficient knowledge of econometrics. But maybe there is
something out there describing the problems and solutions of wage
regressions on household data in detail?
Thanks,
Bob |
|
|
| Back to top |
|
| |
|
Page 1 of 1
All times are GMT - 5 Hours
The time now is Sat Oct 11, 2008 10:26 pm
|
|