Main Page | Report this Page
 
   
Science Forum Index  »  Statistics - Math Forum  »  Large Standard Errors in Multinomial Logit Estimation
Page 1 of 1    
Author Message
Thanaset
Posted: Wed Jan 03, 2007 10:21 am
Guest
Dear group,

I use multinomial logit model to predict directional change in interest
rate. The output from LIMDEP shows that the standard errors of all the
coefficients for one of the categories are extremely large, resulting
in virtually zero t-ratios. Does anyone know why this might be the
case? Any thought is much appreciated. Thank you.

Thanaset
Old Mac User
Posted: Wed Jan 03, 2007 11:18 am
Guest
This means that your data has little or no useful information regrading
that model coefficient.
OMU


Thanaset wrote:
Quote:
Dear group,

I use multinomial logit model to predict directional change in interest
rate. The output from LIMDEP shows that the standard errors of all the
coefficients for one of the categories are extremely large, resulting
in virtually zero t-ratios. Does anyone know why this might be the
case? Any thought is much appreciated. Thank you.

Thanaset
Bob O'Hara
Posted: Wed Jan 03, 2007 12:30 pm
Guest
Thanaset wrote:
Quote:
Dear group,

I use multinomial logit model to predict directional change in interest
rate. The output from LIMDEP shows that the standard errors of all the
coefficients for one of the categories are extremely large, resulting
in virtually zero t-ratios. Does anyone know why this might be the
case? Any thought is much appreciated. Thank you.

Look at the data for those categories: you might simply have few cases.

Another alternative is that the cases are all either successes or
failures, so the point estimates of the coefficients are either plus or
minus infinity. Software packages will get as close as they can, and
then give up. You should have got a warning about this:I don't know
LIMDEP, so I don't know if it does this. The estimates of the standard
errors are then awful, because they depend on the likelihood to be well
enough behaved.

Bob
Old Mac User
Posted: Wed Jan 03, 2007 3:15 pm
Guest
Bob O'Hara wrote...

"The estimates of the standard
errors are then awful, because they depend on the likelihood to be well

enough behaved."

That's a strange statement. OMU

Bob O'Hara wrote:
Quote:
Thanaset wrote:
Dear group,

I use multinomial logit model to predict directional change in interest
rate. The output from LIMDEP shows that the standard errors of all the
coefficients for one of the categories are extremely large, resulting
in virtually zero t-ratios. Does anyone know why this might be the
case? Any thought is much appreciated. Thank you.

Look at the data for those categories: you might simply have few cases.
Another alternative is that the cases are all either successes or
failures, so the point estimates of the coefficients are either plus or
minus infinity. Software packages will get as close as they can, and
then give up. You should have got a warning about this:I don't know
LIMDEP, so I don't know if it does this. The estimates of the standard
errors are then awful, because they depend on the likelihood to be well
enough behaved.

Bob
Old Mac User
Posted: Wed Jan 03, 2007 3:19 pm
Guest
Herman Rubin wrote...

"In that case, the coefficients should be 0, except for the
constant term; but the estimated variance should also be zero.
Something close to that might occur."

Why, pray tell, should the estimates be "0"? If there's very little
information regarding the model coefficient(s), hey can be "whatever"
but with extremely large confidence intervals.

In addition... what "variance should also be zero?" The variance of
what... the estimates of the model coefficients? The variance of the
residuals (I hope not!!)?

Your post is confusing, and surely not helpful to the OP. OMU


Herman Rubin wrote:
Quote:
In article <engloc$266$1@oravannahka.helsinki.fi>,
Bob O'Hara <bob.ohara@helsinki.fi> wrote:
Thanaset wrote:
Dear group,

I use multinomial logit model to predict directional change in interest
rate. The output from LIMDEP shows that the standard errors of all the
coefficients for one of the categories are extremely large, resulting
in virtually zero t-ratios. Does anyone know why this might be the
case? Any thought is much appreciated. Thank you.

Look at the data for those categories: you might simply have few cases.
Another alternative is that the cases are all either successes or
failures, so the point estimates of the coefficients are either plus or
minus infinity. Software packages will get as close as they can, and
then give up.

In that case, the coefficients should be 0, except for the
constant term; but the estimated variance should also be zero.
Something close to that might occur.

You should have got a warning about this; I don't know
LIMDEP, so I don't know if it does this. The estimates of the standard
errors are then awful, because they depend on the likelihood to be well
enough behaved.

This is something not realized; the estimates of standard
error, and the t-tests, etc, are based on asymptotic theory;
how good this is for samples of reasonable size is not clear.
One can look at the behavior of the likelihood function,
which gives a better idea than merely its second derivative
matrix at the maximum.

Bob


--
This address is for information only. I do not claim that these views
are those of the Statistics Department or of Purdue University.
Herman Rubin, Department of Statistics, Purdue University
hrubin@stat.purdue.edu Phone: (765)494-6054 FAX: (765)494-0558
Bob O'Hara
Posted: Thu Jan 04, 2007 2:01 am
Guest
Herman Rubin wrote:
Quote:
In article <engloc$266$1@oravannahka.helsinki.fi>,
Bob O'Hara <bob.ohara@helsinki.fi> wrote:
Thanaset wrote:
Dear group,

I use multinomial logit model to predict directional change in interest
rate. The output from LIMDEP shows that the standard errors of all the
coefficients for one of the categories are extremely large, resulting
in virtually zero t-ratios. Does anyone know why this might be the
case? Any thought is much appreciated. Thank you.

Look at the data for those categories: you might simply have few cases.
Another alternative is that the cases are all either successes or
failures, so the point estimates of the coefficients are either plus or
minus infinity. Software packages will get as close as they can, and
then give up.

In that case, the coefficients should be 0, except for the
constant term; but the estimated variance should also be zero.
Something close to that might occur.

I don't follow this: in effect you're suggesting that the fitted

probability of successes would be the same as at the intercept. But if
the intercept gives a fitted probability that is non-zero, then for the
classes where there are no successes, then the fitted probability should
be zero, i.e. the effect size (on the logit scale) would be minus a
large number, not the intercept probability.

Quote:
You should have got a warning about this; I don't know
LIMDEP, so I don't know if it does this. The estimates of the standard
errors are then awful, because they depend on the likelihood to be well
enough behaved.

This is something not realized; the estimates of standard
error, and the t-tests, etc, are based on asymptotic theory;
how good this is for samples of reasonable size is not clear.
One can look at the behavior of the likelihood function,
which gives a better idea than merely its second derivative
matrix at the maximum.

True: I think the likelihood surface flattens out when the fitted

probabilities approach zero, and the recommendation is to use a profile
likelihood approach to get confidence intervals. It may be that the OP
doesn't need to go this far, though.

Bob
Bob O'Hara
Posted: Thu Jan 04, 2007 2:05 am
Guest
Bob O'Hara wrote:
Quote:
Herman Rubin wrote:

<snip>
Quote:
You should have got a warning about this; I don't know
LIMDEP, so I don't know if it does this. The estimates of the
standard errors are then awful, because they depend on the likelihood
to be well enough behaved.

This is something not realized; the estimates of standard error, and
the t-tests, etc, are based on asymptotic theory;
how good this is for samples of reasonable size is not clear.
One can look at the behavior of the likelihood function,
which gives a better idea than merely its second derivative
matrix at the maximum.

True: I think the likelihood surface flattens out when the fitted
probabilities approach zero, and the recommendation is to use a profile
likelihood approach to get confidence intervals. It may be that the OP
doesn't need to go this far, though.

Just to correct myself: the quadratic approximation to the likelihood is

flatter than the true likelihood, so the asymptotic approximation from
the information matrix is poor.

Bob
Bob O'Hara
Posted: Thu Jan 04, 2007 2:06 am
Guest
Old Mac User wrote:
Quote:
Bob O'Hara wrote...

"The estimates of the standard
errors are then awful, because they depend on the likelihood to be well

enough behaved."

That's a strange statement. OMU

I was trying to avoid explaining the technicalities: see my reply to

Herman (including the correction!).

Bob
Richard Ulrich
Posted: Thu Jan 04, 2007 2:07 am
Guest
On 3 Jan 2007 06:21:04 -0800, "Thanaset" <thanaset@gmail.com> wrote:

Quote:
Dear group,

I use multinomial logit model to predict directional change in interest
rate. The output from LIMDEP shows that the standard errors of all the
coefficients for one of the categories are extremely large, resulting
in virtually zero t-ratios. Does anyone know why this might be the
case? Any thought is much appreciated. Thank you.

Is it possible that there is "no effect"?

Be reminded that logistic regression is a "large sample procedure."
Is your sample large enough, with enough cases in each category?
- 10 or 20 cases in each category, multiplied by the number of
predictors....

--
Rich Ulrich, wpilib@pitt.edu
http://www.pitt.edu/~wpilib/index.html
Old Mac User
Posted: Thu Jan 04, 2007 11:58 am
Guest
Some who posted to this OP are rambling on about the symptoms, not the
cure. The fundamental fact underlying this seems to be "there's very
little informatiokn about that model coefficient(s) in the data."
Cure: If you really want a valid estimate of that model coefficient,
then you need more data and/or different data. Period. End of story.

If the OP wants more help or advice with this, I stand ready to help.
Contact me via e-mail and I'll try to help. No jabbering. No
discussions about how flat the liklihood curve is (yes, I do know what
you are talking about)... just some straightforward answers. No
charge, of course. OMU



Bob O'Hara wrote:
Quote:
Herman Rubin wrote:
In article <engloc$266$1@oravannahka.helsinki.fi>,
Bob O'Hara <bob.ohara@helsinki.fi> wrote:
Thanaset wrote:
Dear group,

I use multinomial logit model to predict directional change in interest
rate. The output from LIMDEP shows that the standard errors of all the
coefficients for one of the categories are extremely large, resulting
in virtually zero t-ratios. Does anyone know why this might be the
case? Any thought is much appreciated. Thank you.

Look at the data for those categories: you might simply have few cases.
Another alternative is that the cases are all either successes or
failures, so the point estimates of the coefficients are either plus or
minus infinity. Software packages will get as close as they can, and
then give up.

In that case, the coefficients should be 0, except for the
constant term; but the estimated variance should also be zero.
Something close to that might occur.

I don't follow this: in effect you're suggesting that the fitted
probability of successes would be the same as at the intercept. But if
the intercept gives a fitted probability that is non-zero, then for the
classes where there are no successes, then the fitted probability should
be zero, i.e. the effect size (on the logit scale) would be minus a
large number, not the intercept probability.

You should have got a warning about this; I don't know
LIMDEP, so I don't know if it does this. The estimates of the standard
errors are then awful, because they depend on the likelihood to be well
enough behaved.

This is something not realized; the estimates of standard
error, and the t-tests, etc, are based on asymptotic theory;
how good this is for samples of reasonable size is not clear.
One can look at the behavior of the likelihood function,
which gives a better idea than merely its second derivative
matrix at the maximum.

True: I think the likelihood surface flattens out when the fitted
probabilities approach zero, and the recommendation is to use a profile
likelihood approach to get confidence intervals. It may be that the OP
doesn't need to go this far, though.

Bob
Bob O'Hara
Posted: Fri Jan 05, 2007 5:08 am
Guest
Old Mac User wrote:
Quote:
Some who posted to this OP are rambling on about the symptoms, not the
cure. The fundamental fact underlying this seems to be "there's very
little informatiokn about that model coefficient(s) in the data."

Until we know what causes the symptoms, we can't prescribe a cure.

Your fundamental fact might be wrong. As a counterexample, consider a
binomial with 10 000 trials, and no failures. There is a lot of
information about the model coefficients, but this is what R gives:

Quote:
summary(glm(cbind(10000,0)~1, family=binomial()))

Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) 31.9 671088.6 4.75e-05 1


(it also warns that the fitted probabilities are numerically 0 or 1).

It may be that the OP doesn't have this problem, but we can't tell until
he comes back.

Bob
Bellinda
Posted: Fri Jan 05, 2007 1:10 pm
Guest
Old Mac User,

I'd be very interested in what you had to say about the "cure" for
this. I've encountered such problems before due to the way categories
had been defined. If you do write something, please post it as it gives
others who are interested a chance to get some insight into such
problems.

Regards,
Bellinda
Bellinda
Posted: Fri Jan 05, 2007 1:48 pm
Guest
Old Mac User,

I'd be very interested in what you had to say about the "cure" for
this. I've encountered such problems before due to the way categories
had been defined. If you do write something, please post it as it gives
others who are interested a chance to get some insight into such
problems.

Regards,
Bellinda
 
Page 1 of 1       All times are GMT - 5 Hours
The time now is Tue Oct 07, 2008 9:04 am