Main Page | Report this Page
 
   
Science Forum Index  »  Statistics - Math Forum  »  Why Score is multiplied by say 20/ln(2)?
Page 1 of 1    
Author Message
Panchu
Posted: Mon Feb 12, 2007 11:18 am
Guest
In "Credit Scoring for Risk Managers: The Handbook for Lenders" by
Elizabeth Mays, chapter 4 page 68, it says that the coeffients from
the logistic regression are multiplied by a factor equal to the number
of points for which we want a doubling of the odds to occur, and then
divided by ln(2). Is the multiplication a standard practice? Why it is
multiplied by say 20/ln(2)? How do we determine what factor to
multiply? Is 20 a standard factor?

Hope anyone can help me!

Thanks in advance!

Regards

Panchanana M.R.L.N.
David Marcus
Posted: Mon Feb 12, 2007 4:44 pm
Guest
Panchu wrote:
Quote:
In "Credit Scoring for Risk Managers: The Handbook for Lenders" by
Elizabeth Mays, chapter 4 page 68, it says that the coeffients from
the logistic regression are multiplied by a factor equal to the number
of points for which we want a doubling of the odds to occur, and then
divided by ln(2). Is the multiplication a standard practice? Why it is
multiplied by say 20/ln(2)? How do we determine what factor to
multiply? Is 20 a standard factor?

It might help if you explain what is being regressed on what.

--
David Marcus
David Winsemius
Posted: Mon Feb 12, 2007 11:24 pm
Guest
"Panchu" <panchanana@gmail.com> wrote in news:1171293538.248577.211090
@q2g2000cwa.googlegroups.com:

Quote:
In "Credit Scoring for Risk Managers: The Handbook for Lenders" by
Elizabeth Mays, chapter 4 page 68, it says that the coeffients from
the logistic regression are multiplied by a factor equal to the number
of points for which we want a doubling of the odds to occur, and then
divided by ln(2). Is the multiplication a standard practice?

That does sound fairly standard. A simplistic starting point would be :
ln(OR)=beta*(difference in variables).
If you want doubling, then set OR=2 and solve. Remember: you have a linear
model in the log-odds scale.

Quote:
Why it is multiplied by say 20/ln(2)? How do we determine what factor
to multiply? Is 20 a standard factor?

Tell us where you got 20 and perhaps someone can answer.

--
David Winsemius
Panchanana
Posted: Tue Feb 13, 2007 4:09 am
Guest
On Feb 13, 1:44 am, David Marcus <DavidMar...@alumdotmit.edu> wrote:
Quote:
Panchu wrote:
In "Credit Scoring for Risk Managers: The Handbook for Lenders" by
Elizabeth Mays, chapter 4 page 68, it says that the coeffients from
the logistic regression are multiplied by a factor equal to the number
of points for which we want a doubling of the odds to occur, and then
divided by ln(2). Is the multiplication a standard practice? Why it is
multiplied by say 20/ln(2)? How do we determine what factor to
multiply? Is 20 a standard factor?

It might help if you explain what is being regressed on what.

--
David Marcus

I will be involved in Credit Scoring project. Prior to that I am going
through thos book. It is written as follows in the book:

Once the logistic regression model has been estimated, we can put in
the X values for a given loan and calculate a scoring using the
equation
SCORE = B1*X1 + ... + Bn*Xn.
The resulting score, however, is on the hard-to-interpret
natural log scale. A standard practice in the scoring world is to
transform the score to a linear scale where a given number of points
will result in a doubling of the odds that the event will happen.
To do the transformation, we multiply the score by a factor
equal to the number of points for which we want a doubling of the odds
to occur and then divided by ln(2). Say we seek a score where the odds
of going bad are expected to double each time the score increases 20
points. To obtain the scaled score, we would multiply the score that
comes directly from the regression model by 20 and divide by the
natural log of 2. That is we would calculate
(B1*X1 + ... + Bn*Xn)*(20/ln(2))

My question is "How do we determine what factor to multiply?". I tried
through calculations, but unable to find!

--- Panchanana
David Marcus
Posted: Tue Feb 13, 2007 7:58 pm
Guest
Panchanana wrote:
Quote:
On Feb 13, 1:44 am, David Marcus <DavidMar...@alumdotmit.edu> wrote:
Panchu wrote:
In "Credit Scoring for Risk Managers: The Handbook for Lenders" by
Elizabeth Mays, chapter 4 page 68, it says that the coeffients from
the logistic regression are multiplied by a factor equal to the number
of points for which we want a doubling of the odds to occur, and then
divided by ln(2). Is the multiplication a standard practice? Why it is
multiplied by say 20/ln(2)? How do we determine what factor to
multiply? Is 20 a standard factor?

It might help if you explain what is being regressed on what.

I will be involved in Credit Scoring project. Prior to that I am going
through thos book. It is written as follows in the book:

Once the logistic regression model has been estimated,

Does the book define/describe this model?

Quote:
we can put in
the X values for a given loan and calculate a scoring using the
equation
SCORE = B1*X1 + ... + Bn*Xn.
The resulting score, however, is on the hard-to-interpret
natural log scale. A standard practice in the scoring world is to
transform the score to a linear scale where a given number of points
will result in a doubling of the odds that the event will happen.
To do the transformation, we multiply the score by a factor
equal to the number of points for which we want a doubling of the odds
to occur and then divided by ln(2). Say we seek a score where the odds
of going bad are expected to double each time the score increases 20
points. To obtain the scaled score, we would multiply the score that
comes directly from the regression model by 20 and divide by the
natural log of 2. That is we would calculate
(B1*X1 + ... + Bn*Xn)*(20/ln(2))

My question is "How do we determine what factor to multiply?". I tried
through calculations, but unable to find!

--
David Marcus
Panchanana
Posted: Wed Feb 14, 2007 5:47 am
Guest
On Feb 14, 4:58 am, David Marcus <DavidMar...@alumdotmit.edu> wrote:
Quote:
Panchananawrote:
On Feb 13, 1:44 am, David Marcus <DavidMar...@alumdotmit.edu> wrote:
Panchu wrote:
In "Credit Scoring for Risk Managers: The Handbook for Lenders" by
Elizabeth Mays, chapter 4 page 68, it says that the coeffients from
the logistic regression are multiplied by a factor equal to the number
of points for which we want a doubling of the odds to occur, and then
divided by ln(2). Is the multiplication a standard practice? Why it is
multiplied by say 20/ln(2)? How do we determine what factor to
multiply? Is 20 a standard factor?

It might help if you explain what is being regressed on what.

I will be involved in Credit Scoring project. Prior to that I am going
through thos book. It is written as follows in the book:

Once the logistic regression model has been estimated,

Does the book define/describe this model?





we can put in
the X values for a given loan and calculate a scoring using the
equation
SCORE = B1*X1 + ... + Bn*Xn.
The resulting score, however, is on the hard-to-interpret
natural log scale. A standard practice in the scoring world is to
transform the score to a linear scale where a given number of points
will result in a doubling of the odds that the event will happen.
To do the transformation, we multiply the score by a factor
equal to the number of points for which we want a doubling of the odds
to occur and then divided by ln(2). Say we seek a score where the odds
of going bad are expected to double each time the score increases 20
points. To obtain the scaled score, we would multiply the score that
comes directly from the regression model by 20 and divide by the
natural log of 2. That is we would calculate
(B1*X1 + ... + Bn*Xn)*(20/ln(2))

My question is "How do we determine what factor to multiply?". I tried
through calculations, but unable to find!

--
David Marcus- Hide quoted text -

- Show quoted text -- Hide quoted text -

- Show quoted text -


Yes, the book defined this model and in Note it is sopecified that
they have not used the regression intercept, B0, to calculate the
score!
Panchanana
Posted: Wed Feb 14, 2007 5:49 am
Guest
On Feb 13, 8:24 am, David Winsemius <doe_s...@comcast.n0T> wrote:
Quote:
"Panchu" <panchan...@gmail.com> wrote in news:1171293538.248577.211090
@q2g2000cwa.googlegroups.com:

In "Credit Scoring for Risk Managers: The Handbook for Lenders" by
Elizabeth Mays, chapter 4 page 68, it says that the coeffients from
the logistic regression are multiplied by a factor equal to the number
of points for which we want a doubling of the odds to occur, and then
divided by ln(2). Is the multiplication a standard practice?

That does sound fairly standard. A simplistic starting point would be :
ln(OR)=beta*(difference in variables).
If you want doubling, then set OR=2 and solve. Remember: you have a linear
model in the log-odds scale.

Why it is multiplied by say 20/ln(2)? How do we determine what factor
to multiply? Is 20 a standard factor?

Tell us where you got 20 and perhaps someone can answer.

--
David Winsemius

To do the calculation, intercept is not present in the model!
David Winsemius
Posted: Wed Feb 14, 2007 9:20 am
Guest
"Panchanana" <panchanana@gmail.com> wrote in
news:1171446560.161487.81860@s48g2000cws.googlegroups.com:

Quote:
On Feb 13, 8:24 am, David Winsemius <doe_s...@comcast.n0T> wrote:
"Panchu" <panchan...@gmail.com> wrote in
news:1171293538.248577.211090 @q2g2000cwa.googlegroups.com:

In "Credit Scoring for Risk Managers: The Handbook for Lenders" by
Elizabeth Mays, chapter 4 page 68, it says that the coeffients from
the logistic regression are multiplied by a factor equal to the
number of points for which we want a doubling of the odds to occur,
and then divided by ln(2). Is the multiplication a standard
practice?

That does sound fairly standard. A simplistic starting point would be
:
ln(OR)=beta*(difference in variables).
If you want doubling, then set OR=2 and solve. Remember: you have a
linear model in the log-odds scale.

Why it is multiplied by say 20/ln(2)? How do we determine what
factor to multiply? Is 20 a standard factor?

Tell us where you got 20 and perhaps someone can answer.

--
David Winsemius

To do the calculation, intercept is not present in the model!


If that was a question, then let me see if I understand it. You want to
know if I left out or forgot to consider the intercept term in error?
Then no, I did not.

The full linear model on the log odds scale would be:

logit(Y) = ln(odds(Y)) = ln(Y/1-Y)=
beta0 +beta1*X1 + beta2*X2 + beta3*X3... (1)

After the model is estimated you can ask the question: "What is the
estimated odds ratio for two cases who differ only in variable X1. Let's
say that the two values are X1_a and X1_b. The answer to that question
is:
pred(Y/(1-Y)|X1=X1a)
OR = ----------------------- = exp(beta_1*(X1_a-X1_b)) (2)
pred(Y/(1-Y)|X1=X1b)

Why no intercept? The intercept term would appear in the initial
calculations starting from the model for eq 1 in both the numerator and
the denominator, but they start out inside ln(.) and when you convert to
the odds scale, you would get beta0/beta0 = 1 as one of the terms. Any of
the other variables with identical values assumed would also have ratios
of unity and dissappear in a cloud of magical logistic smoke. Taking logs
of (2), the expression offered in my initial response then appears
immediately.

If you had two variables under question, and the differences are now
delta(.) then:
OR=exp(beta1*(delta(X1) +beta2*(delta(X2))

I did leave out a few details, but they can be supplied if further proof
is needed.

--
David Winsemius
Panchanana
Posted: Thu Mar 15, 2007 12:39 pm
Guest
On Feb 14, 6:20 pm, David Winsemius <doe_s...@comcast.n0T> wrote:
Quote:
"Panchanana" <panchan...@gmail.com> wrote innews:1171446560.161487.81860@s48g2000cws.googlegroups.com:





On Feb 13, 8:24 am, David Winsemius <doe_s...@comcast.n0T> wrote:
"Panchu" <panchan...@gmail.com> wrote in
news:1171293538.248577.211090 @q2g2000cwa.googlegroups.com:

In "Credit Scoring for Risk Managers: The Handbook for Lenders" by
Elizabeth Mays, chapter 4 page 68, it says that the coeffients from
the logistic regression are multiplied by a factor equal to the
number of points for which we want a doubling of the odds to occur,
and then divided by ln(2). Is the multiplication a standard
practice?

That does sound fairly standard. A simplistic starting point would be
:
ln(OR)=beta*(difference in variables).
If you want doubling, then set OR=2 and solve. Remember: you have a
linear model in the log-odds scale.

Why it is multiplied by say 20/ln(2)? How do we determine what
factor to multiply? Is 20 a standard factor?

Tell us where you got 20 and perhaps someone can answer.

--
David Winsemius

To do the calculation, intercept is not present in the model!

If that was a question, then let me see if I understand it. You want to
know if I left out or forgot to consider the intercept term in error?
Then no, I did not.

The full linear model on the log odds scale would be:

logit(Y) = ln(odds(Y)) = ln(Y/1-Y)=
beta0 +beta1*X1 + beta2*X2 + beta3*X3... (1)

After the model is estimated you can ask the question: "What is the
estimated odds ratio for two cases who differ only in variable X1. Let's
say that the two values are X1_a and X1_b. The answer to that question
is:
pred(Y/(1-Y)|X1=X1a)
OR = ----------------------- = exp(beta_1*(X1_a-X1_b)) (2)
pred(Y/(1-Y)|X1=X1b)

Why no intercept? The intercept term would appear in the initial
calculations starting from the model for eq 1 in both the numerator and
the denominator, but they start out inside ln(.) and when you convert to
the odds scale, you would get beta0/beta0 = 1 as one of the terms. Any of
the other variables with identical values assumed would also have ratios
of unity and dissappear in a cloud of magical logistic smoke. Taking logs
of (2), the expression offered in my initial response then appears
immediately.

If you had two variables under question, and the differences are now
delta(.) then:
OR=exp(beta1*(delta(X1) +beta2*(delta(X2))

I did leave out a few details, but they can be supplied if further proof
is needed.

--
David Winsemius- Hide quoted text -

- Show quoted text -

Since I was away I am giving the response lately. Sorry for this:
Quote:
From the explanation, I am able to make the following solution.

If there is one variable under question, and the differences is now 20
then:
{2* [P1/(1-P1)]}/ {[P1/(1-P1)]} = {exp(alpha+beta*(x+20))}/{exp(alpha
+beta*x)}
2 = exp (beta*20)
ln 2 = beta * 20
beta = (ln 2)/20

Is this correct?
If so, why are we multiplying the score by 20/(ln 2)
Please, explain.

Thanks & Best Regards
Panchanana
David Winsemius
Posted: Sat Mar 17, 2007 10:34 am
Guest
"Panchanana" <panchanana@gmail.com> wrote in
news:1173980386.980429.11080@y66g2000hsf.googlegroups.com:

Quote:
On Feb 14, 6:20 pm, David Winsemius <doe_s...@comcast.n0T> wrote:
"Panchanana" <panchan...@gmail.com> wrote
innews:1171446560.161487.81860@s48g2000cws.googlegroups.com:

On Feb 13, 8:24 am, David Winsemius <doe_s...@comcast.n0T> wrote:
"Panchu" <panchan...@gmail.com> wrote in
news:1171293538.248577.211090 @q2g2000cwa.googlegroups.com:

In "Credit Scoring for Risk Managers: The Handbook for Lenders"
by Elizabeth Mays, chapter 4 page 68, it says that the
coeffients from the logistic regression are multiplied by a
factor equal to the number of points for which we want a
doubling of the odds to occur, and then divided by ln(2). Is the
multiplication a standard practice?

snipped a bunch of stuff

Quote:
Since I was away I am giving the response lately. Sorry for this:
From the explanation, I am able to make the following solution.

If there is one variable under question, and the differences is now 20
then:
{2* [P1/(1-P1)]}/ {[P1/(1-P1)]} = {exp(alpha+beta*(x+20))}/{exp(alpha
+beta*x)}
2 = exp (beta*20)
ln 2 = beta * 20
beta = (ln 2)/20

Is this correct?

If 20 is the change in score that results in a doubling of the odds
estimate in your data, then that is correct.

Quote:
If so, why are we multiplying the score by 20/(ln 2)
Please, explain.

I have not figured out where the confusion is. First you wanted to know
whether the method made sense. It sounded sensible at least as far as one
could tell from a paraphrase. Then you wanted to know whether the 20 was a
standard factor. It is not standard, but is rather derived for the data.
The Ln(2) comes from the desire for OR doubling. Then (I think) you wanted
to know if I had forgotten the intercept. I hadn't.

Now you have derived the inverse of the expression given in your text as
equal to beta and I think that looks reasonable at least as I imagine the
problem, but I cannot go further. I do not have the text and the paraphrase
you originally offered did not really describe a complete procedure. To
decide whether the error is yours, mine, or the original author's, perhaps
you could provide a more complete quotation? You never really described
what the author's end point or goal was.

--
David Winsemius
 
Page 1 of 1       All times are GMT - 5 Hours
The time now is Mon Oct 06, 2008 12:19 pm