Main Page | Report this Page
 
   
Science Forum Index  »  Geology - Earthquakes Forum  »  Musings on evaluation
Page 3 of 4    Goto page Previous  1, 2, 3, 4  Next
Author Message
Guest
Posted: Mon Jan 28, 2008 2:52 pm
In article <n1uqp3pb156lfjetp2dvtap1djc7k99k79@4ax.com>,
Hatunen <hatunen@cox.net> wrote:

Quote:
we don't know enough about the subject to even know if there's
an error, much less gently point it out.

Why do you assume everybody here is as ignorant as you are?
Hatunen
Posted: Mon Jan 28, 2008 5:10 pm
Guest
On Mon, 28 Jan 2008 18:52:58 -0000, ellis@no.spam () wrote:

Quote:
In article <n1uqp3pb156lfjetp2dvtap1djc7k99k79@4ax.com>,
Hatunen <hatunen@cox.net> wrote:

we don't know enough about the subject to even know if there's
an error, much less gently point it out.

Why do you assume everybody here is as ignorant as you are?

Why shouldn't I?

--
************* DAVE HATUNEN (hatunen@cox.net) *************
* Tucson Arizona, out where the cacti grow *
* My typos & mispellings are intentional copyright traps *
Guest
Posted: Mon Jan 28, 2008 5:47 pm
On Jan 28, 6:06 pm, t...@sevak.isi.edu (Thomas A. Russ) wrote:
Quote:
rog...@lpbroadband.net writes:
Hi all;

I'm looking for input on this.

A number of predictors use bullseye predictive schemes, notably JOB
and Petra. This involves partial credit for quakes which fall outside
the primary prediction parameters by specified amounts.

OK, I haven't seen the form of these predictions, so that makes concrete
suggestions a bit tricky. Could you perhaps post some examples?

Greetings Dr Russ. It's nice to hear from those who have a reasonable
grasp of the problems.

Both JOB and Petra have predetermined boundaries for all 3 parameters.
JOB also defines the method of using the rings mathematically to
calculate his score. Unfortunately, his numbers are pulled out of his
(ahem!) ... hat. Petra simply uses an A, B, C rating which tells
nothing about probabilities.

You might wish to find an article of mine which was published in the
Skeptical Inquirer, vol 30 no 5 for a detailed analysis of his
predictions.

Quote:

I've tried several ways of dealing with this. I have insisted on
perfect hits with Petra and I've used multiple probabilities with JOB
but neither method satisfies anyone. It's difficult to deal with
because accurate probability is essential to determining significance
but dealing with 4 ranges with 3 parameters is a LOT of possible
answers.

From this it sounds like they want variation in all possible parameters?
That sounds like you should give up and require exact predictions. The
other solutions, given the combinatorics don't seem very palatable.

No, it's quite burdensome.


Quote:
I'm thinking that the proper way to handle this might be to consider
such a prediction to be multiple predictions, one for each possible
combination, since the predictor isn't specifying which combination to
expect. That would be devastating to their significance, as even a
perfect hit would also involve multiple misses as well. But it's
mathematically sound as far as I can tell.

Well, then perhaps that is the proper method. One would expect having a
wider prediction in time, space and size would devastate any expected
significance. It seems to be having the desired effect.

Although, the perfect hit causes "misses" seems to be at least
intuitively distasteful. Do the predictors specify how much partial
credit they want for events outside the range? Do they want 1/2 credit
if the magitude is off by no more than 1.0?

JOB wants a 10% reduction for each of 3 ranges which are 10% larger.

Then he averages the 3 individual scores to get the final score. But
it's a meaningless number .
Quote:
If so, then perhaps a possible methodology would be to compute some
awful combined probability. Using a simplified example, for a
prediction in area A, during time window T, with magnitude in the range
[M_l, M_h]:

Compute probability P_exact using those parameters.
Compute probability P_near using A, T, [M_l - 1, M_h + 1].

Combine the probabilities by using P_exact + W*(P_near - P_exact) where
W is the weight assigned to a near miss. The number in parentheses
would be the excess probability added by relaxing the boundaries.

Of course, this would have to be done over all combinations, so the
combinatorics of doing this start to look a bit horrendous, but perhaps
not any more so than if you treat them as multiple independent
hypotheses.

It amounts to 27 different possibilities for JOB to be calculated for

each prediction. Not impossible, since his predictions are pretty much
the same each month.

Quote:
In any case, since the particular dimension on which a near miss might
be claimed is not specified in advance, perhaps it would be enough to
just perform two computations:
1. The exact hit probability
2. The fully relaxed hit probability

That seems simple and also appealing, because it encapsulates the exact
and the approximate parts of the relationship. Again, a weighting
factor can be applied based on how much credit is claimed for near
misses.

The best solution is to make only specific predictions and accept only
perfect hits but that's not something I can control. It's up to the
predictors to quit wanting credit for "near enough" results.
Especially when "near enough" is quite a way away.

I agree with the best solution.

On the other hand, perhaps showing the devastating effect on
significance of including the near-misses may be what is needed to
demonstrate why it isn't a good idea to do the experiment that way.

Predictors are generally impervious to constructive criticism, which

is another way to distinguish the pseudoscientist from the real
scientist.

Quote:
Any opinions?

I think the grasping for near misses to be included is just a sign of
the self-delusion that the scientific method, controlled experimentation
and the search for statistical significance is supposed to counteract.
By allowing ex-post-facto excuses or inclusions, one is just encouraging
unscientific behavior, and it is perhaps wisest (although perhaps
quixotic) to fight against that.

Lord knows I've tried!


Quote:
Where near misses should figure in, is in the process of guiding
predictors to try to refine whatever methods they have for generating
the predictions. They should not be viewed as a way of getting credit
for misses, but rather as learning experiences from which to try to
improve in the next round of experimentation.

Agreed. I wrote an evaluation program called BESTFIT which finds the 3

quakes which best fit the given parameters for each prediction. It's
intended as a training device for would-be predictors to adjust their
ranges. Thus far, no one has expressed any interest in using it.

Quote:
Which actually brings up the next question: When the prediction method
changes, one really should start an entirely new set of experiments to
determine significance. I'm not sure that is being done.

No, it isn't.


Roger
Quote:
--
Thomas A. Russ, USC/Information Sciences Institute
Guest
Posted: Mon Jan 28, 2008 8:22 pm
In article <g4hsp358ge9ldl5i8mbdkr593vj24rtih1@4ax.com>,
Hatunen <hatunen@cox.net> wrote:

Quote:
Why shouldn't I?

Only because you'd be wrong. If that doesn't concern you,
knock yourself out.
Thomas A. Russ
Posted: Mon Jan 28, 2008 8:48 pm
Guest
rogerh@lpbroadband.net writes:
Quote:

Yes it is. That's why I need good probability numbers. But my problem
is with people who want partial credit for a 5 when they predict a 6
to 7.

Well, unfortunately, I don't think you can come up with anything
reasonable that could handle such ex-post facto claims. They should
just be treated as misses. Especially since those partial credit scores
are only claimed where there is a near miss -- in other words, quite
selectively.

If predictors really want credit for a 5, then they need to specify that
in advance, and take the hit in an increased chance of random
satisfaction.

Right now they want it both ways. The reduced random chance for a high
magnitude earthquake as well as the credit for a more commonly occuring
quake. That needs to be treated as a clear miss.


--
Thomas A. Russ, USC/Information Sciences Institute
Thomas A. Russ
Posted: Mon Jan 28, 2008 9:06 pm
Guest
rogerh@lpbroadband.net writes:

Quote:
Hi all;

I'm looking for input on this.

A number of predictors use bullseye predictive schemes, notably JOB
and Petra. This involves partial credit for quakes which fall outside
the primary prediction parameters by specified amounts.

OK, I haven't seen the form of these predictions, so that makes concrete
suggestions a bit tricky. Could you perhaps post some examples?

Quote:
I've tried several ways of dealing with this. I have insisted on
perfect hits with Petra and I've used multiple probabilities with JOB
but neither method satisfies anyone. It's difficult to deal with
because accurate probability is essential to determining significance
but dealing with 4 ranges with 3 parameters is a LOT of possible
answers.

From this it sounds like they want variation in all possible parameters?
That sounds like you should give up and require exact predictions. The
other solutions, given the combinatorics don't seem very palatable.

Quote:
I'm thinking that the proper way to handle this might be to consider
such a prediction to be multiple predictions, one for each possible
combination, since the predictor isn't specifying which combination to
expect. That would be devastating to their significance, as even a
perfect hit would also involve multiple misses as well. But it's
mathematically sound as far as I can tell.

Well, then perhaps that is the proper method. One would expect having a
wider prediction in time, space and size would devastate any expected
significance. It seems to be having the desired effect.

Although, the perfect hit causes "misses" seems to be at least
intuitively distasteful. Do the predictors specify how much partial
credit they want for events outside the range? Do they want 1/2 credit
if the magitude is off by no more than 1.0?

If so, then perhaps a possible methodology would be to compute some
awful combined probability. Using a simplified example, for a
prediction in area A, during time window T, with magnitude in the range
[M_l, M_h]:

Compute probability P_exact using those parameters.
Compute probability P_near using A, T, [M_l - 1, M_h + 1].

Combine the probabilities by using P_exact + W*(P_near - P_exact) where
W is the weight assigned to a near miss. The number in parentheses
would be the excess probability added by relaxing the boundaries.

Of course, this would have to be done over all combinations, so the
combinatorics of doing this start to look a bit horrendous, but perhaps
not any more so than if you treat them as multiple independent
hypotheses.

In any case, since the particular dimension on which a near miss might
be claimed is not specified in advance, perhaps it would be enough to
just perform two computations:
1. The exact hit probability
2. The fully relaxed hit probability

That seems simple and also appealing, because it encapsulates the exact
and the approximate parts of the relationship. Again, a weighting
factor can be applied based on how much credit is claimed for near
misses.

Quote:
The best solution is to make only specific predictions and accept only
perfect hits but that's not something I can control. It's up to the
predictors to quit wanting credit for "near enough" results.
Especially when "near enough" is quite a way away.

I agree with the best solution.

On the other hand, perhaps showing the devastating effect on
significance of including the near-misses may be what is needed to
demonstrate why it isn't a good idea to do the experiment that way.

Quote:
Any opinions?

I think the grasping for near misses to be included is just a sign of
the self-delusion that the scientific method, controlled experimentation
and the search for statistical significance is supposed to counteract.
By allowing ex-post-facto excuses or inclusions, one is just encouraging
unscientific behavior, and it is perhaps wisest (although perhaps
quixotic) to fight against that.

Where near misses should figure in, is in the process of guiding
predictors to try to refine whatever methods they have for generating
the predictions. They should not be viewed as a way of getting credit
for misses, but rather as learning experiences from which to try to
improve in the next round of experimentation.

Which actually brings up the next question: When the prediction method
changes, one really should start an entirely new set of experiments to
determine significance. I'm not sure that is being done.

--
Thomas A. Russ, USC/Information Sciences Institute
Skywise
Posted: Tue Jan 29, 2008 12:30 am
Guest
Hatunen <hatunen@cox.net> wrote in news:g4hsp358ge9ldl5i8mbdkr593vj24rtih1@
4ax.com:

Quote:
On Mon, 28 Jan 2008 18:52:58 -0000, ellis@no.spam () wrote:

In article <n1uqp3pb156lfjetp2dvtap1djc7k99k79@4ax.com>,
Hatunen <hatunen@cox.net> wrote:

we don't know enough about the subject to even know if there's
an error, much less gently point it out.

Why do you assume everybody here is as ignorant as you are?

Why shouldn't I?

BWAHHAHAHAHAHAHhahahahah!!! You're as funny as the other dolts
that frequent here.

Brian
--
http://www.skywise711.com - Lasers, Seismology, Astronomy, Skepticism
Seismic FAQ: http://www.skywise711.com/SeismicFAQ/SeismicFAQ.html
Quake "predictions": http://www.skywise711.com/quakes/EQDB/index.html
Sed quis custodiet ipsos Custodes?
Skywise
Posted: Tue Jan 29, 2008 12:32 am
Guest
tar@sevak.isi.edu (Thomas A. Russ) wrote in
news:ymi4pcxxxrv.fsf@blackcat.isi.edu:

Quote:
rogerh@lpbroadband.net writes:

Yes it is. That's why I need good probability numbers. But my problem
is with people who want partial credit for a 5 when they predict a 6
to 7.

Well, unfortunately, I don't think you can come up with anything
reasonable that could handle such ex-post facto claims. They should
just be treated as misses. Especially since those partial credit scores
are only claimed where there is a near miss -- in other words, quite
selectively.

Good point. The method of scoring "near hits" has to be agreed
upon in advance. Since we know this won't happen, it's almost
a moot point.

I have in the past given thought to a way of scoring close calls
and in all my ideas, both parties woudl have to agree to the
method.

We can't even get them to accept black & white hit & miss. Grey
zones would be like arguing politics with a 1 year old.

Brian
--
http://www.skywise711.com - Lasers, Seismology, Astronomy, Skepticism
Seismic FAQ: http://www.skywise711.com/SeismicFAQ/SeismicFAQ.html
Quake "predictions": http://www.skywise711.com/quakes/EQDB/index.html
Sed quis custodiet ipsos Custodes?
Guest
Posted: Tue Jan 29, 2008 11:09 am
Quote:
Why not allow amateur predictors the same latitude? Still require time,
place, and magnitude but add the requirement of confidence intervals. So
a range would be predicted as M5±.5, 90%, ±1 10% If an earthquake occurs
meeting the other two criteria and is M5, that gives a 90% score for
magnitude ... but if it happened with M4.4 or M5.6, that would give only
a 10% score.

IOW, draw a bullseye around the location and draw a two-dimensional
bullseye around the time and magnitude, and score for accuracy. Thus "I
was close" gets 1%.

That's what JOB does. He has a bullseye with 3 rings, each 10% larger.
The problem is how to evaluate it in a mathematically defensible way.

Quote:

And, of course, the confidence intervals would have to be tight enough
to clearly invalidate a miss that was no better than chance. Perhaps
even assign negative points for hits claimed that are too damned far
away. ("Well, it happened a week later, on the other side of the planet,
so I claim a hit!" would lose ten points.)


Yes. That's the problem. Significance calculations don't allow for it.
They require A probability, not an IF this THEN that gyration.
Quote:
--

Roger

Quote:
Timberwoof <me at timberwoof dot com>http://www.timberwoof.com
"When you post sewage, don't blame others for
emptying chamber pots in your direction." ‹Chris L.
Guest
Posted: Tue Jan 29, 2008 11:46 am
On Jan 29, 2:34 pm, Skywise <i...@oblivion.nothing.com> wrote:
Quote:
Hatunen <hatu...@cox.net> wrote in news:vu5vp3t4a4rfvehuph691cr7mr1dqev7vl@
4ax.com:

Well, of course. But close only counts in horseshoes and
handgrenades, as they say.

And nuclear war? I've always added that one to the saying.

What's needed is a clear explanation of the predictor's method,
though, before any further critique can be made.

Agreed. Like I said, both predictor and evaluator must be on
the same game table playing by the same rules.

LOL!

Nice in theory but impossible to achieve in reality. JOB will give
very specific rules but will then turn over rocks looking for ways to
get a hit. Especially where magnitude is concerned. He'll search the
world looking for the right number instead of sticking to one source.

Roger

Quote:

Brian
--http://www.skywise711.com- Lasers, Seismology, Astronomy, Skepticism
Seismic FAQ:http://www.skywise711.com/SeismicFAQ/SeismicFAQ.html
Quake "predictions":http://www.skywise711.com/quakes/EQDB/index.html
Sed quis custodiet ipsos Custodes?
Hatunen
Posted: Tue Jan 29, 2008 1:59 pm
Guest
On Tue, 29 Jan 2008 04:32:41 -0000, Skywise
<into@oblivion.nothing.com> wrote:

Quote:
tar@sevak.isi.edu (Thomas A. Russ) wrote in
news:ymi4pcxxxrv.fsf@blackcat.isi.edu:

rogerh@lpbroadband.net writes:

Yes it is. That's why I need good probability numbers. But my problem
is with people who want partial credit for a 5 when they predict a 6
to 7.

Well, unfortunately, I don't think you can come up with anything
reasonable that could handle such ex-post facto claims. They should
just be treated as misses. Especially since those partial credit scores
are only claimed where there is a near miss -- in other words, quite
selectively.

Good point. The method of scoring "near hits" has to be agreed
upon in advance. Since we know this won't happen, it's almost
a moot point.

I have in the past given thought to a way of scoring close calls
and in all my ideas, both parties woudl have to agree to the
method.

We can't even get them to accept black & white hit & miss. Grey
zones would be like arguing politics with a 1 year old.

Simply make a prediction as to range, like "between 6.0 and 7.0",
or even "6.0 or greater" and then ruthlessly exclude anything
outside the range. Specify that, e.g., only the final values from
USGS will be accepted (one has to note the various values given
for the Loma Prieta earthquake as time went by and the figures
were refined).

In most sciences one has data from a number of experimental runs
with the values recorded; The deviations from the predicted value
are processed according to fairly simple statistical methods. For
instance, if an experimental value is supposed to be 23.1 and you
have values scattered more or less around that value, you can
calculate the standard deviations and show mathematically the
confidence level for the results.

Unfortunately, this doesn't work with one-time events. It would
be possible to take all of a predictor's predictions and
calculate something similar to the standard deviation for the
accumulated results by normalizing the individual results, but
this won't give meaningful results for logarithmic values.



--
************* DAVE HATUNEN (hatunen@cox.net) *************
* Tucson Arizona, out where the cacti grow *
* My typos & mispellings are intentional copyright traps *
Skywise
Posted: Tue Jan 29, 2008 2:14 pm
Guest
Hatunen <hatunen@cox.net> wrote in
news:bppup39asup0fnqefnfsnq8p60hpf2gjno@4ax.com:

Quote:
Simply make a prediction as to range, like "between 6.0 and 7.0",
or even "6.0 or greater" and then ruthlessly exclude anything
outside the range.

And what happens when a M5.9 or M7.1 quake strikes within the time
and location of the perdiction?

Brian
--
http://www.skywise711.com - Lasers, Seismology, Astronomy, Skepticism
Seismic FAQ: http://www.skywise711.com/SeismicFAQ/SeismicFAQ.html
Quake "predictions": http://www.skywise711.com/quakes/EQDB/index.html
Sed quis custodiet ipsos Custodes?
Hatunen
Posted: Tue Jan 29, 2008 3:36 pm
Guest
On Tue, 29 Jan 2008 18:14:30 -0000, Skywise
<into@oblivion.nothing.com> wrote:

Quote:
Hatunen <hatunen@cox.net> wrote in
news:bppup39asup0fnqefnfsnq8p60hpf2gjno@4ax.com:

Simply make a prediction as to range, like "between 6.0 and 7.0",
or even "6.0 or greater" and then ruthlessly exclude anything
outside the range.

And what happens when a M5.9 or M7.1 quake strikes within the time
and location of the perdiction?

What does "ruthlessly" mean?

What if there's and M5.5? Or a M5.2? Should they count?

The predictor is free to make his bounds as loose or as tight as
he (or she) wishes, and if his method is any good he will be able
to set those bounds. If his method may lead to this sort of
result he/she needs to cover it, perhaps by explicitly defining
such deviations as may be acceptable. The important thing is
clear criteria which do not have to be fudged in any way, post
facto, to make the facts fit the theory.

One problem is that it is so easy to toss around simple numbers
like 8.0 without accounting for what they really stand for. In my
example of "between 6.0 and 7.0" energy values of over thirty
times are being covered. Your example of 5.9 would be an
earthquake of something like one-third the energy of the 6.0.

If you were to say "5.0 to 7.0" then you are covering an energy
range of 1000:1. Any proper theory should be able to fit into a
range as wide as these. In fact, from a sort of common sense
standpoint, it would seem that any predictive theory should be
able to cover an energy release range of no more than, say, 10:1.


--
************* DAVE HATUNEN (hatunen@cox.net) *************
* Tucson Arizona, out where the cacti grow *
* My typos & mispellings are intentional copyright traps *
Timberwoof
Posted: Tue Jan 29, 2008 4:54 pm
Guest
In article <13pur86stknrka9@corp.supernews.com>,
Skywise <into@oblivion.nothing.com> wrote:

Quote:
Hatunen <hatunen@cox.net> wrote in
news:bppup39asup0fnqefnfsnq8p60hpf2gjno@4ax.com:

Simply make a prediction as to range, like "between 6.0 and 7.0",
or even "6.0 or greater" and then ruthlessly exclude anything
outside the range.

And what happens when a M5.9 or M7.1 quake strikes within the time
and location of the perdiction?

When rocket scientists present the expected landing areas of their space
craft, they give them in terms of probabilities: 90% within this
ellipse, 99% within this larger one, 99.99% on the planet somewhere...
Likewise, when astronomers predict impacts of asteroids on Mars, they
give likelihoods within target areas.

Indeed, mainstream geological predictions of earthquakes are along the
lines of "66% likelihood of an earthquake M6 or greater on the Hayward
fault in the next 33 years." (Not the actual prediction, but it makes
the point.)

Why not allow amateur predictors the same latitude? Still require time,
place, and magnitude but add the requirement of confidence intervals. So
a range would be predicted as M5±.5, 90%, ±1 10% If an earthquake occurs
meeting the other two criteria and is M5, that gives a 90% score for
magnitude ... but if it happened with M4.4 or M5.6, that would give only
a 10% score.

IOW, draw a bullseye around the location and draw a two-dimensional
bullseye around the time and magnitude, and score for accuracy. Thus "I
was close" gets 1%.

And, of course, the confidence intervals would have to be tight enough
to clearly invalidate a miss that was no better than chance. Perhaps
even assign negative points for hits claimed that are too damned far
away. ("Well, it happened a week later, on the other side of the planet,
so I claim a hit!" would lose ten points.)

--
Timberwoof <me at timberwoof dot com> http://www.timberwoof.com
"When you post sewage, don't blame others for
emptying chamber pots in your direction." ‹Chris L.
Skywise
Posted: Tue Jan 29, 2008 5:06 pm
Guest
Timberwoof <timberwoof.spam@inferNOnoSPAMsoft.com> wrote in
news:timberwoof.spam-F92627.12543129012008@nnrp-virt.nntp.sonic.net:

Quote:
In article <13pur86stknrka9@corp.supernews.com>,
Skywise <into@oblivion.nothing.com> wrote:

Hatunen <hatunen@cox.net> wrote in
news:bppup39asup0fnqefnfsnq8p60hpf2gjno@4ax.com:

Simply make a prediction as to range, like "between 6.0 and 7.0",
or even "6.0 or greater" and then ruthlessly exclude anything
outside the range.

And what happens when a M5.9 or M7.1 quake strikes within the time
and location of the perdiction?

When rocket scientists present the expected landing areas of their space
craft, they give them in terms of probabilities: 90% within this
ellipse, 99% within this larger one, 99.99% on the planet somewhere...
Likewise, when astronomers predict impacts of asteroids on Mars, they
give likelihoods within target areas.

<Snipola of rest>

You make a good point, and I do think it can be done that way.

First, technically, how do we statistically analyse such
predictions to determine if they are better than chance, or not?

But more importantly, and this is the deal breaker, even if
we do develop such a method of analysis, how do we get the
predictors to accept it? So many of them dismiss anything a
"scientist" has to say.

Brian
--
http://www.skywise711.com - Lasers, Seismology, Astronomy, Skepticism
Seismic FAQ: http://www.skywise711.com/SeismicFAQ/SeismicFAQ.html
Quake "predictions": http://www.skywise711.com/quakes/EQDB/index.html
Sed quis custodiet ipsos Custodes?
 
Page 3 of 4    Goto page Previous  1, 2, 3, 4  Next   All times are GMT - 5 Hours
The time now is Thu Dec 04, 2008 7:44 pm