Main Page | Report this Page
Computers Forum Index  »  Computer Artificial Intelligence - Neural Nets  »  worry about "overfitting"?...
Page 1 of 1    

worry about "overfitting"?...

Author Message
MBALOVER...
Posted: Sun Oct 18, 2009 12:47 am
Guest
Hello,

I fit a 3-order polynomial (with 33 coefficients) to a training data
set that contains 50 samples. I am worried that the resulting model
may be overfitted since the training set is not too large.

So, I try to test the resulting model with another set of 50 samples.
There is no common sample between the training and testing sets. The
testing error still looks good.

Do you think that if I should still be worried about overfitting? Or
is the validation step shows that there is no overfitting problem with
my model?

Thanks
 
Greg Heath...
Posted: Sun Oct 18, 2009 1:19 am
Guest
On Oct 17, 8:47 pm, MBALOVER <mbalov... at (no spam) gmail.com> wrote:
Quote:
Hello,

I fit a 3-order polynomial (with 33 coefficients)

???

A 3rd order polynomial has 4 coefficients.


Hope this helps.

Greg
 
amzoti...
Posted: Sun Oct 18, 2009 1:22 am
Guest
On Oct 17, 5:47 pm, MBALOVER <mbalov... at (no spam) gmail.com> wrote:
Quote:
Hello,

I fit a 3-order polynomial (with 33 coefficients) to a training data
set that contains 50 samples. I am worried that the resulting model
may be overfitted since the training set is not too large.


Thanks

Did you mean with 33 data points here?

Your statement is very confusing otherwise.

~A
 
MBALOVER...
Posted: Sun Oct 18, 2009 1:50 am
Guest
I am sorry for making confusion. OK, let's forget about the order of
the polynomial.

Let's say that the model has 33 coefficients. I train the model with
50 training samples. I test the model with another 50 samples. (No
common sample between the training and testing sets).
The validation error still looks good.

My question is that if I should still care about overfitting. I hope I
should not since the validation error looks good.


Thanks,






The model is a matrix with

On Oct 17, 8:22 pm, amzoti <amz... at (no spam) gmail.com> wrote:
Quote:
On Oct 17, 5:47 pm, MBALOVER <mbalov... at (no spam) gmail.com> wrote:

Hello,

I fit a 3-order polynomial (with 33 coefficients) to a training data
set that contains 50 samples. I am worried that the resulting model
may be overfitted since the training set is not too large.

Thanks

Did you mean with 33 data points here?

Your statement is very confusing otherwise.

~A
 
Greg Heath...
Posted: Sun Oct 18, 2009 3:14 am
Guest
On Oct 17, 9:50 pm, MBALOVER <mbalov... at (no spam) gmail.com> wrote:
Quote:
I am sorry for making confusion. OK, let's forget about the order of
the polynomial.

Let's say that the model has 33 coefficients. I train the model with
50 training samples. I test the model with another 50 samples. (No
common sample between the training and testing sets).

Incorrect terminology.

You have 1 sample with 50 cases (or observations)
for training and another sample of 50 cases for validation.

Quote:
The validation error still looks good.

My question is that if I should still care about overfitting.
I hope I should not since the validation error looks good.

You should care. Even though the validation error "looks good",
whatever that is supposed to mean.

A more definitive answer depends on the topology
of the model, the coefficient estimation algorithm
and the probability distribution of the population.

Quote:
The model is a matrix with

33 what?

Greg
 
Phil Sherrod...
Posted: Sun Oct 18, 2009 10:20 pm
Guest
On 17-Oct-2009, Greg Heath <heath at (no spam) alumni.brown.edu> wrote:

Quote:
I fit a 3-order polynomial (with 33 coefficients)

???

A 3rd order polynomial has 4 coefficients.

Only if there is only a single independent variable. Maybe he has 11
independen variables and a constant (which would require 34, not 33).

--
Phil Sherrod
http://www.dtreg.com -- Neural networks, SVM, Decision trees
 
Greg Heath...
Posted: Mon Oct 19, 2009 8:33 am
Guest
On Oct 17, 9:50 pm, MBALOVER <mbalov... at (no spam) gmail.com> wrote:
Quote:
I am sorry for making confusion. OK, let's forget about the order of
the polynomial.

Let's say that the model has 33 coefficients. I train the model with
50 training samples. I test the model with another 50 samples. (No
common sample between the training and testing sets).
The validation error still looks good.

My question is that if I should still care about overfitting. I hope I
should not since the validation error looks good.

Thanks,

The model is a matrix with

It would help if you gave more details:
What kind of problem?
How many input variables?
How many output variables?
What kind of model?
What kind of estimation algorithm?

etc

Hope this helps.

Greg
 
MBALOVER...
Posted: Wed Oct 21, 2009 2:34 am
Guest
Hi Greg and others,

Your answers are helpful.

Greg, if I do K-cross validation and stilll get a good result. Do you
think if overfitting still probably occurs.

Sorry that I could not describe the model here, it is complicated to
describe by text.

Thanks



On Oct 19, 3:33 am, Greg Heath <he... at (no spam) alumni.brown.edu> wrote:
Quote:
On Oct 17, 9:50 pm, MBALOVER <mbalov... at (no spam) gmail.com> wrote:

I am sorry for making confusion. OK, let's forget about the order of
the polynomial.

Let's say that the model has 33 coefficients. I train the model with
50 training samples. I test the model with another 50 samples. (No
common sample between the training and testing sets).
The validation error still looks good.

My question is that if I should still care about overfitting. I hope I
should not since the validation error looks good.

Thanks,

The model is a matrix with

It would help if you gave more details:
What kind of problem?
How many input variables?
How many output variables?
What kind of model?
What kind of estimation algorithm?

etc

Hope this helps.

Greg
 
Greg Heath...
Posted: Wed Oct 21, 2009 4:44 am
Guest
On Oct 20, 10:34 pm, MBALOVER <mbalov... at (no spam) gmail.com> wrote:
Quote:
Hi Greg and others,

Your answers are helpful.

Greg, if I do K-cross validation and stilll get a good result. Do you
think if overfitting still probably occurs.

Sorry that I could not describe the model here, it is complicated to
describe by text.

Thanks

On Oct 19, 3:33 am, Greg Heath <he... at (no spam) alumni.brown.edu> wrote:



On Oct 17, 9:50 pm, MBALOVER <mbalov... at (no spam) gmail.com> wrote:

I am sorry for making confusion. OK, let's forget about the order of
the polynomial.

Let's say that the model has 33 coefficients. I train the model with
50 training samples. I test the model with another 50 samples. (No
common sample between the training and testing sets).
The validation error still looks good.

My question is that if I should still care about overfitting. I hope I
should not since the validation error looks good.

Thanks,

The model is a matrix with

It would help if you gave more details:
What kind of problem?
How many input variables?
How many output variables?
What kind of model?
What kind of estimation algorithm?

Please do not top post. It makes it difficult to follow the thread.

Hard to answer your question without more details.

Greg
 
 
Page 1 of 1    
All times are GMT
The time now is Tue Dec 08, 2009 5:46 pm