ansaurus

Question

whats the diference between train, validation and test set, in neural networks?

Answer 1

A:

I believe that in training mode, you allow the nodes of your network to alter the values of their input or output weights. You also provide positive or negative feedback in order to alter the weights. In other words, you give an input set, and feedback output with the output XOR ed against the known true/false, then NOT that. In other words, when the answers match, you give positive feedback, and when they disagree, you give negative feedback.

Not sure what the difference between test/validation cases is other than maybe you know the answer to the validation cases and use them to validate the output, nad maybe test cases you don't know the answer to, and just accept the answer from your validated neural net...

Zak 2010-06-04 17:52:04

Answer 2

+8 A:

The training and validation sets are used during training.

for each epoch
    for each training data instance
        propagate error through the network
        adjust the weights
        calculate the accuracy over training data
    for each validation data instance
        calculate the accuracy over the validation data
    if the threshold validation accuracy is met
        exit training
    else
        continue training

Once you're finished training, then you run against your testing set and verify that the accuracy is sufficient.

Training Set: this data set is used to adjust the weights on the neural network.

Validation Set: this data set is used to minimize overfitting. You're not adjusting the weights of the network with this data set, you're just verifying that any increase in accuracy over the training data set actually yields an increase in accuracy over a data set that has not been shown to the network before, or at least the network hasn't trained on it (i.e. validation data set). If the accuracy over the training data set increases, but the accuracy over then validation data set stays the same or decreases, then you're overfitting your neural network and you should stop training.

Testing Set: this data set is used only for testing the final solution in order to confirm the actual predictive power of the network.

Lirik 2010-06-04 19:13:55

can you give a look at my code?

Daniel 2010-06-06 02:34:03

@Daniel, what language is that? I'm not familiar with that syntax...

Lirik 2010-06-06 04:24:44

its python :x i just cant get a stop criteria.. the values converge.. but always with some flutuation..

Daniel 2010-06-06 15:42:33

@Daniel, does the training accuracy fluctuate or the validation accuracy fluctuates? It's possible that your validation accuracy fluctuates, but it's less likely that the training accuracy would fluctuate. When you say "input, target = p" does it mean that you're setting both to p?

Lirik 2010-06-06 15:49:41

I'm not very good with python, so the code looks a little confusing to me... in general you want to stop training when your validation accuracy meets a certain threshold, say 70% or 90%, whatever makes sense for the domain of your data.

Lirik 2010-06-06 15:55:20

p is a list, like[[1, 0, 1, 0, 1], [1, 0, 0]]so input, target = p is equal toinput = p[0]target = p[1]input = [1, 0, 1, 0, 1]target = [1, 0, 0]

Daniel 2010-06-06 17:46:27

posted some data

Daniel 2010-06-06 17:54:29

ansaurus

tags:

views:

answers:

whats the diference between train, validation and test set, in neural networks?

related questions