validation sets vs test sets
17 views (last 30 days)
Show older comments
Seemab Janjua
on 16 Dec 2015
Answered: Greg Heath
on 17 Dec 2015
what is difference between validation and test datasets ?
0 Comments
Accepted Answer
Sean de Wolski
on 16 Dec 2015
I assume you're talking about Neural Networks?
If so, validation is used for the neural network to decide when training is complete and to avoid overfitting. Testing is an independent test set.
0 Comments
More Answers (1)
Greg Heath
on 17 Dec 2015
Total = Design + Nondesign
Design = Training + Validation
Nondesign = Testing
Total = Training + Nontraining
Nontraining = Validation + Testing
Overfitting: Using more weights and biases than necessary
Overtraining: Improving the performance of the training data at the expense of deteriorating the performance
on nontraining data
Training data subset: Used to DIRECTLY estimate weights and biases. Performance estimates are BIASED.
Validation data subset: Used to
(1) determine when overtraining an overfit net begins to occur AND
(2) rank multiple designs.
Performance estimates are SIGNIFICANTLY LESS BIASED than training data estimates.
Test data subset: Used to obtain UNBIASED ESTIMATES of performance on nontraining (INCLUDING UNSEEN!) data
HOPE THIS HELPS
GREG
0 Comments
See Also
Categories
Find more on Deep Learning Toolbox in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!