validation sets vs test sets
Show older comments
what is difference between validation and test datasets ?
Accepted Answer
More Answers (1)
Greg Heath
on 17 Dec 2015
Total = Design + Nondesign
Design = Training + Validation
Nondesign = Testing
Total = Training + Nontraining
Nontraining = Validation + Testing
Overfitting: Using more weights and biases than necessary
Overtraining: Improving the performance of the training data at the expense of deteriorating the performance
on nontraining data
Training data subset: Used to DIRECTLY estimate weights and biases. Performance estimates are BIASED.
Validation data subset: Used to
(1) determine when overtraining an overfit net begins to occur AND
(2) rank multiple designs.
Performance estimates are SIGNIFICANTLY LESS BIASED than training data estimates.
Test data subset: Used to obtain UNBIASED ESTIMATES of performance on nontraining (INCLUDING UNSEEN!) data
HOPE THIS HELPS
GREG
Categories
Find more on Deep Learning Toolbox in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!