DataSets
Get more data: can flip images, add random zooms and crops, and add distortion
Normalize:
Subtract mean: makes 0 mean
Normalize variance for every input
Normalize same for test and train
Train/Dev/Test Sets
Train with your training data
Dev to see which models perform bst
Test is to actually give you an accuracy rating for your chosen model rather than overfitting dev
With small data go 60/20/20
With huge data like 1,000,000 can leave 10,000 for dev and 10,000 for test giving a 98/1/1 split
Bias Vs. Variance
High Bias == underfitting
High Variance ==overfitting
Can simulteously have high bias and high variance
Train 1% | 15% | 15%
Dev 11% | 16% | 30%
high var high bi high var and bi
Changes based on optimal value, above assumes optimal is 0
High Bias?
Bigger Network
Train Longer
High Variance?
More data
Regularization
Last updated