Model Assessment & Selection
Model Complexity
Errτ​is a test error, and Erris an expected test error.

In this situation, we all talk about random data so we can't get the exact value of this error. Conditioned on T, random elements of Tbecome realization.
T={(x1​,y1​),…,(xN​,yN​)} It is a realization version of random quantity. X0,Y0refer to sample randomly chosen from test set. We want to predict Errτ​, but it is hard to predict. Instead of Errτ​, we'll predict the expected error Err.
We predict this error for two reasons:
Model selection: estimating the performance of different models in order to choose the best one.
Model assessment: having chosen a final model, estimating its prediction error(generalization error) on new data.
In an ideal situation, we split our data into three parts: Train(0.5), Validation(0.25), and Test(0.25). In train set we fit our model to data, and select model in validation set(Most well performed model in validation set). After that, we predict ErrT​ of our final model and assess this model.
Last updated