Homepage › Solution manuals › Yaser Abu-Mostafa › Learning from Data › Exercise 6.5
Exercise 6.5
Answers
- (a)
- Since we are selecting hypothesis from a fixed set of ,
there are
hypotheses and the validation data set is the ’input data set’, which has a
size of .
We apply the generalization bound equation (2.1), and for any ,
we have
If we assume , then we have
Since is the hypothesis with minimum validation error , so we have
On the other hand, minimizes , so we always have
Compare the two inequalities, we conclude .
- (b)
- If ,
according to Theorem 6.2, we can find a ,
such that
and ,
then we know that
and .
Since is the optimal out-of-sample error we can ever achieve, so we know that , by problem (a), we thus conclude
- (c)
- If we used the rule on the full data set , we would see performance improvement because the learning curve tells us we should use more data to achieve better performance.