Homepage › Solution manuals › Yaser Abu-Mostafa › Learning from Data › Exercise 7.12

Exercise 7.12

Answers

"More data is better" applies to a fixed model $(H, A)$ . However when we are doing early stopping, we are selecting models on a nested hypothesis sets $H_{1} \subset H_{1} \subset \dots$ determined by $D_{train}$ , that’s because at each step, the $w_{1}$ is selected by choosing the one with minimal in-sample error. If we use the full data $D$ , the $w_{1}, w_{2}, \dots$ will be different and as a result, the hypothesis sets will change even if we keep the step size $η$ the same.

That’s why the ’more data is better’ doesn’t apply here.

niuers

2021-12-08 09:55

Exercise 7.12

Answers

Comments

Add answer