Exercise 8.14

Answers

If we remove a data point (xn,yn) with αn = 0, suppose the previous optimal solution is α.

(a)
Since α is the optimal solution for the previous dual problem, It satisfies the constraints in (8.21). i.e. αi 0 for i = 1,,N. And

i=1Nyiαi = inNyiαi = 0 since αn = 0. The second part is exactly the new constraint for the problem with (xn,yn) removed.

So the solution α (after removing αn) is feasible for the new dual problem.

(b)
If there’s another feasible solution (α) for the new dual and it has a lower objective value than α. We construct a new solution for previous dual problem by adding αn = 0 into α, i.e. αc. It’s clear that αc is a feasible solution for the previous dual problem.

From (8.21), the objective value of αc for previous dual problem is thus:

V (αc) = 1 2 i=1N j=1Ny iyjαicα jcx iTx j i=1Nα ic = 1 2 inN jnNy iyjαicα jcx iTx j inNα ic < 1 2 inN jnNy iyjαiα jx iTx j inNα i = V (α)

This contradicts the fact that α is the optimal solution for the previous problem. So we conclude there’s no other feasible solution for the new dual problem that has a lower objective value than α.

(c)
Hence we showed that α (minus αn) is optimal for the new dual problem.
(d)
Since w = i=1Nyiαixi = inNyiαixi. w is the same as previous problem. Also b is computed using a point where αs > 0, it’s not affected by αn as well. So we conclude that the optimal hyperplane doesn’t change.
(e)
As the final hypothesis is not changed when we throw out any data point with αn = 0, after we throw out all such points, we are left with data points that have αn > 0, thus we shows that ECV = 1 N n=1Nen number of αn>0 N .
User profile picture
2021-12-08 10:13
Comments