Exercise 8.3

Answers

If h is the optimal hyperplane, its wh minimizes wTw. Also, for a separating hyperplane, the data point that is nearest to the hyperplance has distance 1 |wh|. Assume ρ+ > ρ, so we should have ρ = 1 |wh|.

Let l = ρ+ρ 2 , and ρ+ = ρ+ l = ρ++ρ 2 > 1 |wh|, ρ = ρ + l = ρ++ρ 2 > 1 |wh|.

If we let w = ρ+ = ρ, then we have |w| < |wh|, which contradicts the fact that wh minimizes wTw.

User profile picture
2021-12-08 10:01
Comments