Exercise 8.7

Answers

(a) Assume we have ρ1 < ρ2, consider the corresponding hypothesis sets Hρ1 and Hρ2. For a given data set (x1,y1),,(xN,yN), for any given dichotomy on the data set, if the dichotomy can be implemented by a hypothesis h2 Hρ2, then there must be a hypothesis h1 Hρ1 that can implement the dichotomy as well.

This is because we can choose the (b2,w2) from h2, since ρ2 = 1 |w2|, if we set (b1,w1) = ρ2 ρ1 (b2,w2) such that |w1| = 1 ρ1 , so h1 = (b1,w1) belongs to Hρ1 and has a margin of ρ1.

As the margin of h1 is smaller than the margin of h2 (ρ1 < ρ2), if h2 can implement the dichotomy, h1 certainly can do that too. The reverse is not necessarily true since if we add points (Assuming adding the proper signs on the two sides of h1) that are in the margin of h2 but outside the margin of h1, h2 won’t be able to implement the new dichotomy while h1 can still do.

So we see that dV C(ρ) is non-increasing in ρ.

(b) We first prove that for any 3 points in the unit disc, there must be two that are within distance 3 of each other. If this is not ture, then there exists 3 points in the unit disc, such that the distances between any two points are larger than 3.

The center of the disc can’t be one of such 3 points, because the distance from center to any point is less than or equal to 1. Now we draw lines from the center to 3 points and extend them to meet with the boundary of the disc. We check the lengths of the segments (formed by connecting the 3 points on the circle), they must be all larger than 3.

We can compute the angle that each segment corresponds to, e.g. cos (𝜃) = a2+b2c2 2ab , where a = b = 1 and c is the segment, 𝜃 is the corresponding angle. We have cos (𝜃) = 1 c2 2 < 0.5, we conclude that 𝜃 > 2π 3 . This is true for all three angles, so we have a total angle of larger than 3 ×2π 3 = 2π, which contradicts.

So for any 3 points in the unit disc, there must be two that are within distance of 3 of each other.

Suppose that when ρ > 3 2 , dV C(ρ) 3, that is the Hρ can shatter at least 3 points.

Now consider any 3 points in the unit disc, from what we have proved, there must be two that are within distance of 3 of each other. Consider the dichotomy when the two points is one positive, one negative, any hypothesis hyperplane (a line) that can implement this dichotomy needs to cross with the line between the two points. We see that of the two distances from the two points to the hyperplane, there is at least one that is in the margin, i.e. less than ρ. The distances are equal when the line is passing through the middle and perpendicular to the line between the two points. And the distances there are both 3 2 , which is smaller than ρ, meaning at least one point will always stay in the margin of any hyperplane that can implement the dichotomy.

This contradicts with the assumption that the hypothesis set Hρ can shatter any 3 points. Thus dV C(ρ) < 3 when ρ > 3 2

User profile picture
2021-12-08 10:08
Comments