Homepage › Solution manuals › Yaser Abu-Mostafa › Learning from Data › Exercise 8.7
Exercise 8.7
Answers
(a) Assume we have ,
consider the corresponding hypothesis sets
and
. For a given
data set , for
any given dichotomy on the data set, if the dichotomy can be implemented by a hypothesis
, then there must
be a hypothesis
that can implement the dichotomy as well.
This is because we can choose the
from , since
, if we set
such
that , so
belongs to
and has a
margin of .
As the margin of is
smaller than the margin of
(), if
can implement
the dichotomy,
certainly can do that too. The reverse is not necessarily true since if
we add points (Assuming adding the proper signs on the two sides of
) that are in the
margin of but outside
the margin of ,
won’t be able to implement the new dichotomy while
can
still do.
So we see that is non-increasing in .
(b) We first prove that for any 3 points in the unit disc, there must be two that are within
distance
of each other. If this is not ture, then there exists 3 points in the unit
disc, such that the distances between any two points are larger than
.
The center of the disc can’t be one of such 3 points, because the distance from center
to any point is less than or equal to 1. Now we draw lines from the center to 3 points
and extend them to meet with the boundary of the disc. We check the lengths of the
segments (formed by connecting the 3 points on the circle), they must be all larger
than .
We can compute the angle that each segment corresponds to, e.g.
, where
and
is the segment,
is the corresponding
angle. We have ,
we conclude that .
This is true for all three angles, so we have a total angle of larger than
,
which contradicts.
So for any 3 points in the unit disc, there must be two that are within distance of
of
each other.
Suppose that when ,
, that
is the
can shatter at least 3 points.
Now consider any 3 points in the unit disc, from what we
have proved, there must be two that are within distance of
of
each other. Consider the dichotomy when the two points is one positive, one negative,
any hypothesis hyperplane (a line) that can implement this dichotomy needs to cross
with the line between the two points. We see that of the two distances from the two
points to the hyperplane, there is at least one that is in the margin, i.e. less than
.
The distances are equal when the line is passing through the middle and
perpendicular to the line between the two points. And the distances there are both
, which is
smaller than ,
meaning at least one point will always stay in the margin of any hyperplane that can
implement the dichotomy.
This contradicts with the assumption that the hypothesis set can shatter any 3 points. Thus when