Suppose the nearest neighbor for
is
Observe that
When , we
expect
because when the data set gets very large, every point
has a nearest neighbor that is close by. That is, we have
for all
. This is
the case if
has bounded support.
By the continuity of ,
This indicates that
and since , it
follows that .
So we conclude that
when
with high probability according to the law of large number.
We first prove that for
numbers of , we have
. This can be proved
by notice that for ,
we always have
Since we also have , apply
above inequality to ,
we have
, add
to both sides we have
. Divide both
sides by ,
we have
.
Now take expectation w.r.t.
on ,
we have
Apply the above inequality, we have
From problem (a), we have ,
take this into above inequality, we have
Where and
we have used
As , we
have ,
so
This demonstrates that for multiclass problem, the simple nearest neighbor is
at most a factor of 2 from optimal.