Exercise 6.13

Answers

(a)
Ein(S1,,Sk;μ1,,μk) = j=1kEj = j=1k xnSjxn μj2. If we fix the clusters to S1,,Sk, take derivatives of Ein w.r.t. μj, we have

Ein μj = xnSj2μj 2xn

Let it equal to 0, we have μj = 1 |Sj| xnSjxn are the centroids of the clusters and minimize Ein.

(b)
If we fix the centers to μ1,,μk. Then the clusters which minimize Ein are obtained by placing into Sj all points for which the closest center is μj, breaking ties arbitrarily.

Suppose this is not true, there must exist a point that doesn’t belong to the cluster which has the closest center to it. Let’s call the point x, and its closest center μj, such that x μjx μl for l = 1,,k. Under our assumption, xSj.

There must exist another cluster, e.g. Si, such that x Si, and x μjx μi.

We can compare Ei,Ej components in Ein, where

Ei = xnSixn μi2

and Ej = xnSjxn μj2.

It’s clear that Ei + Ej doesn’t achieve its minimum value under the assumption. Because if we just put x into Sj, we replace x μi2 with x μj2, which is clearly less than the former. So we reduce the sum of Ei + Ej and thus Ein.

We conclude that for all points,we should put them into the cluster that has the closest center.

User profile picture
2021-12-08 09:49
Comments