Exercise 9.10

Answers

If all the singular values of X are distinct, then the eigenvalues of Σ are all distinct and positive or zero.

(a)
Let X has a dimension of N × d, and with SVD, let X = V T where U has a dimension of N × d and V has a dimension of d × d, so we have UTU = Id and V TV = V V T = Id. Also let V = [v1vd ] , where vi are the d × 1 column vector and they are orthonormal (basis). Γ is the diagonal matrix of the singular values of X, and by construction it’s ordered, i.e. λ1 λ2 λd 0.

Consider any direction v = i=1dqivi, it should have v2 = i=1dqi2 = 1.

var(z1,,zN) = vTΣv = vT 1 NXTXv = 1 NvT(V T)T(V T)v = 1 NvTV ΓUTV Tv = 1 NvTV Γ2V Tv = 1 NvTV Γ2 [ v1T vdT ] v = 1 NvTV [λ12 0 0 λ22 0 λd2 ] [ v1Tv vdTv ] = 1 NvT [ v1vd ] [ λ12v1Tv λ22v2Tv λd2vdTv ] = 1 N [vTv 1vTv d ] [ λ12v1Tv λ22v2Tv λd2vdTv ] = 1 N i=1dvTv iλi2v iTv = 1 N i=1dλ i2vTv iviTv = 1 N i=1dλ i2vTv i2 1 N i=1dλ 12vTv i2 = λ12 N i=1dvTv i2 = λ12 N i=1dq i2 = λ12 N = 1 Nv1Tv 1λ12v 1Tv 1 = 1 N i=1dv 1Tv iλ12v 1Tv i = v1TΣv 1

So the variance is highest when the principal direction is v1 , the top right singular vector of X.

(b)
Follow the proof in problem (a), we see that the top-1 principal direction is v1, next, if we select the next direction v that is orthogonal to v1, it’s clear that var(z1,,zN) = vTΣv = 1 N i=2dλi2vTvi2. It’s easy to see that the principal direction with the highest variance is v2. Continue this, we see that the top-k principal directions are v1,,vk.
(c)
If we don’t have the data matrix X, but knows Σ, since Σ = 1 NXTX = 1 NV Γ2V T

Σ is symmetric, we can do eigen-decomposition, the principal directions are the top-k eigenvectors of Σ.

User profile picture
2021-12-08 10:27
Comments