Exercise 9.4

Answers

(a)
variance(x1) = variance(x^1) = 1, variance(x2) = variance(1 𝜖2x^1+𝜖x^2) = (1𝜖2)variance(x^1)+𝜖2variance(x^2) = 1

covariance(x1,x2) = E[(x1x¯1)(x2x¯2)] = E[x1x2] = E[1 𝜖2x^12+𝜖x^1x^2] = 1 𝜖2

(b)

f(x) = w1x1 + w2x2 = w1x^1 + w2(1 𝜖2x^1 + 𝜖x^2) = (w1 + w21 𝜖2)x^1 + w2𝜖x^2 = ŵ1x^1 + ŵ2x^2

So if we set ŵ1 = w1 + w21 𝜖2,ŵ2 = w2𝜖, we see f is linear in x1,x2.

(c)
From problem (b), we have ŵ1 = ŵ2 = 1, so we have w1 = 𝜖1𝜖2 𝜖 ,w2 = 1 𝜖, so that C w12 + w22 = 21𝜖1𝜖2 𝜖2
(d)
As 𝜖 0, we have the minimum C . It means that we have to use a huge C to be able to implement the target function, which is impossible here.
(e)
If there is significant noise in the data, with correlated inputs, it’ll be hard to regularize the learning, and overfitting is likely. So var term can be high while bias can be low.
User profile picture
2021-12-08 10:21
Comments