Homepage › Solution manuals › Kevin P. Murphy › Machine Learning: a Probabilistic Perspective › Exercise 13.1 - Partial derivative of the RSS
Exercise 13.1 - Partial derivative of the RSS
Answers
For question (a), define:
Then we have straightforwardly:
From which we observe that ’s coefficient is:
while the rest irrelevent terms can be absorbed into:
The optimal value for is:
For question (b), (13.184) is obvious by plugging the definition of into (13.182)-(13.183) and the expression for .