Homepage › Solution manuals › Kevin P. Murphy › Machine Learning: a Probabilistic Perspective › Exercise 8.4 - Gradient and Hessian of log-likelihood for multinomial logistic regression
Exercise 8.4 - Gradient and Hessian of log-likelihood for multinomial logistic regression
Answers
For question (a), given a sample indexed we have,
Now we have:
what dominates is but the elementary calculus.
For question (b), recall that:
Let , we are now ready for reduction:
Summarizing over yields (8.126).
For question (c), we have by definition:
Hence we begin with the result from question (b):
where in the last step we have to adopt the outer product to span the Hessian. Summarizing over yields the desired result (8.127).