Exercise 3.7

Answers

Take derivative of Ein(w) = 1 N n=1N ln (1 + eynwTx n) with respect to w, we have:

Ein(w) = 1 N n=1NynxneynwTx n 1 + eynwTxn = 1 N n=1N ynxn 1 + eynwTxn = 1 N n=1N y nxn𝜃(ynwTx n)

When a sample is misclassified, ynwTxn0, so 𝜃(ynwTxn)0.5, while when a sample is correctly classified, 𝜃(ynwTxn)0.5, so the contribution of ’misclassified’ example is more to the gradient than a correctly classified one.

User profile picture
2021-12-07 22:14
Comments