Homepage › Solution manuals › Kevin P. Murphy › Machine Learning: a Probabilistic Perspective › Exercise 3.19 - Irrelevant features with naive Bayes
Exercise 3.19 - Irrelevant features with naive Bayes
Answers
The log-likelihood is defined by:
In a succint way:
where:
For question (a):
For question (b), with:
a word will not affect this posterior measure as long as:
Hence if:
then it cannot affect the classification decision. That is to say, appear in class 1 and 2 with the same frequency.
For question (c), we have:
They are different when so the bias effect remains. However, this bias reduces when grows large.
For question (d), using information theory would be a solid option.