Exercise 4.16 - Likelihood ratio for Gaussians

Answers

Consider a classifier for two classes, the generative distribution for them are two normal distributions $p (x | y = C_{i}) = 𝒩 (x | μ_{i}, Σ_{i})$ , by the Bayes rule:

\log \frac{p (y = 1 | x)}{p (y = 0 | x)} = \log \frac{p (x | y = 1)}{p (x | y = 0)} + \log \frac{p (y = 1)}{p (y = 0)} .

The first term on r.h.s. is the ratio of likelihood probability.

When we have arbitrary covariance matrices:

\frac{p (x | y = 1)}{p (x | y = 0)} = \sqrt{\frac{| Σ_{0} |}{| Σ_{1} |}} \cdot \exp {- \frac{1}{2} {(x - μ_{1})}^{T} Σ_{1}^{- 1} (x - μ_{1}) + \frac{1}{2} {(x - μ_{0})}^{T} Σ_{0}^{- 1} (x - μ_{0})} .

As $Σ_{0}, Σ_{1}$ are arbitrary matrices, this formulation cannot be reduced further:

\log \frac{p (x | y = 1)}{p (x | y = 0)} = \frac{1}{2} \log \frac{| Σ_{0} |}{| Σ_{1} |} - \frac{1}{2} {(x - μ_{1})}^{T} Σ_{1}^{- 1} (x - μ_{1}) + \frac{1}{2} {(x - μ_{0})}^{T} Σ_{0}^{- 1} (x - μ_{0}) .

Note that the decision boundary ( $\log \frac{p (x | y = 1)}{p (x | y = 0)} = 0$ ) is a quardratic surface in $D$ -dimension space.

When both covariance matrixes are given by $Σ$ :

\frac{p (x | y = 1)}{p (x | y = 0)} = \exp {- \frac{1}{2} {(x - μ_{1})}^{T} Σ^{- 1} (x - μ_{1}) + \frac{1}{2} {(x - μ_{0})}^{T} Σ^{- 1} (x - μ_{0})},

so:

\begin{aligned} \log \frac{p (x | y = 1)}{p (x | y = 0)} & = - \frac{1}{2} {(x - μ_{1})}^{T} Σ^{- 1} (x - μ_{1}) + \frac{1}{2} {(x - μ_{0})}^{T} Σ^{- 1} (x - μ_{0}) \\ = \frac{1}{2} tr (Σ^{- 1} [(x - μ_{1}) {(x - μ_{1})}^{T} - (x - μ_{2}) {(x - μ_{2})}^{T}]) . \end{aligned}

When $Σ$ is a diagonal matrix, we have:

\begin{aligned} \log \frac{p (x | y = 1)}{p (x | y = 0)} & = - \frac{1}{2} {(x - μ_{1})}^{T} Σ^{- 1} (x - μ_{1}) + \frac{1}{2} {(x - μ_{0})}^{T} Σ^{- 1} (x - μ_{0}) \\ = \frac{1}{2} tr (Σ^{- 1} [(x - μ_{1}) {(x - μ_{1})}^{T} - (x - μ_{2}) {(x - μ_{2})}^{T}]) \\ = \frac{1}{2} tr (Λ^{- 1} Φ) \\ = \frac{1}{2} \sum_{i = 1}^{d} λ_{i}^{- 1} Φ_{i, i} . \end{aligned}

where:

Φ = (x - μ_{1}) {(x - μ_{1})}^{T} - (x - μ_{2}) {(x - μ_{2})}^{T} .

Finally, if $Σ = σ^{2} I$ then:

\log \frac{p (x | y = 1)}{p (x | y = 0)} = \frac{1}{2 σ^{2}} tr (Φ) .

Note that for the last three cases, a decision boundary is a linear plane in the space, since the quadratic term on $x$ has been canceled in $Φ$ .

solour_lfq

2021-03-24 13:42

Exercise 4.16 - Likelihood ratio for Gaussians

Answers

Comments

Add answer