Exercise 5.3 - Reject option in classifiers

Answers

For question (a), the posterior expected loss for choosing an non-reject action $ŷ$ given data $x$ is:

\begin{aligned} ρ (ŷ | x) & = \sum_{y = 1}^{C} p (y | x) \cdot L (ŷ, y) \\ = 𝐩^{T} 𝐥 (ŷ), \end{aligned}

where $𝐩$ is the column vector encoding $p (y | x)$ and $𝐥 (ŷ)$ is a column vector whose elements are $λ_{s}$ except for the $ŷ$ -th one. Thus the expected loss is $(1 - p (ŷ | x)) \cdot λ_{s}$ in this case, whose minimum is obtained by let

ŷ = \arg \max_{y} (p (y | x)) .

For the reject option, the loss is uniform $λ_{s}$ .

Thus one should choose reject or $ŷ$ by minimizing:

\min (λ_{r}, (1 - p (ŷ | x)) \cdot λ_{s}) .

λ_{r} \leq (1 - p (ŷ | x)) \cdot λ_{s},

then we readily adopt the reject option. This condition is tantamount to what is required to be prove:

p (ŷ | x) \leq 1 - \frac{λ_{r}}{λ_{s}} .

For question (b), the minimum of the expected loss is:

λ_{s} \cdot \min {\frac{λ_{r}}{λ_{s}}, 1 - p (ŷ | x)},

where $ŷ$ is the most probable class. When $\frac{λ_{r}}{λ_{s}}$ is negligable, the reject option would always be chosen. When $\frac{λ_{r}}{λ_{s}}$ , the reject option would never be chosen.

solour_lfq

2021-03-24 13:42

Exercise 5.3 - Reject option in classifiers

Answers

Comments

Add answer