Exercise 5.3 - Reject option in classifiers

Answers

For question (a), the posterior expected loss for choosing an non-reject action ŷ given data x is:

ρ ( ŷ | x ) = y = 1 C p ( y | x ) L ( ŷ , y ) = 𝐩 T 𝐥 ( ŷ ) ,

where 𝐩 is the column vector encoding p ( y | x ) and 𝐥 ( ŷ ) is a column vector whose elements are λ s except for the ŷ -th one. Thus the expected loss is ( 1 p ( ŷ | x ) ) λ s in this case, whose minimum is obtained by let

ŷ = arg max y ( p ( y | x ) ) .

For the reject option, the loss is uniform λ s .

Thus one should choose reject or ŷ by minimizing:

min ( λ r , ( 1 p ( ŷ | x ) ) λ s ) .

If

λ r ( 1 p ( ŷ | x ) ) λ s ,

then we readily adopt the reject option. This condition is tantamount to what is required to be prove:

p ( ŷ | x ) 1 λ r λ s .

For question (b), the minimum of the expected loss is:

λ s min { λ r λ s , 1 p ( ŷ | x ) } ,

where ŷ is the most probable class. When λ r λ s is negligable, the reject option would always be chosen. When λ r λ s , the reject option would never be chosen.

User profile picture
2021-03-24 13:42
Comments