Homepage › Solution manuals › Gilbert Strang › Linear Algebra and Learning from Data › Exercise 6.3.5
Exercise 6.3.5
Answers
- Payoff matrix 1: the optimal strategy for to pay the minimial value is to choose row 1. The optimal strategy for to get paid max is to choose column 2. So the payoff from to is 2.
- Payoff matrix 2:
If we follow the strategy of the previous payoff matrix. For , on each row, we look at the worst payoff (i.e. maximum payoff), which are 4 and 8 respectively. To optimize ’s worst payoff, he will choose row 1 to minimize the maximum payoff, which is 4 here.
Similary, for , on each row, he’ll look at the worst payoff (i.e. minimum payoff), which are 1 and 2 for column 1 and column 2. To optimize ’s worst payoff, he will choose column 2 to maximize the minimum payoff,which is 2 here.
We see that the optimal payoff for and are different in this case, which is not in the first payoff matrix. So it indicates can’t always choose row 1, and can’t always choose column 2. They need a probability distribution to choose different rows/columns each time.
Suppose choose the two rows with probabilities , and , then we have a mixed strategy with new row: , will choose to make the worst payoff as small as possible, i.e. the largest payoff. This will happen when the two largest payoffs are equal: , solve we have . Similarly, we have . So we see that the optimal payoff is , notice this is smaller than 4, if chooses row 1 and chooses column 2.