Exercise 1.1* (Concave convex functions are affine)

Suppose that a function f : n is both concave and convex. Prove that f is an affine function.

Answers

Proof. Our goal is to demonstrate that f is representable in the form

f ( 𝐱 ) = a 0 + i = 1 n a i x i

for some a 0 , 𝐚 n (see the definition on page 15).

The trick is to notice that f = f ( 0 ) + f for a linear function f ; thus, consider

f : n , f ( 𝐱 ) = f ( 𝐱 ) f ( 0 ) .

We demonstrate that f is indeed linear.

  • (convexity and concavity) We have f ( λ 𝐱 + ( 1 λ ) 𝐲 ) = λ f ( 𝐱 ) + ( 1 λ ) f ( 𝐲 ) .
    Proof. By theorem assumption, f is convex. By Definition 1.1.,

    f ( λ 𝐱 + ( 1 λ ) 𝐲 ) 𝜆𝑓 ( 𝐱 ) + ( 1 λ ) f ( 𝐲 ) .

    Similarly, since f is concave, we have

    f ( λ 𝐱 + ( 1 λ ) 𝐲 ) 𝜆𝑓 ( 𝐱 ) + ( 1 λ ) f ( 𝐲 ) .

    Combining both of these facts, we see that

    f ( λ 𝐱 + ( 1 λ ) 𝐲 ) = 𝜆𝑓 ( 𝐱 ) + ( 1 λ ) f ( 𝐲 ) .

    The same property must then also hold for f :

    f ( λ 𝐱 + ( 1 λ ) 𝐲 ) = f ( λ 𝐱 + ( 1 λ ) 𝐲 ) f ( 0 ) = 𝜆𝑓 ( 𝐱 ) + ( 1 λ ) f ( 𝐲 ) 𝜆𝑓 ( 0 ) ( 1 λ ) f ( 0 ) = λ f ( 𝐱 ) + ( 1 λ ) f ( 𝐲 ) .
  • (homogeneity) We have f ( c 𝐱 ) = c f ( 𝐱 ) for any c 0 .
    Proof. We break up this assertion into cases. The case when c = 0 or c = 1 follows trivially from convexity-concavity condition (also notice that f ( 0 ) = 0 ). If c ( 0 , 1 ) then

    f ( c 𝐱 ) = f ( c 𝐱 + ( 1 c ) 0 ) = c f ( 𝐱 ) + ( 1 c ) f ( 0 ) = 𝑐𝑓 ( 𝐱 ) .

    Finally, if c ( 1 , + ) , then 1 c ( 0 , 1 ) and so,

    f ( 𝐱 ) = f ( 1 c c 𝐱 + ( 1 1 c ) 0 ) = 1 c f ( 𝐱 ) .

    Multiplying both sides by c yields the desired result.

  • (additivity) We have f ( 𝐱 + 𝐲 ) = f ( 𝐱 ) + f ( 𝐲 ) for any 𝐱 , 𝐲 n .
    Proof. This is simply a matter of setting λ = 1 2 and following the convexity-concavity equation:

    f ( 𝐱 + 𝐲 ) = f ( 1 2 2 𝐱 + 1 2 2 𝐲 ) = 1 2 f ( 2 𝐱 ) + 1 2 f ( 2 𝐲 ) = f ( 𝐱 ) + f ( 𝐲 )

    where the last equality comes from the previous claim on homogeneity of f .

  • (homogeneity II) We have f ( c 𝐱 ) = c f ( 𝐱 ) for any c < 0 .
    Proof. To prove the result for the negative numbers, we consider the special case c = 1 . By homogeneity for non-negative scalars and by the additivity, we obtain

    f ( 𝐱 ) = f ( 2 𝐱 𝐱 ) = f ( 2 𝐱 ) + f ( 𝐱 ) = 2 f ( 𝐱 ) + f ( 𝐱 ) f ( 𝐱 ) = f ( 𝐱 ) .

    Using this, we see that for any c ( , 0 ) it is true that f ( c 𝐱 ) = 𝑐𝑓 ( 𝐱 ) .

  • (linearity) We have

    f ( i = 1 k a i 𝐱 i ) = i = 1 n a i f ( 𝐱 i )

    for any a 1 , , a n [ 0 , + ) and 𝐱 1 , , 𝐱 k n .
    Proof. This follows from homogeneity and pairwise additivity using induction on k .

Now we make use of these properties of f . For any 𝐱 n we make use of linearity to obtain:

f ( 𝐱 ) = f ( 0 ) + f ( 𝐱 ) = f ( 0 ) + f ( [ x 1 0 0 ] + [ 0 x 2 0 ] + + [ 0 0 x n ] ) = f ( 0 ) + f ( x 1 𝐞 1 + x 2 𝐞 2 + + x n 𝐞 n ) = f ( 0 ) + i = 1 n x i f ( 𝐞 i )

where 𝐞 1 , , 𝐞 n denote the basis vectors of n . Setting a 0 : = f ( 0 ) and a i : = f ( 𝐞 i ) , we obtain the desired result. □

User profile picture
2021-11-07 20:03
Comments
  • Remark: the part where I prove that $f^*$ is linear is often disregarded - feel free to ignore it if it seems trivial to you.
    eluthingol2022-01-30
  • I begin to self-study this book today and decide to go through the book from cover to cover. The solution you provided is the best companion in this journey!
    plodmatics2022-04-12
  • @plodmatics happy to hear that!
    eluthingol2022-08-16
  • Thank you! I found this exercise to be somewhat annoying...
    rhuq2022-08-26
  • should `` also notice that $\textbf{f}(\textbf{0})$" read ``also notice that $\textbf{f}^*(\textbf{0})$"?
    rhuq2022-08-26
  • @rhuq yes, it should be $\mathbf{f}^{*}(\mathbf{0})$. Correction added, thanks.
    eluthingol2022-08-27
  • When proving convexity and concavity for f*, at the end of that section I believe the (1-lambda)f(0) term needs to be negative.
    skunktrav2023-12-27
  • @skunktrav indeed. correction added, thanks.
    eluthingol2025-12-04