We want to prove that
A function f(x), which is differentiable, is convex if and only if its domain is a convex set and if the following inequality condition is satisfied:
Intuitively, this condition says that the tangent/first-order-taylor-series approximation is globally an under-estimator of f(x).
Proof:
Before beginning the proof, let us first review the fundamental definition of a convex function:
- Definition of a convex function: A function f(x) is said to be convex if and only if its domain is a convex set and if it satisfies the following inequality condition:
Let us assume that f(x) is convex. Then, according to the fundamental definition of convex functions, the following inequality condition must be satisfied:
Now by a simple re-arrangement of terms we can rewrite Eq.(1) as follows:
Now, let
We now express Eq.(2) in terms of g(t), as shown below:
where we have used the fact that g(0) = f(x).
Now taking the limit as t –> 0 on both sides of Eq. (3), we get:
Now, we need to find g’(o). Instead, let us compute the more general g’(t) which is given below:
Now substituting “t = 0” in the above equation, we get:
Now, substituting the above result into Eq. (4), we get:
Notice that this is exactly the inequality condition we wanted to prove.
We have thus proved that:
"if the function f(x) is convex and differentiable then it is necessary that the inequality condition shown in Eq. (5) must be satisfied".
However, our proof is not yet complete
We now have to prove the "sufficiency" part of this condition i.e. we want to prove that:
"if a function f(x) satisfies the inequality condition in Eq. (5) then it sufficient to conclude that f(x) is a convex function".
Showing this part will complete our proof.
To begin with, let us assume that the inequality condition shown in Eq. (5) is satisfied.
Now, consider
Notice that, since Domain(f) is a convex set, z should belong to Domain(f). Also, since we assumed that the inequality condition in Eq. (5) is satisfied, the following two inequalities must hold true:
Now multiplying the inequalities in Eqs. (6) and (7) with t and (1-t) respectively and adding the results we get:
Observe that this is exactly the inequality that f(x) must satisfy in order to be considered as a convex function. Hence, we have proved that, if f(x) satisfies the inequality condition in Eq.(5) then it is sufficient to conclude that f(x) is a convex function.
This ends our proof of the first-order condition of convexity.