Fenchel-Rockafellar and Linear Programming

Author

Micah Pedrick and Seyit Emre Duzoylum

The Fenchel-Rockafellar Theorem is a well-known result from convex analysis that establishes a minimax principle between convex functions and their convex conjugates under some regularity conditions. One fundamental application of this theorem is the characterization of the dual problem of a finite dimensional linear program.

The Fenchel-Rockafellar Theorem

Let \(\phi : X \to \mathbb{R} \cup \left\{+\infty\right\}\) and \(\psi : X \to \mathbb{R} \cup \left\{+\infty\right\}\) be convex, lower semicontinuous and proper functions. Suppose that there exists \(x_0 \in X\) such that \(\phi(x_0),\varphi(x_0) < \infty\), where \(\phi\) is continuous at \(x_0\). Then, it holds that

= {}.

Proof

We provide a sketch of the proof, the reader is referred to Brezis for further reading. Note that, by Young’s Inequality, for any x X and \(u \in X^*\) we have (x) + ^(-u) + (x) + ^(u) x, -u + x, u = 0. Hence, the infimum of the left hand side is greater than or equal to the supremum of the right hand side. If the infimum is \(-\infty\), equality must be obtained, and for every \(u \in X^*\) the supremum is realized; otherwise assume \(\phi(x) + \psi(x) > -\infty\) for all \(x \in X\), and let \(m\) be the value of the infimum.

Let \(A := \left\{(x, t) : \phi(x) \leq t\right\},\) and observe that since \(\phi\) is continuous at \(x_0\), the interior of \(C\) is nonempty. Likewise, let \(B := \left\{(x, t) : t \leq m - \psi(x)\right\}\). Both sets are convex, and by construction we have \(B \cap \text{int}A = \emptyset\). So a geometric Hahn-Banach Theorem implies that there is some nonzero \((f, k) \in X^* \times \mathbb{R}\) and \(\alpha \in \mathbb{R}\) such that

\[\begin{cases} \langle x, f \rangle + kt \geq \alpha, & (x, t) \in A, \\ \langle x, f \rangle + kt \leq \alpha, & (x, t) \in B.\end{cases}\]

Since \(\phi\) is finite at \(x_0\), letting \(t \to +\infty\) shows that \(k\) is nonnegative, and if \(k = 0\), continuity and joint finiteness imply \(\lVert f \rVert = 0\), a contradiction, so \(k > 0\). Thus, we can use the inequalities above with the definitions of \(\phi^*\) and \(\psi^*\) to conclude that ^(-f k) -k, ^(f k) k - m. Which implies that -^(-f k) - ^(f k) m. Finally, since the supremum includes this term, it must also be greater than or equal to the infimum, which yields their equality.

Application to Linear Programs

Let \(A_1 \in \mathbb{R}^{p \times m}\),\(A_2 \in \mathbb{R}^{q\times n}\),\(b_1 \in \mathbb{R}^{p}\), \(b_2 \in \mathbb{R}^{q}\),\(c_1 \in \mathbb{R}^{m}\) and \(c_2 \in \mathbb{R}^{m}\) and consider the following finite dimensional linear program

=

\[\begin{aligned} & {\text{ }\inf} & & \langle c_1, x \rangle + \langle c_2, y \rangle \\ & \text{subject to} & & A_1 x \geq b_1 \\ & & & A_2 y = b_2 \\ & & & x \geq 0 \\ \end{aligned}\]

where \(x \in \mathbb{R}^{m}\) and \(y \in \mathbb{R}^{n}\). Moreover, let us suppose that there exist at least one feasible solution \(\tilde{x} \in \mathbb{R}^{m},\tilde{y} \in \mathbb{R}^{n}\) such that \(A_1 \tilde{x} > b_1\) \(A_2 \tilde{y} = b_2\) and \(\tilde{x} > 0\). Then, dual problem of \(\mathcal{P}\) is given by

=

\[\begin{aligned} & {\text{ }\max} & & \langle b_1 , \alpha \rangle + \langle b_2 , \beta \rangle \\ & \text{subject to} & & A_1^T \alpha \leq c_1 \\ & & & A_2^T \beta = c_2 \\ & & & \alpha \geq 0 \\ \end{aligned}\]

where \(\alpha \in \mathbb{R}^{p}\) and \(\beta \in \mathbb{R}^{q}\). Furthermore, \(\mathcal{D}\) attains its optimal value and we have \(\mathcal{P} = \mathcal{D}\).

Proof

Note that, the linear program may be equivalently written as

where \(\Chi_A\) denotes the characteristic function of \(A \subseteq \mathbb{R}^n\). Further, let us define \(\phi(x,y) = \langle c_1, x \rangle + \langle c_2, y \rangle + \Chi_{ \mathbb{R}_+ }(x)\), and \(\varphi(x,y) = \Chi_{ \mathbb{R}_+ }(b_1 - A_1 x) + \Chi_{\{0\}} (b_2 - A_2 y)\). Note that, both \(\phi\) and \(\varphi\) are convex, lower semicontinuous and proper functions, where we have \(\phi ( \tilde{x},\tilde{y}),\varphi(\tilde{x},\tilde{y}) < \infty\). Moreover, since we have \(\tilde{x} > 0\), we observe that \(\phi\) is continuous at \((\tilde{x},\tilde{y})\). Therefore, by the Fenchel-Rockafellar Theorem we obtain

= .

In terms of convex conjugate of \(\phi\) we have

\[\begin{align} \phi^\star(-w,-t) & = \underset{ x \in \mathbb{R}^{m},y \in \mathbb{R}^{n} }{\sup} \left\{ -\langle x, w+c_1 \rangle - \langle y, t+c_2 \rangle - \Chi_{ \mathbb{R}_+ }(x) \right\} \\ & = \underset{ x \in \mathbb{R}^{m}_{-},y \in \mathbb{R}^{n} }{\sup} \left\{ -\langle x, w+c_1 \rangle - \langle y, t+c_2 \rangle \right\} \\ & = \Chi_{ \mathbb{R}_+ }(w+c_1) + \Chi_{\{0\}}(t+c_2). \end{align}\]

Furthermore, for the convex conjugate of \(\varphi\) we observe

\[\begin{align} \varphi^\star(w,t) & = \underset{ x \in \mathbb{R}^{m},y \in \mathbb{R}^{n} }{\sup} \left\{ \langle x, w \rangle + \langle y, t \rangle - \Chi_{ \mathbb{R}_+ }(b_1 - A_1 x) - \Chi_{\{0\}} (b_2 - A_2 y) \right\} \\ & = \underset{ x \in \mathbb{R}^{m},y \in \mathbb{R}^{n} }{\sup} \left\{ \langle x, w \rangle + \langle y, t \rangle - \underset{ \alpha \in \mathbb{R}^{p}_+ , \beta \in \mathbb{R}^{q} }{\sup} \left\{ \langle b_1 - A_1 x ,\alpha \rangle + \langle b_2 - A_2 y , \beta \rangle \right\} \right\} \\ & = \underset{ x \in \mathbb{R}^{m},y \in \mathbb{R}^{n} }{\sup} \left\{ \underset{ \alpha \in \mathbb{R}^{p}_+ , \beta \in \mathbb{R}^{q} }{\inf} \left\{ \langle x, w \rangle + \langle y, t \rangle + \langle A_1 x - b_1 , \alpha \rangle + \langle A_2 y - b2 , \beta \rangle \right\} \right\} \\ & = \underset{ x \in \mathbb{R}^{m},y \in \mathbb{R}^{n} }{\sup} \left\{ \underset{ \alpha \in \mathbb{R}^{p}_+ , \beta \in \mathbb{R}^{q} }{\inf} \left\{ \langle x, w+ A_1^T \alpha \rangle + \langle y, t+A_2^T\beta \rangle - \langle b_1 , \alpha \rangle - \langle b_2 , \beta \rangle \right\} \right\} \end{align}\]

Notice that, \(\varphi^\star\) is finite if and only if we have \(w = A_1^T \alpha\) and \(t = A_2^T\beta\) for some \(\alpha \in \mathbb{R}^{p}_+ , \beta \in \mathbb{R}^{q}\). Therefore, as \(\phi^\star = \Chi_{ \mathbb{R}_+ }(w+c_1) + \Chi_{\{0\}}(t+c_2)\), by combining these two results with the Fenchel-Rockafellar Theorem we obtain the dual linear program as follows.

References

H. Brezis, Functional Analysis, Sobolev Spaces, and Partial Differential Equations, Section 1.4 T. Rockafellar, Variational Analysis, Section 11.H