优化总结

Optimization Cheatsheet

Handout 1 Introduction

Basic Notions

Optimal Value: $\{f(x): x\in X\}$

Global Minimizer: $x^*$ $f(x^*) \le f(x)$

Note that optimal value can be finite even if global minimizer does not exist

Different Problems

LP Problems： $f(x)$ $X$ is a set defined by linear inequalities

$c^Tx$ $Ax \le b$

Note that eualities can be transformed to inequalities by:

$ax = b$ $\{ax \le b \quad -ax \le -b\}$

Quadratic Programming (QP) Problems: $X$ $f(x) = x^TQx$ $Q = \frac{Q+Q^T}{2}$ is symmertric without loss)

Note that this is different from Quadratically Constrained Quadratic Programming Problems.

Semidefinite Programming (SDP) Problems:

$b^Ty$ $C - \sum ^m _{i=1}y_iA_i \ge 0$ $y \in R^m$

$Q \ge 0$ $Q \in S^n _+$ $x^TQx \ge 0$ $x \in R^n$

How to combine multiple constraints?

$C_1 - \sum ^m _{i = 1} y_i A_i \ge 0$ $\left [ \begin{matrix}C_1 & \\ & C_2 \end{matrix} \right ] - \sum^m _{i=1} y_i \left [ \begin{matrix}A_i & \\ & B_i \end{matrix} \right ] \ge 0$

$C_2 - \sum ^m _{i=1} y_iB_i \ge 0$

Properties of Semidefinte Matrix

$A = \left [ \begin{matrix}A_1 & A_2\\ A_2^T & A_3 \end{matrix} \right ] \ge 0$ $A_1 \ge 0$ $A_3 \ge 0$

Handout 2 Elements of Convex Analysis

Affine and Convex Sets

Affine Set: $\alpha x+(1-\alpha)y \in S$

Convex Set: $\alpha x + (1-\alpha)y \in S$ $\alpha \in [0,1]$

$\empty$ is convex.

Proposition 1: $S$ is not empty

$S$ is affine

$S$ $S$

$S$ $V \subseteq R ^n$ $S$ $\{x\}+V = \{x+v\in R: v \in V\}$ $x \in R$

Proposition 2: $S \subseteq R^n$ is arbitary

$S$ is convex

$S$ $S$

Examples of Convex Sets: Non-negative orthant, hyperplane, halfspaces, euclidean ball, ellipsoid, simplex, convex cone, positive semidefinte cone

Cone: $\{\alpha x: \alpha \gt 0\} \in K$ $x \in K$

$S^n_+$ is a convex cone.

Affine Hull: $S$ $aff(S)$

Convex Hull: $S$ $conv(S)$ $conv(S) = S$ $S$ is convex

Proposition 3:

$aff(S)$ $S$

$conv(S)$ $S$

Convexity-Preserving Operations

Affine Functions: $A(\alpha x_1 + (1-\alpha)x_2) = \alpha A(x_1) + (1-\alpha)A(x_2)$

$A(x) = x + y$ $A(x) = Ux$ $U^TU = UU^T = I$ $A(x) = Px$ ) are all affine operations

Proposition 4: Affine functions operated on a convex set remains its convexity

Projection onto Closed Convex Sets

Note that Projection points do not always exist and are not always unique.

Note that every finite point set is closed since it has no limit points thus fulfill the conditon that every limit points belong to itself.

Theorem 4: $S$ $x \in R^n$ $z^* \in S$ $x$

Projection: $\prod _S (x) = arg \quad min _{z\in S} ||x - z||^2 _2$

Weierstrass's Theorem: $f$ $T$ $T$

Theorem 5: $S$ $z^* = \prod _S (x)$ $z^* \in S$ $(z-z^*)^T(x-z^*)\le0$ $z \in S$

Separation Theorems

Theorem 6 (Point-Set Separation): $S$ $x \in R^n \backslash S$ $y \in R^n$ $max _{z\in S} y^Tz \lt y^T x$

Theorem 7: $S \subseteq R^n$ $S$

$max _{z \in S_1} y^T z \lt min _{z \in S_2} y^Tz$ $\{(x_1,x_2): x_2 \ge \frac{1}{x_1}\}$ $R_-$

Theorem 8 (Set-Set Separation): $S_1$ $S_2 \subseteq R^n$ $S_1 \and S_2 = \empty$ $S_2$ $y\in R^n$ $max _{z\in S_1}y^Tz \lt min _{u \in S_2} y^T u$

Basic Definitions and Propeerties of Convex Functions

Convex Functions: $f(\alpha x_1 + (1-\alpha)x_2) \le \alpha f(x_1) + (1-\alpha)f(x_2)$

Concave Functions: $-f$ is convex

Epigraph: $epi(f) = \{(x,t) \in R^n\times R: f(x) \le t\}$

Effective Domain: $dom(f) = \{x \in R^n:f(x) \lt +\infty \}$

Proposition 9: $f:R^n \rightarrow R \bigcup \{+\infty\}$ $epi(f)$ is convex

$dom(f)$ $f$ is convex

Corollary 2: (Jensen's Inequality) $f:R^n \rightarrow R \bigcup \{+\infty\}$ $f(\sum ^k _{i=1} \alpha _i x_i) \le \sum ^k _{i=1} \alpha_i f(x_i)$ $x_1,...,x_k \in R^n$ $\alpha_1,...,\alpha_k \in [0,1]$ $\sum ^k _{i=1} \alpha _i = 1$

Convexity-Preserving Transformations

Theorem 11:

Non-Negative Combinations: $f(x) = \sum ^m _{i=1} \alpha _i f_i(x)$ $f_i$ $\alpha_i \ge 0$

Pointwise Supremum: $f(x) = sup _{i\in I}f_i(x)$

Affine Composition: $f(x) = g(A(x))$

Composition with an Increasing Convex Function: $h$ $dom(h)$ $f(x) = g(h(x))$

Restriction on Lines: $f$ $f(x_0 + th)$ $x_0$ $h$

Differentiable Convex Functions

Theorem 12: $f$ $f(x) \ge f(\bar{x}) + (\nabla f(\bar{x}))^T(x-\bar{x})$ $x,\bar{x} \in S$

Theorem 13: $f$ $S \subseteq R^n$ $f$ $S$ $\nabla ^2 f(x) \ge 0$ $x \in S$

Non-Differentiable Convex Functions

Subgradient: $s$ $f$ $\bar{x}$ $f(x) \ge f(\bar{x}) + s^T(x-\bar{x})$ $s$ $f$ $\bar{x}$ $\partial f(\bar{x})$

Theorem 14:

$f$ $x\in R^n$ $\partial f(x) = \{\nabla f(x)\}$

$f$ $f'(x,d) = \lim _{t \rightarrow 0} \frac{f(x+td) - f(x)}{t}$ $f$ $x$ $d \in R^n \backslash \{0\}$ $\partial f(x)$ $f'(x,d) = max _{s \in \partial f(x)}s^Td$ $d$

Calculus and Linear Algebra Preparations

Cachy-Schwarz Inequality: $(\sum ^n _{i=1} x_iy_i)^2 \le (\sum^n _{i=1} x_i ^2)(\sum ^n _{i=1} y_i^2)$

Vector Norm:

$||x||_1 = \sum ^m _{i=1} |x_i|$

$||x||_2 = \sqrt{\sum ^m _{i=1} x_i^2}$

$||x||_p = (\sum ^m _{i=1} |x_i|^p)^{\frac{1}{p}}$

$\infty$ $||x||_{\infty} = max _i |x_i|$

$-\infty$ $||x||_{-\infty} = min _i |x_i|$

$||x||_0 =$ $x$

Matrix Norm:

$||A||_1 = max _j \sum ^m _{i=1} |a _{i,j}|$

$||A||_2 = \sqrt{\lambda _1}$

$\infty$ $||A||_{\infty} = max_i \sum ^m _{j=1} |a_{i,j}|$

$||A||_F = (\sum ^m _{i=1} \sum ^n _{j=1} a_{i,j}^2)^{\frac{1}{2}}$

Taylor's Formula: $f(x) = f(a) + R_n(x)$ $R_n(x) = \frac{f^{(n+1)}(x)}{(n+1)!}(x-a) ^{(n+1)}$

Semidefinite:

$\left [ \begin{matrix} A & B \\ B^T & C \end{matrix} \right ] \ge 0$ $C - B^TA^{-1}B \ge 0$

Handout 3 Elements of Linear Programming

Basic Definitions and Properties

Polyhedron: The intersection of a finite set of halfspaces

Polytope: A bounded polyhedron

Note that a closed convex set is the intersection of all the halfspaces containing it but this does not mean that any closed convex set is a polyhedron

External Elements of a Polyheedron

Active Set: $a_i^T\bar{x} = b_i$

Theorem 1: The following are equivalent:

$\{a_i \in R: i \in I\}$ that are linearly independent

$\bar{x} \in R^n$ $a_i^Tx = b_i$

Basic Solution: $x$ $x$ $x$ is in the polyhedron, then it is a basic feasible solution

Note that not every polyhedron has basic feasible solution

Line: $P$ $x \in P$ $d \in R^n \backslash \{0\}$ $x+ \alpha d \in P$ $\alpha \in R$

Theorem 3: $P \subseteq R^n$ is a non-empty polyhedron, then the following are equivalent:

$P$ has at least one vertex

$P$ does not contain a line

$\{a_i\}^m _{i=1}$

Existence of Optimal Solutions to Linear Programs

Theorem 4: $min _{x\in P} h^Tx$ $P$ $-\infty$ or there exists a vertex that is optimal.

Note that there could be non-vertex optimal solutions but at least one vertex optimal solution exists

Corollary 1: $min _{x\in P} h^Tx$ $P$ $-\infty$ or there exists an optimal solution.

Standard LP:

$c^Ty$

$Ay = b$ $y \ge 0$

Theorems of Alternatives

Theorem 5: (Farkas' Lemma) $A \in R ^{m \times n}$ $b \in R^m$ be given. Then exactly one of the following systems has a solution:

$Ax = b$ $x \ge 0$

$A^Ty \le 0$ $b^Ty \gt 0$

Note that 2) is not a polyhedron since polyhedrons can not have strict inequalites

Corollary 2: (Gordan's Theorem) $A \in R^{m \times n}$ be given. Then exactly one of the following systems has a solution:

$Ax \gt 0$

$A^Ty = 0$ $y \ge 0$ $y \neq 0$

LP Duality Theory

Primal Problem and Dual Problem:

(P) $c^Tx$

$Ax = b$ $x \ge 0$

(D) $b^Ty$

$A^Ty \le c$

Theorem 6: (LP Weak Duality) $\bar{x} \in R^n$ $\bar{y} \in R^m$ $b^T\bar{y} \le c^T \bar{x}$ .

Corollary 3:

$-\infty$ , then (D) must be infeasible

$+\infty$ , then (P) must be infeasible

$\bar{x}$ $\bar{y}$ $\Delta(\bar{x},\bar{y})=c^T\bar{x}-b^T\bar{y} = 0$ $\bar{x}$ $\bar{y}$ are optimal solutions to (P) and (D)

Note that if (P) is infeasible, it is possible that (D) is also infeasible

Theorem 7: (LP Strong Duality) $x^* \in R^n$ $y^* \in R^m$ $c^Tx^* = b^Ty^*$

Corollary 4: Suppose both (P) and (D) are feasible. Then both (P) and (D) have optimal solutions and their respective optimal values are equal

Theorem 8: (Complementory Slackness) $\bar{x}$ $\bar{y}$ $\bar{x}_i(c-A^T\bar{y})_i = 0$ $i = 1,...,n$

Handout 5 Elements of Conic Linear Programming

Introduction

Pointed Cone:

$K$ is non-empty and closed under addition

$K$ is a cone

$K$ $u \in K$ $-u \in K$ $u = 0$

A pointed cone is automatically convex

Examples of Pointed Cone:

$R^n _+$

$Q^{n+1}= \{(t,x) \in R \times R^n:t \ge ||x||_2\}$

$S^n _+ = \{X \in S^n: u^TXu \ge 0 \quad for \quad all \quad u \in R^n\}$

Note that these three cone are all self-dual

$X \cdot Y = tr(XY)$

Proposition 1: $E_1,...,E_n$ $K_i \sube E_i$ $i = 1,...,n$ $K = K_1 \times ... \times K_n =\{(x_1,...,x_n) \in E_1 \times ... \times E_n: x_i \in K_i \}$ is a closed pointed cone with non-empty interior.

Conic Linear Programming

Standard Form：

$v^* _p$ = $c \cdot x$

$a_i \cdot x = b_i$ $i=1,...,m$

$x \ge _K 0$

Dual:

$v^* _d$ = $b^Ty$

$\sum ^m _{i=1} y_ia_i + s = c$

$y \in R^m$ $s \ge _{K^*} 0$

$K^* = \{w \in E: x \cdot w \ge 0 \quad for \quad all \quad x \in K\}$

Proposition 2: $K \sube E$ be a non-empty set. Then the following hold:

$K^*$ is a closed convex cone

$K$ $K^*$

$K$ $K^*$ is pointed

$K$ $K^*$ has a mon-empty interior

Examples:

Linear Programming:

(P) $c^Tx$

$a^T_i x = b_i$ $i=1,...,m$

$x \in R^n _+$

(D) $b^Ty$

$\sum ^m _{i=1} y_ia_i +s =c$

$y \in R^m$ $s \in R^n _+$

Second-Order Cone Programming (SOCP):

(P) $c^Tx$

$a^T_ix = b_i$ $i = 1,...,m$

$x \in Q^{n+1}$

(D) $b^Ty$

$(v - u^Ty, d - A^Ty) \in Q^{n+1}$

Semidefinite Programming (SDP):

(P) $C \cdot X$

$A_i \cdot X = b_i$ $i = 1,...,m$

$X \in S^n _+$

(D) $b^Ty$

$\sum ^m _{i=1} y_iA_i + S = C$

$y \in R^m$ $S \in S^n _+$

Theorem 1: (CLP Weak Duality) $\bar{x} \in K$ $(\bar{y}, \bar{s}) \in R^m \times K^*$ $b^T\bar{y} \le c \cdot \bar{x}$

Theorem 2: (CLP Farkas Lemma) $\bar{y} \in R^m$ $-\sum ^m _{i=1} \bar{y}_i a_i \in int(K^*)$ , then exactly one of the following holds:

$a_i \cdot x = b_i$ $i = 1,...,m$ $X \in K$

$-\sum ^m _{i=1} \bar{y}_i a_i \in K^*, b^Ty > 0$

Theorem 3: (CLP Strong Duality)

$v_p ^* = v^* _d$ $(\bar{y}, \bar{s})$ $b^T\bar{y} = v_p ^* = v^* _d$

$v_p ^* = v^* _d$ $(\bar{y}, \bar{s})$ $c \cdot \bar{x} = v_p ^* = v^* _d$

$\bar{x}$ $(\bar{y}, \bar{s})$ to (D), the following are equivalent:

They are optimal solutions
The duality gap is zero
$\bar{x} \cdot \bar{s} = 0$

Handout 6 Some Applications of Conic Linear Programming

Quadratically Constrained Quadratic Optimization

Original Problem:

$x^TCx$ ` $C\cdot X$

$x^TA_ix \ge b_i$ $i=1,...,m$ $A_i \cdot X \ge b_i$ $X \ge 0$ $rank(X)\le 1$

Semidefinite Relaxation:

$C\cdot X$

$A_i \cdot X \ge b_i$ $X \ge 0$

handout 7 Optimality Conditions and Lagrangian Duality

Unconstrained Optimizationm Problems

Proposition 1: $f: R^n \rightarrow R$ $\bar{x} \in R^n$ $d \in R^n$ $\nabla f (\bar{x})^Td < 0$ $\alpha _0 >0$ $f(\bar{x} + \alpha d) < f(\bar{x})$ $\alpha \in (0,\alpha_0)$ $d$ $f$ $\bar{x}$ .

Corollary 1: (First Order Necessary Condition for Unconstrained Optimization) $f: R^n \rightarrow R$ $\bar{x} \in R^n$ $\bar{x}$ $\nabla f(\bar{x}) = 0$ $\{d \in R^n: \nabla f(\bar{x}) ^Td < 0 \} \neq \empty$ .

Proposition 2: $S \sube R^n$ $f: R^n \rightarrow R$ $S$ $\bar{x} \in S$ $\bar{x}$ $S$ $\nabla f(\bar{x}) = 0$ .

Proposition 4: (Second Order Sufficient Condition for Unconstrained Optimization) $f: R^n \rightarrow R$ $\bar{x} \in R^n$ $\nabla f(\bar{x}) = 0$ $\nabla ^2 f(\bar{x})$ $\bar{x}$ is a local minimum.

Constrained Optimization Problems

Problem:

$f(x)$

$g_i(x) \le 0$ $i = 1,...,m$

$h_j(x) = 0$ $i = 1,...,m$

$x \in X$

Theorem 2: (The Fritz John Necessary Conditions) $\bar{x} \in S$ $u \in R$ $v_1,...,v_{m_1} \in R$ $w_1,...,w_{m_2} \in R$ such that

$u \nabla f(\bar{x}) + \sum ^{m_1} _{i=1} v_i \nabla g_i(\bar{x}) + \sum ^{m_2} _{j=1} w_j\nabla h_j (\bar{x}) = 0$

$u, v_i \ge 0$ $i = 1,...,m_1$

$(u, v_1,...,v_{m_1},w_1,...,w_{m_2}) \neq 0$

$v_ig_i(\bar{x}) = 0$ $i = 1,...,m_1$

Theorem 3: (The Karush-Kuhn-Tucker Necessary Conditions) $\bar{x} \in S$ $I = \{i \in \{1,...,m_1\}:g_i(\bar{x}) = 0\}$ $\bar{x}$ $\{\nabla g_i(\bar{x}) \} _{i \in I}$ $\{\nabla h_j(\bar{x}) \}_{j=1} ^{m_2}$ $v_1,....,v_{m_1} \in R$ $w_1,...,w_{m_2} \in R$ such that

$\nabla f(\bar{x}) + \sum ^{m_1} _{i=1} v_i \nabla g_i(\bar{x}) + \sum ^{m_2} _{j=1} w_j\nabla h_j (\bar{x}) = 0$

$v_i \ge 0$ $i = 1,...,m_1$

$v_ig_i(\bar{x}) = 0$ $i = 1,...,m_1$

Theorem 4: $g_1,...,g_{m_1}$ $h_1,...,h_{m_2}$ $\bar{x} \in S$ $I = \{i \in \{1,...,m_1\}:g_i(\bar{x}) = 0\}$ $x' \in S$ $g_i(x') <0$ $i \in I$ $\bar{x}$ satisfies the KKT conditions.

Theorem 5: $g_1,...,g_{m_1}$ $h_1,...,h_{m_2}$ $\bar{x} \in S$ $\bar{x}$ satisfies the KKT conditions.

Theorem 6: $X$ $f,g_1,...,g_{m_1}$ $X$ $h_1,...,h_{m_2}$ $(\bar{x},\bar{v},\bar{w}) \in X \times R^{m_1} \times R^{m_2}$ $\bar{x}$ is an optimal solution.

$\nabla f(\bar{x}) + \sum ^{m_1} _{i=1} \bar{v}_i \nabla g_i(\bar{x}) + \sum ^{m_2} _{j=1} \bar{w}_j\nabla h_j (\bar{x}) = 0$

$\bar{v}_i \ge 0$ $i = 1,...,m_1$

$\bar{v}_ig_i(\bar{x}) = 0$ $i = 1,...,m_1$

$g_i(\bar{x}) \le 0$ $i = 1,...,m_1$

$h_j(\bar{x}) = 0$ $i = 1,...,m_2$

Lagrangian Duality

Primal Problem:

$v^* _p$ = $f(x)$

$g_i(x) \le 0$ $i = 1,...,m_1$

$h_j(x) = 0$ $j = 1,...,m_2$

$x \in X$

Dual Problem:

$v^*_d$ $_{v \in R^{m_1}_+, w\in R^{m_2}}$ $\theta(v,w)$

$\theta(v,w) = inf _{x \in X} L(x,v,w)$

$L(x,v,w) = f(x) + v^TG(x) + w^TH(x)$

Theorem 7: (Weak Duality Theorem) $\bar{x} \in R^n$ $(\bar{v},\bar{w}) \in R^{m_1} \times R^{m_2}$ $\theta(\bar{v},\bar{w}) \le f(\bar{x})$

Theorem 8: (Strong Duality Theorem) $(\bar{x}, \bar{v}, \bar{w})$ $L$ $\bar{x}$ $(\bar{v}, \bar{w})$ is optimal for (D)

Saddle Point:

$\bar{x} \in X$

$\bar{v} \ge 0$

$x \in X$ $(v,w) \in R^{m_1}_+ \times R^{m_2}$ $L(\bar{x}, v, w) \le L(\bar{x}, \bar{v}, \bar{w}) \le L(x, \bar{v}, \bar{w})$

Posted in Technology by smartdubTags: study

Welcome!

Yun Peng

Chinese Name

Major

City

Age

Email

Research Interest

Software Engineering

Artificial Intelligence

Optimization Cheatsheet

Handout 1 Introduction

Basic Notions

Different Problems

Properties of Semidefinte Matrix

Handout 2 Elements of Convex Analysis

Affine and Convex Sets

Convexity-Preserving Operations

Projection onto Closed Convex Sets

Separation Theorems

Basic Definitions and Propeerties of Convex Functions

Convexity-Preserving Transformations

Differentiable Convex Functions

Non-Differentiable Convex Functions

Calculus and Linear Algebra Preparations

Handout 3 Elements of Linear Programming

Basic Definitions and Properties

External Elements of a Polyheedron

Existence of Optimal Solutions to Linear Programs

Theorems of Alternatives

LP Duality Theory

Handout 5 Elements of Conic Linear Programming

Introduction

Conic Linear Programming

Handout 6 Some Applications of Conic Linear Programming

Quadratically Constrained Quadratic Optimization

handout 7 Optimality Conditions and Lagrangian Duality

Unconstrained Optimizationm Problems

Constrained Optimization Problems

Lagrangian Duality