Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

KKT Conditions in Finite Dimensions

University of Pisa

Overview

In this lecture we show the core ideas needed to understand inequality constrained first-order and second-order optimality in finite dimensions.

We focus on the problem

minuRnf(u)s.t.φj(u)0,  j=1,,m,\min_{u\in\mathbb{R}^n} f(u) \quad\text{s.t.}\quad \varphi_j(u)\ge 0,\; j=1,\dots,m,

with f,φjC1(Rn)f,\varphi_j\in C^1(\mathbb{R}^n).

The take-home message is simple:

  1. at a constrained minimum there is no admissible descent direction;

  2. this forces the objective gradient to balance active constraint normals;

  3. this balance is exactly the KKT system;

  4. second order is curvature of the Lagrangian along directions that remain tangent to active constraints.

  5. with the convention L=fjλjφj\mathcal L=f-\sum_j\lambda_j\varphi_j and φj0\varphi_j\ge 0, active multipliers are nonnegative, and they vanish for inactive constraints (complementarity).


Problem Setting and Active Constraints

Define the feasible set

Uad:={uRn:φj(u)0 j}.\mathcal U_{\mathrm{ad}}:=\{u\in\mathbb R^n:\varphi_j(u)\ge 0\ \forall j\}.

Let uˉUad\bar u\in\mathcal U_{\mathrm{ad}} be a local minimizer.

Not all constraints matter equally at uˉ\bar u. Only the constraints that are exactly at the boundary can influence first-order optimality. So we define the active set

A(uˉ):={j{1,,m}:φj(uˉ)=0}.\mathcal A(\bar u):=\{j\in\{1,\dots,m\}:\varphi_j(\bar u)=0\}.

Inactive constraints satisfy φj(uˉ)>0\varphi_j(\bar u)>0 and do not contribute at first order. For example: if we have a contact constraint between two bodies (that cannot penetrate), then only the constraints corresponding to points in contact are active, while the others are inactive.


Admissible First-Order Directions

Near uˉ\bar u, a direction dRnd\in\mathbb R^n is first-order admissible if it does not decrease active constraints. Linearizing active constraints gives

φj(uˉ)d0jA(uˉ).\nabla\varphi_j(\bar u)\cdot d\ge 0 \quad\forall j\in\mathcal A(\bar u).

This condition is the local geometric ingredient we need.

Interpretation:


First-Order Necessary Condition

At a local minimizer, every admissible first-order direction must be non-descent:

f(uˉ)d0for every d such that φj(uˉ)d0 jA(uˉ).\nabla f(\bar u)\cdot d\ge 0 \quad\text{for every } d \text{ such that } \nabla\varphi_j(\bar u)\cdot d\ge 0\ \forall j\in\mathcal A(\bar u).

Why this is unavoidable:

This is the true origin of KKT.


From First-Order Geometry to KKT

For regular active constraints (the standard finite-dimensional nondegenerate case), there exist multipliers λj0\lambda_j\ge 0 such that

f(uˉ)jA(uˉ)λjφj(uˉ)=0.\nabla f(\bar u)-\sum_{j\in\mathcal A(\bar u)}\lambda_j\nabla\varphi_j(\bar u)=0.

Extending by λj=0\lambda_j=0 on inactive constraints yields the standard stationarity form

f(uˉ)j=1mλjφj(uˉ)=0.\nabla f(\bar u)-\sum_{j=1}^m\lambda_j\nabla\varphi_j(\bar u)=0.

Together with primal feasibility and complementarity, we obtain

f(uˉ)j=1mλjφj(uˉ)=0,φj(uˉ)0,j=1,,m,λj0,j=1,,m,λjφj(uˉ)=0,j=1,,m.\begin{aligned} \nabla f(\bar u)-\sum_{j=1}^m\lambda_j\nabla\varphi_j(\bar u) &= 0,\\ \varphi_j(\bar u) &\ge 0,\quad j=1,\dots,m,\\ \lambda_j &\ge 0,\quad j=1,\dots,m,\\ \lambda_j\,\varphi_j(\bar u) &= 0,\quad j=1,\dots,m. \end{aligned}

Complementarity meaning:


Box Constraints: The Prototype Case

Consider

minuRnf(u)s.t.aiuibi,i=1,,n.\min_{u\in\mathbb R^n} f(u) \quad\text{s.t.}\quad a_i\le u_i\le b_i,\quad i=1,\dots,n.

Equivalent inequality form:

uiai0,biui0.u_i-a_i\ge 0, \qquad b_i-u_i\ge 0.

The componentwise first-order condition becomes

{if(uˉ)0if uˉi=ai,if(uˉ)0if uˉi=bi,if(uˉ)=0if ai<uˉi<bi.\begin{cases} \partial_i f(\bar u)\ge 0 & \text{if } \bar u_i=a_i,\\ \partial_i f(\bar u)\le 0 & \text{if } \bar u_i=b_i,\\ \partial_i f(\bar u)=0 & \text{if } a_i<\bar u_i<b_i. \end{cases}

This is the exact finite-dimensional model behind bound-constrained control variables in PDE-constrained optimization.


Second-Order Conditions

Define the Lagrangian

L(u,λ):=f(u)j=1mλjφj(u).\mathcal L(u,\lambda):=f(u)-\sum_{j=1}^m\lambda_j\varphi_j(u).

After first-order conditions are satisfied, the next question is curvature along first-order feasible/tangent directions. A practical critical set is

C(uˉ):={d:φj(uˉ)d0 jA(uˉ), f(uˉ)d=0}.\mathcal C(\bar u):=\{d:\nabla\varphi_j(\bar u)\cdot d\ge 0\ \forall j\in\mathcal A(\bar u),\ \nabla f(\bar u)\cdot d=0\}.

Second-order necessary condition:

dTuu2L(uˉ,λ)d0dC(uˉ).d^T\nabla^2_{uu}\mathcal L(\bar u,\lambda)d\ge 0 \quad\forall d\in\mathcal C(\bar u).

Second-order sufficient condition (strict local minimality):

dTuu2L(uˉ,λ)d>0dC(uˉ){0}.d^T\nabla^2_{uu}\mathcal L(\bar u,\lambda)d>0 \quad\forall d\in\mathcal C(\bar u)\setminus\{0\}.

Interpretation:


Summary

At a constrained local minimizer:

  1. no admissible first-order direction can decrease ff;

  2. this forces a balance between objective gradient and active constraint normals;

  3. that balance is KKT with nonnegative multipliers and complementarity;

  4. second order is positivity of Lagrangian curvature on critical directions.

This is the essential finite-dimensional picture we need before moving to PDE-constrained settings.