KKT conditions, Descent methods - GitHub Pages · KKT conditions, Descent methods Review. OUTLINE....

KKT conditions, Descent methods

CE 377K

April 23, 2015

REVIEW

What is the method of Lagrange multipliers?

What are three interpretations?

KKT conditions, Descent methods Review

OUTLINE

Karush-Kuhn-Tucker conditions

Descent methods for convex optimization

KKT conditions, Descent methods Outline

INEQUALITYCONSTRAINTS

An equivalent way of phrasing the Lagrange multiplier technique is :

At an optimal solution x∗ to the problem minx f (x) subject to hi (x) = 0 fori ∈ {1, . . . ,m}, we have

∇f (x) +m∑i=1

λi∇h(x) = 0

for some λi , and furthermore hi (x) = 0 for all i ∈ {1, . . . ,m}.

The Karush-Kuhn Tucker conditions are as follows:

At an optimal solution x∗ to the problem minx f (x) subject to hi (x) = 0 fori ∈ {1, . . . ,m} and gj(x) ≤ 0 for j ∈ {1, . . . , `} we have

∇f (x) +m∑i=1

λi∇h(x) +∑̀j=1

µj∇g(x) = 0

for some λi and nonnegative µj , and furthermore hi (x) = 0 for all i ∈{1, . . . ,m}, gj(x) ≤ 0 for j ∈ {1, . . . , `}, and µj = 0 for any inactiveconstraint.KKT conditions, Descent methods Inequality constraints

Unpacking the KKT conditions:

A multiplier µj is introduced for each inequality constraint, just like aλi is introduced for each equality.

We distinguish between an active and an inactive inequalityconstraint. The constraint gj(x) ≤ 0 is active if gj(x) = 0 andinactive if gj(x) < 0.

The multiplier for each µj must be nonnegative, and zero for eachinactive constraint.

The second and third points warrant further explanation.

KKT conditions, Descent methods Inequality constraints

If a constraint is inactive at the optimum solution, it is essentiallyirrelevant, and changing the right-hand side by a small amount will notaffect the optimal solution at all.

Therefore µj = 0 for any inactive constraint.

Increasing the right-hand side of the constrant gj(x) can only improve theoptimal value of the objective function. So µj cannot be negative.

(Why is this not true for equality constraints?)

A simple example

Minimize f (x) = (x + 5)2 subject to x ≤ 0.

The optimal solution is clearly x = −5. The inequality constraint is active,so µ = 0.

Here ∇f (x) = 2(x + 5) and ∇g(x) = 1; if we set x = −5 and µ = 0, then

∇f (x) + µ∇g(x) = 0

so this is (potentially) the optimal solution.

A simple example

Minimize f (x) = (x − 5)2 subject to x ≤ 0.

The optimal solution is now x = 0. The inequality constraint is active, soµ ≥ 0.

Here ∇f (x) = 2(x − 5) and ∇g(x) = 1; if we set x = 0 then

∇f (x) + µ∇g(x) = 0

is true when µ = 10.

The interpretation: by changing the constraint from x ≤ 0 to x ≤ 1, theobjective function can be reduced by approximately 10.

DESCENT METHODS

Assume that we are given a convex optimization problem.

A descent method is an iterative algorithm consisting of the followingsteps:

1 Choose an initial feasible solution x← x0.

2 Identify a feasible “target” solution x∗ in a “downhill direction.”

3 Choose a step size λ ∈ [0, 1] and set x← λx∗ + (1− λ)x

4 Test for termination, and return to step 2 if we need to improvefurther.

KKT conditions, Descent methods Descent methods

Hiking analogy

The two key questions are:

1 How do I choose the target solution?

2 How do I choose λ

You might also ask how to choose the initial solution x0. For a convexoptimization problem, it won’t matter too much, and if we answer theprevious two questions correctly we will converge to the global optimumfrom any starting point.

Today and next class, I’ll give a few different ways these steps can beperformed:

For choosing the target x∗, I will show you the conditional gradient andgradient projection methods.

For choosing the step size λ, I will show you the method of successiveaverages, limited minimization rule, and Armijo rule.

For a convex optimization problem, any combination of these will convergeto the global optimum from any starting point.

Target selection

Both of the target selection rules are based on the gradient at the currentpoint, ∇f (x).

Remember that the gradient is a vector pointing in the direction ofsteepest ascent. Since our standard form is a minimization problem, wewill want to move in the opposite direction as the gradient.

However, just using the gradient as a search direction creates problems,because it fails to account for the constraints.

The conditional gradient and gradient projection methods are designed toprovide reasonable target points while accounting for constraints.

Conditional gradient

In the conditional gradient rule, we assume that the slope of the objectiveis the same throughout the feasible region. (This is not true, but gives usa direction to move in.)

This is equivalent to replacing the objective function with its linearapproximation at the current point x.

We then solve the optimization problem for the approximate objectivefunction f (x∗) = f (x) +∇f (x)(x∗ − x) with the same constraints, and usethe optimal x∗ value for the target.

If all of the constraints are linear, the conditional gradient method is nothingmore than solving a linear program.

Example

Transit frequency setting problem with just 2 routes, same data as before:

Minimize 1/n1 + 4/n2 subject to n1 + n2 = 6, n1 ≥ 0, n2 ≥ 0.

Gradient projection

In the gradient projection method, the target is found by calculating thepoint x− s∇f (x) (where s is another step size), and then calculating theprojection of that point onto the feasible region.

The projection of a point x onto a set X is the point in X closest to x, andis denoted by ΠXx.

Examples of projection

Project the point (5, 10) onto the set defined by 0 ≤ x1 ≤ 7, 0 ≤ x2 ≤ 7.

Project the point (4, 2) onto the set defined by x1 + x2 = 5

Minimize 1/n1 + 4/n2 subject to n1 + n2 = 6, n1 ≥ 0, n2 ≥ 0.

KKT conditions, Descent methods - GitHub Pages · KKT conditions, Descent methods Review. OUTLINE....

Documents

Transcript of KKT conditions, Descent methods - GitHub Pages · KKT conditions, Descent methods Review. OUTLINE....

TABLE OF CONTENTS - KKT Pakistan · community for awareness in the non-surgical treatments of spinal related conditions. KKT technology is a highly sophisticated, non-invasive, evidence-based

EE563 Convex Optimization - Module 8 · 2020-07-14 · Convex Optimization. Outline •Sum-Rate Maximization Problem in Communications •Solution utilizing KKT Conditions Section

DESCENT PREPARATION MANAGED DESCENT T/D · PDF filepf descent phase pnf descent preparation managed descent iaf t/d 80 nm ~10 minutes a320 -version 05a

OUR VISION IS YOUR PROGRESS - KKT chillers

ON THE SOLUTION OF THE KKT - uni-wuerzburg.dekanzow/paper/PotRedP.pdf · ON THE SOLUTION OF THE KKT ... we use an equation reformulation of the KKT conditions and a corresponding

Kkt interactive web_edge-10-26-2010

Constrained optimizationmdav.ece.gatech.edu/ece-6254-spring2017/notes/08-svms.pdfIf a constrained optimization problem is •differentiable •convex then the KKT conditions are necessary

SOLVING REDUCED KKT SYSTEMS IN BARRIER METHODS FOR … · SOLVING REDUCED KKT SYSTEMS IN BARRIER METHODS FOR LINEAR AND QUADRATIC PROGRAMMING PhilipE.GILL⁄,WalterMURRAYy, DulceB.PONCELEON¶

Lec 22 - 23 KKT Conditions

KKT INTERNATIONAL ORTHOPAEDIC SPINE CENTRE PAKISTAN

The Security of Latent Dirichlet Allocationpages.cs.wisc.edu/~jerryzhu/pub/aistatsAttackLDA.pdf · The Security of Latent Dirichlet Allocation 2 The KKT Conditions for LDA Variational

Mortii Matii de Program de Kkt

Kkt interactive smx-toronto2013

Lecture Week9 Q KKT

ON THE SOLUTION OF THE KKT - uni-wuerzburg.de · 2019. 1. 8. · ON THE SOLUTION OF THE KKT CONDITIONS OF GENERALIZED NASH EQUILIBRIUM PROBLEMS1 Axel Dreves2, Francisco Facchinei3,

Karush-Kuhn-Tucker Conditions and Its Usagesyuxiangw/classes/CS292A... · 5.5.4 Mechanics interpretation of KKT conditions The KKT conditions can be given a nice interpretation in

KKT 45 Launch Presentationsupport.moulnisky.com/downloads/Lava/KKT Serie/KKT45...MP3/MP4/3GP Audio and Video Playback Wireless FM with Recording KKT 45 31 March 2014 Click pictures

Nonlinear Optimization: Algorithms 3: Interior-point methodsmembers.cbio.mines-paristech.fr/.../7_algo3/algo3.pdfInterior-point methods solve the problem (or the KKT conditions) by

Research Article OP-KNN:MethodandApplications · 2019. 7. 31. · (KKT) conditions. Only linear equations need to be solved which makes the approach much simpler. As a consequence,

Buzzle 1-KKT Flyer