Shabbir Ahmed [email protected]/2012/presentations/Ahmed.pdf · S. Ahmed (GA...

118
Risk Averse Stochastic Programming Shabbir Ahmed [email protected]

Transcript of Shabbir Ahmed [email protected]/2012/presentations/Ahmed.pdf · S. Ahmed (GA...

Page 1: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Risk Averse Stochastic Programming

Shabbir [email protected]

Page 2: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Outline

Classical stochastic programmingModeling issues

Risk aversionDistribution robustness

Algorithmic issuesSamplingOptimization

Based on work with J. Luedtke, A. Shapiro, and W. Wang.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 2 / 41

Page 3: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Classical Stochastic Programming

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 3 / 41

Page 4: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Stochastic Programming

SP : minx∈X

{f (x) := EP [F (x , ξ)]

}

x is the decision vector,X is the set of feasible solutions,ξ is a random vector with known distribution P,F is a “cost” function, andwe want to minimize expected cost.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 4 / 41

Page 5: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Examples

Newsvendorx : order quanity; ξ: demandF (x , ξ) := q+(x − ξ)+ + q−(ξ − x)+X := R+.

Portfolio selectionx : investment proportions; ξ: asset returnsF (x , ξ): portfolio loss function, e.g. F (x , ξ) := −ξ>xX := {x ∈ Rn : e>x = 1, x ≥ 0}

Two-stage stochastic programsx : first stage decisions; ξ: uncertain parameters; y : second stagedecisionsF (x , ξ := (q,h,T )) = min{q>y : Wy ≥ h − Tx}X := {x ∈ Rn : Ax ≥ b}

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 5 / 41

Page 6: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Examples

Newsvendorx : order quanity; ξ: demandF (x , ξ) := q+(x − ξ)+ + q−(ξ − x)+X := R+.

Portfolio selectionx : investment proportions; ξ: asset returnsF (x , ξ): portfolio loss function, e.g. F (x , ξ) := −ξ>xX := {x ∈ Rn : e>x = 1, x ≥ 0}

Two-stage stochastic programsx : first stage decisions; ξ: uncertain parameters; y : second stagedecisionsF (x , ξ := (q,h,T )) = min{q>y : Wy ≥ h − Tx}X := {x ∈ Rn : Ax ≥ b}

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 5 / 41

Page 7: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Examples

Newsvendorx : order quanity; ξ: demandF (x , ξ) := q+(x − ξ)+ + q−(ξ − x)+X := R+.

Portfolio selectionx : investment proportions; ξ: asset returnsF (x , ξ): portfolio loss function, e.g. F (x , ξ) := −ξ>xX := {x ∈ Rn : e>x = 1, x ≥ 0}

Two-stage stochastic programsx : first stage decisions; ξ: uncertain parameters; y : second stagedecisionsF (x , ξ := (q,h,T )) = min{q>y : Wy ≥ h − Tx}X := {x ∈ Rn : Ax ≥ b}

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 5 / 41

Page 8: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Examples

Newsvendorx : order quanity; ξ: demandF (x , ξ) := q+(x − ξ)+ + q−(ξ − x)+X := R+.

Portfolio selectionx : investment proportions; ξ: asset returnsF (x , ξ): portfolio loss function, e.g. F (x , ξ) := −ξ>xX := {x ∈ Rn : e>x = 1, x ≥ 0}

Two-stage stochastic programsx : first stage decisions; ξ: uncertain parameters; y : second stagedecisionsF (x , ξ := (q,h,T )) = min{q>y : Wy ≥ h − Tx}X := {x ∈ Rn : Ax ≥ b}

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 5 / 41

Page 9: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

2S-SP Example: Supply Chain Network Design

Strategic decisions: Locate DCsand warehouses

Operational decisions:Shipments through the networkto satisfy customer demands

Locate→ observe demand→ship.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 6 / 41

Page 10: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Issues

SP : minx∈X

{f (x) := EP [F (x , ξ)]

}

(Typically) evaluating f (x) = EP [F (x , ξ)] exactly is impossible.Large-scale optimization problem.

Stochastic Programming ≈ Sampling + (Deterministic) Optimization

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 7 / 41

Page 11: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Issues

SP : minx∈X

{f (x) := EP [F (x , ξ)]

}

(Typically) evaluating f (x) = EP [F (x , ξ)] exactly is impossible.Large-scale optimization problem.

Stochastic Programming ≈ Sampling + (Deterministic) Optimization

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 7 / 41

Page 12: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Issues

SP : minx∈X

{f (x) := EP [F (x , ξ)]

}

(Typically) evaluating f (x) = EP [F (x , ξ)] exactly is impossible.Large-scale optimization problem.

Stochastic Programming ≈ Sampling + (Deterministic) Optimization

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 7 / 41

Page 13: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Issues

SP : minx∈X

{f (x) := EP [F (x , ξ)]

}

(Typically) evaluating f (x) = EP [F (x , ξ)] exactly is impossible.Large-scale optimization problem.

Stochastic Programming ≈ Sampling + (Deterministic) Optimization

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 7 / 41

Page 14: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Sample Average Approximation

Generate i.i.d sample (ξ1, . . . , ξN) from P.Solve

SAAN : minx∈X

{fN(x) := N−1

N∑i=1

[F (x , ξi)]}

Let vN be the optimal value and XN be the set of optimal solutions.Let v∗ and X ∗ be the optimal value and optimal solution set for SP(assume these exist).

What is the relation between vN and v∗ and between XN and X ∗

w.r.t sample size N?

How to solve SAAN?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41

Page 15: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Sample Average Approximation

Generate i.i.d sample (ξ1, . . . , ξN) from P.Solve

SAAN : minx∈X

{fN(x) := N−1

N∑i=1

[F (x , ξi)]}

Let vN be the optimal value and XN be the set of optimal solutions.Let v∗ and X ∗ be the optimal value and optimal solution set for SP(assume these exist).

What is the relation between vN and v∗ and between XN and X ∗

w.r.t sample size N?

How to solve SAAN?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41

Page 16: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Sample Average Approximation

Generate i.i.d sample (ξ1, . . . , ξN) from P.Solve

SAAN : minx∈X

{fN(x) := N−1

N∑i=1

[F (x , ξi)]}

Let vN be the optimal value and XN be the set of optimal solutions.Let v∗ and X ∗ be the optimal value and optimal solution set for SP(assume these exist).

What is the relation between vN and v∗ and between XN and X ∗

w.r.t sample size N?

How to solve SAAN?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41

Page 17: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Sample Average Approximation

Generate i.i.d sample (ξ1, . . . , ξN) from P.Solve

SAAN : minx∈X

{fN(x) := N−1

N∑i=1

[F (x , ξi)]}

Let vN be the optimal value and XN be the set of optimal solutions.Let v∗ and X ∗ be the optimal value and optimal solution set for SP(assume these exist).

What is the relation between vN and v∗ and between XN and X ∗

w.r.t sample size N?

How to solve SAAN?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41

Page 18: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Sample Average Approximation

Generate i.i.d sample (ξ1, . . . , ξN) from P.Solve

SAAN : minx∈X

{fN(x) := N−1

N∑i=1

[F (x , ξi)]}

Let vN be the optimal value and XN be the set of optimal solutions.Let v∗ and X ∗ be the optimal value and optimal solution set for SP(assume these exist).

What is the relation between vN and v∗ and between XN and X ∗

w.r.t sample size N?

How to solve SAAN?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41

Page 19: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Sample Average Approximation

Generate i.i.d sample (ξ1, . . . , ξN) from P.Solve

SAAN : minx∈X

{fN(x) := N−1

N∑i=1

[F (x , ξi)]}

Let vN be the optimal value and XN be the set of optimal solutions.Let v∗ and X ∗ be the optimal value and optimal solution set for SP(assume these exist).

What is the relation between vN and v∗ and between XN and X ∗

w.r.t sample size N?

How to solve SAAN?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41

Page 20: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Sample Average Approximation

Generate i.i.d sample (ξ1, . . . , ξN) from P.Solve

SAAN : minx∈X

{fN(x) := N−1

N∑i=1

[F (x , ξi)]}

Let vN be the optimal value and XN be the set of optimal solutions.Let v∗ and X ∗ be the optimal value and optimal solution set for SP(assume these exist).

What is the relation between vN and v∗ and between XN and X ∗

w.r.t sample size N?

How to solve SAAN?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41

Page 21: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Convergence

Theorem (Shapiro, Wets, Birge etc.)As N →∞, vN and XN converges to their true counterparts v∗ and X ∗

... exponentially fast!

Implication: For problem with n variables, with

N = O( nε2

)an optimal solution to SAAN is an ε-optimal solution to SP withvery high probability.

SAAN also serves in a simple statistical procedure to validate thequality of a candidate solution.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 9 / 41

Page 22: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Convergence

Theorem (Shapiro, Wets, Birge etc.)As N →∞, vN and XN converges to their true counterparts v∗ and X ∗

... exponentially fast!

Implication: For problem with n variables, with

N = O( nε2

)an optimal solution to SAAN is an ε-optimal solution to SP withvery high probability.

SAAN also serves in a simple statistical procedure to validate thequality of a candidate solution.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 9 / 41

Page 23: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Convergence

Theorem (Shapiro, Wets, Birge etc.)As N →∞, vN and XN converges to their true counterparts v∗ and X ∗

... exponentially fast!

Implication: For problem with n variables, with

N = O( nε2

)an optimal solution to SAAN is an ε-optimal solution to SP withvery high probability.

SAAN also serves in a simple statistical procedure to validate thequality of a candidate solution.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 9 / 41

Page 24: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Convergence

Theorem (Shapiro, Wets, Birge etc.)As N →∞, vN and XN converges to their true counterparts v∗ and X ∗

... exponentially fast!

Implication: For problem with n variables, with

N = O( nε2

)an optimal solution to SAAN is an ε-optimal solution to SP withvery high probability.

SAAN also serves in a simple statistical procedure to validate thequality of a candidate solution.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 9 / 41

Page 25: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Optimization via Decomposition

x x x

),ˆ( 11 ξxFs ∂∈),ˆ( 11 ξxFv ∈

),ˆ( 22 ξxFs ∂∈),ˆ( 22 ξxFv ∈

),ˆ( NN xFs ξ∂∈),ˆ( NN xFv ξ∈

),( 11 sv ),( 22 sv ),( NN sv

. . . . . . . . 

Xxxf

∈s.t.)(ˆmin

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 10 / 41

Page 26: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Risk Aversion

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 11 / 41

Page 27: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Risk Aversion

SP : minx∈X

{f (x) := E[F (x , ξ)]

}

Why expected costs?Given x , the objective (cost) is a random variable F (x , ξ).

),( 1 ξxF

),( 2 ξxF

We need a scalar measure to compare solutions x1 and x2.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 12 / 41

Page 28: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Risk functions ρ

A risk function is a mapping that assigns a real number ρ[Z ] to arandom variable Z .

Examples:Expected value: ρ[Z ] = E[Z ]Expected (dis)utility: ρ[Z ] = E[u(Z )]Mean-Variance: ρ[Z ] = E[Z ] + λV[Z ]α-quantile or α-VaR: ρ[Z ] = Kα[Z ] = min{t : Pr(Z ≤ t) ≥ α}α-Conditional-VaR: ρ[Z ] = CVaRα = E[Z |Z ≥ Kα[Z ]]

α

1 − α

Kα CVaRα

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 13 / 41

Page 29: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Risk Averse SP

RASP : minx∈X

{f (x) := ρ[F (x , ξ)]

}

Choice of ρ is (mostly) a modeling issue.How to solve RASP? (Sampling + Optimization)Expected (dis)utility→ straight-forward (Rutenberg ’73).Utility function hard to elicit→ Dispersion statistics preferred.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 14 / 41

Page 30: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Risk Averse SP

RASP : minx∈X

{f (x) := ρ[F (x , ξ)]

}

Choice of ρ is (mostly) a modeling issue.How to solve RASP? (Sampling + Optimization)Expected (dis)utility→ straight-forward (Rutenberg ’73).Utility function hard to elicit→ Dispersion statistics preferred.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 14 / 41

Page 31: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Risk Averse SP

RASP : minx∈X

{f (x) := ρ[F (x , ξ)]

}

Choice of ρ is (mostly) a modeling issue.How to solve RASP? (Sampling + Optimization)Expected (dis)utility→ straight-forward (Rutenberg ’73).Utility function hard to elicit→ Dispersion statistics preferred.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 14 / 41

Page 32: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Risk Averse SP

RASP : minx∈X

{f (x) := ρ[F (x , ξ)]

}

Choice of ρ is (mostly) a modeling issue.How to solve RASP? (Sampling + Optimization)Expected (dis)utility→ straight-forward (Rutenberg ’73).Utility function hard to elicit→ Dispersion statistics preferred.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 14 / 41

Page 33: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Risk Averse SP

RASP : minx∈X

{f (x) := ρ[F (x , ξ)]

}

Choice of ρ is (mostly) a modeling issue.How to solve RASP? (Sampling + Optimization)Expected (dis)utility→ straight-forward (Rutenberg ’73).Utility function hard to elicit→ Dispersion statistics preferred.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 14 / 41

Page 34: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-Risk Optimization

We will consider risk functions of the form

ρ[Z ] = γE[Z ] + λD[Z ]

where D is a dispersion measure, and γ and λ are trade-offweights.Can analyze risk return tradeoff.What D, γ, and λ “makes sense”?How do we solve (sampling + optimization) the correspondingstochastic programs?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 15 / 41

Page 35: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-Risk Optimization

We will consider risk functions of the form

ρ[Z ] = γE[Z ] + λD[Z ]

where D is a dispersion measure, and γ and λ are trade-offweights.Can analyze risk return tradeoff.What D, γ, and λ “makes sense”?How do we solve (sampling + optimization) the correspondingstochastic programs?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 15 / 41

Page 36: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-Risk Optimization

We will consider risk functions of the form

ρ[Z ] = γE[Z ] + λD[Z ]

where D is a dispersion measure, and γ and λ are trade-offweights.Can analyze risk return tradeoff.What D, γ, and λ “makes sense”?How do we solve (sampling + optimization) the correspondingstochastic programs?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 15 / 41

Page 37: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-Risk Optimization

We will consider risk functions of the form

ρ[Z ] = γE[Z ] + λD[Z ]

where D is a dispersion measure, and γ and λ are trade-offweights.Can analyze risk return tradeoff.What D, γ, and λ “makes sense”?How do we solve (sampling + optimization) the correspondingstochastic programs?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 15 / 41

Page 38: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-Variance

Markowitz approach:ρ[Z ] = E[Z ] + λV[Z ].V[F (x , ξ)] is oftennon-convex.Provably NP-hard.Sampling theory hard toextend.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 16 / 41

Page 39: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-Variance

Markowitz approach:ρ[Z ] = E[Z ] + λV[Z ].V[F (x , ξ)] is oftennon-convex.Provably NP-hard.Sampling theory hard toextend.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 16 / 41

Page 40: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-Variance

Markowitz approach:ρ[Z ] = E[Z ] + λV[Z ].V[F (x , ξ)] is oftennon-convex.Provably NP-hard.Sampling theory hard toextend.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 16 / 41

Page 41: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-Variance

Markowitz approach:ρ[Z ] = E[Z ] + λV[Z ].V[F (x , ξ)] is oftennon-convex.Provably NP-hard.Sampling theory hard toextend.

Structure of Variance

0 2 4 6 8 10 12 14 16 18 20200

400

600

800

1000

1200

1400

1600

1800

2000

χ

V[f(χ,ω)]

V[f(x, ω)] is a non-convex function.

F (x , ξ) = q+(x − ξ)+ + q−(ξ − x)+

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 16 / 41

Page 42: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-Variance

Markowitz approach:ρ[Z ] = E[Z ] + λV[Z ].V[F (x , ξ)] is oftennon-convex.Provably NP-hard.Sampling theory hard toextend.

Structure of Variance

0 2 4 6 8 10 12 14 16 18 20200

400

600

800

1000

1200

1400

1600

1800

2000

χ

V[f(χ,ω)]

V[f(x, ω)] is a non-convex function.

F (x , ξ) = q+(x − ξ)+ + q−(ξ − x)+

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 16 / 41

Page 43: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Value-at-Risk Optimization

VaR Stochastic Programming:

min{f (x) := Kα[F (x , ξ)] : x ∈ X}.

Equivalent to a chance-constrained stochastic program:

min{t : Pr[F (x , ξ)− t ≤ 0] ≥ α, x ∈ X}

Non-convex even in the linear setting F (x , ξ) = −ξ>x .Sampling theory extended (Luedtke + A.’08)SAAN is NP-hard, solve via integer programming

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 17 / 41

Page 44: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Value-at-Risk Optimization

VaR Stochastic Programming:

min{f (x) := Kα[F (x , ξ)] : x ∈ X}.

Equivalent to a chance-constrained stochastic program:

min{t : Pr[F (x , ξ)− t ≤ 0] ≥ α, x ∈ X}

Non-convex even in the linear setting F (x , ξ) = −ξ>x .Sampling theory extended (Luedtke + A.’08)SAAN is NP-hard, solve via integer programming

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 17 / 41

Page 45: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Value-at-Risk Optimization

VaR Stochastic Programming:

min{f (x) := Kα[F (x , ξ)] : x ∈ X}.

Equivalent to a chance-constrained stochastic program:

min{t : Pr[F (x , ξ)− t ≤ 0] ≥ α, x ∈ X}

Non-convex even in the linear setting F (x , ξ) = −ξ>x .Sampling theory extended (Luedtke + A.’08)SAAN is NP-hard, solve via integer programming

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 17 / 41

Page 46: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Value-at-Risk Optimization

VaR Stochastic Programming:

min{f (x) := Kα[F (x , ξ)] : x ∈ X}.

Equivalent to a chance-constrained stochastic program:

min{t : Pr[F (x , ξ)− t ≤ 0] ≥ α, x ∈ X}

Non-convex even in the linear setting F (x , ξ) = −ξ>x .Sampling theory extended (Luedtke + A.’08)SAAN is NP-hard, solve via integer programming

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 17 / 41

Page 47: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Value-at-Risk Optimization

VaR Stochastic Programming:

min{f (x) := Kα[F (x , ξ)] : x ∈ X}.

Equivalent to a chance-constrained stochastic program:

min{t : Pr[F (x , ξ)− t ≤ 0] ≥ α, x ∈ X}

Non-convex even in the linear setting F (x , ξ) = −ξ>x .Sampling theory extended (Luedtke + A.’08)SAAN is NP-hard, solve via integer programming

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 17 / 41

Page 48: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution Robustness

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 18 / 41

Page 49: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution Robustness

SP : minx∈X

{f (x) := EP [F (x , ξ)]

}

SP assumes precise knowledge of underlying distribution P.In the SAA framework, a sampling oracle for P is needed.How to handle imprecision of the distribution?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 19 / 41

Page 50: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution Robustness

SP : minx∈X

{f (x) := EP [F (x , ξ)]

}

SP assumes precise knowledge of underlying distribution P.In the SAA framework, a sampling oracle for P is needed.How to handle imprecision of the distribution?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 19 / 41

Page 51: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution Robustness

SP : minx∈X

{f (x) := EP [F (x , ξ)]

}

SP assumes precise knowledge of underlying distribution P.In the SAA framework, a sampling oracle for P is needed.How to handle imprecision of the distribution?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 19 / 41

Page 52: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution Families

Instead of knowing P exactly, suppose we only know a family P oflikely distributions.

E.g. We have estimated a nominal distribution P0 and to accountfor estimation errors we can consider

P = {P : (1− ε1)P0 � P � (1 + ε2)P0, EP [1] = 1},

where 0 ≤ ε1 < 1 and 0 ≤ ε2.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 20 / 41

Page 53: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution Robust SP

DRSP : minx∈X

maxP∈P

EP [F (x , ξ)]

Convex problem in x , but evaluation may be difficult.Sampling theory does not immediately extend.Q. Can we extend classical SP methodology (sampling andoptimization) to DRSP?A. Yes (for some P) but indirectly.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 21 / 41

Page 54: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution Robust SP

DRSP : minx∈X

maxP∈P

EP [F (x , ξ)]

Convex problem in x , but evaluation may be difficult.Sampling theory does not immediately extend.Q. Can we extend classical SP methodology (sampling andoptimization) to DRSP?A. Yes (for some P) but indirectly.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 21 / 41

Page 55: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution Robust SP

DRSP : minx∈X

maxP∈P

EP [F (x , ξ)]

Convex problem in x , but evaluation may be difficult.Sampling theory does not immediately extend.Q. Can we extend classical SP methodology (sampling andoptimization) to DRSP?A. Yes (for some P) but indirectly.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 21 / 41

Page 56: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution Robust SP

DRSP : minx∈X

maxP∈P

EP [F (x , ξ)]

Convex problem in x , but evaluation may be difficult.Sampling theory does not immediately extend.Q. Can we extend classical SP methodology (sampling andoptimization) to DRSP?A. Yes (for some P) but indirectly.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 21 / 41

Page 57: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution Robust SP

DRSP : minx∈X

maxP∈P

EP [F (x , ξ)]

Convex problem in x , but evaluation may be difficult.Sampling theory does not immediately extend.Q. Can we extend classical SP methodology (sampling andoptimization) to DRSP?A. Yes (for some P) but indirectly.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 21 / 41

Page 58: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution robustness and risk aversion

Theorem (Artzner et al.’99)If P is a closed convex family of distributions then there exists a(convex + ...) risk function ρ such that

maxP∈P

EP [Z ] = ρ[Z ],

and vice versa.

Follows from conjugate duality.The associated risk functions are “consistent” with rational choice(e.g. stochastic ordering, risk aversion, coherence etc.).

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 22 / 41

Page 59: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution robustness and risk aversion

Theorem (Artzner et al.’99)If P is a closed convex family of distributions then there exists a(convex + ...) risk function ρ such that

maxP∈P

EP [Z ] = ρ[Z ],

and vice versa.

Follows from conjugate duality.The associated risk functions are “consistent” with rational choice(e.g. stochastic ordering, risk aversion, coherence etc.).

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 22 / 41

Page 60: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution robustness and risk aversion

Theorem (Artzner et al.’99)If P is a closed convex family of distributions then there exists a(convex + ...) risk function ρ such that

maxP∈P

EP [Z ] = ρ[Z ],

and vice versa.

Follows from conjugate duality.The associated risk functions are “consistent” with rational choice(e.g. stochastic ordering, risk aversion, coherence etc.).

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 22 / 41

Page 61: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Distribution robustness and risk aversion

Theorem (Artzner et al.’99)If P is a closed convex family of distributions then there exists a(convex + ...) risk function ρ such that

maxP∈P

EP [Z ] = ρ[Z ],

and vice versa.

Follows from conjugate duality.The associated risk functions are “consistent” with rational choice(e.g. stochastic ordering, risk aversion, coherence etc.).

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 22 / 41

Page 62: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Example

Consider a setting with K scenarios:

maxp

{K∑

k=1

pkF (x , ξk ) :K∑

k=1

pk = 1, 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

maxp

{λ+

K∑k=1

pk (F (x , ξk )− λ) : 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

{λ+ (1 + ε)

K∑k=1

p0k (F (x , ξk )− λ)+

}= min

λ{λ+ (1 + ε)E(F (x , ξ)− λ)+}

= CVaR[F (x , ξ)]

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 23 / 41

Page 63: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Example

Consider a setting with K scenarios:

maxp

{K∑

k=1

pkF (x , ξk ) :K∑

k=1

pk = 1, 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

maxp

{λ+

K∑k=1

pk (F (x , ξk )− λ) : 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

{λ+ (1 + ε)

K∑k=1

p0k (F (x , ξk )− λ)+

}= min

λ{λ+ (1 + ε)E(F (x , ξ)− λ)+}

= CVaR[F (x , ξ)]

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 23 / 41

Page 64: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Example

Consider a setting with K scenarios:

maxp

{K∑

k=1

pkF (x , ξk ) :K∑

k=1

pk = 1, 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

maxp

{λ+

K∑k=1

pk (F (x , ξk )− λ) : 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

{λ+ (1 + ε)

K∑k=1

p0k (F (x , ξk )− λ)+

}= min

λ{λ+ (1 + ε)E(F (x , ξ)− λ)+}

= CVaR[F (x , ξ)]

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 23 / 41

Page 65: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Example

Consider a setting with K scenarios:

maxp

{K∑

k=1

pkF (x , ξk ) :K∑

k=1

pk = 1, 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

maxp

{λ+

K∑k=1

pk (F (x , ξk )− λ) : 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

{λ+ (1 + ε)

K∑k=1

p0k (F (x , ξk )− λ)+

}= min

λ{λ+ (1 + ε)E(F (x , ξ)− λ)+}

= CVaR[F (x , ξ)]

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 23 / 41

Page 66: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Example

Consider a setting with K scenarios:

maxp

{K∑

k=1

pkF (x , ξk ) :K∑

k=1

pk = 1, 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

maxp

{λ+

K∑k=1

pk (F (x , ξk )− λ) : 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

{λ+ (1 + ε)

K∑k=1

p0k (F (x , ξk )− λ)+

}= min

λ{λ+ (1 + ε)E(F (x , ξ)− λ)+}

= CVaR[F (x , ξ)]

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 23 / 41

Page 67: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Example

Consider a setting with K scenarios:

maxp

{K∑

k=1

pkF (x , ξk ) :K∑

k=1

pk = 1, 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

maxp

{λ+

K∑k=1

pk (F (x , ξk )− λ) : 0 ≤ pk ≤ (1 + ε)p0k

}

= minλ

{λ+ (1 + ε)

K∑k=1

p0k (F (x , ξk )− λ)+

}= min

λ{λ+ (1 + ε)E(F (x , ξ)− λ)+}

= CVaR[F (x , ξ)]

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 23 / 41

Page 68: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Equivalence

When ρ is a convex risk function and P is a closed convex set,

minx∈X

maxP∈P

EP [F (x , ξ)]⇔ minx∈X

ρ[F (x , ξ)]

Unified way of treating both distribution robustness and riskaversion.We focus on solving the risk model.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 24 / 41

Page 69: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Equivalence

When ρ is a convex risk function and P is a closed convex set,

minx∈X

maxP∈P

EP [F (x , ξ)]⇔ minx∈X

ρ[F (x , ξ)]

Unified way of treating both distribution robustness and riskaversion.We focus on solving the risk model.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 24 / 41

Page 70: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Equivalence

When ρ is a convex risk function and P is a closed convex set,

minx∈X

maxP∈P

EP [F (x , ξ)]⇔ minx∈X

ρ[F (x , ξ)]

Unified way of treating both distribution robustness and riskaversion.We focus on solving the risk model.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 24 / 41

Page 71: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Equivalence

When ρ is a convex risk function and P is a closed convex set,

minx∈X

maxP∈P

EP [F (x , ξ)]⇔ minx∈X

ρ[F (x , ξ)]

Unified way of treating both distribution robustness and riskaversion.We focus on solving the risk model.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 24 / 41

Page 72: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Band distribution family

Given a nominal distribution P0 consider the “band” distribution family

P = {P : (1− ε1)P0 � P � (1 + ε2)P0, EP [1] = 1}

with 0 ≤ ε1 ≤ 1 and 0 ≤ ε2.

P0

P

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 25 / 41

Page 73: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

The Mean-QDEV Risk Function

TheoremThe risk function corresponding to the band family is

ρ[Z ] = EP0 [Z ] + λQDEVα[Z ]

where λ = (ε1 + ε2) and α = ε2/(ε1 + ε2), and

QDEVα[Z ] = EP0 [α(Z − Kα[Z ])+ + (1− α)(Kα[Z ]− Z )+].

α

1 − α

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 26 / 41

Page 74: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-ASD

Note that ε1 + ε2 controls the “width” of the “band” and ε2/(ε1 + ε2)controls its “position.”If the worst position is allowed then the corresponding coherentrisk measure is

ρ[Z ] = EP0 [Z ] + λEP0 [(Z − EZ )+]

with λ = (ε1 + ε2).MASD[Z ] := EP0 [(Z − EZ )+].

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 27 / 41

Page 75: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Two Mean-Risk SP Models

We consider mean-risk modelsminx∈X{E[F (x , ξ)] + λD[F (x , ξ)]}

where D is

(i) QDEVα

(ii) MASD

Equivalent to RSP models corresponding to the “band” distributionfamily.Can we extend SAA? Can we optimize SAAN efficiently?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 28 / 41

Page 76: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Two Mean-Risk SP Models

We consider mean-risk modelsminx∈X{E[F (x , ξ)] + λD[F (x , ξ)]}

where D is

(i) QDEVα

(ii) MASD

Equivalent to RSP models corresponding to the “band” distributionfamily.Can we extend SAA? Can we optimize SAAN efficiently?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 28 / 41

Page 77: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Two Mean-Risk SP Models

We consider mean-risk modelsminx∈X{E[F (x , ξ)] + λD[F (x , ξ)]}

where D is

(i) QDEVα

(ii) MASD

Equivalent to RSP models corresponding to the “band” distributionfamily.Can we extend SAA? Can we optimize SAAN efficiently?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 28 / 41

Page 78: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Two Mean-Risk SP Models

We consider mean-risk modelsminx∈X{E[F (x , ξ)] + λD[F (x , ξ)]}

where D is

(i) QDEVα

(ii) MASD

Equivalent to RSP models corresponding to the “band” distributionfamily.Can we extend SAA? Can we optimize SAAN efficiently?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 28 / 41

Page 79: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Two Mean-Risk SP Models

We consider mean-risk modelsminx∈X{E[F (x , ξ)] + λD[F (x , ξ)]}

where D is

(i) QDEVα

(ii) MASD

Equivalent to RSP models corresponding to the “band” distributionfamily.Can we extend SAA? Can we optimize SAAN efficiently?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 28 / 41

Page 80: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Two Mean-Risk SP Models

We consider mean-risk modelsminx∈X{E[F (x , ξ)] + λD[F (x , ξ)]}

where D is

(i) QDEVα

(ii) MASD

Equivalent to RSP models corresponding to the “band” distributionfamily.Can we extend SAA? Can we optimize SAAN efficiently?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 28 / 41

Page 81: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-QDEV

TheoremA Mean-QDEV SP is equivalent to a standard (expectationminimization) SP.

QDEVα[Z ] = miny∈R E[α(Z − y)+ + (1− α)(y − Z )+]

minx∈X

E[F (x , ξ)] + λQDEVα[F (x , ξ)]

mmin

x∈X ,y∈RE[F (x , ξ)] + λα(F (x , ξ)− y)+ + λ(1− α)(y − F (x , ξ))+︸ ︷︷ ︸

φ(x ,y ,ξ)

]

mmin

x∈X ,y∈RE[φ(x , y , ξ)].

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 29 / 41

Page 82: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-QDEV

TheoremA Mean-QDEV SP is equivalent to a standard (expectationminimization) SP.

QDEVα[Z ] = miny∈R E[α(Z − y)+ + (1− α)(y − Z )+]

minx∈X

E[F (x , ξ)] + λQDEVα[F (x , ξ)]

mmin

x∈X ,y∈RE[F (x , ξ)] + λα(F (x , ξ)− y)+ + λ(1− α)(y − F (x , ξ))+︸ ︷︷ ︸

φ(x ,y ,ξ)

]

mmin

x∈X ,y∈RE[φ(x , y , ξ)].

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 29 / 41

Page 83: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-QDEV

TheoremA Mean-QDEV SP is equivalent to a standard (expectationminimization) SP.

QDEVα[Z ] = miny∈R E[α(Z − y)+ + (1− α)(y − Z )+]

minx∈X

E[F (x , ξ)] + λQDEVα[F (x , ξ)]

mmin

x∈X ,y∈RE[F (x , ξ)] + λα(F (x , ξ)− y)+ + λ(1− α)(y − F (x , ξ))+︸ ︷︷ ︸

φ(x ,y ,ξ)

]

mmin

x∈X ,y∈RE[φ(x , y , ξ)].

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 29 / 41

Page 84: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-QDEV

TheoremA Mean-QDEV SP is equivalent to a standard (expectationminimization) SP.

QDEVα[Z ] = miny∈R E[α(Z − y)+ + (1− α)(y − Z )+]

minx∈X

E[F (x , ξ)] + λQDEVα[F (x , ξ)]

mmin

x∈X ,y∈RE[F (x , ξ)] + λα(F (x , ξ)− y)+ + λ(1− α)(y − F (x , ξ))+︸ ︷︷ ︸

φ(x ,y ,ξ)

]

mmin

x∈X ,y∈RE[φ(x , y , ξ)].

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 29 / 41

Page 85: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Solving Mean-QDEV SP

When X is compact the domain of y is compact.Existing SAA analysis and method applies directly (dimensiongoes up by one).When F is linear and X is polyhedral, the corresponding SAANproblem is a linear program.When F is convex (e.g. two-stage stochastic linear programs) andX is polyhedral, a specialized parametric decomposition algorithmhas been developed to construct the efficient frontier (Mean vsQDEV).

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 30 / 41

Page 86: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Solving Mean-QDEV SP

When X is compact the domain of y is compact.Existing SAA analysis and method applies directly (dimensiongoes up by one).When F is linear and X is polyhedral, the corresponding SAANproblem is a linear program.When F is convex (e.g. two-stage stochastic linear programs) andX is polyhedral, a specialized parametric decomposition algorithmhas been developed to construct the efficient frontier (Mean vsQDEV).

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 30 / 41

Page 87: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Solving Mean-QDEV SP

When X is compact the domain of y is compact.Existing SAA analysis and method applies directly (dimensiongoes up by one).When F is linear and X is polyhedral, the corresponding SAANproblem is a linear program.When F is convex (e.g. two-stage stochastic linear programs) andX is polyhedral, a specialized parametric decomposition algorithmhas been developed to construct the efficient frontier (Mean vsQDEV).

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 30 / 41

Page 88: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Solving Mean-QDEV SP

When X is compact the domain of y is compact.Existing SAA analysis and method applies directly (dimensiongoes up by one).When F is linear and X is polyhedral, the corresponding SAANproblem is a linear program.When F is convex (e.g. two-stage stochastic linear programs) andX is polyhedral, a specialized parametric decomposition algorithmhas been developed to construct the efficient frontier (Mean vsQDEV).

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 30 / 41

Page 89: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Solving Mean-QDEV SP

When X is compact the domain of y is compact.Existing SAA analysis and method applies directly (dimensiongoes up by one).When F is linear and X is polyhedral, the corresponding SAANproblem is a linear program.When F is convex (e.g. two-stage stochastic linear programs) andX is polyhedral, a specialized parametric decomposition algorithmhas been developed to construct the efficient frontier (Mean vsQDEV).

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 30 / 41

Page 90: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-MASD SP

TheoremA Mean-MASD SP is equivalent to a minimax (expectationminimization) SP.

minx∈X{E[F (x , ξ)] + λMASD[F (x , ξ)]}

m

minx∈X

maxα∈[0,1]

{E[F (x , ξ)] + λQDEVα[F (x , ξ)]}

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 31 / 41

Page 91: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-MASD SP

TheoremA Mean-MASD SP is equivalent to a minimax (expectationminimization) SP.

minx∈X{E[F (x , ξ)] + λMASD[F (x , ξ)]}

m

minx∈X

maxα∈[0,1]

{E[F (x , ξ)] + λQDEVα[F (x , ξ)]}

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 31 / 41

Page 92: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-MASD SP⇔ Minmax SP (contd)

m

minx∈X

maxα∈[0,1]

miny∈R

E[F (x , ξ) + λα(F (x , ξ)− y)+ + λ(1− α)(y − F (x , ξ))+︸ ︷︷ ︸ψ(x ,y ,α,ξ)

]

m

minx∈X ,y∈R

maxα∈[0,1]

{E[ψ(x , y , α, ξ)]}

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 32 / 41

Page 93: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Mean-MASD SP⇔ Minmax SP (contd)

m

minx∈X

maxα∈[0,1]

miny∈R

E[F (x , ξ) + λα(F (x , ξ)− y)+ + λ(1− α)(y − F (x , ξ))+︸ ︷︷ ︸ψ(x ,y ,α,ξ)

]

m

minx∈X ,y∈R

maxα∈[0,1]

{E[ψ(x , y , α, ξ)]}

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 32 / 41

Page 94: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Stochastic Minimax Problems

The mean-ASD problem is equivalent to a stochastic minimaxproblem.Existing SAA approaches do not apply directly.Decomposition is not obvious.Consider general stochastic minimax problems

minx∈X

maxy∈Y{Ψ(x , y) := E[F (x , y , ξ)]}

where X and Y are compact convex sets, and F is convex in xand concave in y a.e.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 33 / 41

Page 95: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Stochastic Minimax Problems

The mean-ASD problem is equivalent to a stochastic minimaxproblem.Existing SAA approaches do not apply directly.Decomposition is not obvious.Consider general stochastic minimax problems

minx∈X

maxy∈Y{Ψ(x , y) := E[F (x , y , ξ)]}

where X and Y are compact convex sets, and F is convex in xand concave in y a.e.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 33 / 41

Page 96: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Stochastic Minimax Problems

The mean-ASD problem is equivalent to a stochastic minimaxproblem.Existing SAA approaches do not apply directly.Decomposition is not obvious.Consider general stochastic minimax problems

minx∈X

maxy∈Y{Ψ(x , y) := E[F (x , y , ξ)]}

where X and Y are compact convex sets, and F is convex in xand concave in y a.e.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 33 / 41

Page 97: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Stochastic Minimax Problems

The mean-ASD problem is equivalent to a stochastic minimaxproblem.Existing SAA approaches do not apply directly.Decomposition is not obvious.Consider general stochastic minimax problems

minx∈X

maxy∈Y{Ψ(x , y) := E[F (x , y , ξ)]}

where X and Y are compact convex sets, and F is convex in xand concave in y a.e.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 33 / 41

Page 98: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Stochastic Minimax Problems

The mean-ASD problem is equivalent to a stochastic minimaxproblem.Existing SAA approaches do not apply directly.Decomposition is not obvious.Consider general stochastic minimax problems

minx∈X

maxy∈Y{Ψ(x , y) := E[F (x , y , ξ)]}

where X and Y are compact convex sets, and F is convex in xand concave in y a.e.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 33 / 41

Page 99: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

SAA for Stochastic Minimax Problems

SAAN : vN = minx∈X

maxy∈Y{N−1

N∑i=1

[F (x , y , ξi)]}

TheoremAs N →∞, vN and XN converges to their true counterparts v∗ and X ∗

exponentially fast. So with

N = O(

nx + ny

ε2

)an optimal solution to SAAN is an ε-optimal solution to minmax SP withvery high probability.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 34 / 41

Page 100: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

SAA for Stochastic Minimax Problems

SAAN : vN = minx∈X

maxy∈Y{N−1

N∑i=1

[F (x , y , ξi)]}

TheoremAs N →∞, vN and XN converges to their true counterparts v∗ and X ∗

exponentially fast. So with

N = O(

nx + ny

ε2

)an optimal solution to SAAN is an ε-optimal solution to minmax SP withvery high probability.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 34 / 41

Page 101: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

SAA for Stochastic Minimax Problems

SAAN : vN = minx∈X

maxy∈Y{N−1

N∑i=1

[F (x , y , ξi)]}

TheoremAs N →∞, vN and XN converges to their true counterparts v∗ and X ∗

exponentially fast. So with

N = O(

nx + ny

ε2

)an optimal solution to SAAN is an ε-optimal solution to minmax SP withvery high probability.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 34 / 41

Page 102: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Solving SAAN for mean-MASD

Sharper sample size estimates for Mean-MASD stochasticprograms.We have developed statistical validation techniques to screencandidate solutions from SAAN .How to decompose?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 35 / 41

Page 103: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Solving SAAN for mean-MASD

Sharper sample size estimates for Mean-MASD stochasticprograms.We have developed statistical validation techniques to screencandidate solutions from SAAN .How to decompose?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 35 / 41

Page 104: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Solving SAAN for mean-MASD

Sharper sample size estimates for Mean-MASD stochasticprograms.We have developed statistical validation techniques to screencandidate solutions from SAAN .How to decompose?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 35 / 41

Page 105: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Solving SAAN for mean-MASD

Sharper sample size estimates for Mean-MASD stochasticprograms.We have developed statistical validation techniques to screencandidate solutions from SAAN .How to decompose?

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 35 / 41

Page 106: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Decomposition of mean-MASD SP

E[F (x , ξ)] + λMASD[F (x , ξ)]= E[F (x , ξ)] + λE[F (x , ξ)− E[F (x , ξ)]]+= (1− λ)E[F (x , ξ)] + λmax{F (x , ξ),E[F (x , ξ)]}= (1− λ)µ(x) + λν(x)

Let I(ξ) = 1 if F (x , ξ) > E[F (x , ξ)] and 0 otherwise. Givens(ξ) ∈ ∂F (x , ξ) and s = E[s(ξ)], let

s = E[I(ξ)s(ξ) + (1− I(ξ))s]

then s ∈ ∂ν(x).

Thus, evaluation of µ and ν and its subgradients can be done in adecomposed manner.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 36 / 41

Page 107: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Decomposition of mean-MASD SP

E[F (x , ξ)] + λMASD[F (x , ξ)]= E[F (x , ξ)] + λE[F (x , ξ)− E[F (x , ξ)]]+= (1− λ)E[F (x , ξ)] + λmax{F (x , ξ),E[F (x , ξ)]}= (1− λ)µ(x) + λν(x)

Let I(ξ) = 1 if F (x , ξ) > E[F (x , ξ)] and 0 otherwise. Givens(ξ) ∈ ∂F (x , ξ) and s = E[s(ξ)], let

s = E[I(ξ)s(ξ) + (1− I(ξ))s]

then s ∈ ∂ν(x).

Thus, evaluation of µ and ν and its subgradients can be done in adecomposed manner.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 36 / 41

Page 108: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Decomposition of mean-MASD SP

E[F (x , ξ)] + λMASD[F (x , ξ)]= E[F (x , ξ)] + λE[F (x , ξ)− E[F (x , ξ)]]+= (1− λ)E[F (x , ξ)] + λmax{F (x , ξ),E[F (x , ξ)]}= (1− λ)µ(x) + λν(x)

Let I(ξ) = 1 if F (x , ξ) > E[F (x , ξ)] and 0 otherwise. Givens(ξ) ∈ ∂F (x , ξ) and s = E[s(ξ)], let

s = E[I(ξ)s(ξ) + (1− I(ξ))s]

then s ∈ ∂ν(x).

Thus, evaluation of µ and ν and its subgradients can be done in adecomposed manner.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 36 / 41

Page 109: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Illustration: Inventory Problems

Considered Distribution robust newsvendor = Risk aversenewsendorAnalytical optimality equations.Analysis of sensitivity of order quantity to distributionalinaccuracies = risk aversion.For multi-period problems optimal policy structure is identical tothat of standard expectation minimization problems.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 37 / 41

Page 110: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Effect of distribution robustness

P0P− P+

cost

Classical SP solutionRobust SP solution

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 38 / 41

Page 111: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Illustration: Supply Chain Network Design

A chemical supply chain adapted from Tsiakis, Shah & Pantelides,2001Two-stage problem (Mixed-integer first stage)First-stage: Capacity of 6 warehouses and 8 DCsCustomer demand is uncertain (6 random variables)Second-stage: Ship to customers + Outsource penaltyMinimize (annualized) capacity costs + shipping costs +outsourcing penaltySolved Expectation + 0.5*MASD modelSample size = 500, Replications = 20, Evaluation sample size =10000

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 39 / 41

Page 112: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Results: Supply Chain Network Design

Model Deterministic Traditional SP Mean-MASDcost1 17541 22095 26556

E[cost2] 144834 130156 126408SD[cost2] 861 681 600

Pr{infeas}0.95(%) 43.52 13.29 4.39E[cost2|feas] 109076 110104 118032

total-time 7.57 1414.32 1694.10W1 (1300) 0 0 620W2 (1100) 0 0 0W3 (1200) 0 0 0W4 (1200) 0 0 0W5 (1100) 1100 1100 1100W6 (1300) 610 1300 1239D1 (1000) 800 1000 1000D2 ( 900) 0 0 0D3 ( 950) 0 0 0D4 (1050) 910 1050 1050D5 (1100) 0 0 0D6 (1000) 0 0 0D7 ( 980) 0 350 909D8 (1050) 0 0 0

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 40 / 41

Page 113: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Conclusions

Classical SP assumes accurate distribution and is risk-neutral.SP with risk functions offer a unifying treatment of thesedeficiencies.Classical sampling and decomposition algorithms extended tomean-MASD and mean-QDEV models.Extended models not much harder than classical SP.Additional research: Risk averse extensions of dynamic(multi-stage) stochastic inventory problems.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 41 / 41

Page 114: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Conclusions

Classical SP assumes accurate distribution and is risk-neutral.SP with risk functions offer a unifying treatment of thesedeficiencies.Classical sampling and decomposition algorithms extended tomean-MASD and mean-QDEV models.Extended models not much harder than classical SP.Additional research: Risk averse extensions of dynamic(multi-stage) stochastic inventory problems.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 41 / 41

Page 115: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Conclusions

Classical SP assumes accurate distribution and is risk-neutral.SP with risk functions offer a unifying treatment of thesedeficiencies.Classical sampling and decomposition algorithms extended tomean-MASD and mean-QDEV models.Extended models not much harder than classical SP.Additional research: Risk averse extensions of dynamic(multi-stage) stochastic inventory problems.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 41 / 41

Page 116: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Conclusions

Classical SP assumes accurate distribution and is risk-neutral.SP with risk functions offer a unifying treatment of thesedeficiencies.Classical sampling and decomposition algorithms extended tomean-MASD and mean-QDEV models.Extended models not much harder than classical SP.Additional research: Risk averse extensions of dynamic(multi-stage) stochastic inventory problems.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 41 / 41

Page 117: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Conclusions

Classical SP assumes accurate distribution and is risk-neutral.SP with risk functions offer a unifying treatment of thesedeficiencies.Classical sampling and decomposition algorithms extended tomean-MASD and mean-QDEV models.Extended models not much harder than classical SP.Additional research: Risk averse extensions of dynamic(multi-stage) stochastic inventory problems.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 41 / 41

Page 118: Shabbir Ahmed sahmed@isye.gatechfocapo.cheme.cmu.edu/2012/presentations/Ahmed.pdf · S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 8 / 41. Convergence Theorem (Shapiro, Wets, Birge

Conclusions

Classical SP assumes accurate distribution and is risk-neutral.SP with risk functions offer a unifying treatment of thesedeficiencies.Classical sampling and decomposition algorithms extended tomean-MASD and mean-QDEV models.Extended models not much harder than classical SP.Additional research: Risk averse extensions of dynamic(multi-stage) stochastic inventory problems.

S. Ahmed (GA Tech) Risk Averse SP FOCAPO 2012 41 / 41