Fast Robustness Quantification with Variational...

141
Fast Robustness Quantification with Variational Bayes ITT Career Development Assistant Professor, MIT Tamara Broderick With: Ryan Giordano, Rachael Meager, Jonathan Huggins, Michael I. Jordan

Transcript of Fast Robustness Quantification with Variational...

Page 1: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Fast Robustness Quantification with Variational Bayes

ITT Career Development Assistant Professor,

MIT

Tamara Broderick

With: Ryan Giordano, Rachael Meager, Jonathan Huggins, Michael I. Jordan

Page 2: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

Page 3: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

Page 4: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

Page 5: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

p(✓|x) /✓ p(x|✓)p(✓)

Page 6: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

p(✓|x) /✓ p(x|✓)p(✓)

Page 7: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

p(✓|x) /✓ p(x|✓)p(✓)

Page 8: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

p(✓|x) /✓ p(x|✓)p(✓)

Page 9: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

p(✓|x) /✓ p(x|✓)p(✓)

Page 10: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

p(✓|x) /✓ p(x|✓)p(✓)

Page 11: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

p(✓|x) /✓ p(x|✓)p(✓)

Bayes Theorem

Page 12: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

p(✓|x) /✓ p(x|✓)p(✓)

Bayes Theorem

Page 13: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

p(✓|x) /✓ p(x|✓)p(✓)

Bayes Theorem

Page 14: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

1

p(✓|x) /✓ p(x|✓)p(✓)

Bayes Theorem

Page 15: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

Robustness quantification

1

p(✓|x) /✓ p(x|✓)p(✓)

Bayes Theorem

Page 16: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Robustness • Global & local • Rarely used • Approximation,

MCMC • Our solution: linear

response variational Bayes

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

Robustness quantification

1

p(✓|x) /✓ p(x|✓)p(✓)

Bayes Theorem

Page 17: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Robustness • Global & local • Rarely used • Approximation,

MCMC • Our solution: linear

response variational Bayes

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

Robustness quantification

1

p(✓|x) /✓ p(x|✓)p(✓)

Bayes Theorem

Page 18: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Robustness • Global & local • Rarely used • Approximation,

MCMC • Our solution: linear

response variational Bayes

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

Robustness quantification

1

p(✓|x) /✓ p(x|✓)p(✓)

Bayes Theorem

Page 19: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

Robustness quantification

1

p(✓|x) /✓ p(x|✓)p(✓)

• Robustness • Global & local • Rarely used • Approximation

MCMC • Our solution: linear

response variational Bayes

Bayes Theorem

Page 20: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

Robustness quantification

1

p(✓|x) /✓ p(x|✓)p(✓)

• Robustness • Global & local • Rarely used • Approximation,

MCMC • Our solution: linear

response variational Bayes

Bayes Theorem

Page 21: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

Robustness quantification

1

p(✓|x) /✓ p(x|✓)p(✓)

• Robustness • Global & local • Rarely used • Approximation,

MCMC • Our solution: linear

response variational Bayes

Bayes Theorem

Page 22: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Bayesian inference • Complex, modular models; posterior distribution

• Have to express prior beliefs in a distribution: challenges • Time-consuming; subjective; complex models

Robustness quantification

1

p(✓|x) /✓ p(x|✓)p(✓)

• Robustness • Global & local • Rarely used • Approximation,

MCMC • Our solution: linear

response variational Bayes

Bayes Theorem

Page 23: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes as an alternative to MCMC • Challenges of VB • Accurate uncertainties from VB • Accurate robustness quantification from VB • Big idea: derivatives/perturbations are easy in VB

2

Robustness quantification

Page 24: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes as an alternative to MCMC • Challenges of VB • Accurate uncertainties from VB • Accurate robustness quantification from VB • Big idea: derivatives/perturbations are easy in VB

2

Robustness quantification

Page 25: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes as an alternative to MCMC • Challenges of VB • Accurate uncertainties from VB • Accurate robustness quantification from VB • Big idea: derivatives/perturbations are easy in VB

2

Robustness quantification

Page 26: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes as an alternative to MCMC • Challenges of VB • Accurate uncertainties from VB • Accurate robustness quantification from VB • Big idea: derivatives/perturbations are easy in VB

2

Robustness quantification

Page 27: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes as an alternative to MCMC • Challenges of VB • Accurate uncertainties from VB • Accurate robustness quantification from VB • Big idea: derivatives/perturbations are easy in VB

2

Robustness quantification

Page 28: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes as an alternative to MCMC • Challenges of VB • Accurate uncertainties from VB • Accurate robustness quantification from VB • Big idea: derivatives/perturbations are easy in VB

2

Robustness quantification

Page 29: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

p(✓|x)q(✓)

q⇤(✓)

Variational Bayes

3

Page 30: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

p(✓|x)q(✓)

q⇤(✓)

Variational Bayes

3

Page 31: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

p(✓|x)q(✓)

q⇤(✓)

Variational Bayes

3

Page 32: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

q⇤(✓)

Variational Bayes

3

Page 33: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

q⇤(✓)

Variational Bayes

q(✓)

3

Page 34: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

q(✓)

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

q⇤(✓)

Variational Bayes

p(✓|x)

3

Page 35: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

q⇤(✓)

p(✓|x)

q⇤(✓)

Variational Bayes

3

Page 36: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

q⇤(✓)

Variational Bayes

p(✓|x)

q⇤(✓)

3

Page 37: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

q⇤(✓)

Variational Bayes

p(✓|x)

q⇤(✓)

3

Page 38: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

q⇤(✓)

Variational Bayes

p(✓|x)

q⇤(✓)

3

Page 39: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

q⇤(✓)

Variational Bayes

p(✓|x)

q⇤(✓)

3

Page 40: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast

q⇤(✓)

[Broderick, Boyd, Wibisono, Wilson, Jordan 2013]

Variational Bayes

p(✓|x)

q⇤(✓)

3

Page 41: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes (VB) • Approximation for

posterior • Minimize Kullback-Liebler

(KL) divergence:

p(✓|x)

KL(qkp(·|x))

• VB practical success • point estimates and prediction • fast, streaming, distributed

q⇤(✓)

[Broderick, Boyd, Wibisono, Wilson, Jordan 2013]

Variational Bayes

p(✓|x)

q⇤(✓)

3

Page 42: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes

!

• Mean-field variational Bayes (MFVB)

!

• Underestimates variance (sometimes severely)

• No covariance estimates

What about uncertainty?

[Bishop 2006]

q(✓) =JY

j=1

q(✓j)

KL(q||p(·|x)) =Z

✓q(✓) log

q(✓)

p(✓|x)d✓

✓1

✓2

4

Page 43: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes

!

• Mean-field variational Bayes (MFVB)

!

• Underestimates variance (sometimes severely)

• No covariance estimates

What about uncertainty?

q(✓) =JY

j=1

q(✓j)

4

Page 44: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes

!

• Mean-field variational Bayes (MFVB)

!

• Underestimates variance (sometimes severely)

• No covariance estimates

What about uncertainty?

[Bishop 2006]

q(✓) =JY

j=1

q(✓j)

✓1

✓2p(✓|x)

4

Page 45: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes

!

• Mean-field variational Bayes (MFVB)

!

• Underestimates variance (sometimes severely)

• No covariance estimates

What about uncertainty?

[Bishop 2006]

q(✓) =JY

j=1

q(✓j)

KL(q||p(·|x)) =Z

✓q(✓) log

q(✓)

p(✓|x)d✓

✓1

✓2p(✓|x)

4

Page 46: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes

!

• Mean-field variational Bayes (MFVB)

!

• Underestimates variance (sometimes severely)

• No covariance estimates

What about uncertainty?

[Bishop 2006]

q(✓) =JY

j=1

q(✓j)

KL(q||p(·|x)) =Z

✓q(✓) log

q(✓)

p(✓|x)d✓

✓1

✓2p(✓|x)

4

Page 47: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes

!

• Mean-field variational Bayes (MFVB)

!

• Underestimates variance (sometimes severely)

• No covariance estimates

What about uncertainty?

[Bishop 2006]

q(✓) =JY

j=1

q(✓j)

KL(q||p(·|x)) =Z

✓q(✓) log

q(✓)

p(✓|x)d✓

✓1

✓2p(✓|x)

q⇤(✓)

4

Page 48: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes

!

• Mean-field variational Bayes (MFVB)

!

• Underestimates variance (sometimes severely)

• No covariance estimates

What about uncertainty?

[Bishop 2006]

q(✓) =JY

j=1

q(✓j)

KL(q||p(·|x)) =Z

✓q(✓) log

q(✓)

p(✓|x)d✓

✓1

✓2p(✓|x)

q⇤(✓)

4

Page 49: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes

!

• Mean-field variational Bayes (MFVB)

!

• Underestimates variance (sometimes severely)

• No covariance estimates

What about uncertainty?

[Bishop 2006]

q(✓) =JY

j=1

q(✓j)

KL(q||p(·|x)) =Z

✓q(✓) log

q(✓)

p(✓|x)d✓

✓1

✓2p(✓|x)

q⇤(✓)

4

Page 50: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes

!

• Mean-field variational Bayes (MFVB)

!

• Underestimates variance (sometimes severely)

• No covariance estimates

What about uncertainty?

q(✓) =JY

j=1

q(✓j)

KL(q||p(·|x)) =Z

✓q(✓) log

q(✓)

p(✓|x)d✓

✓1

✓2

[MacKay 2003; Bishop 2006; Wang, Titterington 2004; Turner, Sahani 2011]

p(✓|x)

q⇤(✓)

4

Page 51: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Variational Bayes

!

• Mean-field variational Bayes (MFVB)

!

• Underestimates variance (sometimes severely)

• No covariance estimates

What about uncertainty?

q(✓) =JY

j=1

q(✓j)

KL(q||p(·|x)) =Z

✓q(✓) log

q(✓)

p(✓|x)d✓

✓1

✓2

[MacKay 2003; Bishop 2006; Wang, Titterington 2004; Turner, Sahani 2011]

p(✓|x)

q⇤(✓)

[Dunson 2014; Bardenet, Doucet, Holmes 2015]4

Page 52: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

5

Page 53: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

5

Page 54: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

5

Page 55: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

5

Page 56: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

p(✓|x)

[Bishop 2006]5

Page 57: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

5

Page 58: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 59: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 60: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 61: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 62: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 63: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 64: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 65: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 66: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 67: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

⌃ =d

dtT

d

dtC

p(·|x)(t)

�����t=05

Page 68: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 69: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 70: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 71: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

[Bishop 2006]

p(✓|x)

q⇤(✓)

5

Page 72: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Cumulant-generating function

!

• True posterior covariance vs MFVB covariance

!

• “Linear response” !

• The LRVB approximation

V :=d2

dtT dtCq⇤(t)

����t=0

Linear response

mean =d

dtC(t)

����t=0

⌃ :=d2

dtT dtC

p(·|x)(t)

����t=0

log pt(✓) := log p(✓|x) + t

T✓ � C(t), MFVB q⇤t

⌃ =d

dtTEpt✓

����t=0

⇡ d

dtTEq⇤t ✓

����t=0

=: ⌃

[Bishop 2006]

C(t) := logEetT ✓

p(✓|x)

q⇤(✓)

5

Page 73: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• LRVB covariance estimate

• Suppose exponential family with mean parametrization

LRVB estimator⌃ :=

d

dtTEq⇤t ✓

����t=0

qt mt

6

Page 74: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• LRVB covariance estimate

• Suppose exponential family with mean parametrization

LRVB estimator⌃ :=

d

dtTEq⇤t ✓

����t=0

qt mt

6

Page 75: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• LRVB covariance estimate

• Suppose exponential family with mean parametrization

LRVB estimator⌃ :=

d

dtTEq⇤t ✓

����t=0

qt mt

= (I � V H)�1V

6

Page 76: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• LRVB covariance estimate

• Suppose exponential family with mean parametrization

LRVB estimator

⌃ =

✓@2KL

@m@mT

����m=m⇤

◆�1

⌃ :=d

dtTEq⇤t ✓

����t=0

qt mt

= (I � V H)�1V

6

Page 77: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• LRVB covariance estimate

• Suppose exponential family with mean parametrization

LRVB estimator

⌃ =

✓@2KL

@m@mT

����m=m⇤

◆�1

⌃ :=d

dtTEq⇤t ✓

����t=0

qt mt

= (I � V H)�1V

6

Page 78: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• LRVB covariance estimate

• Suppose exponential family with mean parametrization

LRVB estimator

⌃ =

✓@2KL

@m@mT

����m=m⇤

◆�1

⌃ :=d

dtTEq⇤t ✓

����t=0

qt mt

= (I � V H)�1V

6

Page 79: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• LRVB covariance estimate

• Suppose exponential family with mean parametrization

• Symmetric and positive definite at local min of KL

• The LRVB assumption:

LRVB estimator

⌃ =

✓@2KL

@m@mT

����m=m⇤

◆�1

⌃ :=d

dtTEq⇤t ✓

����t=0

qt mt

= (I � V H)�1V

6

Page 80: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• LRVB covariance estimate

• Suppose exponential family with mean parametrization

• Symmetric and positive definite at local min of KL

• The LRVB assumption: Ept✓ ⇡ Eq⇤t ✓

LRVB estimator

⌃ =

✓@2KL

@m@mT

����m=m⇤

◆�1

⌃ :=d

dtTEq⇤t ✓

����t=0

qt mt

= (I � V H)�1V

6

Page 81: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• LRVB covariance estimate

• Suppose exponential family with mean parametrization

• Symmetric and positive definite at local min of KL

• The LRVB assumption: Ept✓ ⇡ Eq⇤t ✓p(✓|x)

q⇤(✓)

[Bishop 2006]

LRVB estimator

⌃ =

✓@2KL

@m@mT

����m=m⇤

◆�1

⌃ :=d

dtTEq⇤t ✓

����t=0

qt mt

= (I � V H)�1V

6

Page 82: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• LRVB covariance estimate

• Suppose exponential family with mean parametrization

• Symmetric and positive definite at local min of KL

• The LRVB assumption: Ept✓ ⇡ Eq⇤t ✓p(✓|x)

q⇤(✓)

• LRVB estimate is exact when MFVB gives exact mean (e.g. multivariate normal)

[Bishop 2006]

LRVB estimator

⌃ =

✓@2KL

@m@mT

����m=m⇤

◆�1

⌃ :=d

dtTEq⇤t ✓

����t=0

qt mt

= (I � V H)�1V

6

Page 83: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

7

Page 84: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

7

Page 85: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

7

Page 86: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

7

Page 87: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

7

Page 88: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

profit

7

Page 89: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

profit

7

Page 90: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

profit

7

Page 91: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

profit

7

Page 92: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

profit1 if microcredit

7

Page 93: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

profit1 if microcredit

7

Page 94: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

profit1 if microcredit

7

Page 95: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

profit1 if microcredit

7

Page 96: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

profit1 if microcredit

7

Page 97: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

✓µk

⌧k

◆iid⇠ N

✓✓µ⌧

◆, C

profit1 if microcredit

7

Page 98: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

✓µk

⌧k

◆iid⇠ N

✓✓µ⌧

◆, C

��2k

iid⇠ �(a, b)

profit1 if microcredit

7

Page 99: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

✓µk

⌧k

◆iid⇠ N

✓✓µ⌧

◆, C

◆ ✓µ⌧

◆iid⇠ N

✓✓µ0

⌧0

◆,⇤�1

��2k

iid⇠ �(a, b)

profit1 if microcredit

7C ⇠ Sep&LKJ(⌘, c, d)

Page 100: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment

8

Page 101: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment

MFV

B

8

Page 102: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment

MFV

B

8

Page 103: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• One set of 2500

MCMC draws: 45 minutes

• All of MFVB optimization, LRVB uncertainties, all sensitivity measures: 58 seconds!

• Many other models and data sets: Mixture models, generalized linear mixed models, etc

MFV

B

8

Page 104: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• One set of 2500

MCMC draws: 45 minutes

• All of MFVB optimization, LRVB uncertainties, all sensitivity measures: 58 seconds!

• Many other models and data sets: Mixture models, generalized linear mixed models, etc

MFV

B

8

Page 105: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• One set of 2500

MCMC draws: 45 minutes

• All of MFVB optimization, LRVB uncertainties, all sensitivity measures: 58 seconds!

• Many other models and data sets: Mixture models, generalized linear mixed models, etc

MFV

B

LRVB,!MFVB

8

Page 106: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment• One set of 2500

MCMC draws: 45 minutes

• All of MFVB optimization, LRVB uncertainties, all sensitivity measures: 58 seconds!

• Many other models and data sets: Mixture models, generalized linear mixed models, etc

MFV

B

LRVB,!MFVB

8

Page 107: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Robustness quantification

• Variational Bayes as an alternative to MCMC • Challenges of VB • Accurate uncertainties from VB • Accurate robustness quantification from VB • Big idea: derivatives/perturbations are easy in VB

9

Page 108: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Robustness quantification

• Variational Bayes as an alternative to MCMC • Challenges of VB • Accurate uncertainties from VB • Accurate robustness quantification from VB • Big idea: derivatives/perturbations are easy in VB

9

Page 109: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Robustness quantification• Bayes Theorem

p(✓|x)/✓ p(x|✓)p(✓)

10

Page 110: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

10

Page 111: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

10

Page 112: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

• Sensitivity

10

Page 113: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Bayes Theorem

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

• Sensitivity

10

Page 114: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Bayes Theorem

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

• Sensitivity

S :=dEp↵ [g(✓)]

d↵

����↵

�↵

10

Page 115: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Bayes Theorem

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

• Sensitivity

S :=dEp↵ [g(✓)]

d↵

����↵

�↵

10

Page 116: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Bayes Theorem

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

• Sensitivity

S :=dEp↵ [g(✓)]

d↵

����↵

�↵

10

Page 117: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Bayes Theorem

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

• Sensitivity

S :=dEp↵ [g(✓)]

d↵

����↵

�↵

10

Page 118: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Bayes Theorem

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

• Sensitivity

S :=dEp↵ [g(✓)]

d↵

����↵

�↵

⇡dEq⇤↵ [g(✓)]

d↵

����↵

�↵ =: S

10

Page 119: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Bayes Theorem

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

• Sensitivity

S :=dEp↵ [g(✓)]

d↵

����↵

�↵

⇡dEq⇤↵ [g(✓)]

d↵

����↵

�↵ =: S LRVB estimator

10

Page 120: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Bayes Theorem

Robustness quantification• Bayes Theoremp↵(✓) := p(✓|x,↵)

/✓ p(x|✓)p(✓|↵)

• Sensitivity

S :=dEp↵ [g(✓)]

d↵

����↵

�↵

⇡dEq⇤↵ [g(✓)]

d↵

����↵

�↵ =: S LRVB estimator

• When in exponential familyq⇤↵

10

Page 121: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Bayes Theorem

Robustness quantification• Bayes Theorem

S = A

✓@2KL

@m@mT

����m=m⇤

◆�1

B

p↵(✓) := p(✓|x,↵)/✓ p(x|✓)p(✓|↵)

• Sensitivity

S :=dEp↵ [g(✓)]

d↵

����↵

�↵

⇡dEq⇤↵ [g(✓)]

d↵

����↵

�↵ =: S LRVB estimator

• When in exponential familyq⇤↵

10

Page 122: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

C ⇠ Sep&LKJ(⌘, c, d)

Microcredit Experiment• Simplified from Meager (2015) • K microcredit trials (Mexico, Mongolia, Bosnia, India,

Morocco, Philippines, Ethiopia) • Nk businesses in kth site (~900 to ~17K) • Profit of nth business at kth site:

!

!• Priors and hyperpriors:

yknindep⇠ N (µk + Tkn⌧k,�

2k)

✓µk

⌧k

◆iid⇠ N

✓✓µ⌧

◆, C

◆ ✓µ⌧

◆iid⇠ N

✓✓µ0

⌧0

◆,⇤�1

��2k

iid⇠ �(a, b)

profit1 if microcredit

11

Page 123: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment

12

Page 124: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment

MFV

B

12

Page 125: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment

• Perturb Λ11: 0.03 ➔ 0.04

MFV

B

12

Page 126: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment

• Perturb Λ11: 0.03 ➔ 0.04

Sensitivity

MFV

BLR

VB

12

Page 127: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment

• Perturb Λ11: 0.03 ➔ 0.04

Sensitivity

MFV

BLR

VB

12

Page 128: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Microcredit Experiment

13

Page 129: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Sensitivity of the expected microcredit effect (τ)

• Normalized to be on scale of standard deviations in τ

• E.g.

Microcredit Experiment

13

Page 130: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Sensitivity of the expected microcredit effect (τ)

• Normalized to be on scale of standard deviations in τ

• E.g.

Microcredit Experiment

13

Page 131: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Sensitivity of the expected microcredit effect (τ)

• Normalized to be on scale of standard deviations in τ

• E.g.

Microcredit Experiment

13

Page 132: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Sensitivity of the expected microcredit effect (τ)

• Normalized to be on scale of standard deviations in τ

• E.g.

Microcredit Experiment

13

Page 133: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Sensitivity of the expected microcredit effect (τ)

• Normalized to be on scale of standard deviations in τ

• E.g.

Microcredit Experiment

13

Page 134: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Sensitivity of the expected microcredit effect (τ)

• Normalized to be on scale of standard deviations in τ

• E.g.

Microcredit Experiment

StdDevq⌧ = 1.8

13

Page 135: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Sensitivity of the expected microcredit effect (τ)

• Normalized to be on scale of standard deviations in τ

• E.g.

Microcredit Experiment

StdDevq⌧ = 1.8Eq⌧ = 3.7

13

Page 136: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Sensitivity of the expected microcredit effect (τ)

• Normalized to be on scale of standard deviations in τ

• E.g.

Microcredit Experiment

StdDevq⌧ = 1.8Eq⌧ = 3.7= 2.06 ⇤ StdDevq⌧

13

Page 137: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Sensitivity of the expected microcredit effect (τ)

• Normalized to be on scale of standard deviations in τ

• E.g.

Microcredit Experiment

StdDevq⌧ = 1.8Eq⌧ = 3.7= 2.06 ⇤ StdDevq⌧

⇤12 + = 0.03

13

Page 138: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

• Sensitivity of the expected microcredit effect (τ)

• Normalized to be on scale of standard deviations in τ

• E.g.

Microcredit Experiment

StdDevq⌧ = 1.8Eq⌧ = 3.7= 2.06 ⇤ StdDevq⌧

⇤12 + = 0.03

13Eq⌧ < 1.0 ⇤ StdDevq⌧

Page 139: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

Conclusion• We provide linear response variational Bayes:

supplements MFVB for fast & accurate covariance estimate

• More from LRVB: fast & accurate robustness quantification

• Interested in your data and models: • Sensitivity to prior perturbations • Sensitivity to data perturbations

14

Page 140: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

T. Broderick, N. Boyd, A. Wibisono, A. C. Wilson, and M. I. Jordan. Streaming variational Bayes. NIPS, 2013. !R Giordano, T Broderick, and MI Jordan. Linear response methods for accurate covariance estimates from mean field variational Bayes. NIPS, 2015.!!R Giordano, T Broderick, and MI Jordan. Robust Inference with Variational Bayes. NIPS AABI Workshop, 2015. ArXiv:1512.02578.!! https://github.com/rgiordan/MicrocreditLRVB! !J. Huggins, T. Campbell, and T. Broderick. Core sets for scalable Bayesian logistic regression. Under review. ArXiv:1605.06423.

References

15

Page 141: Fast Robustness Quantification with Variational …people.csail.mit.edu/tbroderick/files/broderick_2016...Fast Robustness Quantification with Variational Bayes ITT Career Development

References

R Bardenet, A Doucet, and C Holmes. On Markov chain Monte Carlo methods for tall data. arXiv, 2015.

CM Bishop. Pattern Recognition and Machine Learning.

D Dunson. Robust and scalable approach to Bayesian inference. Talk at ISBA 2014.

DJC MacKay. Information Theory, Inference, and Learning Algorithms. Cambridge University Press, 2003.

R Meager. Understanding the impact of microcredit expansions: A Bayesian hierarchical analysis of 7 randomised experiments. ArXiv:1506.06669, 2015.

RE Turner and M Sahani. Two problems with variational expectation maximisation for time-series models. In D Barber, AT Cemgil, and S Chiappa, editors, Bayesian Time Series Models, 2011.

B Wang and M Titterington. Inadequacy of interval estimates corresponding to variational Bayesian approximations. In AISTATS, 2004.

16