Non-myopic Informative Path Planning in Spatio-Temporal Models Alexandra Meliou Andreas Krause...

Non-myopic Informative Path Planning in Spatio-Temporal

Models

Alexandra Meliou

Andreas Krause

Carlos Guestrin

Joe Hellerstein

Collection Tours

Approximate Queries Approximate representation of the world:

Discrete locations Lossy communication Noisy measurements

Applications do not expect accurate values (tolerance to noise)

Monitored phenomena usually demonstrate strong correlationsCorrelation makes approximation cheap

Example: Return the temperature at all locations ±1C, with 95% confidence

Optimizing Information

: sensing nodes on path

Approximate answers

Search for most informative paths

Continuous Queries

Repeated at periodic intervals Finite horizon

Example: Return the temperature at all locations ±1C, with 95% confidence,

every 10 minutes for the next 5 hours.

Myopic vs Nonmyopic tradeoff

Myopic approach: repeat optimization for every timestep

Timestep 1Timestep 2

Myopic vs Nonmyopic tradeoff

Nonmyopic approach: optimize for all timesteps

Timestep 1Timestep 2 No work! Extra node

Quantify Informativeness

Entropy [Shewry & Wynn ‘87]

Mutual Information [Caselton & Zidek ‘84]

Reduction of predictive variance [Chaloner & Verdinelli ‘95]

Measuring Information

Observing 1 gives information on 3 and 4

Observing 2 gives information on 3 and 5

After observing 2, observing 3 becomes less useful

Diminishing Returns

Submodular Functions

)()()()( BFXBFAFXAF −∪≥−∪BA⊆

More reward

Less reward

Entropy, mutual information and reduction of predictive variance are all submodular.

Non-myopic Spatio-Temporal Path Planning (NSTP)

Given: A collection of submodular functions ft

• ft only depends on data collected at times 1..t

A set of accuracy constraints kt

Find: A collection of paths Pt with

( )( ) ttt

= ∑=

minarg

Minimize cost

Subject to reward constraints

Planning for multiple timesteps

Harder than planning for one

First idea : Solve an equivalent single step problem

instead!

obviously

Nonmyopic Planning Graph

t=1 t=2 t=3

A solution path on the NPG = collection of paths for multiple timesteps

Solve the single step problem

NP hard No good known approximation guarantees

Dual: Submodular Orienteering Problem

P* = argmaxP f P( )

s.t. C P( ) ≤ B

P* = argminP C P( )

s.t. f P( ) ≥ K

dual: primal:Maximize reward

Subject to budget constraints

Minimize cost

Subject to reward constraints

Good News

The dual algorithm [Chekuri & Pal ’05] provides an O(logn) factor approximation

f ( ˆ P ) ≥f (OPT)

(where n is the size of the network)

Covering Algorithm

Transform a dual blackbox solution to a primal solution

P* = argmaxP f P( )

s.t. C P( ) ≤ B

P* = argminP C P( )

s.t. f P( ) ≥ K

primal:

Reward required to “cover”

(with α approximation factor)

Call with BOPT

Return solution with reward ≥K/α

CoveringAlgorithm

Transform a dual blackbox solution to a primal solution

Reward required to “cover”

• Call SOP for increasing budgets

• Guaranteed to cover K/α reward when called for BOPT

• Update chosen set and repeat for uncovered reward

• Terminate when ε portion left

Guaranteed to use at most budget

2logε

log 1−1

⎝ ⎜

⎠ ⎟BOPT

• Call for budget 1 : insufficient reward• Call for budget 2• Call for budget BOPT: reward sufficient!

uncovered reward

Bad News

On the unrolled graph the Chekuri-Pal guarantee becomes O(log(nT))

The running time on the unrolled graph is O((BnT)log(nT))

Addressing Computation Complexity

DP Algorithm Algorithm details in proceedings Bug in proof of guarantees. Not fixed (yet)

New algorithm: Nonmyopic Greedy Details on my webpage… Guaranteed to provide O(logn) approximation

Better than the previous O(log(nT))

Approach

Replace expensive blackbox, with cheaper blackbox

Covering transformation

Chekuri-Pal

SOP on NPG

Blackbox for dual

Nonmyopic greedy

algorithm

Blackbox for dual

More efficient:Nonmyopic greedy calls the dual on the smaller network graph instead of the unrolled

Nonmyopic Greedy

dual(b,Gt)

R = 2C = 1

R = 1C = 1

R = 3C = 2

R = 5C = 4

R = 4C = 2

R = 3C = 2

R = 6C = 4

R = 3C = 2

R = 5C = 3

R = 4C = 3

R = 5C = 4

budget

Cost = 2

Time = 2

Best greedy choice condition on A1

R = 2C = 1

R = 1C = 1

R = 2C = 2

R = 1C = 2

XX X XX

Cost = 1

Time = 1

R = 0C = 1

R = 1C = 1

X X XP3

Cost = 1

Time = 3

Best ratio R/C1. Condition on picked data2. Recompute matrix

Return best of A1, A2

dual(budget=4,time=1)

dual(budget=1,time=3)

For border cases were A1 is bad, A2 is guaranteed to be good

Nonmyopic Greedy Guarantees

Nonmyopic greedy Chekuri-Pal on NPG

O(B2T(nB)logn) O((nBT)log(nT))

f (P) ≥1− e−1

2log nf (OPT)

f (P) ≥1

log(nT)f (OPT)

Myopic and Nonmyopic evaluation

Varying Constraints

Setup: 46 nodes on the Intel Berkeley Lab deployment 7 days of data (5 for learning, 2 for testing)

Cost and Runtime

Varying Horizon

Effect of greedy parameters

Varying budget levels

Conclusions Transform any blackbox solution to

nonmyopic

Obtain primal from dual

Nonmyopic greedy provides significant runtime improvements and better theoretical guarantees

Non-myopic Informative Path Planning in Spatio-Temporal Models Alexandra Meliou Andreas Krause...

Documents

Transcript of Non-myopic Informative Path Planning in Spatio-Temporal Models Alexandra Meliou Andreas Krause...

Metro Maps of Dafna Shahaf Carlos Guestrin Eric Horvitz.

Federated Facts and Figures Joseph M. Hellerstein UC Berkeley.

Carnegie Mellon University Joseph Gonzalez Joint work with Yucheng Low Aapo Kyrola Danny Bickson Carlos Guestrin Joe Hellerstein Alex Smola The Next Generation.

Carnegie Mellon GraphLab A New Framework for Parallel Machine Learning Yucheng Low Aapo Kyrola Carlos Guestrin Joseph Gonzalez Danny Bickson Joe Hellerstein.

Copyright ©2004 Carlos Guestrin guestrin VLDB 2004 Efficient Data Acquisition in Sensor Networks Presented By Kedar Bellare (Slides adapted.

Online Aggregation Joseph M. Hellerstein Peter J. Haas Helen J. Wang.

The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou.

Markov Decision Processes (MDPs) (cont.)guestrin/Class/15781/slides/mdps-rl...1 Markov Decision Processes (MDPs) (cont.) Machine Learning – 10701/15781 Carlos Guestrin Carnegie Mellon

Eddies: Continuously Adaptive Query Processing Ron Avnur Joseph M. Hellerstein UC Berkeley.

PCA - Carnegie Mellon School of Computer Scienceguestrin/Class/15781/slides/pca-mdps...©2005-2007 Carlos Guestrin 1 PCA Machine Learning – 10701/15781 Carlos Guestrin Carnegie Mellon

Dynamic Bayesian Networks Beyond 10708./guestrin/Class/10708-F06/Slides/dbn... · 2006. 12. 26. · 1 1 Dynamic Bayesian Networks Beyond 10708 Graphical Models – 10708 Carlos Guestrin

08 Leket Hellerstein Against Girl Songs A

A Decision Stump - cs.cmu.eduguestrin/Class/15781/slides/... · A Decision Stump. 2 3 ©Carlos Guestrin 2005-2007 The final tree 4 ©Carlos Guestrin 2005-2007 Basic Decision Tree

JournalofUrbanEconomics - UCI Social Sciencesdneumark/Hellerstein et al JUE 2008.pdf · 466 J.K. Hellerstein et al. / Journal of Urban Economics 64 (2008) 464–479 city) residence

Model-driven Data Acquisition in Sensor Networks Amol Deshpande 1,4 Carlos Guestrin 4,2 Sam Madden 4,3 Joe Hellerstein 1,4 Wei Hong 4 1 UC Berkeley 2 Carnegie.

Daniel Hellerstein (ERS) and Sean Sylvia (AREC/UMD)

Carlos Guestrin- Neural Networks

Towards Adaptive Dataflow Infrastructure Joe Hellerstein, UC Berkeley.

Learning Tree Conditional Random Fields Joseph K. Bradley Carlos Guestrin.

A Sketch of Regres Mike Carey Joey Hellerstein Michael Stonebraker.