Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S...

56
W I S S E N T E C H N I K L E I D E N S C H A F T www.tugraz.at Recent Algorithmic Advances in Topological Data Analysis Michael Kerber Gudhi workshop, Porquerolles, France, Oct 19, 2016

Transcript of Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S...

Page 1: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

W I S S E N T E C H N I K L E I D E N S C H A F T

www.tugraz.at

Recent Algorithmic Advancesin Topological Data AnalysisMichael KerberGudhi workshop, Porquerolles, France, Oct 19, 2016

Page 2: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

2 Computational Topology@TU Graz

Arnur Nigmetov Hannah Schreiber Aruni Choudhary(MPI Saarbrucken)

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 3: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

3 Our mission

“. . . to boldlycompute what notopologists hascomputed before.”

Algorithmic foundations, implementations, andsoftware in computational topology and geometry.

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 4: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

3 Our mission

“. . . to boldlycompute what notopologists hascomputed before.”

Algorithmic foundations, implementations, andsoftware in computational topology and geometry.

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 5: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

4 The algorithmic pipeline

1. Turn input into multi-scale representation

2. Compute topological invariants

3. Draw conclusions about the input

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 6: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

4 The algorithmic pipeline

1. Turn input into multi-scale representation

2. Compute topological invariants

3. Draw conclusions about the input

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 7: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

5 Cech filtrations

Nested sequence of simplicial complexes

Important in topological data analysis

Vietoris-Rips complexes: Closely related

Problem: Size of k -skeleton is( n

k+1

)= O(nk+1)

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 8: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

6 Topological approximation

Persistence diagram of the Cech filtration:

birth

death

Question: Can we find a small filtration whosepersistence diagram is provably close to the Cechdiagram?

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 9: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

6 Topological approximation

Persistence diagram of the Cech filtration:

birth

death

Question: Can we find a small filtration whosepersistence diagram is provably close to the Cechdiagram?

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 10: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

7 Previous work

Sparse Rips complex [Sheehy 2012]

(1 + ε)-approximation of size

n ·(

)O(∆k)

with ∆ the doubling dimension of the point set

Various related approaches [Dey, Fan, Wang 2012]

[K., Sharathkumar 2013] [Botnan, Spreemann 2015]

[Buchet et al. 2015] [Cavanna, Jahanseir, Sheehy 2015]

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 11: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

8 Our contributions [Choudhary,K.,Raghvendra, SoCG 2016]

6(d + 1)-approximation of size

n · 2O(d log k)

per scale for n points in Rd .

Combined with dimension reduction:O(log3/2 n)-approximation of size nO(1).

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 12: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

8 Our contributions [Choudhary,K.,Raghvendra, SoCG 2016]

6(d + 1)-approximation of size

n · 2O(d log k)

per scale for n points in Rd .

Combined with dimension reduction:O(log3/2 n)-approximation of size nO(1).

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 13: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

9 Approximation by lattices I

Diameter of a cell: α ·√

d

Two non-adjacent cells areat least α apart

⇒√

d-approximation!

But highly degenerate: 2d

cells intersect in a point(leads to size n · 2O(dk))

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 14: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

9 Approximation by lattices I

Diameter of a cell: α ·√

d

Two non-adjacent cells areat least α apart

⇒√

d-approximation!

But highly degenerate: 2d

cells intersect in a point(leads to size n · 2O(dk))

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 15: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

9 Approximation by lattices I

Diameter of a cell: α ·√

d

Two non-adjacent cells areat least α apart

⇒√

d-approximation!

But highly degenerate: 2d

cells intersect in a point(leads to size n · 2O(dk))

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 16: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

9 Approximation by lattices I

Diameter of a cell: α ·√

d

Two non-adjacent cells areat least α apart

⇒√

d-approximation!

But highly degenerate: 2d

cells intersect in a point(leads to size n · 2O(dk))

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 17: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

9 Approximation by lattices I

Diameter of a cell: α ·√

d

Two non-adjacent cells areat least α apart

⇒√

d-approximation!

But highly degenerate: 2d

cells intersect in a point(leads to size n · 2O(dk))

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 18: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

10 Approximation by lattices II

Hexagonal grid

How to generalize in higherdimensions?

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 19: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

11 The permutahedron

Voronoi region of A∗d -lattice

Diameter: O(α ·√

d)

Lemma: Non-intersectingcells are at leastα ·√

2/(d + 1) apart.

Size of the dual k -skeleton:n2O(d log k)

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 20: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

11 The permutahedron

Voronoi region of A∗d -lattice

Diameter: O(α ·√

d)

Lemma: Non-intersectingcells are at leastα ·√

2/(d + 1) apart.

Size of the dual k -skeleton:n2O(d log k)

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 21: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

11 The permutahedron

Voronoi region of A∗d -lattice

Diameter: O(α ·√

d)

Lemma: Non-intersectingcells are at leastα ·√

2/(d + 1) apart.

Size of the dual k -skeleton:n2O(d log k)

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 22: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

12 Interleaving

φ

φ

φ

φ

ψ

ψ

φ

· · · //Rβ2(d+1)

φ

&&

g //Rβ8(d+1)3 // · · ·

· · · // Xβ

ψ;;

θ // Xβ4(d+1)2

ψ88

// · · ·

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 23: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

13 The Johnson-Lindenstrauss Lemma

For a point set S ⊂ Rd of n points and 0 < ε < 1, thereis a map

f : Rd → Rm

with m = O( log nε2 ) such that for any two points s, t ∈ S:

(1− ε)‖s − t‖ ≤ ‖f (s)− f (t)‖ ≤ (1 + ε)‖s − t‖.

Moreover, a random (scaled) projection from Rd toRm has that property with a probability of at least 1

2 .

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 24: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

13 The Johnson-Lindenstrauss Lemma

For a point set S ⊂ Rd of n points and 0 < ε < 1, thereis a map

f : Rd → Rm

with m = O( log nε2 ) such that for any two points s, t ∈ S:

(1− ε)‖s − t‖ ≤ ‖f (s)− f (t)‖ ≤ (1 + ε)‖s − t‖.

Moreover, a random (scaled) projection from Rd toRm has that property with a probability of at least 1

2 .

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 25: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

14 Dimension reduction

Size per scale n · 2O(d log k)

[Johnson, Lindenstrauss 1984]: d ≈ log n (constant distortion)⇒ size nO(log k), total distortion O(log n)

[Matousek 1990]: d ≈ log nlog log n ((log n)-distortion)

⇒ size nO(1), total distortion O(log2 n)

[Bourgain 1985]: General metric space: Embed toO(log2 n) dimensions with distortion O(log n)

⇒size nO(1) and total distortion O(log3 n)

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 26: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

14 Dimension reduction

Size per scale n · 2O(d log k)

[Johnson, Lindenstrauss 1984]: d ≈ log n (constant distortion)⇒ size nO(log k), total distortion O(log n)

[Matousek 1990]: d ≈ log nlog log n ((log n)-distortion)

⇒ size nO(1), total distortion O(log2 n)

[Bourgain 1985]: General metric space: Embed toO(log2 n) dimensions with distortion O(log n)

⇒size nO(1) and total distortion O(log3 n)

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 27: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

14 Dimension reduction

Size per scale n · 2O(d log k)

[Johnson, Lindenstrauss 1984]: d ≈ log n (constant distortion)⇒ size nO(log k), total distortion O(log n)

[Matousek 1990]: d ≈ log nlog log n ((log n)-distortion)

⇒ size nO(1), total distortion O(log2 n)

[Bourgain 1985]: General metric space: Embed toO(log2 n) dimensions with distortion O(log n)

⇒size nO(1) and total distortion O(log3 n)

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 28: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

15 Our contributions [Choudhary,K.,Raghvendra, SoCG 2016]

6(d + 1)-approximation of size

n · 2O(d log k)

per scale for n points in Rd .

Combined with dimension reduction:O(log3/2 n)-approximation of size nO(1).

Lower bound: Any (1 + δ)-approximation scheme hasto be of size nΩ(log log n) if δ < 1

96 log1.001 n.

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 29: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

16 The algorithmic pipeline

1. Turn input into multi-scale representation2. Compute topological invariants3. Draw conclusions about the input

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 30: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

17 Distances between persistence diagrams

A1 A2 · · · An

A1 A2 . . . An−1 An

A1A2... d(Ai ,Aj)

An−1An

n diagrams⇒(n

2

)distances

Often the computationalbottleneck

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 31: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

18 Distance measures

One-to-one pairing of points

Every point must be paired

Pairing with diagonal allowed

Cost of a pair (p,q): ‖p − q‖∞Bottleneck distance: Minimize maximal cost1-Wasserstein distance: Minimize Σ of the costsStability [Cohen-Steiner et al. 2007]

Other distances

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 32: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

19 From diagram distance to graph matching

Weighted complete bipartite graph GWeight of an edge: L∞ distance of the pointsEXCEPT: weight is zero if both points are on diagonalUse graph matching algorithm (Hopcroft-Karp,Hungarian,. . .)

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 33: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

20 Does geometry help?

G is “almost” metric (modulo diagonal)Asymptotically faster algorithms are known for thiscase[Efrat, Itai, Katz: Geometry Helps in Bottleneck Matching... 2001]

[Vaidya: Geometry Helps in Matchings. 1989] (opt. assignment)Adaption to persistence diagrams straight-forward[Folklore? Mentioned in Edelsbrunner, Harer 2010]

AnswerYes, in theory

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 34: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

20 Does geometry help?

G is “almost” metric (modulo diagonal)Asymptotically faster algorithms are known for thiscase[Efrat, Itai, Katz: Geometry Helps in Bottleneck Matching... 2001]

[Vaidya: Geometry Helps in Matchings. 1989] (opt. assignment)Adaption to persistence diagrams straight-forward[Folklore? Mentioned in Edelsbrunner, Harer 2010]

AnswerYes, in theory

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 35: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

21 Does geometry help?

Our contributionYes, also in practice! [K., Morozov, Nigmetov, ALENEX 2016]

We compare geometric and non-geometricimplementations of bottleneck matchings and optimalassignment (for R2)We show experimentally that geometry improvesperformanceWe outperform Dionysus, the only publically availablesoftware for distances of persistence diagramsOur code is freely available:https://bitbucket.org/grey_narn/hera

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 36: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

22 The bottleneck case

Let G[α] be the graph G with all edges of weight > αdeleted

Observation: If G[α] has a perfect matching, thebottleneck distance is at most α.

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 37: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

23 The Wasserstein case

Assignment problem: Findperfect matching with minimalsum of costs.

Discrete optimal transport

Hungarian algorithm

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 38: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

23 The Wasserstein case

Assignment problem: Findperfect matching with minimalsum of costs.

Discrete optimal transport

Hungarian algorithm

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 39: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

n bidders (left), n objects (right)

Maintain (partial) matchingObjects have (global) pricesBidders have (individual) appreciations

Repeat:

Pick an unassigned bidderValue of object: appreciation - pricePick best and 2nd-best valued objectsAssign bidder to best valued objectIncrease price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 40: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

n bidders (left), n objects (right)Maintain (partial) matching

Objects have (global) pricesBidders have (individual) appreciations

Repeat:

Pick an unassigned bidderValue of object: appreciation - pricePick best and 2nd-best valued objectsAssign bidder to best valued objectIncrease price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 41: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

5

8

8

3

6

n bidders (left), n objects (right)Maintain (partial) matchingObjects have (global) prices

Bidders have (individual) appreciations

Repeat:

Pick an unassigned bidderValue of object: appreciation - pricePick best and 2nd-best valued objectsAssign bidder to best valued objectIncrease price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 42: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

5

8

8

3

6

3

11

7

2

7

n bidders (left), n objects (right)Maintain (partial) matchingObjects have (global) pricesBidders have (individual) appreciations

Repeat:

Pick an unassigned bidderValue of object: appreciation - pricePick best and 2nd-best valued objectsAssign bidder to best valued objectIncrease price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 43: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

5

8

8

3

6

1

9

3

6

8

n bidders (left), n objects (right)Maintain (partial) matchingObjects have (global) pricesBidders have (individual) appreciations

Repeat:

Pick an unassigned bidderValue of object: appreciation - pricePick best and 2nd-best valued objectsAssign bidder to best valued objectIncrease price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 44: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

5

8

8

3

6

n bidders (left), n objects (right)Maintain (partial) matchingObjects have (global) pricesBidders have (individual) appreciations

Repeat:

Pick an unassigned bidderValue of object: appreciation - pricePick best and 2nd-best valued objectsAssign bidder to best valued objectIncrease price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 45: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

5

8

8

3

6

3

11

7

2

7

n bidders (left), n objects (right)Maintain (partial) matchingObjects have (global) pricesBidders have (individual) appreciations

Repeat:

Pick an unassigned bidder

Value of object: appreciation - pricePick best and 2nd-best valued objectsAssign bidder to best valued objectIncrease price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 46: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

5

8

8

3

6

3

11

7

2

7

−2

−1

1

−1

3

n bidders (left), n objects (right)Maintain (partial) matchingObjects have (global) pricesBidders have (individual) appreciations

Repeat:

Pick an unassigned bidderValue of object: appreciation - price

Pick best and 2nd-best valued objectsAssign bidder to best valued objectIncrease price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 47: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

5

8

8

3

6

1

3

n bidders (left), n objects (right)Maintain (partial) matchingObjects have (global) pricesBidders have (individual) appreciations

Repeat:

Pick an unassigned bidderValue of object: appreciation - pricePick best and 2nd-best valued objects

Assign bidder to best valued objectIncrease price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 48: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

5

8

8

3

6

1

3

n bidders (left), n objects (right)Maintain (partial) matchingObjects have (global) pricesBidders have (individual) appreciations

Repeat:

Pick an unassigned bidderValue of object: appreciation - pricePick best and 2nd-best valued objectsAssign bidder to best valued object

Increase price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 49: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

24 The auction algorithm [Bertsekas 1988]

5

8

3

6

1

310+eps

n bidders (left), n objects (right)Maintain (partial) matchingObjects have (global) pricesBidders have (individual) appreciations

Repeat:

Pick an unassigned bidderValue of object: appreciation - pricePick best and 2nd-best valued objectsAssign bidder to best valued objectIncrease price by difference of values,plus ε > 0

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 50: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

25 Why auction?

Theorem [Bertsekas 1988]

Let opt denote the cost of the optimal assignment, and dthe cost returned by the auction. Then

opt− nε ≤ d ≤ opt

Large ε: Fast, rough approximationSmall ε: Slow, accurate approximationε-scaling [Bertsekas, Castanon 1991]

Remark: Getting exact result possible, but very slow

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 51: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

26 How geometry helps

−8

−5−3

−5−4

Appreciation = -distance to object

Crucial query: Find the best andsecond best object for an unassignedbidder.

Our approach

k-d-tree with weight per node

Weight=minimal price

Prune search in subtrees if bettercandidates are known

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 52: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

26 How geometry helps

−8

−5−3

−5−4

Appreciation = -distance to object

Crucial query: Find the best andsecond best object for an unassignedbidder.

Our approach

k-d-tree with weight per node

Weight=minimal price

Prune search in subtrees if bettercandidates are known

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 53: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

27 Experimental comparison

500 1,000 1,500 2,000

10−1

100

101

102

103

# points on a 4-sphere

Sec

onds

Dionysusnon-geom.

geom.

Linear space (geometric) vs quadratic space(non-geometric)Exact distance (Dionysus) vs relative1%-approximation

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 54: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

28 The Hera library

URL: https://bitbucket.org/grey_narn/hera

Code for bottleneck (LGPL) andWasserstein (BSD)

Supports q-Wasserstein distanceand different choice of innermetric (instead of L∞)

Download it! Use it! Tell us yourexperience!

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 55: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

29 The algorithmic pipeline

1. Turn input into multi-scale representation2. Compute topological invariants3. Draw conclusions about the input

Gudhi workshop, Porquerolles, France, Oct 19, 2016

Page 56: Recent Algorithmic Advances in Topological Data Analysis · W I S S E N T E C H N I K L E I D E N S C H A F T Recent Algorithmic Advances in Topological Data Analysis Michael Kerber

30

Gudhi workshop, Porquerolles, France, Oct 19, 2016