Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate...

41
Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20, 2008, Duke University joint work with Gunnar Carlsson (Stanford), Peter Kim and Zhiming Luo (Guelph), and Moo Chung (Wisconsin–Madison) Peter Bubenik Multivariate Topological Data Analysis

Transcript of Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate...

Page 1: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Multivariate Topological Data Analysis

Peter Bubenik

Cleveland State University

November 20, 2008, Duke Universityjoint work with Gunnar Carlsson (Stanford), Peter Kim and

Zhiming Luo (Guelph), and Moo Chung (Wisconsin–Madison)

Peter Bubenik Multivariate Topological Data Analysis

Page 2: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Framework

Ideal Truth(Parameter)

Observed Reality(Statistic)

Function on a compactmanifold

Sampled Data

Topological Descriptionvia Persistent Homology

measurement

Filter using level sets

Peter Bubenik Multivariate Topological Data Analysis

Page 3: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Framework

Ideal Truth(Parameter)

Observed Reality(Statistic)

Function on a compactmanifold

Sampled Data

Topological Descriptionvia Persistent Homology

Filtered Simplicial Complex

Topological Descriptionvia Persistent Homology

measurement

Filter using level sets

How close?

Peter Bubenik Multivariate Topological Data Analysis

Page 4: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Framework

Ideal Truth(Parameter)

Observed Reality(Statistic)

Function on a compactmanifold

Sampled Data

Topological Descriptionvia Persistent Homology

Estimated Function

Topological Descriptionvia Persistent Homology

measurement

Filter using level sets Statistical techniques

Filter using level setsHow close?

Peter Bubenik Multivariate Topological Data Analysis

Page 5: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Persistent homology

Persistent homology describes the homological features whichpersist as a single parameter changes.

Here, we take this parameter to be a threshold on the function onthe space from which we are sampling.

Peter Bubenik Multivariate Topological Data Analysis

Page 6: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Functions on R

Let f : R → R and assume that if f ′(x) = 0, then f ′′(x) 6= 0, ie, f

has non-degenerate critical points.

This means each critical point is a local minimum or a localmaximum.

Peter Bubenik Multivariate Topological Data Analysis

Page 7: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Functions on R

Let f : R → R and assume that if f ′(x) = 0, then f ′′(x) 6= 0, ie, f

has non-degenerate critical points.

This means each critical point is a local minimum or a localmaximum.

Define the sublevel sets Rf ≤t = f −1(−∞, t], t ∈ R.

Remark

As t increases, the topology of Rf ≤t does not change as long aswe do not pass a critical value.

Peter Bubenik Multivariate Topological Data Analysis

Page 8: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Critical Points of Functions on R

At the critical points we have the following effects:

At a local minimum, a new component is added.

At a local maximum, two components are merged.

Peter Bubenik Multivariate Topological Data Analysis

Page 9: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Critical Points of Functions on R

At the critical points we have the following effects:

At a local minimum, a new component is added.

At a local maximum, two components are merged.

Pairing critical points: Pair a local maximum, with the higher(younger) of the two local minimum associated with the twocomponents which it joins.

Peter Bubenik Multivariate Topological Data Analysis

Page 10: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Critical Points of Functions on R

At the critical points we have the following effects:

At a local minimum, a new component is added.

At a local maximum, two components are merged.

Pairing critical points: Pair a local maximum, with the higher(younger) of the two local minimum associated with the twocomponents which it joins.

If we graph all pairs of critical points we obtain the (Reduced)Persistence Diagram.

Peter Bubenik Multivariate Topological Data Analysis

Page 11: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Persistence Diagram for Functions on R

For example:

1 2 3 4

1

2

3

4

x

y y = f (x)

Peter Bubenik Multivariate Topological Data Analysis

Page 12: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Persistence Diagram for Functions on R

For example:

1 2 3 4

1

2

3

4

x

y y = f (x)

Rf ≤1

b

b

Peter Bubenik Multivariate Topological Data Analysis

Page 13: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Persistence Diagram for Functions on R

For example:

1 2 3 4

1

2

3

4

x

y y = f (x)

Rf ≤2

b

b

Peter Bubenik Multivariate Topological Data Analysis

Page 14: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Persistence Diagram for Functions on R

For example:

1 2 3 4

1

2

3

4

x

y y = f (x)

Rf ≤3

Peter Bubenik Multivariate Topological Data Analysis

Page 15: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Persistence Diagram for Functions on R

For example:

1 2 3 4

1

2

3

4

x

y y = f (x)

Rf ≤4

Peter Bubenik Multivariate Topological Data Analysis

Page 16: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Persistence Diagram for Functions on R

For example:

1 2 3 4

1

2

3

4

x

y y = f (x)

1 2 3 4

1

2

3

4

birth

death

b

b

Peter Bubenik Multivariate Topological Data Analysis

Page 17: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Persistent homology for densities on manifolds

Let (M, g) be a compact, connected Riemannian manifold, with adominating measure ν (the invariant measure which in localcoordinates is

|g |dx1...dxd ).

Let f : M → [0,∞] such that∫

Mfdν = 1.

This probability density gives an increasing filtration of M bysublevel sets

Mf ≤r = {x ∈ M | f (x) ≤ r}.

Peter Bubenik Multivariate Topological Data Analysis

Page 18: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Persistent homology for densities on manifolds

Let (M, g) be a compact, connected Riemannian manifold, with adominating measure ν (the invariant measure which in localcoordinates is

|g |dx1...dxd ).

Let f : M → [0,∞] such that∫

Mfdν = 1.

This probability density gives an increasing filtration of M bysublevel sets

Mf ≤r = {x ∈ M | f (x) ≤ r}.

This induces an increasing filtration on C∗(M),

the Morse filtration: Fr (C∗(M)) = C∗(Mf ≤r ),

from which we can calculate the persistent homology of f .

Peter Bubenik Multivariate Topological Data Analysis

Page 19: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

The statistical viewpoint

We assume that f = fθ belongs to a family of densities

{fθ | θ ∈ Θ}

where θ is the parameter and Θ is the parameter space which canbe finite dimensional (parametric), or, infinite dimensional(nonparametric).

Goal

Find an estimate θ of θ so that the persistent homology of fθ isclose to the persistent homology of fθ.

Peter Bubenik Multivariate Topological Data Analysis

Page 20: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Functions on M – Review of Morse Theory

Let f : M → R be a Morse function with distinct critical valuest0 < t1 < · · · < tk . Recall that Mf≤t = f −1(∞, t].

Peter Bubenik Multivariate Topological Data Analysis

Page 21: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Functions on M – Review of Morse Theory

Let f : M → R be a Morse function with distinct critical valuest0 < t1 < · · · < tk . Recall that Mf≤t = f −1(∞, t].

The index p of f at the critical point associated to the criticalvalue tj is the number of negative eigenvalues of the Hessian.

Peter Bubenik Multivariate Topological Data Analysis

Page 22: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Functions on M – Review of Morse Theory

Let f : M → R be a Morse function with distinct critical valuest0 < t1 < · · · < tk . Recall that Mf≤t = f −1(∞, t].

The index p of f at the critical point associated to the criticalvalue tj is the number of negative eigenvalues of the Hessian.

Morse theory says that Mf ≤tj is homotopy equivalent to ∂Mf ≤tj−1

together with an attached p−dimensional cell.Consequently, βp increases by one (positive) or βp−1 decreases byone (negative).

Peter Bubenik Multivariate Topological Data Analysis

Page 23: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Functions on M – Review of Morse Theory

Let f : M → R be a Morse function with distinct critical valuest0 < t1 < · · · < tk . Recall that Mf≤t = f −1(∞, t].

The index p of f at the critical point associated to the criticalvalue tj is the number of negative eigenvalues of the Hessian.

Morse theory says that Mf ≤tj is homotopy equivalent to ∂Mf ≤tj−1

together with an attached p−dimensional cell.Consequently, βp increases by one (positive) or βp−1 decreases byone (negative).

Pairing of the positive critical points of index p and the negativecritical points of index p + 1 produces the degree–p PersistenceDiagram, denoted by Dp(f ).

Peter Bubenik Multivariate Topological Data Analysis

Page 24: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

A Function

For example

−5.0

−2.5

0.0 x

2.50.0

−5.0 −2.5

0.5

0.0 5.02.5y 5.0

1.0

1.5

2.0

Peter Bubenik Multivariate Topological Data Analysis

Page 25: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Sublevel sets and H1

−5.0

−2.5

0.0 x

2.50.0

−5.0 −2.5

0.25

0.0 5.02.5y 5.0

0.5

0.75

1.0

−5.0

−2.5

0.0 x

2.50.0

−5.0

0.25

−2.5 0.0 5.02.5

0.5

y 5.0

0.75

1.0

y

−2.5

−2.5

−5.0

x 0.0

0.0

−5.0

2.5

2.5

5.0

5.0

−5.0

y

5.0

5.0

x

−5.0

2.5

−2.5 0.0 2.5

0.0

−2.5

5.0

−2.5

x

2.5

2.5 5.0

−5.0

−2.5

0.0

−5.0

y

0.0

Peter Bubenik Multivariate Topological Data Analysis

Page 26: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Persistence Diagram of the function

1 2

1

2

birth

death

b

b

Peter Bubenik Multivariate Topological Data Analysis

Page 27: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Bottleneck Distance

We now want to define a metric on the space of PersistenceDiagrams.

Peter Bubenik Multivariate Topological Data Analysis

Page 28: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Bottleneck Distance

We now want to define a metric on the space of PersistenceDiagrams.

Let f , g : M → R be two Morse functions, with associatedPersistence Diagrams Dp(f ) and Dp(g).

Definition

The Bottleneck distance is given by

dB(Dp(f ),Dp(g)) = infη

supx‖x − η(x)‖∞,

where the infimum is taken over all bijections η : Dp(f ) → Dp(g)and the supremum is taken over all points x ∈ Dp(f ).

Peter Bubenik Multivariate Topological Data Analysis

Page 29: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Bottleneck Distance

For example:

1 2 3 4

1

2

3

4

x

y y = f (x)

Peter Bubenik Multivariate Topological Data Analysis

Page 30: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Bottleneck Distance

For example:

1 2 3 4

1

2

3

4

x

y y = f (x)

1 2 3 4

1

2

3

4

birth

death

b

b

b

b

bb

b

Peter Bubenik Multivariate Topological Data Analysis

Page 31: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Stability

The following fundamental result bounds the bottleneck distancefor persistence diagrams with the better-known supremum norm.

Theorem (Cohen-Steiner, Edelsbrunner, Harer)

dB(Dp(f ),Dp(g)) ≤ ‖f − g‖∞

Peter Bubenik Multivariate Topological Data Analysis

Page 32: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Stability

The following fundamental result bounds the bottleneck distancefor persistence diagrams with the better-known supremum norm.

Theorem (Cohen-Steiner, Edelsbrunner, Harer)

dB(Dp(f ),Dp(g)) ≤ ‖f − g‖∞

For us, this result is crucial as it allows us to connect topology tostatistics.

Take f to be an unknown function and f to its statisticalestimator. Then,

dB(Dp(f ),Dp(f )) ≤ ‖f − f ‖∞

Peter Bubenik Multivariate Topological Data Analysis

Page 33: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Nonparametric regression

Let M be a manifold. We would like to be able to solve thefollowing Nonparametric Regression Problem.

Problem

We assume that there exists a function f : M → R such that

y = f (x) + ǫ, x ∈ M

where ǫ is a normal random variable with mean zero and variance

σ2 > 0. Given a sample (y1, x1), . . . , (yn, xn), find an estimator f

of f .

We want to find an estimator f that minimizes ‖f − f ‖∞ andcalculate the asymptotics as n → ∞.

Peter Bubenik Multivariate Topological Data Analysis

Page 34: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

The precise setup

Let M be a compact, connected m-dimensional Riemannianmanifold with Riemannian metric ρ(·, ·) (given by geodesicdistance) and volume element dω.

Peter Bubenik Multivariate Topological Data Analysis

Page 35: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

The precise setup

Let M be a compact, connected m-dimensional Riemannianmanifold with Riemannian metric ρ(·, ·) (given by geodesicdistance) and volume element dω.

As the parameter space we take the Holder class of functions on M

Λ(β,L) ={

f : M → R | |f (x) − f (y)| ≤ Lρ(x , y)β , x , y ∈ M

}

,

where 0 < β ≤ 1.

Peter Bubenik Multivariate Topological Data Analysis

Page 36: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Expected loss (risk)

Definition

The expected loss (risk) of an estimator fε is given by

Ef ‖fε − f ‖∞.

Peter Bubenik Multivariate Topological Data Analysis

Page 37: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

A lower bound

Theorem (B-C-C-K-L)

For the regression model,

supf ∈Λ(β,L)

Ef ‖f − f ‖∞ ≥ Cψn

as n → ∞, where ψn =

(

log(n)

n

)β/(2β+d)

and

C = Ld/(2β+d)(

σ2 vol(M)(β+d)d2

vol(Sd−1)β2

(2β+d)

Peter Bubenik Multivariate Topological Data Analysis

Page 38: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

A upper bound

In fact, this lower bound can be achieved.

Partition M by Ai ⊂ M, i = 1, . . . ,N and letfε(x) =

∑Ni=1 ai IAi

(x).

Peter Bubenik Multivariate Topological Data Analysis

Page 39: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

A upper bound

In fact, this lower bound can be achieved.

Partition M by Ai ⊂ M, i = 1, . . . ,N and letfε(x) =

∑Ni=1 ai IAi

(x).By a suitable choice of Ai , ai , i = 1, . . . ,N (constructive) one has

Theorem (B-C-C-K-L)

Using this estimator,

supf ∈Λ(β,L)

E‖fε − f ‖∞ ≤ Cψε

as ε→ 0.

Peter Bubenik Multivariate Topological Data Analysis

Page 40: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Estimating persistent homology

Corollary

In the white noise model

EdB(Dp(fε),Dp(f )) ≤ Cψε

as ε→ 0.

Peter Bubenik Multivariate Topological Data Analysis

Page 41: Multivariate Topological Data Analysisacademic.csuohio.edu/bubenik_p/talks/duke.pdf · Multivariate Topological Data Analysis Peter Bubenik Cleveland State University November 20,

Summary

Topological features of functions can be described usingtopological persistence.

For nonparametric regression problems, statistical estimatorscan be used to recover the topology of the function.

Peter Bubenik Multivariate Topological Data Analysis