Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory:...

61
Mikael Vejdemo-Johansson Introduction to Topological Data Analysis

Transcript of Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory:...

Page 1: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Mikael Vejdemo-Johansson

Introduction toTopological Data Analysis

Page 2: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Data has shape

• Homology: linear algebra measures shape

• Persistence: squinting with mathematics

• Cohomology

• Applications

Outline

Page 3: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Data has shape

• Homology: linear algebra measures shape

• Persistence: squinting with mathematics

• Cohomology

• Applications

Outline

Page 4: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Data has shape

Page 5: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Data has shape

Page 6: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Data has shape

Page 7: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Data has shape

Page 8: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Data has shape

Page 9: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Data has shape

• Homology: linear algebra measures shape

• Persistence: squinting with mathematics

• Cohomology

• Applications

Outline

Page 10: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 11: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 12: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

C0

Page 13: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

C1

Page 14: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

C2

Page 15: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Linear maps C2 C1

- +

C1 C0

-

∂:

∂:

Page 16: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Boundary of paths telescope

∂ =

-

-

-

+

+

+

Page 17: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Boundary of paths telescope

∂ =

-

-

-

+

+

+

Page 18: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Boundary of paths telescope

∂ =

-

-

+

+

Page 19: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Boundary of paths telescope

∂ =

-

-

+

+

Page 20: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Boundary of paths telescope

∂ =

-

Note: if the path z is a cycle, endpoints coincide, so ∂z=0

Page 21: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• A chain is a linear combination of simplices

• A cycle is an element of ker ∂ Something that looks like a closed path

• A boundary is an element of img ∂ Something that should look like a closed path

• Homology is the quotient vector space ker ∂ / img ∂ Essential (non-obvious) cycles

Definitions

Page 22: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Data has shape

• Homology: linear algebra measures shape

• Persistence: squinting with mathematics

• Cohomology

• Applications

Outline

Page 23: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 24: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 25: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 26: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 27: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 28: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 29: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 30: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 31: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 32: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

3x intersection

Page 33: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

3x intersection

Page 34: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

3x intersection

Page 35: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 36: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 37: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 38: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 39: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 40: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

What parameter to pick?

Page 41: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Look at all parameters at once

• Scale-independent

• Compressed data summary

• As the parameter grows, complexes include Inclusions create linear maps on homology

• Track homology classes as the parameter growsHomology is born, lives, and dies.

Persistent homology

Page 42: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Category theory: Homology is a functor.Sequence of spaces X1 → X2 → X3 → X4.Maps to sequence of vector spaces with induced linear maps:HkX1 → HkX2 → HkX3 → HkX4.

• Module theory (over 𝕜[t] or over an An quiver or…): Sequence of vector spaces and linear maps decomposes into direct sum of interval modules.These are defined by birth index and death index.

Persistent homology

Page 43: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Persistent barcodesPersistent diagrams

Page 44: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

Persistent barcodesPersistent diagrams

Page 45: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Data has shape

• Homology: linear algebra measures shape

• Persistence: squinting with mathematics

• Cohomology

• Applications

Outline

Page 46: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Instead of chains Cn, use cochains Cn=Hom(Cn,𝕜)

• Instead of boundary ∂ use coboundary δ=∂T

• Cocycles: follow a generalized path invariance:f(x,y) + f(y,z) = f(x,z) for any paths x→y, x→z, y→z

• Coboundaries: path invariance follows from a generalized potential function construction:f = g(y)-g(x)

Cohomology: dualize everything

Page 47: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Fun fact:H1(X; ℤ) is bijective with homotopy equivalence classes of functions X → S1

• Construction for [f]∈ H1(X; ℤ):

• Send vertices to 0

• Use f(e) as a wrapping number: wrap e around the circle f(e) times

• All higher simplices work out bc path invariance

1-Cohomology isCircle-valued functions

Page 48: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Construction adapts to data: Pretend [f] is instead from H1(X; ℝ).arg ming | f - δg |2 mod 1.0is a smooth function C0 → S1.

• Can compute intrinsic phase variables from geometry of time series

1-Cohomology isCircle-valued functions

Page 49: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence

• Data has shape

• Homology: linear algebra measures shape

• Persistence: squinting with mathematics

• Cohomology

• Applications

Outline

Page 50: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 51: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 52: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 53: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 54: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 55: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 56: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 57: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 58: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 59: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 60: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence
Page 61: Introduction to Topological Data Analysismvj/GC-BigData... · 4/18/2019  · •Category theory: Homology is a functor. Sequence of spaces X1 → X2 → X3 → X4. Maps to sequence