Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf ·...

88
Lecture 10: Discussion and Challenges NSF/CBMS Conference Sayan Mukherjee Departments of Statistical Science, Computer Science, Mathematics Duke University www.stat.duke.edu/sayan May 31, 2016

Transcript of Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf ·...

Page 1: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Lecture 10: Discussion and ChallengesNSF/CBMS Conference

Sayan Mukherjee

Departments of Statistical Science, Computer Science, Mathematics

Duke University

www.stat.duke.edu/⇠sayan

May 31, 2016

Page 2: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Software and computing

Page 3: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

A review paper

Page 4: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Topological Data AnalysisA software survey

Mikael Vejdemo-JohanssonAI Laboratory, Jozef Stefan Institute, Slovenia

Wednesday, March 19, 14

Page 5: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Cubical homologyPixels and voxels

• Cellular homology theoryBuilding blocks are n-cubes

• Admits very efficient matrix processing methods

• Homotopy reduction techniques reduce to matrix traversals

• Well adapted for 2d and 3d images or pixel/voxel clouds

Wednesday, March 19, 14

Page 6: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

ChomP

• Cubical homology — with or without persistence

• GUI, command line interface, and C++ library

• Encodes a wide range of both space and mapping analyses

• Includes a wide range of homotopy-based optimizations

http://chomp.rutgers.edu/Software.html

Wednesday, March 19, 14

Page 7: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

HAP

• Module for the GAP computer algebra system

• Primarily focused on research programming into group cohomology

• Includes support for cubical persistent homology

http://www.gap-system.org/Packages/hap.html

Wednesday, March 19, 14

Page 8: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Plex / jPlex / javaPlex

• Family of software packages developed at Stanford, adapted for use from Matlab

• Implements a range of algorithms — both for constructing complexes and computing their persistent (co)homology

• Current recommended incarnation: javaPlexhttp://javaplex.googlecode.com

Wednesday, March 19, 14

Page 9: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Dionysus

• Library for computational homology

• Contains example applications implementing persistent homology and cohomology, as well as time-varying persistence (vineyards) & low-dimensional optimizations

• Relies on Boost, and optionally on CGAL for low-dimensional optimizations

• Includes a Python interface through Boost::Python

http://www.mrzv.org/software/dionysusWednesday, March 19, 14

Page 10: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

pHat

• Recent released software package and C++ library

• Implements several optimizations to the persistence algorithm

• Does not (currently) construct the complex for you

• (currently) restricted to �2 coefficients

• Some support for SMP parallelization using OpenMP

http://phat.googlecode.com

Wednesday, March 19, 14

Page 11: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Perseus

• Cubical and simplicial complex representation and several different construction methods

• Uses discrete morse theory to speed up computation

http://www.math.rutgers.edu/~vidit/perseus

Wednesday, March 19, 14

Page 12: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

ToMaTo

• C++ library for topological analysis

• Relies on libANN for approximate nearest neighbors

http://geometrica.saclay.inria.fr/data/ToMATo/

Wednesday, March 19, 14

Page 13: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

GAP Persistence

• Persistent homology and complex construction in the GAP computer algebra system

http://www-circa.mcs.st-and.ac.uk/~mik/persistence/

Wednesday, March 19, 14

Page 14: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Python Mapper

• Open source solution

• Developed by Müllner & Babu at Stanford University

• Focused on being a research tool

• Exports graph structure in several formats: GraphViz .dotd3.js JSON graph representation

http://math.stanford.com/~muellner/mapper

Wednesday, March 19, 14

Page 15: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Packages

Page 16: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Computing persistence homology

Given boundary matrix D

Find R = DV ,where V is upper-triangularR is reduced, no columns have lowest nonzeros in the same row.

The reduction is via Gaussian elimination, reduce to Smith Normalform.Rank of R are the number of o↵ diagonal ones.

Page 17: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Computing persistence homology

Given boundary matrix D

Find R = DV ,where V is upper-triangularR is reduced, no columns have lowest nonzeros in the same row.

The reduction is via Gaussian elimination, reduce to Smith Normalform.Rank of R are the number of o↵ diagonal ones.

Page 18: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Computing persistence homology

Given boundary matrix D

Find R = DV ,where V is upper-triangularR is reduced, no columns have lowest nonzeros in the same row.

The reduction is via Gaussian elimination, reduce to Smith Normalform.Rank of R are the number of o↵ diagonal ones.

Page 19: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Computing persistence homology

Page 20: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Reduction

60,

Page 21: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Reduction

61,

Page 22: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Reduction

62,

Page 23: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Reduction

63,

Page 24: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Reduction

64,

Page 25: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Reduction

65,

Page 26: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Adding Geometry

Building a complex - single linkage graph

Cech complexFor balls of radius centered at the points, a -simplex is inthe complex iff there is intersection of the balls.

40,

Page 27: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Adding Geometry

If we only have pairwise information

Vietoris-Rips complexFor balls of radius centered at the points, a -simplex is inthe complex iff there is clique in the Cech graph.

43,

Page 28: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Adding geometry

•  

Page 29: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Adding geometry

A simplex � = {q0

, ..., qk} is weakly witnessed by a point x ifd(qi , x) < d(q, x) for any i 2 [0, k] and q 2 Q \ {qo , ..., qk}

I is strongly witnessed if in additiond(qi , x) = d(qj , x), 8i , j 2 [0, k].

Given a set of points P = {p1

, p2

, ..., pn} 2 IRd and a subsetQ ✓ P ,

I The witness complex W (P ,Q) is the collection of simplifieswith vertices from Q with all subsimplices weakly witnessed bya point in P .

I Can be defined for a general metric space.

Page 30: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Adding geometry

A simplex � = {q0

, ..., qk} is weakly witnessed by a point x ifd(qi , x) < d(q, x) for any i 2 [0, k] and q 2 Q \ {qo , ..., qk}

I is strongly witnessed if in additiond(qi , x) = d(qj , x), 8i , j 2 [0, k].

Given a set of points P = {p1

, p2

, ..., pn} 2 IRd and a subsetQ ✓ P ,

I The witness complex W (P ,Q) is the collection of simplifieswith vertices from Q with all subsimplices weakly witnessed bya point in P .

I Can be defined for a general metric space.

Page 31: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Adding geometry

A simplex � = {q0

, ..., qk} is weakly witnessed by a point x ifd(qi , x) < d(q, x) for any i 2 [0, k] and q 2 Q \ {qo , ..., qk}

I is strongly witnessed if in additiond(qi , x) = d(qj , x), 8i , j 2 [0, k].

Given a set of points P = {p1

, p2

, ..., pn} 2 IRd and a subsetQ ✓ P ,

I The witness complex W (P ,Q) is the collection of simplifieswith vertices from Q with all subsimplices weakly witnessed bya point in P .

I Can be defined for a general metric space.

Page 32: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Adding geometry

Page 33: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Comparison

Page 34: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multipqrameter persistence

Page 35: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multiparameter persistence

Page 36: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multiparameter persistence

Page 37: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multiparameter persistence

Page 38: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multiparameter persistence

Page 39: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multiparameter persistence

Page 40: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multiparameter persistence

Page 41: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multiparameter persistence

Page 42: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multiparameter persistence

Page 43: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multiparameter persistence

Page 44: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Multiparameter persistence

Page 45: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Challenges

I Certificates for various approximate filtrations

I Distributed computing

I Discrete Morse approaches

I Randomized algorithms

I Multiscale persistence

I Multidimensional persistence

I Sampling and distribution properties of persistence

Page 46: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Challenges

I Certificates for various approximate filtrations

I Distributed computing

I Discrete Morse approaches

I Randomized algorithms

I Multiscale persistence

I Multidimensional persistence

I Sampling and distribution properties of persistence

Page 47: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Challenges

I Certificates for various approximate filtrations

I Distributed computing

I Discrete Morse approaches

I Randomized algorithms

I Multiscale persistence

I Multidimensional persistence

I Sampling and distribution properties of persistence

Page 48: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Challenges

I Certificates for various approximate filtrations

I Distributed computing

I Discrete Morse approaches

I Randomized algorithms

I Multiscale persistence

I Multidimensional persistence

I Sampling and distribution properties of persistence

Page 49: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Challenges

I Certificates for various approximate filtrations

I Distributed computing

I Discrete Morse approaches

I Randomized algorithms

I Multiscale persistence

I Multidimensional persistence

I Sampling and distribution properties of persistence

Page 50: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Challenges

I Certificates for various approximate filtrations

I Distributed computing

I Discrete Morse approaches

I Randomized algorithms

I Multiscale persistence

I Multidimensional persistence

I Sampling and distribution properties of persistence

Page 51: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Challenges

I Certificates for various approximate filtrations

I Distributed computing

I Discrete Morse approaches

I Randomized algorithms

I Multiscale persistence

I Multidimensional persistence

I Sampling and distribution properties of persistence

Page 52: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Inference

Page 53: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigms

Data Filtration Barcodes Interpretation

Paradigm 1: EDA

Data Filtration Barcodes Modeling

Paradigm 2: Modeling

Page 54: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigms

StatisticalTheory ApplicationsProbability

Theory

• Hypothesis  testing  

• Bootstrapping  

• Bayesian  estimation  

• Kalman  filtering  

• E-­‐M  

• Some  idea

• Normal  distribution  

• Central  Limit  Theorem  

• Gaussian  processes  

• Markov  chains  

• Bayes  theorem  

• Developing  ideas

• Test  drugs  effect  

• Noise  filtering  

• Tracking  

• Pattern  recognition  

• Classification  

• Many  ideasTDA

Page 55: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 1

Study the shape of data, what is the (multiscale) topology of thedata.

Why and what summaries ?

(1) Extracting �k can provide intuition

(2) Projecting onto IRP2 can provide intuition

(3) Persistence landscapes and diagrams can provide informationpause

(4) Statistical guarantees on these summaries(i) minmax results(ii) confidence/credible intervals(iii) consistency(iv) central limit theorems(v) extreme value theory

Page 56: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 1

Study the shape of data, what is the (multiscale) topology of thedata.

Why and what summaries ?

(1) Extracting �k can provide intuition

(2) Projecting onto IRP2 can provide intuition

(3) Persistence landscapes and diagrams can provide informationpause

(4) Statistical guarantees on these summaries(i) minmax results(ii) confidence/credible intervals(iii) consistency(iv) central limit theorems(v) extreme value theory

Page 57: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 1

Study the shape of data, what is the (multiscale) topology of thedata.

Why and what summaries ?

(1) Extracting �k can provide intuition

(2) Projecting onto IRP2 can provide intuition

(3) Persistence landscapes and diagrams can provide informationpause

(4) Statistical guarantees on these summaries(i) minmax results(ii) confidence/credible intervals(iii) consistency(iv) central limit theorems(v) extreme value theory

Page 58: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 1

Study the shape of data, what is the (multiscale) topology of thedata.

Why and what summaries ?

(1) Extracting �k can provide intuition

(2) Projecting onto IRP2 can provide intuition

(3) Persistence landscapes and diagrams can provide informationpause

(4) Statistical guarantees on these summaries

(i) minmax results(ii) confidence/credible intervals(iii) consistency(iv) central limit theorems(v) extreme value theory

Page 59: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 1

Study the shape of data, what is the (multiscale) topology of thedata.

Why and what summaries ?

(1) Extracting �k can provide intuition

(2) Projecting onto IRP2 can provide intuition

(3) Persistence landscapes and diagrams can provide informationpause

(4) Statistical guarantees on these summaries(i) minmax results(ii) confidence/credible intervals(iii) consistency(iv) central limit theorems(v) extreme value theory

Page 60: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 2

Summaries as features for downstream analysis.

(1) Machine learning perspective(i) features for classification and regression

(ii) features for dimension reduction(iii) kernel models/kernel engineering(iv) bias variance tradeo↵(v) function approximation questions

(2) Sampling distribution perspective(i) Su�ciency(ii) Pseudolikelihoods and empirical likelihoods(iii) Je↵rey’s conditioning(iv) Distributions of summaries under null models and hypothesis

testing(v) Understanding topological noise

Page 61: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 2

Summaries as features for downstream analysis.

(1) Machine learning perspective(i) features for classification and regression(ii) features for dimension reduction

(iii) kernel models/kernel engineering(iv) bias variance tradeo↵(v) function approximation questions

(2) Sampling distribution perspective(i) Su�ciency(ii) Pseudolikelihoods and empirical likelihoods(iii) Je↵rey’s conditioning(iv) Distributions of summaries under null models and hypothesis

testing(v) Understanding topological noise

Page 62: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 2

Summaries as features for downstream analysis.

(1) Machine learning perspective(i) features for classification and regression(ii) features for dimension reduction(iii) kernel models/kernel engineering

(iv) bias variance tradeo↵(v) function approximation questions

(2) Sampling distribution perspective(i) Su�ciency(ii) Pseudolikelihoods and empirical likelihoods(iii) Je↵rey’s conditioning(iv) Distributions of summaries under null models and hypothesis

testing(v) Understanding topological noise

Page 63: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 2

Summaries as features for downstream analysis.

(1) Machine learning perspective(i) features for classification and regression(ii) features for dimension reduction(iii) kernel models/kernel engineering(iv) bias variance tradeo↵

(v) function approximation questions

(2) Sampling distribution perspective(i) Su�ciency(ii) Pseudolikelihoods and empirical likelihoods(iii) Je↵rey’s conditioning(iv) Distributions of summaries under null models and hypothesis

testing(v) Understanding topological noise

Page 64: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 2

Summaries as features for downstream analysis.

(1) Machine learning perspective(i) features for classification and regression(ii) features for dimension reduction(iii) kernel models/kernel engineering(iv) bias variance tradeo↵(v) function approximation questions

(2) Sampling distribution perspective(i) Su�ciency

(ii) Pseudolikelihoods and empirical likelihoods(iii) Je↵rey’s conditioning(iv) Distributions of summaries under null models and hypothesis

testing(v) Understanding topological noise

Page 65: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 2

Summaries as features for downstream analysis.

(1) Machine learning perspective(i) features for classification and regression(ii) features for dimension reduction(iii) kernel models/kernel engineering(iv) bias variance tradeo↵(v) function approximation questions

(2) Sampling distribution perspective(i) Su�ciency(ii) Pseudolikelihoods and empirical likelihoods

(iii) Je↵rey’s conditioning(iv) Distributions of summaries under null models and hypothesis

testing(v) Understanding topological noise

Page 66: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 2

Summaries as features for downstream analysis.

(1) Machine learning perspective(i) features for classification and regression(ii) features for dimension reduction(iii) kernel models/kernel engineering(iv) bias variance tradeo↵(v) function approximation questions

(2) Sampling distribution perspective(i) Su�ciency(ii) Pseudolikelihoods and empirical likelihoods(iii) Je↵rey’s conditioning

(iv) Distributions of summaries under null models and hypothesistesting

(v) Understanding topological noise

Page 67: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 2

Summaries as features for downstream analysis.

(1) Machine learning perspective(i) features for classification and regression(ii) features for dimension reduction(iii) kernel models/kernel engineering(iv) bias variance tradeo↵(v) function approximation questions

(2) Sampling distribution perspective(i) Su�ciency(ii) Pseudolikelihoods and empirical likelihoods(iii) Je↵rey’s conditioning(iv) Distributions of summaries under null models and hypothesis

testing

(v) Understanding topological noise

Page 68: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Paradigm 2

Summaries as features for downstream analysis.

(1) Machine learning perspective(i) features for classification and regression(ii) features for dimension reduction(iii) kernel models/kernel engineering(iv) bias variance tradeo↵(v) function approximation questions

(2) Sampling distribution perspective(i) Su�ciency(ii) Pseudolikelihoods and empirical likelihoods(iii) Je↵rey’s conditioning(iv) Distributions of summaries under null models and hypothesis

testing(v) Understanding topological noise

Page 69: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Where we are

StatisticalTheory ApplicationsProbability

Theory

• Hypothesis  testing  

• Bootstrapping  

• Bayesian  estimation  

• Kalman  filtering  

• E-­‐M  

• Some  idea

• Normal  distribution  

• Central  Limit  Theorem  

• Gaussian  processes  

• Markov  chains  

• Bayes  theorem  

• Developing  ideas

• Test  drugs  effect  

• Noise  filtering  

• Tracking  

• Pattern  recognition  

• Classification  

• Many  ideas

TDA

Page 70: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Open questions

(1) Principled approaches to filtration selection.

(2) Quantification of ✏-su�ciency for di↵erent models/moduloinvariants.

(3) Summaries for graphs

(4) Information geometry for spaces with singularities andstratified spaces

(5) MCMC for models of di↵erent dimensions and algebraicstructures

(6) Signal processing and dictionary learning for shapes

(7) Summaries of complex objects as vector spaces ?

(8) Distribution theory for topological and geometric summaries.

Page 71: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Open questions

(1) Principled approaches to filtration selection.

(2) Quantification of ✏-su�ciency for di↵erent models/moduloinvariants.

(3) Summaries for graphs

(4) Information geometry for spaces with singularities andstratified spaces

(5) MCMC for models of di↵erent dimensions and algebraicstructures

(6) Signal processing and dictionary learning for shapes

(7) Summaries of complex objects as vector spaces ?

(8) Distribution theory for topological and geometric summaries.

Page 72: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Open questions

(1) Principled approaches to filtration selection.

(2) Quantification of ✏-su�ciency for di↵erent models/moduloinvariants.

(3) Summaries for graphs

(4) Information geometry for spaces with singularities andstratified spaces

(5) MCMC for models of di↵erent dimensions and algebraicstructures

(6) Signal processing and dictionary learning for shapes

(7) Summaries of complex objects as vector spaces ?

(8) Distribution theory for topological and geometric summaries.

Page 73: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Open questions

(1) Principled approaches to filtration selection.

(2) Quantification of ✏-su�ciency for di↵erent models/moduloinvariants.

(3) Summaries for graphs

(4) Information geometry for spaces with singularities andstratified spaces

(5) MCMC for models of di↵erent dimensions and algebraicstructures

(6) Signal processing and dictionary learning for shapes

(7) Summaries of complex objects as vector spaces ?

(8) Distribution theory for topological and geometric summaries.

Page 74: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Open questions

(1) Principled approaches to filtration selection.

(2) Quantification of ✏-su�ciency for di↵erent models/moduloinvariants.

(3) Summaries for graphs

(4) Information geometry for spaces with singularities andstratified spaces

(5) MCMC for models of di↵erent dimensions and algebraicstructures

(6) Signal processing and dictionary learning for shapes

(7) Summaries of complex objects as vector spaces ?

(8) Distribution theory for topological and geometric summaries.

Page 75: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Open questions

(1) Principled approaches to filtration selection.

(2) Quantification of ✏-su�ciency for di↵erent models/moduloinvariants.

(3) Summaries for graphs

(4) Information geometry for spaces with singularities andstratified spaces

(5) MCMC for models of di↵erent dimensions and algebraicstructures

(6) Signal processing and dictionary learning for shapes

(7) Summaries of complex objects as vector spaces ?

(8) Distribution theory for topological and geometric summaries.

Page 76: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Open questions

(1) Principled approaches to filtration selection.

(2) Quantification of ✏-su�ciency for di↵erent models/moduloinvariants.

(3) Summaries for graphs

(4) Information geometry for spaces with singularities andstratified spaces

(5) MCMC for models of di↵erent dimensions and algebraicstructures

(6) Signal processing and dictionary learning for shapes

(7) Summaries of complex objects as vector spaces ?

(8) Distribution theory for topological and geometric summaries.

Page 77: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Open questions

(1) Principled approaches to filtration selection.

(2) Quantification of ✏-su�ciency for di↵erent models/moduloinvariants.

(3) Summaries for graphs

(4) Information geometry for spaces with singularities andstratified spaces

(5) MCMC for models of di↵erent dimensions and algebraicstructures

(6) Signal processing and dictionary learning for shapes

(7) Summaries of complex objects as vector spaces ?

(8) Distribution theory for topological and geometric summaries.

Page 78: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Mathematics

Page 79: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Spectral simplicial theory

(1) Cheeger inequalities for middle dimensions

(2) Higher-dimensional version of pageRank

(3) Limits of random walks as Brownian motion of forms

(4) Graph sparsification with L

1

(5) Synchronization and learning maps, and multicommodity flows

(6) SLE on simplicial complexes, loop erased random surfaces

Page 80: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Spectral simplicial theory

(1) Cheeger inequalities for middle dimensions

(2) Higher-dimensional version of pageRank

(3) Limits of random walks as Brownian motion of forms

(4) Graph sparsification with L

1

(5) Synchronization and learning maps, and multicommodity flows

(6) SLE on simplicial complexes, loop erased random surfaces

Page 81: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Spectral simplicial theory

(1) Cheeger inequalities for middle dimensions

(2) Higher-dimensional version of pageRank

(3) Limits of random walks as Brownian motion of forms

(4) Graph sparsification with L

1

(5) Synchronization and learning maps, and multicommodity flows

(6) SLE on simplicial complexes, loop erased random surfaces

Page 82: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Spectral simplicial theory

(1) Cheeger inequalities for middle dimensions

(2) Higher-dimensional version of pageRank

(3) Limits of random walks as Brownian motion of forms

(4) Graph sparsification with L

1

(5) Synchronization and learning maps, and multicommodity flows

(6) SLE on simplicial complexes, loop erased random surfaces

Page 83: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Spectral simplicial theory

(1) Cheeger inequalities for middle dimensions

(2) Higher-dimensional version of pageRank

(3) Limits of random walks as Brownian motion of forms

(4) Graph sparsification with L

1

(5) Synchronization and learning maps, and multicommodity flows

(6) SLE on simplicial complexes, loop erased random surfaces

Page 84: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Spectral simplicial theory

(1) Cheeger inequalities for middle dimensions

(2) Higher-dimensional version of pageRank

(3) Limits of random walks as Brownian motion of forms

(4) Graph sparsification with L

1

(5) Synchronization and learning maps, and multicommodity flows

(6) SLE on simplicial complexes, loop erased random surfaces

Parsons

Page 85: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Stochastic topology

Page 86: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Stochastic topology

Page 87: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are
Page 88: Lecture 10: Discussion and Challenges - Statistical Sciencesayan/CBMS/Mukherjee/lecture10.pdf · Cubical homology Pixels and voxels • Cellular homology theory Building blocks are

Acknowledgements

Many people.

Funding:

I Center for Systems Biology at Duke

I NSF DMS, CCF, IIS

I AFOSR, DARPA

I NIH