Preliminar ideas about TDA...Preliminar ideas about TDA TDA is about: Providing topological...

PersLay: a Neural Network for persistence diagrams

6th SmartData@Polito Workshop

Jan. 30th, 2020.

DataShape - Inria Saclaytheo.lacombe@inria.fr

and related topics.

Higher Methods in Data Science and ML

M. Carriere, F. Chazal, Y. Ike, T. Lacombe,M. Royer, Y. Umeda

Preliminar ideas about TDA

TDA is about:• Providing topological descriptors

of a given object

• In a quantitative way.

Preliminar ideas about TDA

TDA is about:• Providing topological descriptors

of a given object

• In a quantitative way.

A global overview of TDA

TDA can be applied to any type of structured data.

e.g. Point clouds in Rd, Weighted graphs, time series...

The TDA pipeline: persistent homology

Reminder:We want to encode the topology of an object at all scales

Homology:Encodes the topological features (connected components, loop, holes...)

β0 = 1 connected componentβ1 = 2 loopsβ2 = 1 hole

Simplicial homology:Homology on a computer.

β0 = 1 connected componentβ1 = 2 loopsβ2 = 1 hole

All we need is an increasing sequence of topological spaces.

Example: Given a function f : X→ R, consider Ft := f−1((−∞, t])

filtration sublevel sets of f

Persistence: looking at scales

x ∈ Ft ⇔ f(x) ≤ t

Example: Given X = x1 . . . xn ∈ Rd =: X, take f(x) = distX(x)

x ∈ Ft ⇔ f(x) ≤ t

Example: Given X = x1 . . . xn ∈ Rd =: X, take f(x) = distX(x)

x ∈ Ft ⇔ f(x) ≤ t

• Compute at all scale the homology of the current topological space

• Record births and deaths of topological features (loop, holes, etc.)

births

• Print the output as the persistence diagram of the filtration

Diagram in dim 1 (loops)

births

t = 1t

births

t = 1.5

births

Input item + filtration f

Sequence of increasing topo. spaces

Persistence diagram

Summary:

Input item + filtration f

Sequence of increasing topo. spaces

Persistence diagram

Summary:

Persistence Diagrams as topological features

Given (X1 . . . Xn), consider (µ1 . . . µn) their resp. diagrams.

• Can we use the (µi)is as inputs in a ML task?

→ You can compare PDs with a metric dp (theoretically motivated)

→ PDs are “point clouds” with (possibly) different cardinalities

Pbm: The space of PDs equipped with dp is ill-structured

no addition

no scalar multiplication

hard to define mean/barycenter

Pbm: The space of PDs equipped with dp is ill-structured

no addition

no scalar multiplication

hard to define mean/barycenter

Idea: Embed PDs into linear spaces viaa feature map Φ

• Two principal approaches: finite dim vectorizationor infinite dim ones (kernels)

Do not scale well on large sets of large diagrams

Persistence Images [Adams et al. 2017]

Compute PD Rotate PD Compute persistencesurface

Discretize

ρµ(·) =∑p∈µ

wt(p) · exp

(−‖ · −p‖

weight function: wt((x, y)) = 1

Discretize

For each pixel P , compute I(P ) =∫∫Pρµ

Concatenate all I(P ) into a single vector PI(µ) (2D image)

Discretize plane into grid of pixels:

Discretize

Prop: [Adams et al. 2017] (stability)

• ‖PI(µ)− PI(ν)‖2 ≤√dC(w, φp) d1(µ, ν)

Discretize

Observations: Depends on hyperparams (grid size)

but also trainable params (σ, wt...)

Rotate PD

Compute rank function

Use boundariesof rank function

Persistence Landscape [Bubenik 2015]

Observation: These vectorizations are of the form

Φ(µ) = op{φθ(p), p ∈ µ}

Where θ is a set of trainable parameters, and φθ : R2 → Rd.

where op =∑,max, top− k, etc.

Observation: These vectorizations are of the form

Φ(µ) = op{φθ(p), p ∈ µ}

Where θ is a set of trainable parameters, and φθ : R2 → Rd.

Idea: Say a PD is a point cloud µ = {p1 . . . pn} ⊂ R2.

We want a function Φ that is permutation invariant

where op =∑,max, top− k, etc.

Φ(p1 . . . pn) = Φ(pπ(1) . . . pπ(n))

for any permutation π ∈ Sn

Deep Sets [Zaheer et al. 2017]

Question: What are the permutation invariant functions?

Deep Sets [Zaheer et al. 2017]

[Zaheer et al. 2017] Φ is permutation invariant iff ∃ρ, φ s.t.:

Φ({p1 . . . pn}) = ρ

(n∑i=1

φ(pi)

Question: What are the permutation invariant functions?

provided (pi) ⊂ K compact (or countable).

Reminder: Neural networks

Rd0Rd1

tf(0)θ0

f(1)θ1

f(n−1)θn−1

f(n)θn· · ·

NN with depth n ∈ N∗xi

Rd0Rd1

tf(0)θ0

f(1)θ1

f(n−1)θn−1

f(n)θn· · ·

NN with depth n ∈ N∗

Final predictor: Fθ = f(n)θn◦ · · · ◦ f (0)

Goal: Minimize `(θ) =∑i ‖Fθ(xi)− yi‖22 w.r.t. θ

Rd0Rd1

tf(0)θ0

f(1)θ1

f(n−1)θn−1

f(n)θn· · ·

Requirement: Know ∇f (i)θi

for each i. ⇒ chain rule.

Rd0Rd1

tf(0)θ0

f(1)θ1

f(n−1)θn−1

f(n)θn· · ·

Requirement: Know ∇f (i)θi

for each i. ⇒ chain rule.

Observation: If f(0)θ0

is perm. inv., so is the NN.

Application to persistence diagrams

Permutation invariant layers generalize several TDA approaches

→ persistence lanscapes → silhouettes → Betti curves

Weight functionPoint transformation

PersLay(µ) = ρ (op{w(p) · φ(p)}p∈µ)

Permutation-invariantoperation

Weight functionPoint transformation

PersLay(µ) = ρ (op{w(p) · φ(p)}p∈µ)

Permutation-invariantoperation

Our contribution: Implementing (and sharing) it.

features

w(·)φ(·) op

w(·)φ(·)

opw(·)φ(·)

Goal: classify orbits of linked twisted map

Orbits described by (depending on parameter r):{xn+1 = xn + r yn(1− yn) mod 1

yn+1 = yn + r xn+1(1− xn+1) mod 1

Label = 2

Label = 1 Label = 5

Label = 4Label = 3

(flow dynamics)

Goal: classify orbits of linked twisted map

Label = 2

Label = 1 Label = 5

Label = 4Label = 3

(flow dynamics)

Take home messages

• Topological Data Analysis builds topological summary

of a given object: the persistence diagram.

• PDs are not straightforward to use in ML pipelines

• A workaround is to vectorize our PDs, adaptativelyto a given learning task (e.g. classification).

• The “Deep Sets” architecture can be used to incorporate PDsin NN architecture while encompassing existing vectorizations.

Take home messages

• Topological Data Analysis builds topological summary

of a given object: the persistence diagram.

• PDs are not straightforward to use in ML pipelines

• A workaround is to vectorize our PDs, adaptativelyto a given learning task (e.g. classification).

Supplementary material

• https://github.com/MathieuCarriere/perslay→ soon integrated to the Gudhi library (hopefully).

• The “Deep Sets” architecture can be used to incorporate PDsin NN architecture while encompassing existing vectorizations.

• PersLay: A Neural Network Layer for Persistence Diagramsand New Graph Topological Signatures, AISTATS 2020

Preliminar ideas about TDA...Preliminar ideas about TDA TDA is about: Providing topological...

Documents

Transcript of Preliminar ideas about TDA...Preliminar ideas about TDA TDA is about: Providing topological...

ESTUDIO PRELIMINAR CHIPAQUE

Topological Concepts in Machine Learning - helikoid.sihelikoid.si/tda/zitnik-acat.pdf · Agenda Key Ideas in Topological Data Analysis (TDA) Concept Analysis and Document Clustering

Tda Web2.0

TDA StraTegic review 2014/15 - Torbay Council Upda… · TDA StraTegic review 2014/15 23 2882/0315. executive SummAry TDA Strategic Review 2014 ... The TDA delivers a wide range of

Tda 2611aq

KX-TDA TSP - randersoft.comrandersoft.com/download/TDA/KXTDATSPinstManualv33.pdf · KX-TDA TSP version 3.1.0.7 or later supports TDE/NCP series PBX. KX-TDA TSP version 3.3 or later

IW:LEARN TDA/SAP Training Course Module 2: Development of the TDA.

Marcha Fitoquimica Preliminar

Calculo - Purcell (Preliminar)

(Informe Preliminar)

Estudo Preliminar - Superquadra

Vocabulario Preliminar Hello Hola Vocabulario Preliminar Good Morning Buenos Días.

IC TDA-11115,TDA-11145 Service Manual En

KX-TDA TSP - randersoft.comrandersoft.com/download/tda/KXTDATSPinstManualv33.pdf · About TSP Panasonic TAPI Service Provider for TDA/TDE/NCP Series (Panasonic KX-TDA TSP) is a driver

Preliminar periodico 3

Folleto Preliminar ARP

Catalogo Tda

TDA StraTegic review 2014/15 - Torbay · 2016. 9. 9. · TDA StraTegic review 2014/15 23 2882/0315. executive SummAry TDA Strategic Review 2014 Delivering successes for the TDA to

Etapa Preliminar

Resulado Final Preliminar