The Persistent Homology of Distance Functions under Random Projection

69
The Persistent Homology of Distance Functions under Random Projection Don Sheehy University of Connecticut

Transcript of The Persistent Homology of Distance Functions under Random Projection

Page 1: The Persistent Homology of Distance Functions under Random Projection

The Persistent Homology of Distance Functions

under Random Projection

Don Sheehy University of Connecticut

Page 2: The Persistent Homology of Distance Functions under Random Projection

Unions of Balls

Page 3: The Persistent Homology of Distance Functions under Random Projection

Unions of Balls

Finite Point Set

Page 4: The Persistent Homology of Distance Functions under Random Projection

Unions of Balls

Finite Point Set Union of Balls

Page 5: The Persistent Homology of Distance Functions under Random Projection

Unions of Balls

Finite Point Set Union of Balls

Topologically uninteresting Potentially Interesting

Page 6: The Persistent Homology of Distance Functions under Random Projection

Unions of Balls

Finite Point Set Union of Balls

Topologically uninteresting Potentially Interesting

Idea: Fill in the gaps in the ambient space. Examples: Molecules and Manifolds

Page 7: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

Page 8: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

P ⇢ RdInput:

Page 9: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

P

↵ =[

p2P

ball(p,↵) = {x 2 Rd | d(x, P ) ↵}

P ⇢ RdInput:

Page 10: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

P

↵ =[

p2P

ball(p,↵) = {x 2 Rd | d(x, P ) ↵}

Persistent Homology was invented to track changesin the homology of P↵ as ↵ ranges from 0 to 1.

P ⇢ RdInput:

Page 11: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

P

↵ =[

p2P

ball(p,↵) = {x 2 Rd | d(x, P ) ↵}

Persistent Homology was invented to track changesin the homology of P↵ as ↵ ranges from 0 to 1.

Pers({P↵})

P ⇢ RdInput:

Page 12: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

P

↵ =[

p2P

ball(p,↵) = {x 2 Rd | d(x, P ) ↵}

Persistent Homology was invented to track changesin the homology of P↵ as ↵ ranges from 0 to 1.

Pers({P↵})

P ⇢ RdInput:

Page 13: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

Page 14: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

Page 15: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}

Page 16: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Page 17: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}

Rips Complex: RP (↵) = {� ✓ P | diam(�) ↵}

ˇ

Cech Filtration: {CP (↵)}↵�0

Page 18: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}

Rips Complex: RP (↵) = {� ✓ P | diam(�) ↵}

ˇ

Cech Filtration: {CP (↵)}↵�0

Rips Filtration: {RP (↵)}↵�0

Page 19: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}

Rips Complex: RP (↵) = {� ✓ P | diam(�) ↵}

ˇ

Cech Filtration: {CP (↵)}↵�0

Rips Filtration: {RP (↵)}↵�0

CP (↵) ✓ RP (↵) ✓ CP (p2↵)

Page 20: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}

Rips Complex: RP (↵) = {� ✓ P | diam(�) ↵}

ˇ

Cech Filtration: {CP (↵)}↵�0

Rips Filtration: {RP (↵)}↵�0

CP (↵) ✓ RP (↵) ✓ CP (p2↵)

Pers({RP (↵)}) is ap2-approximation to Pers({CP (↵)}).

Page 21: The Persistent Homology of Distance Functions under Random Projection

Representing sublevels of distances

Page 22: The Persistent Homology of Distance Functions under Random Projection

Representing sublevels of distances

ˇ

Cech Complex: size O(nd+1).

↵-complex (a.k.a. Delaunay Filtration): size O(ndd/2e).

Quality Meshes: size 2

(d2)n.(Sparse

ˇ

Cech Complex: 2

(d2)n).*

Page 23: The Persistent Homology of Distance Functions under Random Projection

Representing sublevels of distances

ˇ

Cech Complex: size O(nd+1).

↵-complex (a.k.a. Delaunay Filtration): size O(ndd/2e).

Quality Meshes: size 2

(d2)n.(Sparse

ˇ

Cech Complex: 2

(d2)n).*

Page 24: The Persistent Homology of Distance Functions under Random Projection

Representing sublevels of distances

ˇ

Cech Complex: size O(nd+1).

↵-complex (a.k.a. Delaunay Filtration): size O(ndd/2e).

Quality Meshes: size 2

(d2)n.(Sparse

ˇ

Cech Complex: 2

(d2)n).*

Key Point: Ambient Dimension Matters!

Page 25: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Page 26: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Idea: Project to lower dimensions. Preserve pairwise distances.

Page 27: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Idea: Project to lower dimensions. Preserve pairwise distances.

Let f : RD ! Rdbe a linear map where d = O(log n/"2) such that:

Page 28: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Idea: Project to lower dimensions. Preserve pairwise distances.

(1� ")ka� bk2 kf(a)� f(b)k2 (1 + ")ka� bk2Squared distances preserved up to multiplicative factor.1

Let f : RD ! Rdbe a linear map where d = O(log n/"2) such that:

Page 29: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Idea: Project to lower dimensions. Preserve pairwise distances.

(1� ")ka� bk2 kf(a)� f(b)k2 (1 + ")ka� bk2Squared distances preserved up to multiplicative factor.1

|(b� a)>(c� a)� (f(b)� f(a))>(f(b)� f(a))| "kb� akkc� ak.Inner products preserved up to additive factor.2

Let f : RD ! Rdbe a linear map where d = O(log n/"2) such that:

Page 30: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Idea: Project to lower dimensions. Preserve pairwise distances.

a

b

c f(c)

f(b)

f(a)

(1� ")ka� bk2 kf(a)� f(b)k2 (1 + ")ka� bk2Squared distances preserved up to multiplicative factor.1

|(b� a)>(c� a)� (f(b)� f(a))>(f(b)� f(a))| "kb� akkc� ak.Inner products preserved up to additive factor.2

Let f : RD ! Rdbe a linear map where d = O(log n/"2) such that:

Page 31: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Page 32: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Yes, for Rips filtrations, but not a tight approximation.

Page 33: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Yes, for Rips filtrations, but not a tight approximation.Distance function itself is not preserved.

Page 34: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Yes, for Rips filtrations, but not a tight approximation.Distance function itself is not preserved.Pairwise distances in sublevels are not preserved.

Page 35: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Yes, for Rips filtrations, but not a tight approximation.Distance function itself is not preserved.Pairwise distances in sublevels are not preserved.Is topology preserved? Maybe yes, maybe no.

Page 36: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Yes, for Rips filtrations, but not a tight approximation.Distance function itself is not preserved.Pairwise distances in sublevels are not preserved.Is topology preserved? Maybe yes, maybe no.Is persistent homology preserved? YES.

Page 37: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximation

Page 38: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Page 39: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Page 40: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Idea: If f “preserves M.E.B. radii”, then it preserves

the persistent homology of the distance function.

Page 41: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Idea: If f “preserves M.E.B. radii”, then it preserves

the persistent homology of the distance function.

Page 42: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Idea: If f “preserves M.E.B. radii”, then it preserves

the persistent homology of the distance function.

For S ✓ P , (1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Page 43: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Idea: If f “preserves M.E.B. radii”, then it preserves

the persistent homology of the distance function.

For S ✓ P , (1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

For all ↵ � 0, CP (p1� 4") ✓ Cf(P )(↵) ✓ CP (

p1� 4")

Page 44: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Idea: If f “preserves M.E.B. radii”, then it preserves

the persistent homology of the distance function.

For S ✓ P , (1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

For all ↵ � 0, CP (p1� 4") ✓ Cf(P )(↵) ✓ CP (

p1� 4")

So, Pers(d(·, f(P ))) is a (1 +O("))-approximation

to Pers(d(·, P )).

Page 45: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

Page 46: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

Let S = {p1, . . . , pr} and let x 2 conv(S).

Page 47: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

x =rX

i=1

�ipi, whererX

i=1

�i = 1.

Let S = {p1, . . . , pr} and let x 2 conv(S).

Page 48: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

x =rX

i=1

�ipi, whererX

i=1

�i = 1.

kx� pk2 =

�����

rX

i=1

�i(pi � p)

�����

2

=rX

i=1

rX

j=1

�i�j(pi � p)>(pj � p).For any p 2 S,

Let S = {p1, . . . , pr} and let x 2 conv(S).

Page 49: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

��kp� xk2 � kf(p)� f(x)k2�� =

rX

i=1

rX

j=1

�i�j

��(pi � p)>(pj � p)� (f(pi)� f(p))>(f(pj)� f(p))��

rX

i=1

rX

j=1

�i�j"kpi � pkkpj � pk

rX

i=1

rX

j=1

�i�j4" rad(S)2

= 4" rad(S)2.

x =rX

i=1

�ipi, whererX

i=1

�i = 1.

kx� pk2 =

�����

rX

i=1

�i(pi � p)

�����

2

=rX

i=1

rX

j=1

�i�j(pi � p)>(pj � p).For any p 2 S,

Let S = {p1, . . . , pr} and let x 2 conv(S).

Page 50: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

Page 51: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Page 52: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Let x = center(S).

Page 53: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound:

Let x = center(S).

Page 54: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound: rad(f(S))

2 max

p2P(kx� pk2 + 4" rad(S)

2)

max

p2P((1 + 4")rad(S)

2)

= (1 + 4")rad(S)

2.

Let x = center(S).

Page 55: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound: rad(f(S))

2 max

p2P(kx� pk2 + 4" rad(S)

2)

max

p2P((1 + 4")rad(S)

2)

= (1 + 4")rad(S)

2.

Lower Bound:

Let x = center(S).

Page 56: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound: rad(f(S))

2 max

p2P(kx� pk2 + 4" rad(S)

2)

max

p2P((1 + 4")rad(S)

2)

= (1 + 4")rad(S)

2.

Lower Bound:

Let x = center(S).

Let q 2 S be such that kq � xk = rad(S) andkf(q)� center(f(S))k � kf(q)� f(x)k.

Page 57: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound: rad(f(S))

2 max

p2P(kx� pk2 + 4" rad(S)

2)

max

p2P((1 + 4")rad(S)

2)

= (1 + 4")rad(S)

2.

Lower Bound:

Let x = center(S).

Let q 2 S be such that kq � xk = rad(S) andkf(q)� center(f(S))k � kf(q)� f(x)k.

Page 58: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound: rad(f(S))

2 max

p2P(kx� pk2 + 4" rad(S)

2)

max

p2P((1 + 4")rad(S)

2)

= (1 + 4")rad(S)

2.

Lower Bound:

Let x = center(S).

Let q 2 S be such that kq � xk = rad(S) andkf(q)� center(f(S))k � kf(q)� f(x)k.

rad(f(S))2 � kf(q)� center(f(S))k2

� kf(q)� f(x)k2

� kq � xk2 � 4" rad(S)2

= (1� 4")rad(S)2.

Page 59: The Persistent Homology of Distance Functions under Random Projection

Extension to k-NN distances.

Page 60: The Persistent Homology of Distance Functions under Random Projection

Extension to k-NN distances.

Page 61: The Persistent Homology of Distance Functions under Random Projection

Extension to k-NN distances.

d

kP (x) = distance from x to k points of P .

Page 62: The Persistent Homology of Distance Functions under Random Projection

Extension to k-NN distances.

d

kP (x) = distance from x to k points of P .

Corollary: If f is an "-JL projection then for all k,Pers(d

kf(P )) is a 1 +O(") approximation to Pers(d

kP ).

Page 63: The Persistent Homology of Distance Functions under Random Projection

Extension to k-NN distances.

d

kP (x) = distance from x to k points of P .

Corollary: If f is an "-JL projection then for all k,Pers(d

kf(P )) is a 1 +O(") approximation to Pers(d

kP ).

Bonus: Also works for weighted points.

Page 64: The Persistent Homology of Distance Functions under Random Projection

Going forward…

Page 65: The Persistent Homology of Distance Functions under Random Projection

Going forward…

• Eliminate inner product condition.

Page 66: The Persistent Homology of Distance Functions under Random Projection

Going forward…

• Eliminate inner product condition.• Eliminate constant factor (4)

Page 67: The Persistent Homology of Distance Functions under Random Projection

Going forward…

• Eliminate inner product condition.• Eliminate constant factor (4)• Eliminate linearity condition.

Page 68: The Persistent Homology of Distance Functions under Random Projection

Going forward…

• Eliminate inner product condition.• Eliminate constant factor (4)• Eliminate linearity condition.• Extend to distances to measures.

Page 69: The Persistent Homology of Distance Functions under Random Projection

Going forward…

• Eliminate inner product condition.• Eliminate constant factor (4)• Eliminate linearity condition.• Extend to distances to measures.

Thank you.