Tensors and graphical models

Mariya Ishteva with Haesun Park, Le Song

Dept. ELEC, VUB Georgia Tech, USA

INMA Seminar, May 7, 2013, LLN

Outline

Tensors

Random variables and graphical models

Tractable representations

Structure learning

Tensors

RM×N×P

• Multilinear rank (R1,R2,R3)

• Rank-R

Rank-1 tensor:

R = min(r), s.t. A =r∑

{rank-1 tensor}i

Matrix representations of tensors

Mode-1

A = A(1) =

• Mode-2

• Mode-3

• Multilinear rank: (rank(A(1)), rank(A(2)), rank(A(3)))

Tensor-matrix multiplication

• Tensor-matrix product

• Contraction A ∈ RI×J×M B ∈ R

K×L×M

C = 〈A, B〉3 C(i , j , k , l) =M∑

aijmbklm

4th order tensorC ∈ R

I×J×K×L

Basic decompositions

Singular value decomposition (SVD)

MLSVD / HOSVD

CP / CANDECOMP / PARAFAC

Outline

Tensors

Structure learning

Discrete random variables

• Random variable

X ; 1, . . . , nPx(1), . . . , Px(n) Px ∈ R

n, Rn+, [0, 1]

• X1,X2; P(X1,X2) P12 ∈ Rn×n

1 · · · n1 P12(1, 1) · · · P12(1, n)...

n P12(n, 1) · · · P12(n, n)

• P(x1, x2) := P(X1 = x1,X2 = x2)

2 random variables

X1,X2; P(X1,X2) P12 ∈ Rn×n

X1 ⊥ X2

P(x1, x2) = P(x1)P(x2)rank-1 matrix

P(x1, x2) =∑

P(x1|h)P(x2|h)P(h)

low-rank matrixrank-k matrix, k < n

Conditional probability tables (CPTs) P(X1|H),P(X2|H)

3 random variablesX1,X2,X3; P(X1,X2,X3) P123 ∈ R

n×n×n

X1,X2,X3 independent

P(x1, x2, x3) = P(x1)P(x2)P(x3)

rank-1 tensor

X1 X2 X3

rank-k tensor, k < n

= · · ·

P(x1, x2, x3) =∑

P(x1|h)P(x2|h)P(x3|h)P(h)

4 random variables

• X1,X2,X3,X4; P(X1,X2,X3,X4) P1234 ∈ Rn×n×n×n

• X1,X2,X3,X4 independent

X1 X2 X3 X4

P(x1, x2, x3, x4) =∑

P(x1|h)P(x2|h)P(x3|h)P(x4|h)P(h)

• more variables

• more hidden variables

Challenges

• 10 variables, 10 states each −→ 1010 entries

• We need tractable representations• Latent variable models / low-rank factors• # parameters: exponential −→ polynomial

X1 X1 X

X1 X1 X1

• Challenges:• Choose a good representation X

• Learn the correct structure X

• Estimate the parameters ×

Outline

Tensors

Structure learning

Tensors and graphical models

CP / CANDECOMP / PARAFACH

X1 X2 Xn· · ·

Tensor trainH1 H2 H3 Hn

X1 X2 X3 Xn

· · ·

Hierarchical Tucker

X1 X1 X

X1 X1 X1 Latent tree model

Tucker / MLSVDBlock term decomposition

Tensor train (TT) decomposition

A(i1,...,id )=∑

α0,...,αd

G1(α0, i1, α1)G2(α1, i2, α2) . . .Gd(αd−1, id , αd )

[I. V. Oseledets, SIAM J. Scientific Computing, 2011]

• Avoids curse of dimensionality• Small number of parameters, compared to Tucker model• Slightly more parameters than CP but more stable• Gk (αk−1, nk , αk ) has dimensions rk−1 × nk × rk , r0 = rd = 1• rk are called compression ranks:

Ak = Ak (i1, . . . , ik ; ik+1, . . . , id ), rank(Ak ) = rk

• Computation based on SVD• Computation: top → bottom

H1 H2 H3 Hn

X1 X2 X3 Xn

· · ·

Hierarchical Tucker decomposition

[L. Grasedyck, SIMAX, 2010]

• Similar properties as TT decomposition• Computation: bottom → top

X1 X1 X

X1 X1 X1

Potential advantages of tensor approach

• Real data are often multi-way

• Provides higher-level view

• Flexibility: different ranks in each mode: Tucker

• Uniqueness: CP, Block term decomposition

• No curse of dimensionality: Tensor train, hierarch. Tucker

Outline

Tensors

Structure learning

• Given: (samples of) observed variables

• Assumption: the variables can be connected via hiddenvariables in a tree structure in a meaningful way

• Find: the tree / the relationships between the variables

• Additional difficulty: unknown number of hidden states

X X X X

X3 X5 X2 X1 X X1 X1 X1

Quartet relationships: topologies

P(x1, x2, x3, x4) =∑

P(x1|h)P(x2|h)P(h, g)P(x3|g)P(x4|g)

Building trees based on quartet relationships

Choose 3 variables and form a tree

Add all other variables, one by one

• Split the current tree into 3 subtrees• Choose 3 variables from different subtrees• Resolve the quartet relation with current and chosen variables• Insert the current variable in a subtree or connect to the tree

[For simplicity, assume each latent variable has 3 neighbors]

Tensor view of quartets

P(X1,X2,X3,X4) =

IH PHG IG

A = reshape(P,n2,n2);

B = reshape(permute(P, [1,3,2,4]),n2,n2);

C = reshape(permute(P, [1,4,2,3]),n2,n2).

Notation: P1|H , P2|H , etc. stand for P(X1|H), P(X2|H), etc.

Rank properties of matrix representations

(P2|H P1|H PHG P4|G P3|G⊤

(P3|G P1|H diag(PHG(:)) P4|G P2|H⊤

• rank(A) = rank(PHG) = krank(B) = rank(C) = nnz(PHG)

rank(A) ≪ rank(B) = rank(C)

• Sampling noise Nuclear norm relaxation

‖A‖∗ =∑n2

i=1 σi(A)

Resolving quartet relations

Algorithm 1 i∗ = Quartet(X1, X2, X3, X4)

1: Estimate P(X1,X2,X3,X4) from a set of m i.i.d. samples.2: Unfold P into matrices A, B and C, and compute

a1 = ‖A‖∗, a2 = ‖B‖∗ and a3 = ‖C‖∗.

3: Return i∗ = arg min i∈{1,2,3}ai .

• Easy to compute

• Recovery conditions

• Finite sample guarantees

• Agnostic to the number of hidden states

• Compares favorably to alternatives

Example: stock data

Given: stock prices (25 years, discretized into 10 values)

Find: relations between stocks

Finance:• C (Citigroup)• JPM (JPMorgan Chase)• AXP (American Express)• F (Ford Motor: Automotive and Financial Services)

Retailers:• TGT (Target)• WMT (WalMart)• RSH (RadioShack)

Conclusions

• Tensor decompositions are related to graphical models

• A common goal: tractable representations

• Tensors can be used for structure learning

Thank you!

mariya.ishteva@vub.ac.be

Tensors and graphical models - Personal...

Transcript of Tensors and graphical models - Personal...

Tensors and graphical models - Personal...

Documents

Transcript of Tensors and graphical models - Personal...

Sparse Tensors Networks

SPARSE TENSORS DECOMPOSITION SOFTWARE

Intro to Tensors

1 Vectors & Tensors - Auckland

Vectors, Tensors and Matrices

Introduction to Tensors

Tensors Presentation

Class8 Tensors forBB - Swin

Tensors and their applications

Homotopy embedding tensors

Singular Values of Tensors

Higher-Order Tensors in Diffusion Imaginglekheng/work/dagstuhl.pdf · Higher-Order Tensors in Diffusion Imaging 133 3 Mathematical Background We include a basic introduction to tensors

Part B Tensors

EIGENVECTORS OF TENSORS -0 - Home | Institute for Mathematics and its Applications · · 2016-02-01EIGENVECTORS OF TENSORS Bernd Sturmfels ... JM Landsberg: Tensors: Geometry and

Tensors and Optimization

Chapter 1 - Cartesian Tensors

Vectors and Tensors - IIT Hyderabadashok/Maths_Lectures/TutorialB/Vector_Tens… · Vectors and Tensors The mechanics of solids is a story told in the language of vectors and tensors.

Tensors in Physics - Norsk Regnesentral · Tensors in Physics Doing black holes and cosmology with Mathematica Harald H. Soleng Norwegian Computing Center Tensors in Physics is a

A REVIEW OF VECTORS AND TENSORS - TAMU Mechanicsmechanics.tamu.edu/.../2016/10/Lecture-02-Vectors-and-Tensors-1.pdf · VECTORS&TENSORS - 22. SECOND-ORDER TENSORS . A second-order

M. Billaud-Friess ,A.Nouyand O. Zahm€¦ · canonical tensors, Tucker tensors, Tensor Train tensors [27,40], Hierarchical Tucker tensors [25] or more general tree-based Hierarchical