Boolean tensor factorizations - mpi-inf.mpg.depmiettin/papers/BTF_ICDM_slides.pdfFACTORIZATIONS: THE...

BOOLEAN TENSOR FACTORIZATIONS

Pauli Miettinen 14 December 2011

BACKGROUND: TENSORS AND TENSOR FACTORIZATIONS

BACKGROUND: BOOLEAN MATRIX FACTORIZATIONS

• Given a binary matrix X and a positive integer R, find two binary matrices A and B such that A has R columns and B has R rows and X ≈ A o B.

• A o B is the Boolean matrix product of A and B,

(A ◦ B)ij =R�

bilclj

BOOLEAN TENSOR FACTORIZATIONS: THE IDEA

1. Take existing (normal) tensor factorization

2. Make everything binary and define summation as 1 + 1 = 1

3. Try to understand what you just did.

Research problem. What can we say about Boolean tensor factorizations and how do they relate to normal tensor factorizations and Boolean matrix factorizations?

RANK-1 (BOOLEAN) TENSORS

X = a× b

RANK-1 (BOOLEAN) TENSORS

X = a×1 b×2 c

THE CP TENSOR DECOMPOSITION

xijk ≈R�

airbjrckr

a1 a2 aR

bRb2b1

c1 c2 cR

+ + · · ·+

THE CP TENSOR DECOMPOSITION

xijk ≈R�

airbjrckr

THE BOOLEAN CP TENSOR DECOMPOSITION

a1 a2 aR

bRb2b1

c1 c2 cR

∨ ∨ · · ·∨

xijk ≈R�

airbjrckr

THE BOOLEAN CP TENSOR DECOMPOSITION

≈ ◦◦

xijk ≈R�

airbjrckr

DIGRESSION: FREQUENT TRI-ITEMSET MINING

• Rank-1 N-way binary tensors define an N-way itemset

• Particularly, rank-1 binary matrices define an itemset

• In itemset mining the induced sub-tensor must be full of 1s

• Here, the items can have holes

• Boolean CP decomposition = lossy N-way tiling

TENSOR RANKThe rank of a tensor is the minimum number of rank-1

tensors needed to represent the tensor exactly.

a1 a2 aR

bRb2b1

c1 c2 cR

+ + · · ·+=

BOOLEAN TENSOR RANKThe Boolean rank of a binary tensor is the minimum

number of binary rank-1 tensors needed to represent the tensor exactly using Boolean arithmetic.

a1 a2 aR

bRb2b1

c1 c2 cR

∨ ∨ · · ·∨

SOME RESULTS ON RANKS

• Normal tensor rank is NP-hard to compute

• Normal tensor rank of n-by-m-by-k tensor can be more than min{n, m, k}

• But no more than min{nm, nk, mk}

• Boolean tensor rank is NP-hard to compute

• Boolean tensor rank of n-by-m-by-k tensor can be more than min{n, m, k}

• But no more thanmin{nm, nk, mk}

SPARSITY

• Binary matrix X of Boolean rank R and |X| 1s has Boolean rank-R decomposition A o B such that |A| + |B| ≤ 2|X| [M., ICDM ’10]

• Binary N-way tensor of Boolean tensor rank R has Boolean rank-R CP-decomposition with factor matrices A1, A2, …, AN such that ∑i |Ai| ≤ N| |

• Both results are existential only and extend to approximate decompositions

THE TUCKER TENSOR DECOMPOSITION

xijk ≈P�

gpqraipbjqckr

THE BOOLEAN TUCKER TENSOR DECOMPOSITION

xijk ≈P�

gpqraipbjqckr

THE ALGORITHMS

• The normal CP-decomposition can be solved using matricization and ALS

• ⊙ is the Khatri–Rao matrix product

• (C ⊙ B)T is R-by-mk

• For normal matrices, we can use standard least-squares projections

• One projection per mode

• Similar algorithms for the Tucker decomposition

X(1) = A(C⊙ B)T

X(2) = B(C⊙A)T

X(3) = C(B⊙A)T

THE ALGORITHMS

• For Boolean case, matrix product must be changed

• Khatri–Rao stays as it

• Finding the optimal projection is NP-hard even to approximate

• Good initial values are needed due to multiple local minima

• Obtained using Boolean matrix factorization to matricizations

X(1) = A ◦ (C⊙ B)T

X(2) = B ◦ (C⊙A)T

X(3) = C ◦ (B⊙A)T

THE TUCKER CASE

• The core tensor has global effects

• Updates are hard

• Core tensor is usually small

• We can afford more time per element

• In Boolean case many changes make no difference

xijk ≈P�

gpqraipbjqckr

SYNTHETIC EXPERIMENTS

4 8 16 32 640

12x 10

Reconstr

uction e

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(4, 4, 4) (8, 4, 4) (8, 8, 8) (16, 8, 4)0

BTucker_ALSTucker_ALS

Tucker_ALSF

4 8 16 32 640

12x 10

Reconstr

uction e

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(4, 4, 4) (8, 4, 4) (8, 8, 8) (16, 8, 4)0

Tucker_ALSF

4 8 16 32 640

12x 10

Reconstr

uction e

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(4, 4, 4) (8, 4, 4) (8, 8, 8) (16, 8, 4)0

Tucker_ALSF

REAL-WORLD EXPERIMENTS

5 10 15 301200

uction

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(5, 5, 5) (10, 10, 10) (15, 15, 15) (30, 30, 15)1000

RanksR

Tucker_ALSF

5 10 15 301200

uction

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(5, 5, 5) (10, 10, 10) (15, 15, 15) (30, 30, 15)1000

RanksR

Tucker_ALSF

5 10 15 301200

uction

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(5, 5, 5) (10, 10, 10) (15, 15, 15) (30, 30, 15)1000

RanksR

Tucker_ALSF

CONCLUSIONS

• Boolean tensor decompositions are a bit like normal tensor decompositions

• And a bit like Boolean matrix factorizations

• They generalize other data mining techniques in many ways

• The playing field between Boolean and normal tensor factorizations is more level

CONCLUSIONS

• Boolean tensor decompositions are a bit like normal tensor decompositions

• And a bit like Boolean matrix factorizations

• They generalize other data mining techniques in many ways

• The playing field between Boolean and normal tensor factorizations is more level

!ank Y#!

Boolean tensor factorizations - mpi-inf.mpg.depmiettin/papers/BTF_ICDM_slides.pdfFACTORIZATIONS: THE...

Documents

Transcript of Boolean tensor factorizations - mpi-inf.mpg.depmiettin/papers/BTF_ICDM_slides.pdfFACTORIZATIONS: THE...

Scalable Tensor Factorizations with Missing Data

Matrix and Tensor Factorization for Machine Learninggrabus/courses/... · IFT 6760A Matrix and Tensor Factorization for Machine Learning Winter 2020 Instructor: Guillaume Rabusseau

Tensor factorization for model-free space- variant blind ...Tensor factorization for model-free space-variant blind deconvolution of the single- and multi-frame multi-spectral image

Matrix and Tensor Factorization from a Machine Learning ...statmath.wu.ac.at/research/talks/resources/talkfreudenthaler.pdf · Matrix Factorization - SVD I SVD: Decompose p 1 Tp 2

Volodymyr Kuleshovú Arun Tejasvi Chagantyú …kuleshov/papers/aistats2015...Tensor Factorization via Matrix Factorization Volodymyr Kuleshovú Arun Tejasvi Chagantyú Percy Liang

Rubik: Knowledge Guided Tensor Factorization and ...hiplab.mc.vanderbilt.edu/~ychen/kdd.pdfRubik: Knowledge Guided Tensor Factorization and ... to tensor factorization and completion,

Negative-Unlabeled Tensor Factorization for Location Category Inference ... · Negative-Unlabeled Tensor Factorization for Location Category Inference from Highly Inaccurate Mobility

NAG Library Chapter Introduction f08 – Least Squares and ...2.2.1 QR factorization The most common, and best known, of the factorizations is the QR factorization given by A ¼ Q

Robust Tensor Factorization With Unknown Noise...niques to deal with tensor problems. However, as shown in [10], such matricization fails to exploit the essential tensor structure

Beyond word2vec: Distance-graph Tensor Factorization for ...charuaggarwal.net/cikm2019.pdf · factorization machines, which provides an alternative methodology for implementation.

Convolutional Auto-Encoder With Tensor-Train Factorization

Coupled Matrix and Tensor Factorizations: Models ...perso.ens-lyon.fr/bora.ucar/tensors-pp16/eacar-pp16.pdf · CMTF-OPT is a gradient–based optimization approach for joint factorization

Temporal Link Prediction Using Matrix and Tensor Factorizationstgkolda/pubs/bibtgkfiles/ACM-TKDD-Link... · 2011-03-11 · 10 Temporal Link Prediction Using Matrix and Tensor Factorizations

Global Optimality in Matrix and Tensor Factorization, Deep ...

Regularized Tensor Factorizations and Higher-Order ... · PDF fileRegularized Tensor Factorizations and Higher-Order Principal Components Analysis ... Jan and Dan Duncan ... (Harshman,

Multiverse Recommendation: N-dimensional Tensor Factorization for Context-aware Collaborative Filtering

GENERALIZED TENSOR FACTORIZATION by Yusuf Kenan Yılmazcemgil/papers/KenanPhDThesis2012.pdf · GENERALIZED TENSOR FACTORIZATION by Yusuf Kenan Yılmaz B.S., Computer Engineering,

Temporal Link Prediction using Matrix and Tensor ... · Temporal Link Prediction using Matrix and Tensor Factorizations DANIEL M. DUNLAVY and TAMARA G. KOLDA Sandia National Laboratories

Collaborators Nonnegative Matrix and Tensor …helper.ipam.ucla.edu/publications/sews2/sews2_7123.pdfNonnegative Matrix and Tensor Factorizations for Text Mining Applications IPAM

Global Optimality in Matrix and Tensor Factorization, Deep ...€¦ · Factorization, Deep Learning & Beyond ... "Imagenet classification with deep convolutional neural networks."NIPS,