Co clustering by-block_value_decomposition

Click here to load reader

download Co clustering by-block_value_decomposition

of 15

Transcript of Co clustering by-block_value_decomposition

  • 1. Author / Bo Long, Zhongfei Zhang and Philip S. Yu
    Source / ACM KDD05, August 21-24, 2005, pp. 635 pp. 640
    Presenter / Allen Wu
    Co-clustering by Block Value Decomposition
    1

2. Outline
Introduction
Block value decomposition
Derivation of the Algorithm
Empirical Evaluation
Conclusion
2
3. Introduction
Dyadic data refer to a domain with two finite sets of objects in which observations are made for dyads.
Co-clustering can effectively deal with the high dimensional and sparse data between rows and columns.
In this paper, a new co-clustering framework, Block Value Decomposition(BVD), had been proposed.
3
4. Introduction (cont.)
This paper develop a specific novel co-clustering algorithm for a special yet very popular case non-negative dyadic data.
The algorithm performs an implicitly adaptive dimensionality reduction, which works well for typical sparse data.
The dyadic data matrix is factorized into three components.
The row-coefficient matrix R
The block value matrix B
The column-coefficient matrix C
4
5. The definition of dyadic data
5
The notion dyadic refers to a domain with two sets of objects X={x1, , xn} and Y={y1, , ym}
The data can be organized as an n by m two-dimensional matrix Z.
Each w(x,y) corresponds to one element of Z.
6. 6


=
nk
k l
lm
nm
k l
7. 7
y1 y2y3y4
x1
x2
x3
x4


C
B
R
y1 y2y3 y4
=
y1 y2y3 y4
x1
x2
x3
x4
x1
x2
x3
x4
RBC
Z
8. Block value decomposition definition
8
Non-negative block value decomposition of a non-negative data matrix Z nm(i.e. ij: Zij 0) is given by the minimization of
f(R, B, C) = ||Z RBC||2
subject to the constraints ij: Rij 0, Bij 0 and Cij 0, where R nk, B kl, C lm, k