Geometric Aspects of LSA

15
Geometric Aspects of Semantic Indexing Dr. Heinrich Hartmann (Oxford) WeST Koblenz, 31.08.2012

description

Talk given at Oberseminar WeST in 2012

Transcript of Geometric Aspects of LSA

Page 1: Geometric Aspects of LSA

Geometric Aspects of Semantic IndexingDr. Heinrich Hartmann (Oxford)

WeST Koblenz, 31.08.2012

Page 2: Geometric Aspects of LSA

Outline

1. Latent Semantic Analysis

2. Geometry of SVD

3. Common Ground

Page 3: Geometric Aspects of LSA

Part 1Latent Semantic Analysis

Problem. Given a set of documents(D1,...,DN)

which documents have a similar topic?

Answer. Latent Semantic Analysis (Deerwester, et. al. 1990)

Page 4: Geometric Aspects of LSA

Latent Semantic Analysis - 2

Page 5: Geometric Aspects of LSA

Latent Semantic Analysis - 3

Page 6: Geometric Aspects of LSA

Latent Semantic Analysis - 4

Page 7: Geometric Aspects of LSA

Latent Semantic Analysis - 5

Page 8: Geometric Aspects of LSA

Part 2Geometry of SVD

Study geometry of the associated map!

Page 9: Geometric Aspects of LSA
Page 10: Geometric Aspects of LSA

Part 3Common Ground

Page 11: Geometric Aspects of LSA

Common GroundThe extremal vector

Which meaning has the extremal vector?

Inside the term-space.

Page 12: Geometric Aspects of LSA

Common GroundExample 1

Page 13: Geometric Aspects of LSA

Common GroundExample 2

Page 14: Geometric Aspects of LSA

Common GroundExample 3

Page 15: Geometric Aspects of LSA

Common GroundConclusion

The extremal vector represents the Dominant Topic.

Thank You