Structured Learning of Two-Level Dynamic Rankings

20
STRUCTURED LEARNING OF TWO-LEVEL DYNAMIC RANKINGS Karthik Raman , Thorsten Joachims & Pannaga Shivaswamy Cornell University 1

description

Structured Learning of Two-Level Dynamic Rankings. Karthik Raman , Thorsten Joachims & Pannaga Shivaswamy Cornell University. Handling Query Ambiguity. Non-Diversified. Diversified. Ranking Metrics: Two Extremes. Non-Div . Diversified. d 1. d 2. d 3. d 4. ………………??…….………. - PowerPoint PPT Presentation

Transcript of Structured Learning of Two-Level Dynamic Rankings

Page 1: Structured Learning of       Two-Level Dynamic Rankings

1

STRUCTURED LEARNING OF TWO-LEVEL DYNAMIC RANKINGS

Karthik Raman, Thorsten Joachims & Pannaga Shivaswamy

Cornell University

Page 2: Structured Learning of       Two-Level Dynamic Rankings

2

HANDLING QUERY AMBIGUITYDiversifiedNon-Diversified

Page 3: Structured Learning of       Two-Level Dynamic Rankings

3

RANKING METRICS: TWO EXTREMES

d1

d2

d3

d4

4 3 0

4 0 0

0 3 0

0 0 3

8 6 3

Non-Div.

Rnkgd

tdU )|(

= 8/2 + 6/3 + 3/6

t1 t2 t3

1/2 1/3 1/6

4 3 0

4 0 0

0 3 0

0 0 3

Diversified

4 3 3 )|(max Rnkgd tdU

= 4/2 + 3/3 + 3/6

………………??…….……….

t1 t2 t3

P(t1) =1/2

P(t2) =1/3

P(t3) =1/6

U(d1|t)

U(d2|t)

U(d3|t)

U(d4|t)

Page 4: Structured Learning of       Two-Level Dynamic Rankings

KEY: DIMINISHING RETURNS

0 1 2 3 4 5 6 7 8 9 100

1

2

3

4

5

g(x)=x

g(x)=min(x,1)

# of rel. documents for given intent

4

Non-Diversified

Diversified

Page 5: Structured Learning of       Two-Level Dynamic Rankings

KEY: DIMINISHING RETURNS

For a given query and user intent: Marginal benefit of seeing additional relevant doc. decreases.

0 1 2 3 4 5 6 7 8 9 100

1

2

3

4

5g(x)=xg(x)=log(1+x)g(x)=√xg(x)=min(x,2)g(x)=min(x,1)

5

Page 6: Structured Learning of       Two-Level Dynamic Rankings

6

GENERAL RANKING METRIC

d1

d2

d3

d4

Given ranking θ = (d1, d2,…. dk) and concave function g

t1 t2 t3

1/2 1/3 1/64 3 0

4 0 0

0 3 0

0 0 3

√8 √6 √3

= √8 /2 + √6/3 + √3/6

g(x) = √x

Rankingd

tdU )|(

ki

iig tdUgktU

1

)|(@)|(

- Can incorporate position discount also

Page 7: Structured Learning of       Two-Level Dynamic Rankings

7

COMPUTING OPTIMAL RANKINGS

These utility functions are submodular. Given the utility function, can find ranking that

optimizes it using a greedy algorithm: At each iteration: Choose Document that Maximizes

Marginal Benefit

Algorithm has (1 – 1/e) approximation bound.

d1 Look at Marginal Benefitsd1 2.2

d2 1.7 1.4

d3 0.4 0.2

d4 1.9 1.7

d4?

d2?

d1 2.2

d2 1.7 1.4

d3 0.4 0.2 1.3

d4 1.9 1.7 0.1

?d1 2.2

d2 1.7

d3 0.4

d4 1.9

Page 8: Structured Learning of       Two-Level Dynamic Rankings

8

EXPERIMENTAL SETTINGS

Experiments run on TREC and WEB datasets: Both datasets have document-intent annotations.

Avg. Intents per Doc: 20 vs. 4.5 Prob(Dominant Intent) : 0.28 vs. 0.52

Different Utility functions compared: PREC : g(x) =x SQRT : √x LOG :

log(1+x) SAT-2 : min(x,2) SAT-1 : min(x,1)

Page 9: Structured Learning of       Two-Level Dynamic Rankings

9

TURNING THE KNOB

2 3 4 5 6 7 8 9 10 110

0.51

1.52

2.53

3.54

4.5

TRECWEB

# of Intents Covered

# O

f Doc

s. P

er In

tent

Co

vere

d

?No Diversification

Maximum Diversity

LOG

SQRT

SAT-2LOG

SQRT

SAT-2

Page 10: Structured Learning of       Two-Level Dynamic Rankings

10

HOW DO WE BREAK THIS TRADE-OFF?

KEY IDEA: Using user interaction as feedback to disambiguate the query on-the-fly, so as to present more results relevant to the user.

(Brandt et. al. WSDM ‘11)

Page 11: Structured Learning of       Two-Level Dynamic Rankings

11TWO-LEVEL DYNAMIC RANKING: EXAMPLE

First-Level: Start with a Diverse Ranking

User interested in the ML Method.

Expands to see more such results.

Page 12: Structured Learning of       Two-Level Dynamic Rankings

12

TWO-LEVEL DYNAMIC RANKING: EXAMPLE

New ranking related to ML Method.

Second-Level Ranking

Page 13: Structured Learning of       Two-Level Dynamic Rankings

13

TWO-LEVEL DYNAMIC RANKING: EXAMPLE

Page 14: Structured Learning of       Two-Level Dynamic Rankings

14

TWO-LEVEL DYNAMIC RANKING: EXAMPLE

Page 15: Structured Learning of       Two-Level Dynamic Rankings

15

TWO-LEVEL DYNAMIC RANKING : EXAMPLE

expandd1 d1,1 d1,2 d1,3

d2 d2,1 d2,2 d2,3

d3 d3,1 d3,2 d3,3

d4 d4,1 d4,2 d4,3

skip

skip

skip

expand

expand

expand

Two-Level Ranking (Θ)

Head

Tail Documents

Page 16: Structured Learning of       Two-Level Dynamic Rankings

16

UTILITY OF TWO-LEVEL RANKING

Can extend definition of Utility for two-level rankings: Concave function applied to (weighted) sum over all

rows. Resulting measure is nested submodular function.

Can similarly extend the greedy algorithm to a nested greedy algorithm for Two-Level Rankings: (1): Given a head document, can greedily build a row. (2): Each iteration, amongst all rows from (1), choose

the one with highest marginal benefit.

Can still prove an approximation bound: ≈ 0.47

)11()11(ee

Page 17: Structured Learning of       Two-Level Dynamic Rankings

17

STATIC VS. TWO-LEVEL DYNAMIC RANKINGS

Ser0.2

0.25

0.3

0.35

0.4

0.45 PREC

Ser0.9

0.951

1.051.1

1.151.2

1.25 SQRT

Ser0.5

0.6

0.7

0.8

0.9

1 LOG

Ser0.9

11.11.21.31.41.5

SAT-2

Static

Dy-namic

TREC

Ser0.6

0.65

0.7

0.75

0.8PREC

Ser1.5

1.6

1.7

1.8

1.9

2SQRT

Ser1.2

1.251.3

1.351.4

1.451.5

1.551.6 LOG

Ser1.6

1.651.7

1.751.8

1.851.9

1.952

2.05SAT-2

Static

Dy-namic

WEB

Page 18: Structured Learning of       Two-Level Dynamic Rankings

18

STRUCTURAL-SVM BASED LEARNING FRAMEWORK What if we do not have the document-intent labels? Solution: Use TERMS as a substitute for intents.

Structural SVMs used to predict complex outputs.

Utility Measure with Words replacing

intents

Similarity between head and tail documents.

Page 19: Structured Learning of       Two-Level Dynamic Rankings

19

LEARNING EXPERIMENTS Compare with different static-learning

baselines:

Page 20: Structured Learning of       Two-Level Dynamic Rankings

20

MAIN TAKE-AWAYS

1) Using Submodular Performance Measures to smoothly bridge the gap between depth and diversity.

2) Breaking the Trade-off between depth and diversity via user interaction using Two Level Dynamic Rankings.

3) Given training data, being able to predict dynamic rankings using the Structural SVM-based Learning Algorithm.

Code up on : http://www.cs.cornell.edu/~karthik/code/svm-dyn.html