Challenges for Supporting Faceted Search in Large...

1
Contacts: Jaime Teevan (teevan), Susan Dumais (sdumais), Zachary Gutt (zachg) Challenges for Supporting Faceted Search in Large, Heterogeneous Corpora like the Web Jaime Teevan, Susan Dumais, Zachary Gutt Microsoft Faceted Search Today Examples Challenges with the Web Scale, Heterogeneity Generating metadata: Which facets? Scale (docs and facets) Accuracy vs. coverage Uncertainty filtering vs. ranking vs. interacting Identifying which facets to surface: Predictability vs. relevance to query Discriminability vs. relevance to searcher Computing previews: Accurately predicting counts, without examining all results Location Shoppin g Etc. HomePage Domain Time Genr e Topic

Transcript of Challenges for Supporting Faceted Search in Large...

Page 1: Challenges for Supporting Faceted Search in Large ...people.csail.mit.edu/teevan/work/publications/workshops/...Category Appliances (33) Bath (13) Building Materials (325) Decor (385)

Contacts: Jaime Teevan (teevan), Susan Dumais (sdumais), Zachary Gutt (zachg)

Challenges for Supporting Faceted Search in

Large, Heterogeneous Corpora like the Web

Jaime Teevan, Susan Dumais, Zachary Gutt

Microsoft

Faceted Search Today – Examples

Challenges with the Web – Scale, Heterogeneity

Generating metadata:

• Which facets?

• Scale (docs and facets)

• Accuracy vs. coverage

• Uncertainty – filtering vs. ranking vs. interacting

Identifying which facets to surface:

• Predictability vs. relevance to query

• Discriminability vs. relevance to searcher

Computing previews:

• Accurately predicting counts, without examining all results

LocationShoppin

g Etc.

HomePage

Domain

Time

Genr

eTopic