Extending faceted search to the general web

Extending Faceted Search to the General Web

2014/11/25 (Tue.)�Chang Wei-Yuan @ MakeLab Group Meeting

Weize Kong, James Allan �CIKM‘14

+Outline

n Introduction �

n Method �n Facet Generation �n Facet Feedback �n Evaluation �

n Experiment �

n Conclusion �

n Thought

+Outline

n Experiment �

n Conclusion �

n Thought

+ Introduction

n Faceted search helps users by offering drill-down options as a complement to the keyword input box.

+ Introduction

n However, this idea is not well explored for general web search. �n heterogeneous nature �

+ Introduction

baggage allowance

所有航線

國內航線

國際航線

貨運公司行李類型

+ Introduction

baggage allowance

所有航線

國內航線

國際航線

貨運公司行李類型

← query

← facet

← facet term

↓ search result ( ducument)

+ Introduction

n Goal : �n query-dependent automatic facet generation �n user feedback on these query facets into

document ranking

+Outline

n Experiment �

n Conclusion �

n Thought

+Flow Chart 10

Search Result

Candidate Facets Facets

Selected Terms

Top-ranked Documents

Search Result

Query Extracting

Candidates Refining

Candidates

Facet Feedback

+Flow Chart 11

Search Result

Selected Terms

Search Result

Query Extracting

Candidates Refining

Candidates

Facet Feedback

+Facet

Generation Facet

Feedback Evaluation

n Input : Query and Search Result�

n Step 1 : Extracting Candidates �

n Step 2 : Refining Candidates �

n Output : Query Facet �

+Facet

Generation Facet

Feedback Evaluation

n Step 1 : Extracting Candidates �n applied both textual and HTML patterns on

the top search results �

+Facet

Generation Facet

Feedback Evaluation

n query : “mars landing”�

n search results �n “ Mars rovers such as Curiosity, Opportunity

and Spirit ”�

n candidate facets �n C : { Curiosity, Opportunity, Spirit } �

+Facet

Generation Facet

Feedback Evaluation

n the candidate query facets extracted. �n noisy�n non-relevant to the issued query�n terms be not members of the same class �

+Facet

Generation Facet

Feedback Evaluation

n candidate facets : �

+Facet

Generation Facet

Feedback Evaluation

+Facet

Generation Facet

Feedback Evaluation

n Refine �

+Facet

Generation Facet

Feedback Evaluation

n Step 2 : Refining Candidates �n re-cluster the query facets or their facet

terms into higher quality query facets �

+Facet

Generation Facet

Feedback Evaluation

n Topic modeling �n pLSA, LDA�

n Unsupervised clustering method �n QDMiner, QDM �

n Super-vised methods based on a graphical model �n QF-I, QF-J �

+Facet

Generation Facet

Feedback Evaluation

n Input : Query and Search Result�

n Output : Facet : { a set of terms } �n Year : { 2007, 2011, 2012 } �n Lab : { NASA, Mars Science Lab, Curiosity Lab } �

+Flow Chart 22

Search Result

Selected Terms

Search Result

Query Extracting

Candidates Refining

Candidates

Facet Feedback

+Facet

Generation Facet

Feedback Evaluation

n Input : Document, Query, User Selection �n Document = one of search result �

n Boolean Filtering Model �

n Soft Ranking Model �

n Output : the score of each document

+Facet

Generation Facet

Feedback Evaluation

n Fu denotes the set of feedback facets which user selected �

n condition B can be either AND, OR, or A+O �n S(D, Q) is the score returned by the original

retrieval model �

+Facet

Generation Facet

Feedback Evaluation

n λ is a parameter for adjusting the weight �n SE(D, Fu) is the expansion part which captures

the relevance between the document and feedback facet�

+Facet

Generation Facet

Feedback Evaluation

n Input : Documents, Query, User Selection �

n Output : the score of each document

+Facet

Generation Facet

Feedback Evaluation

n Intrinsic Evaluation �n Ground Truth: query facets are constructed

by human annotators �n annotators are asked to group or re-group

terms in the pool into preferred query facets. �n  pooling facets generated by the different systems �

n compared with facets generated by different systems �

+Facet

Generation Facet

Feedback Evaluation

n Extrinsic Evaluation �n User Model �

n  The user model describes how a user selects feedback terms from facets, based on which we can estimate the time cost for the user.

↑ time for scanning facet

time for selecting terms

+Facet

Generation Facet

Feedback Evaluation

n Extrinsic Evaluation �n Oracle Feedback and Annotator Feedback �

n  Oracle feedback model only selected effective terms as feedback. �

n  The annotator is asked to select all the terms from the facets that would help address the information need. �

+Outline

n Experiment �

n Conclusion �

n Thought

+Experiment Settings

n Dataset �n  For the document corpus, we use the ClueWeb09

Category-B collection. �n  196 queries and 678 query subtopics �

n Facet Generation Models �n  pLSA, LDA, QDM, QF-I and QF-J �

n Facet Feedback Models �n  Boolean filtering models, soft ranking models �

n Baseline Retrieval Model �n  SDM, and its MAP(Mean average precision) = 0.185 �

+Facet Generation Models 32

based on annotator feedback and SF feedback model

based on oracle feedback and SF feedback model.

based on annotator feedback and SF feedback model

based on oracle feedback and SF feedback model.

Our experiments testify to the potential of Faceted Web Search.

+Facet Feedback Models 35

+Facet Feedback Models 36

Our experiments show feedback models effective.

+Outline

n Experiment �

n Conclusion �

n Thought

+Conclusion

n This paper proposed Faceted Web Search. �n an extension of faceted search to the general

Web �

n query-dependent automatic facet generation �

n feedback on these query facets into document ranking

+Outline

n Experiment �

n Conclusion �

n Thought

+Thanks for listening. 2014 / 11 / 25 (Tue.) @ MakeLab Group Meeting �v123582@gmail.com�

Extending faceted search to the general web

Data & Analytics

Transcript of Extending faceted search to the general web

Web Du Faceted Search V3 Alt

Extending Faceted Search with Automated Object Rankingusers.ics.forth.gr/...FacetedSearchAutoRanking.pdf · Faceted Search (FS) is the de facto query paradigm in e-commerce for more

Hearst Faceted Metadata for Site Navigation and Search

OERScout: Widening Access to OER through Faceted Search

Faceted Search

User-centric Faceted Search for Semantic Portals · faceted ontology queries that can be processed with a semantic faceted search engine such as Ontogator [13]. Figure 1 depicts how

Dynamic Taxonomies and Faceted Search - Semantic Scholar · Extending traditional models of Information Retrieval, search for digital resources has lately been widely recognized as

Fast Faceted Search

Faceted Metadata for Image Search and Browsingpeople.ischool.berkeley.edu/~hearst/papers/flamenco.pdf · Faceted Metadata for Image Search and Browsing Ka-Ping Yee, Kirsten Swearingen,

Flexible Search and Navigation using Faceted Metadataflamenco.berkeley.edu/papers/flamenco02.pdf · Flexible Search and Navigation using Faceted Metadata Jennifer English, Marti Hearst,

FaSet: A Set Theory Model for Faceted Search

Faceted search using Solr and Ontopia

Dynamic Taxonomies and Faceted Search

Faceted Metadata for Image Search and Browsing

Beyond Basic Faceted Search Ben-Yitzhak, et al.

Faceted Metadata for Site Navigation and Search

Faceted search of heterogeneous geographic information for ...liliana/EC/Information_Processing... · Several works promote the development of faceted search interfaces (Hearst, 2006)

Faceted Metadata in Search Interfaces

Faceted Search over Ontology-Enhanced RDF Data · Faceted search is a prominent approach for end-user data access, and several RDF-based faceted search systems have been devel-oped.

IRI Data Library Faceted Search : an example of