8th Language and Computation Day - University of Essex

Research OverviewInteractive Experimentation

Bootstrapping ExperimentationGoing Forward

Explorations in Bootstrapping Guided Search

8th Language and Computation Day

Deirdre Lungleydmlung@essex.ac.uk

October 8, 2009

Deirdre Lungley

Research ContributionMethodology

Research Contribution

1 Automatically acquire a domain model for a document collection

2 Allow for user adaptation through the incorporation of log data

3 Provide an insight into the different nature of general search, e.g.,WWW search versus intranet search

Deirdre Lungley

Methodology

Formal Concept Analysis (FCA) lattice based domain model

Navigational qualitiesCoatoms provide initial query refinement suggestions

Deriving lattice document descriptors (index terms)

Lattice structure dependant on good document descriptorsUse combination of NLP and mining of query logsNLP techniques:

Noun phrase terms which occur in at least 2 contexts are included.Also extract terms which co-occur with query term(s)

Query log mining:

Machine learning through relative relevanceLearn the URLs relevant to a query term(s)Attach query term(s) to these URLs

Deirdre Lungley

Methodology

Lattice structure dependant on good document descriptorsUse combination of NLP and mining of query logs

NLP techniques:

Query log mining:

Deirdre Lungley

Methodology

Query log mining:

Deirdre Lungley

Methodology

Query log mining:

Deirdre Lungley

Early Interactive Intranet Experiment

Early Interactive Intranet Experiment1

Simulate log data transactions for some frequent queries

Evaluate generated query refinement suggestions over two baselines:Lattice based solely on text processing of documentsFrequent terms

Results:

Adapted Lattice B1:Unadapted Lattice B2:Frequent Terms

% suggestions

judged relevant 73% 32% 42%

Results confirm our assumption that users would prefer queryrefinement suggestions learnt from user queries over contentgenerated terms

1Lungley, D. and Kruschwitz, U., Automatically Maintained Domain Knowledge:Initial Findings. In proceedings of the 31st European Conference on IR Research, ECIR2009

Deirdre Lungley

Results:

% suggestions

Deirdre Lungley

Results:

% suggestions

Deirdre Lungley

Results:

% suggestions

Deirdre Lungley

WWW Bootstrapping ExperimentObservationsMLE-based Query Suggestions

World Wide Web Bootstrapping Experiment

MSN Search Asset Data Collection

15 million queries and related clicks

TREC topics, 1 low frequency, 3 medium and 6 high

Results of UK evaluation:

Adapted Lattice B1:Unadapted Lattice B2:Noun Count

% suggestions

Results of Mechanical Turk evaluation:

% suggestions

Deirdre Lungley

% suggestions

Deirdre Lungley

% suggestions

Deirdre Lungley

% suggestions

Deirdre Lungley

Observations

Can we say deriving suggestions from logs works better on intranetdata?

Influencing factors:

Limitation to simple term pair evaluation - WWW requires morecontextTemporal dimension - log data dated May 2006

Can we say deriving suggestions from historic queries works betterthan from historic queries and clicks? Useful since:

Query data more readily availableSensitive nature of click data

Suggests evaluation of query-only adaptation

Intranet experimentAdapt relative relevance learningHighly dependant on good precision (P@1/P@2/P@5)Nutch (VSM) to Lucene (BM25F)

Deirdre Lungley

Observations

Can we say deriving suggestions from logs works better on intranetdata? Influencing factors:

Deirdre Lungley

Observations

Can we say deriving suggestions from historic queries works betterthan from historic queries and clicks?

Useful since:

Deirdre Lungley

Observations

Deirdre Lungley

Observations

Deirdre Lungley

Deriving query suggestions from Intranet Query Logs using MLEResearch Questions:

Usefulness of dialogue log componentSuitability of Web derived suggestions for domain-specific searchGeneral Web user perception of ”usefulness” of extracted suggestions

Query bigram MLE – max P(qn+1|q) over (q, qn+1)Experimental setup:

Suggestions generated for top 25 most frequently submitted queries67 participants for both evaluations

Method Relevant – Local Relevant – MTMLE-Session 71.0% 63.6%MLE-Dialogue 75.7% 68.9%MLE-Dialogue-Add 72.1% 63.6%MLE-Dialogue-Replace 75.2% 73.1%Baseline-Snippets 54.9% 51.3%Baseline-Google 35.6% 58.3%

Deirdre Lungley

Query bigram MLE – max P(qn+1|q) over (q, qn+1)

Experimental setup:Suggestions generated for top 25 most frequently submitted queries67 participants for both evaluations

Deirdre Lungley

Going Forward

Revisit lattice document descriptors

Move from ”Related searches” to ”concepts”Conceptual representation to map a specific URL into some spaceLatent Semantic Analysis (LSA) kernel

Deirdre Lungley

Questions?

Deirdre Lungley

Figure: Automade - UoE Intranet

Deirdre Lungley

8th Language and Computation Day - University of Essex

Documents

Transcript of 8th Language and Computation Day - University of Essex

Essex Learning Partnership - ELP Driving school Improvement for Essex Learners.

25 August, 2014Copyright Edward Tsang1 Evolutionary Computation Edward Tsang, University of Essex 1. Overview of EA 2. Analysis of GA Fundamental Theorem,

Issue 663 Beekeeper March 2020 - Essex Beekeepers ......The Essex Beekeeper Monthly Magazine of the Essex Beekeepers’ Association Furthering the Craft of Beekeeping in Essex Registered

County of EssEx EmployEE Handbook - Essex County · ESSEX COUNTY GOVERNMENT Essex County is governed under the County Executive Plan of New Jersey’s Optional County Charter Law.

lfd. Nr Neptun Name VB 1 1310 Essex Essex 29,00files.homepagemodules.de/b628943/f23t388p1036n2_kYsxRTfi.pdf · lfd. Nr Neptun Name VB 1 1310 Essex "Essex" 29,00 € 2 1310 Essex "Franklin"

University of Essex Undergraduate foundation courses at Essex Your pathway to success International Office Wivenhoe Park Colchester Essex,

VILLAGE OF ESSEX JUNCTION TRUSTEES TOWN OF ESSEX ... · 12/12/2019 · TOWN OF ESSEX SELECTBOARD Subcommittee on Governance Special Meeting Agenda 2 Lincoln Street Essex Junction,

THE ESSEX BEEKEEPER...THE ESSEX BEEKEEPER Monthly Magazine of the Essex Beekeepers’ Association Registered Charity number 1031419 Furthering the Craft of Beekeeping in Essex No.

Essex Boys

Landscape Character Assessment of the Essex Coast · Nigel Brown - Essex County Council Terry Coehlo - Essex County Council Lynn Dyson-Bruce - Essex County Council Debbie Knopp -

Essex climatechangeflooding_ScottHursley

8TH CONFERENCE ON COMPUTATIONAL BIOLOGY & …8TH CONFERENCE ON COMPUTATIONAL BIOLOGY & BIOINFORMATICS April 3-4, 2020 LSI-I Digital Media Center - C:-....puting Center for Computation

Essex Unite

Essex Record Office | Essex at War, 1914-1918

Essex Mums

Essex Vehicle Equip Asset Strategy - Welcome to Essex County

Bedfordshire, Essex

Essex Cares: Wellbeing & Activity Centres in Mid Essex

Evolutionary Computation and Aritficial Life (ECAL) cousre - CS BGU - July 8th, 2009 1 Evolving Efficient List Search Algorithms Kfir Wolfson Moshe Sipper.

2013~2016 Essex Fire Authority BUILDING A SAFER ESSEX