Information Re-Retrieval: Repeat Queries in Yahoo’s Logs

6

Information Re- Retrieval: Repeat Queries in Yahoo’s Logs Jaime Teevan, Eytan Adar, Rosie Jones, Michael A. S. Potts SIGIR 2007

Upload
laura-vincent
Category

Documents
view
13
download
3

Embed Size (px):

description

Information Re-Retrieval: Repeat Queries in Yahoo’s Logs. Jaime Teevan, Eytan Adar, Rosie Jones, Michael A. S. Potts SIGIR 2007. Motivation. Re-finding information is a common activity of W e b search What is the intention of re-finding information? - PowerPoint PPT Presentation

Transcript of Information Re-Retrieval: Repeat Queries in Yahoo’s Logs

Page 1: Information Re-Retrieval: Repeat Queries in Yahoo’s Logs

Information Re-Retrieval: Repeat Queries in Yahoo’s Logs

Jaime Teevan, Eytan Adar, Rosie Jones, Michael A. S. Potts

SIGIR 2007

Page 2: Information Re-Retrieval: Repeat Queries in Yahoo’s Logs

Motivation

• Re-finding information is a common activity of Web search

• What is the intention of re-finding information?

• What factors favor/indicate user’s re-finding of information?

Page 3: Information Re-Retrieval: Repeat Queries in Yahoo’s Logs

Dataset

• 114 Yahoo users search trace over 1 year (Aug 2004 – July 2005)– 115 queries / trace– Considered as repeat

when separated > 30 minutes

• 119 volunteers in a controlled experiment– users are asked to repeat

one query made 30 mins to 1 hour ago

Page 4: Information Re-Retrieval: Repeat Queries in Yahoo’s Logs

Techniques used

• Normalizing query terms– Capitalization, stop words removal, duplicate words removal, extra white

space, stemming

– Word order (e.g. “new york department of state” and “department of state new york”)

– Non-alphanumerics (e.g. “sub-urban” vs “sub urban”)

– Word merge (e.g. “wal mart” vs “walmart”)

– Domain (e.g. hotmail vs hotmail.com)

– Words swap (e.g. “american embassy london” vs “american consulate london”)

• SVM classifier– Applied to predict whether a result will be clicked again

Page 5: Information Re-Retrieval: Repeat Queries in Yahoo’s Logs

Discovery

• Navigation query is one major type of re-finding information– Bank, news, mail– .com, .edu, .net

• Rank changes affects re-finding

Page 6: Information Re-Retrieval: Repeat Queries in Yahoo’s Logs

Discovery

• Memory fades– Control experiment

30% are mis-remembered (36/119)27 out of 36 are equivalent after normalization

– Yahoo Logs

• Indicators of repeat click– # clicks in first query– # clicks in previous query– # unique clicks in previous query

Logs Miner : Portal for Data Mining Web Access Logs

Logs Miner : Portal for Data Mining Web Access Logs

Jeopardy $100 Facts About Logarithms Exponentials to Logs Evaluating Logs Expanding Logs Condensing Logs $200 $300 $400 $300 $200 $100 $400 $300 $200 $100.

Jeopardy $100 Facts About Logarithms Exponentials to Logs Evaluating Logs Expanding Logs Condensing Logs $200 $300 $400 $300 $200 $100 $400 $300 $200 $100.

Case 5:16-md-02752-LHK Document 495 Filed 07/21/20 Page 1 of … · 2020-07-22 · Yahoo’s failures to timely disclose, Yahoo’s misleading public statements, and the sale of the

Case 5:16-md-02752-LHK Document 495 Filed 07/21/20 Page 1 of … · 2020-07-22 · Yahoo’s failures to timely disclose, Yahoo’s misleading public statements, and the sale of the

CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-trill.pdf · - Real-time streaming, temporal queries on logs, progressive queries etc. Language Integration

CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-trill.pdf · - Real-time streaming, temporal queries on logs, progressive queries etc. Language Integration

Logs – Solve USING LOGS METHOD

Logs – Solve USING LOGS METHOD

Why Yahoo’s identity crisis could finish it off

Why Yahoo’s identity crisis could finish it off

Identifying Slow Queries, and Fixing Them!...log autovacuum min duration 5/40. log min duration statement log_min_duration_statement = 0 Zero Logs every statement sent Number is in

Identifying Slow Queries, and Fixing Them!...log autovacuum min duration 5/40. log min duration statement log_min_duration_statement = 0 Zero Logs every statement sent Number is in

Logs, Logs, Every Where, Nor Any Byte to Grok

Logs, Logs, Every Where, Nor Any Byte to Grok

Mining related queries from Web search engine query logs ... · in utilizing Web search engine query logs for mining related queries. Cui, Wen, Nie, and Ma (2002) proposed a method

Mining related queries from Web search engine query logs ... · in utilizing Web search engine query logs for mining related queries. Cui, Wen, Nie, and Ma (2002) proposed a method

INTRODUCTION TO PEOPLESOFT QUERY. AGENDA Overview PeopleSoft Query Running Queries Writing Queries Advanced Topics –Multiple Table Queries –Prompted Queries.

INTRODUCTION TO PEOPLESOFT QUERY. AGENDA Overview PeopleSoft Query Running Queries Writing Queries Advanced Topics –Multiple Table Queries –Prompted Queries.

Threat Hunting with Application Logs and Sigma - owasp.org · Threat Hunting with Application Logs and Sigma ... – Rule Format – Rule Examples – Conversion to SIEM queries How

Threat Hunting with Application Logs and Sigma - owasp.org · Threat Hunting with Application Logs and Sigma ... – Rule Format – Rule Examples – Conversion to SIEM queries How

CREATING COMPLEX QUERIES WITH NESTED QUERIES CS1100: Data, Databases, and Queries CS1100Microsoft Access1.

CREATING COMPLEX QUERIES WITH NESTED QUERIES CS1100: Data, Databases, and Queries CS1100Microsoft Access1.

Queries Ms. Jaimie Barbé. Queries DoDAAC and UIC Queries in AESIP.

Queries Ms. Jaimie Barbé. Queries DoDAAC and UIC Queries in AESIP.

Making Apache Hadoop Secure Devaraj Das ddas@apache.org Yahoo’s Hadoop Team.

Making Apache Hadoop Secure Devaraj Das [email protected] Yahoo’s Hadoop Team.

Mashups - Information Sciences Institute · •Data Extraction –Simile, Dapper, D.Mix [Hartman 2007], OpenKapow •Widget Approach –Yahoo’s Pipes, Microsoft’s Popfly, IBM’s

Mashups - Information Sciences Institute · •Data Extraction –Simile, Dapper, D.Mix [Hartman 2007], OpenKapow •Widget Approach –Yahoo’s Pipes, Microsoft’s Popfly, IBM’s

06 Resistivity Logs Induction Logs

06 Resistivity Logs Induction Logs

Information Re-Retrieval Repeat Queries in Yahoo’s Logs Jaime Teevan (MSR), Eytan Adar (UW), Rosie Jones and Mike Potts (Yahoo) Presented by Hugo Zaragoza.

Information Re-Retrieval Repeat Queries in Yahoo’s Logs Jaime Teevan (MSR), Eytan Adar (UW), Rosie Jones and Mike Potts (Yahoo) Presented by Hugo Zaragoza.

Collect Logs - cisco.com fileCollect Logs CiscoPrimeCollaborationenablesyoutocollectcalllogstoidentifyfaultsinthecallsforCiscoVoicePortal (CVP),UnifiedContactCenterEnterprise(UnifiedCCE

Collect Logs - cisco.com fileCollect Logs CiscoPrimeCollaborationenablesyoutocollectcalllogstoidentifyfaultsinthecallsforCiscoVoicePortal (CVP),UnifiedContactCenterEnterprise(UnifiedCCE

Mining related queries from Web search engine query logs using an

Mining related queries from Web search engine query logs using an

Whose Logs, What Logs, Why Logs - Your Quickest Path to Security Visibility

Whose Logs, What Logs, Why Logs - Your Quickest Path to Security Visibility

Languages

Pages

Legal

Copyright © 2022 FDOCUMENTS