Expertise Finding for Question Answering (QA) Services

Expertise Finding for Question Answering (QA) ServicesOctober 16, 2014Department of Knowledge Service EngineeringProf. Jae-Gil Lee

210/16/2014

Brief Bio• Currently, an associate professor at Department of

Knowledge Service Engineering, KAIST• Homepage: http://dm.kaist.ac.kr/jaegil

• Lab homepage: http://dm.kaist.ac.kr/

• Previously, worked at IBM Almaden Research Center and University of Illinois at Urbana-Champaign

• Areas of Interest: Data Mining and Big Data

310/16/2014

Table of Contents• Community-based Question Answering (CQA) Services• Background and Motivation

• Methodology Overview

• Evaluation Results

• Social Search Engines for Location-Based Questions• Background and Motivation

• System Architecture and User Interface

410/16/2014

Question Answering (QA) Services

QA services are good at Recently updated information Personalized information Advice & opinion[Budalakoti et al., 2010]

Questions Answers KnowledgeBase

Search

Experts

510/16/2014

Community-based Question Answering (CQA) Services

Naver Knowledge-In Yahoo! Answers

50,000 questions per day 160,000 questions per day

610/16/2014

Motivation of Our Study• Most contributions (i.e.,

answers) in CQA services are made by a small number of heavy users

• Recently-joined users are prone to leave CQA ser-vices very soon

Only 8.4% of answerers remained after a year

Making the long tail stay longer before they leave is of prime importance towards the success of the services

710/16/2014

Problem Setting• To whom does the service provider need to pay special at-

tention? Recently-joined (i.e., light) users who are likely to become contributive (i.e., heavy) users

• Goal: estimating the likelihood of a light user becoming a heavy user (mainly by his/her expertise)

• Challenges: lack of information about the light user

어장관리 ?

810/16/2014

Intuition behind Our Methodology• A person’s active vocabulary reveals his/her

knowledge

• Vocabulary has sharable characteristics so that domain-specific words are repeatedly used by expert answerers

Device

Memory

Computer

RAMSSD

Operation

Q&A 1 by Answerer 1 Q&A 2 by Answerer 2

Domain-SpecificVocabularies

CommonVocabularies

LevelDifference

SharableCharacteristics

910/16/2014

Estimated Expertise

Heavy Users Words Light Users

The more expert a user is, the higher the level of words he/she used is.

1010/16/2014

Availability• Simply measuring the number of a user’s answers with

their importance proportional to their recency

1110/16/2014

Answer Affordance• Being defined as the likelihood of a light user becom-

ing a heavy user if he/she is treated specially

• Considering both expertise and availability

𝐴𝑓𝑓𝑜𝑟𝑑𝑎𝑛𝑐𝑒 (𝑢𝑙 )=¿

1210/16/2014

Data Set• Collected from Naver Knowledge-In (KiN, 지식인 )

• Spanning ten years (from Sept. 2002 to Aug. 2012)

• Including two categories: Computers and Travel• Computers: factual information, Travel: subjective opinions

• The entropy was used for measuring the expertise of a user, working well especially for the categories where factual exper-tise is primarily sought after [Adamic et al., 2008]

• StatisticsComputers Travel

# of answers 3,926,794 585,316

# of words 191,502 232,076

# of users 228,369 44,866

1310/16/2014

Evaluation Setting (1/2)• Finding the top-k users by

Affordance() for light users our methodology

• Retrieving the top-k directoryexperts managed by KiN competitor

• Measuring the two measuresfor the next one month• User availability: the ratio of the number of the top-k users who

appeared on the day to the total number of users who appeared on that day

• Answer possession: the ratio of the number of the answers posted by the top-k users on the day to the total number of answers posted on that day

1410/16/2014

Evaluation Setting (2/2)

Ten year period

Sept. 2002 July 2011 July 2012 Aug. 2012

Used for deriving the word levels Used for finding top-k experts by our methodology

Picked up the top-k directory experts managed by KiN

Monitored the user availability and answer possession

1510/16/2014

The result of the answer possession

The result of the user availability (a) Computers (b) Travel

(a) Computers (b) Travel

top-400 top-200

See the paper for the technical details.

Sung, J., Lee, J., and Lee, U., "Booming Up the Long Tails: Discovering Potentially Contributive Users in Community-Based Question Answering Services," In Proc. 7th Int'l AAAI Conf. on Weblogs and Social Media (ICWSM), Cambridge, Massachusetts, July 2013.

This paper received the Best Paper Award at AAAI ICWSM-13.

1710/16/2014

Table of Contents• Community-based Question Answering (CQA) Services• Background and Motivation

• Methodology Overview

• Social Search Engines for Location-Based Questions• Background and Motivation

• System Architecture and User Interface

1810/16/2014

Social Search (1/2)• A new paradigm of knowledge acquisition that relies

on the people of a questioner’s social network

1910/16/2014

Social Search (2/2)

If you want to get some opinions or advices from your online friends, what do you do?

Not knowing whom to ask Knowing whom to ask

Taking advantage of both approaches

Social Search

2010/16/2014

KiN Here ( 지식인 위치질문 )• A query is routed by finding a match between a target

location of a query and a relevant location of a user

동 단위로 추가

2110/16/2014

Location-Based Questions• Informally defined as “search for a business or place of in-

terest that is tied to a specific geographical location”[Amin et al., 2009]

• Very popular especially in mobile search and typically sub-jective• Mobile search is estimated to comprise 10% 30% of all searches ∼• About 9 10% of the queries from Yahoo! mobile search∼ ,

over 15% of 1 million Google queries from PDA devices , and about 10% of 10 million Bing mobile queries were identified as location-based questions

• In a set of location-based questions, 63% of them were non-factual, and the remaining 37% of them were factual

Mobile social search is the best way to process location-based questions

2210/16/2014

Glaucus: A Social Search Engine for Location-Based Questions

1. Asking a question to Glaucus2. Selecting proper experts3. Routing the question to the experts4. Returning an answer to the questioner5. (Optional) Rating the answer

GlaucusSocial Search

Engine

User Database

1: Query

2: Selected Experts

3: Query

Answer 4: Answer

5: Feedback

Crawling

Questioner

2310/16/2014

User Interface• An Android app has been developed and is under

(closed) beta testing

Questioner Answerer

2410/16/2014

Data Collection• Being able to collect who visited where and when on

geosocial networking services such as Foursquare• Users check-in to a venue and also may leave a tip

• Our crawler collects such information upon user approval

2510/16/2014

Expert Finding

Location

Expertise Finding for Question Answering (QA) Services

Documents

Transcript of Expertise Finding for Question Answering (QA) Services

QA to AQ Part Five - Rebecca Wirfs-Brockwirfs-brock.com/PDFs/QA2AQPartFive.pdf · QA to AQ Part Five: Being Agile at Quality ﹘ “Growing Quality Awareness and Expertise” - 2

NExT-QA: Next Phase of Question-Answering to Explaining ......NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions Junbin Xiao, Xindi Shang, Angela Yao, Tat-Seng

Results of the BioASQ Track of the Question Answering Lab ...ceur-ws.org/Vol-1180/CLEF2014wn-QA-BalikasEt2014.pdf · Answering Lab at CLEF 2014 George Balikas, Ioannis Partalas, Axel-Cyrille

Question Answering Systems - Syracuse Universityclasses.ischool.syr.edu/ist664/NLPFall2015/QA.2015.ppt.pdf · 2015. 3. 13. · Question Answering (QA) • IR assumes that the user

Question Answering using Constraint Satisfaction: QA-by-Dossier-with-Constraints John Prager, Jennifer Chu-Carroll, Krzysztof Czuba Watson Research Ctr.

(Some) Research Trends in Question Answering (QA)mausam/courses/col864/spring2017/slides/16-qa.pdf · (Some) Research Trends in Question Answering (QA) ... • IR-QB: Maps questions

Example-Driven Question Answering di.pdf · Open-domain question answering (QA) is an emerging information-seeking paradigm, which automatically generates accurate and concise answers

CASIA@V2: A MLN-based Question Answering System over Linked …ceur-ws.org/Vol-1180/CLEF2014wn-QA-ShizhuEt2014.pdf · 2014-09-06 · CASIA@V2: A MLN-based Question Answering System

© Johan Bos November 2005 Question Answering Lecture 1 (two weeks ago): Introduction; History of QA; Architecture of a QA system; Evaluation. Lecture 2.

Visual7W: Grounded Question Answering in Images · PDF filenew task of visual question answering (QA) has been pro-posed to evaluate a model’s capacity for deep image under-standing

Question Answering and Reading Comprehension · Question Answering (QA) vs. Information Retrieval (IR) • QA and IR are related, but satisfy diﬀerent info needs • In QA, questions

IMTKU Question Answering System for World History …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...World History Exams at NTCIR-12 QA Lab2 myday@mail.tku.edu.tw NTCIR-12

A Semantic Question Answering Framework for Large Data Sets · Answering over Linked Data [28] has helped in data sharing and development of robust systems. Most approaches to QA

A Semantic Question Answering Framework for Large Data Setssemantics to various degrees. In this article, we describe a purely semantic question answering (QA) framework for large

Question Classification in Question Answering Systems23705/FULLTEXT01.pdf · 1.1 Question answering The goal of question answering (QA) is to provide a succinct answer, given a question

SEMINAR ON MACHINE ANSWERING - DFKIneumann/ML4QAseminar2016/presentations/LT1-QA-Ne… · SEMINAR ON MACHINE LEARNING AND QUESTION ... machine learning! ... It was the worst peacetime

Question Answering and Reading Comprehensioncs.jhu.edu/~kevinduh/notes/introhlt19-kduh-qa.pdf · Question Answering (QA) vs. Information Retrieval (IR) • QA and IR are related,

Deploying Semantic Resources for Open Domain Question Answeringisl.anthropomatik.kit.edu/pdf/Schlaefer2007a.pdf · 2017. 3. 7. · curacy of an open domain question answering (QA)

QA Lab-4: QALab-PoliInfo QA Lab so far QA Lab is aimed at complex real-world question answering (QA) technologies lNTCIR-11 QA Lab lNTCIR-12 QA Lab-2 lNTCIR-13 QA Lab-3 Previous tasks

150626 QA EA Answering Guidelines