KBQA: An Online Template Based Question Answering ...Question Answering (QA) has drawn a lot of...

KBQA: An Online Template Based Question Answering System over Freebase

Wanyun Cui, Yanghua Xiao⇤, Wei WangShanghai Key Laboratory of Data Science, School of Computer Science, Fudan University

wanyuncui1@gmail.com, shawyh@fudan.edu.cn, weiwang1@fudan.edu.cn

AbstractQuestion answering (QA) has become a popularway for humans to access billion-scale knowledgebases. QA systems over knowledge bases produceaccurate and concise answers. The key of QA overknowledge bases is to map the question to a cer-tain substructure in the knowledge base. To dothis, KBQA (Question Answering over KnowledgeBases) uses a new kind of question representation:templates, learned from a million scale QA cor-pora. For example, for questions about a city’s pop-ulation, KBQA learns templates such as What’sthe population of $city?, How manypeople are there in $city?. It learnsoverall 1171303 templates for 4690 relations.Based on these templates, KBQA effectively andefficiently supports binary factoid questions orcomplex questions.

1 IntroductionQuestion Answering (QA) has drawn a lot of research inter-ests. A QA system is designed to answer a particular typeof questions [2]. One of the most important types of ques-tions is the factoid question (FQ), which asks about objectivefacts of an entity. A particular type of FQ, known as the bi-nary factoid question (BFQ) [1], asks about a property of anentity. For example, a� how many people are therein Honolulu? In this paper, we focus on answering theseBFQs over Freebase.

KBQA understands a question by templates. As an exam-ple, How many people are there in $city? is thetemplate for a�. No matter $city refers to Honolulu or othercities, the template always asks about population of the ques-tion. KBQA learns the templates’ corresponding predicatethrough Yahoo! Answers, a large scale QA corpora consist-ing of millions of qa pairs.

⇤Correspondence author. This work was supported by XiaoiResearch, by Shanghai Internet Big Data Engineering and Tech-nology Center, by the National NSFC(No.61472085, 61171132,61033010), by National Key Basic Research Program of China un-der No.2015CB358800, by Shanghai Municipal Science and Tech-nology Commission foundation key project under No.15JC1400900.

We show the number of predicates and templates KBQAlearned in Table 1, with comparison to bootstrapping [4],which uses BOA patterns to represent questions. From the re-sult, KBQA finds significantly more templates and predicatesdespite that the corpus size of bootstrapping is larger. Thisimplies that KBQA is more effective: (1) the large numberof templates ensures that KBQA understands diverse ques-tions; (2) the large number of predicates ensures that KBQAunderstands diverse question intents.

KBQA BootstrappingCorpus 41M QA pairs 256M sentencesTemplates 1,171,303 471,920Predicates (BOA patterns) 4690 283

Table 1: Coverage of Templates

2 AnatomyArchitecture Figure 1 shows the architecture of KBQA. Theoffline procedure learns the mapping from templates to pred-icates in the Template Extraction module by using maximumlikelihood (ML) estimator. Such estimator is trained by theresult of Entity & Value Identification. Furthermore, KBQAunderstands complex predicate forms of the knowledge basein the Predicate Expansion module. When a question comesinto the online procedure, KBQA first parses it into one orseveral BFQs. For each BFQ, KBQA extracts its template andlookups the predicates from the template repository. FinallyKBQA returns the value of the entity and predicate retrievedin the knowledge base as the answer.

Question Parsing The core of the online procedure is toconvert a question into templates. To do this, KBQA identi-fies the question’s entity by Stanford NER [3], and replacesthe entity by its concepts using conceptualization [5]. Theconceptualization mechanism is based on a large semanticnetwork (Probase [6]) that consists of millions of entities andconcepts, so that KBQA has enough granularity to representdiverse questions.

We elaborate the Template Extraction component.KBQA learns templates and their mappings to knowledgebase predicates by Yahoo! Answers. First, for each qa pairin Yahoo! Answers, KBQA extracts the entity in the ques-tion and the corresponding value in the answer. Then, KBQA

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16)

Figure 1: System Architecture

lookups the predicate between the entity and value in theknowledge base. The basic idea is, KBQA uses the qa pairsas the training data. For each template, if most of its questioninstances in the QA corpora share the same predicate, KBQAmaps the template to this predicate.

3 DemonstrationAs shown in Figure 2, KBQA provides a simple interface forusers on the web page, which consists of three parts: (1) theQA part displays the main result of the question; (2) the feed-back part provides a way for users to vote for the answer; (3)the explanation part displays how the answer is extracted andwhy KBQA can extract the answer.

Diverse Question Types KBQA actually satisfies users’different requirements and supports different question types.

• Simple BFQ Users can ask simple BFQs, i.e., questionsabout a property of an entity. Figure 2 shows the ques-tion asking about Shakespeare’s birthday.

• Questions relying on expanded predicates The “pred-icate expansion” module enables KBQA to understandexpanded predicates. For example, Freebase rep-resents “spouse” relation by the expanded predicate“spouse s!spouse!name”.

• Complex question The “question parsing” module de-composes a complex question into several BFQs. Thisenables KBQA to understand and answer complex ques-tions. Figure 3 shows a complex question.

User Feedback In the feedback part, KBQA enables usersto vote for the answer. The feedbacks of KBQA are routedback as inputs and help improve the system.

References[1] E. Agichtein, S. Cucerzan, and E. Brill. Analysis of factoid

questions for effective relation extraction. In SIGIR, pages 567–568, 2005.

[2] J. Burger, C. Cardie, V. Chaudhri, R. Gaizauskas, S. Harabagiu,D. Israel, C. Jacquemin, C.-Y. Lin, S. Maiorano, G. Miller,

Figure 2: KBQA for simple BFQ

Figure 3: KBQA for complex questions

et al. Issues, tasks and program structures to roadmap researchin question & answering (q&a). In DUC Roadmapping Docu-ments, 2001.

[3] J. R. Finkel, T. Grenager, and C. Manning. Incorporating non-local information into information extraction systems by gibbssampling. In ACL, pages 363–370, 2005.

[4] D. Gerber and A. Ngonga Ngomo. Bootstrapping the linkeddata web. In 1st Workshop on Web Scale Knowledge ExtractionISWC, 2011.

[5] Y. Song, H. Wang, Z. Wang, H. Li, and W. Chen. Short textconceptualization using a probabilistic knowledgebase. In IJ-CAI, pages 2330–2336, 2011.

[6] W. Wu, H. Li, H. Wang, and K. Q. Zhu. Probase: A probabilistictaxonomy for text understanding. In SIGMOD, pages 481–492,2012.

KBQA: An Online Template Based Question Answering ...Question Answering (QA) has drawn a lot of...

Documents

Transcript of KBQA: An Online Template Based Question Answering ...Question Answering (QA) has drawn a lot of...

Example-Driven Question Answering di.pdf · Open-domain question answering (QA) is an emerging information-seeking paradigm, which automatically generates accurate and concise answers

arXiv:1905.06933v3 [cs.CL] 6 Jun 2019 · 2019. 6. 7. · tured or not, QA tasks can be categorized into knowledge-based (KBQA), text-based (TBQA), mixed, and others. In KBQA, the

KBQA: Learning Question Answering over QA Corpora and ... · ble 1 can be answered by knowing that number of people in the question is a synonym of predicate population. Obvi-ously,

Natural Language Processing for Scientific Research · (QA) systems. Question answering composes of reading comprehension tasks that demon-strates the understanding of the system

KBQA: Learning Question Answering over QA Corpora … · KBQA: Learning Question Answering over QA Corpora and Knowledge Bases ... (BFQ) [1], asks about a property of an entity. For

Deepak Gupta Question Answering: Learning to Answer AI-NLP …ai-nlp-ml/course/dnlp/QA-CEP-2020.pdf · Motivation and History • Open domain QA systems received larger attention

WISS QA Do it yourself Question answering over Linked Data

QA Lab-4: QALab-PoliInfo QA Lab so far QA Lab is aimed at complex real-world question answering (QA) technologies lNTCIR-11 QA Lab lNTCIR-12 QA Lab-2 lNTCIR-13 QA Lab-3 Previous tasks

Question Classification in Question Answering Systems23705/FULLTEXT01.pdf · 1.1 Question answering The goal of question answering (QA) is to provide a succinct answer, given a question

Question Answering with Imperfect Temporal Information€¦ · 1 Introduction Question answering systems (QA–systems) are information retrieval systems that diﬀer from traditional

Question-Answering System - SJSU...Question Answering [QA] Systems. QA system is a computer science discipline within the field of Information Retrieval and Natural Language Processing,

Open Domain Question Answering Using Early Fusion of ... · 1 Introduction Open domain Question Answering (QA) is the task of ﬁnding answers to questions posed in nat-ural language.

Visual7W: Grounded Question Answering in Imagesvision.stanford.edu/pdf/zhu2016cvpr.pdfFigure2: Examples of multiple-choice QA from the 7W question categories. The ﬁrst row shows

Scaling Question Answering to the Webetzioni/papers/acm...Scaling Question Answering to the Web † 245 Fig. 1. Architecture of MULDER. interactive QA systems also have to consider

A Knowledge Graph Based Health Assistant...2.2 Medical Knowledge Graph Based Question Answering Traditional approaches of developing knowledge based question answering (KBQA) systems

VQA SURVEY 1 Visual Question Answering using Deep Learning: A … · 2019-09-05 · Tally-QA: Very recently, in 2019, the Tally-QA [10] dataset is proposed which is the largest dataset

Expertise Finding for Question Answering (QA) Services March 5, 2014March 5, 2014 Department of Knowledge Service EngineeringDepartment of Knowledge Service.

SEMINAR ON MACHINE ANSWERING - DFKIneumann/ML4QAseminar2016/presentations/LT1-QA-Ne… · SEMINAR ON MACHINE LEARNING AND QUESTION ... machine learning! ... It was the worst peacetime

Question Answering and Dialog Systems - ut · Factoid QA methods •IR-based question answering or text-based question answering •Relies on huge amounts of texts •Given a question,

150626 QA EA Answering Guidelines