What’s Missing: A Knowledge Gap Guided Approach for Multi...

What’s Missing: A Knowledge Gap Guided Approach for Multi-hop Question Answering Tushar Khot and Ashish Sabharwal and Peter Clark EMNLP 2019

Krunal Shah (shahkr@seas.upenn.edu)

20th April 2020

Example (from Open Book Question Answering)

Question: Which of these would let the most heat travel through?A) a new pair of jeans.B) a steel spoon in a cafeteria. C) a cotton candy at a store. D) a calvin klein cotton hat.

Core Fact: (annotated as part of dataset)

Metal lets heat travel through.

Knowledge Gap (similar gaps for other choices): steel spoon in a cafeteria _____ metal.

Motivation

• This work concentrates on multi-hop QA in partial knowledge setting• Most existing QA datasets assume availability of all required knowledge• However, partial knowledge setting is quite intuitive, natural and

challenging

Example:Which of these would let the most heat travel through?a) Osmium

Hints?

Knowledge Gaps

Broadly (inexhaustively) classified into three types:1. Question-to-fact Gap:

2. Fact-to-Answer Gap (this work)

3. Question-to-Answer (Fact) Gap

* image from the original paper

Brief Approach

1. Identify “key span” in core fact

Question: Which of these would let the most heat travel through?A) a new pair of jeans.B) a steel spoon in a cafeteria. C) a cotton candy at a store.

D) a calvin klein cotton hat.

Brief Approach

2. Retrieve some knowledge to help fill knowledge gap

Brief Approach

(steel spoon, isA, metal)

3. Identify relations between key span and answer choices

(using retrieved knowledge)

Open Book QA-Short

Narrow down questions as:1. Fact supports correct answer

- answer-fact gap2. Questions with small answer choices

- Long answer choices lead to noisy gaps

Knowledge Gap Dataset (KGD)

Starting with questions included in Open Book QA-Short:

Annotate:1. Key span in core fact that could answer the question

- because answer-fact gap2. One or more relations that satisfy knowledge gap

Data point: {question, fact, spans, relations}Only include questions with agreement greater than 2/3.

* image from the original paper

Proposed Model (GapQA)

Question: Which of these would let the most heat travel through?A) a new pair of jeans.B) a steel spoon in a cafeteria. C) a cotton candy at a store. D) a calvin klein cotton hat.

Core Fact:Metal lets heat travel through.

Repeat subsequent steps for all options

1. Run RC Model with (Question, Core Fact as context) to predict “key span”

Question: Which of these would let the most heat travel through?A) a new pair of jeans.

2. Knowledge Retrieval

Tuple Search:Subject matching sObject matching c

Text Search: Elastic Search using “s + c”

Convert everything to text.

3. Scores the option choice- Fact relevance score:- Relation prediction score

B weighted representation of A (attB(A))

Ai . Bjmax

softmaxA

x + x + x + x =

A’s token representations

Compare rep A and rep B (comp(A, B))

representation A

representation B

concatenation

Fact Relevance

Question and choice attended fact representation:Squestion, choice(fact) = avg(attquestion(fact), attchoice(fact))

Aggregate fact representation:Rfact = avg(fact representations)

scorefact(choice) = FF( comp(Rfact,Squestion, choice(fact)) )

Relation Prediction

Span attended knowledge sentence representation:Sspan (knowledge sent) = attspan(knowledge sent)

Choice attended knowledge sentence representation:Schoice (knowledge sent) = attchoice(knowledge sent)

Partial relation representation= FF( comp(Sspan (knowledge sent), Schoice (knowledge sent)) )

Relation Prediction Score

Relation RepresentationRrelation = avgknowledge sentences(Partial Relation representation)

Question Fact Composed RepresentationRfact,question = comp(max pooled question rep., max pooled fact rep.)

Relation Prediction Score= FF( [Rfact,question; Rrelation])

Training methodology

1. Use BiDAF trained on SQuAD + finetuned on Knowledge Gap Dataset to predict spans

2. Relation loss:- Project relation representation to multilabel relation classification- Binary cross-entropy loss

3. Train model on- Knowledge Gap Dataset- Open Book QA-Short using predicted spans and ignoring relation loss

Results and analysis

Impressive results:~6.5% improvement in partial knowledge setting~3% improvement on complete knowledge setting

* denotes results are statistically significant

Ablation study results

Personal thoughts

About the paper:+ Propose an excellent (realistic) task+ Thorough analysis of results- Only handle two hop questions

§ Jansen et al. show that even elementary science questions require 4 to 6 sentences to answer and explain on an average [3].

- Need a more general framework for dealing with gaps§ Handle more kinds of knowledge gap

• Results and limitations of the work suggest we are still quite far away from truly solving two hop questions

Personal thoughts

• The ability to answer the following is integral to reasoning“What more do I need to know (to achieve something)?”

• An observation that can possibly help towards a more general framework

• Previously explored idea of multi-hop QA as path finding! [4][5]

• Recognize where a node is missing

References

1. What's Missing: A Knowledge Gap Guided Approach for Multi-hop Question Answering. Tushar Khot, Ashish Sabharwal, Peter Clark. EMNLP 2019

2. QASC: A Dataset for Question Answering via Sentence Composition. Tushar Khot, Peter Clark, Michal Guerquin, Peter Jansen, Ashish Sabharwal. AAAI 2020

3. What's in an Explanation? Characterizing Knowledge and Inference Requirements for Elementary Science Exams. Jansen, Peter and Balasubramanian, Niranjan and Surdeanu, Mihai and Clark, Peter. COLING 2016.

4. Question Answering via Integer Programming over Semi-Structured Knowledge. Daniel Khashabi, Tushar Khot, Ashish Sabharwal, Peter Clark, Oren Etzioni, Dan Roth. IJCAI 2016.

5. Question Answering as Global Reasoning over Semantic Abstractions. Daniel Khashabi, Tushar Khot, Ashish Sabharwal, Dan Roth. AAAI 2018.

Analyzing results using different knowledge sources

* denotes results are statistically significant

Contributions

- Annotate a subset of Open Book QA for partial knowledge setting- Propose a novel approach of identify missing knowledge and filling it for

multi hop QA- Propose a model that learns to fill missing knowledge from external

knowledge and compose it with exisiting knowledge- State of the art results on QA with partial knowledge

Another related dataset

QASC: Question Answering via Sentence Composition [2]

* image from [2]

4. Question

1. Seed fact2. Fact from large corpus

3. Composed fact

Outline:

• Problem and Motivation• Brief overview and dataset• Proposed model (GapQA)• Experiments• Personal thoughts

What’s Missing: A Knowledge Gap Guided Approach for Multi...

Documents

Transcript of What’s Missing: A Knowledge Gap Guided Approach for Multi...

The Quality in Acute Stroke Care (QASC) Implementation Project€¦ · • Stroke unit eligibility: Category A or B acute stroke units in NSW, Australia • Patient eligibility: •

Spring20 Javaday

The Quality in Acute Stroke Care Project (QASC)

File Systems Overviewlass.cs.umass.edu › ~shenoy › courses › spring20 › lectures › Lec17.pdfFile Systems Overview Remember the high-level view of the OS as a translator from

The Quality in Acute Stroke Care (QASC) Implementation Project Local Stroke Champion Presentation.

Stewart Spring20 2020/Stewart_Spring20_BuyAgro.pdf · Title Stewart_Spring20.indd Created Date 5/6/2020 1:44:11 PM

CS165: Priority Queues, Heapscs165/.Spring20/slides/11-PrioQsHeaps.pdfHeap Operations -heapInsert n Step 1: put a new value into first open position (maintaining completeness), i.e.

AtHomeCapeCod Spring20 Digital

DRAFT - Department of Computer Science › ~ac › Teach › CS35-Spring20 › Notes … · 2/7/2020 · Sagar Kale, and Jelani Nelson for numerous comments and contributions. Of

Voskhod, Soyuz and Zond - users.umiacs.umd.eduusers.umiacs.umd.edu/~oard/teaching/154/spring20/slides/24/154f191… · 9. April 1967: First inflight fatality (Soyuz 1) 15. October

Chapter 11: Inheritance and Polymorphismcs165/.Spring20/slides/06... · 2020-02-06 · Liang, Introduction to Java Programming, Tenth Edition, (c) 2015 Pearson Education, Inc. All

6.003: Signal Processingsigproc.mit.edu/_static/spring20/lectures/lec04b.pdf6.003: Signal Processing Discrete-Time Fourier Transform • De nition • Examples • Properties • Relations

EE565:Mobile Robotics Lecture 1 - Lahore University of ...web.lums.edu.pk/~akn/Files/Other/teaching/mobile robotics/spring20… · EE565: Mobile Robotics Module 1: Mobile Robot Kinematics

[] UNDECIDABLE PROBLEMS [] ABOUT CONTEXT-FREE LANGUAGESabhij/course/theory/FLAT/Spring20/slides/... · W → ε|AbcV FLAT, Spring 2020 Abhijit Das. CFL Fullness is Undecidable Theorem

appendix a - Colorado State Universitycs270/.Spring20/resources/PattPatelAppA.pdfJSR FigureA.2 FormatoftheentireLC-3instructionset.Note:+ indicatesinstructionsthat modifyconditioncodes

Combining Natural Logic and Shallow Reasoning for Question ...cis700dr/Spring20/files/04-08-02.pdf · – Natural Logic : Icard and Moss semantics – NaturalLI framework • Improvements

QASC: A Dataset for Question Answering via Sentence ...

Application Layer - Tulane Universityzzheng3/teaching/cmps6750/spring20/app.pdfInternet apps: application, transport protocols 14 application e-mail remote terminal access Web file

test1 soln › marzban › 390 › spring20 › test1_soln.pdfTitle: test1_soln.jnt Author: Administrator

Map Projections & Coordinate Systemscourses.geo.utexas.edu/courses/371c/Lectures/Spring20/... · 2020. 1. 28. · Geo327G/386G: GIS & GPS Applications in Earth Sciences Jackson School