Natural Language Based Reformulation Resource and Web Exploitation for Question Answering Ulf...

Natural Language Based Reformulation Resource and Web Exploitation for Question

Answering

Ulf Hermjakob, Abdessamad Echihabi, Daniel MarcuUniversity of Southern California

Presented By:Soobia Afroz

Introduction

The degree of difficulty How closely a given corpus matches the question and NOT on the question itself

Q: When was the UN founded?

A: The UN was formed in January 1942.

A: The name "United Nations", coined by United States President Franklin D. Roosevelt, was first used in the "Declaration by United Nations" of 1 January 1942, during the Second World War, when representatives of 26 nations pledged their Governments to

continue fighting together against the Axis Powers.

Larger text => Good Answers => Validation in original text

Paraphrasing questions:

Create semantically equivalent paraphrases of the questions Match Answer/string with any of the paraphrases

• Question paraphrases + Retrieval engine Find documents containing correct answers

• Rank and select better answers• Automatically paraphrase questions by TextMap.

Example:

“How did Mahatma Gandhi die?”

“How deep is Crater Lake?”

“Who invented the cotton gin?”

Automatic Paraphrases of questions:

How the system works:

• Parse questions

• Identify the answer type of the question

• Reformulate the questionaverage reformulations: 3.14

• Match at parse-tree level

1. Syntactic reformulations

• Turn a question into declarative form, e.g.,

2. Inference Reformulations

3. Reformulation Chains

4. Generation

Information Retrieval and the Web

TREC (Text Retrieval Conference)

IR system for Webclopedia

Web based IR system

Query Reformulation module

Web Search engine

Sentence Ranking module

1. Query Reformulation module

Previous attempts:• Simple, exhaustive string-based manipulations• Transformation grammars• Learning algorithms

Current attempt:• Analyze how people naturally form queries to find answers• Randomly selected 50 TREC8 questions• Manually produced simplest queries that yield the most Web pages containing

answers• Analyzed the manually-produced queries and categorized them into seven ‘natural’

techniques that were used to form a natural language question• Derived algorithms that replicate each of the observed technique

Query Reformulation Techniques

2. Sentence Ranking module

• Produce a list of Boolean queries for each question using all the query reformulation techniques

• Retrieve the top ten results for each query using a web search engine• Retrieve the documents, strip HTML, segment the text into sentences• Each sentence is ranked according to 2 schemas:

Score w.r.t. queries terms:-- Each word in query assigned a weight-- Each quoted term in the query has a weight equal to the sum of the weights of its

words-- Each sentence has a weight equal to the weighted overlap with queries terms

Score w.r.t. answers:-- Tag sentences using BBN’s IdentiFinder (a hidden Markov model that learns to recognize and classify names,

dates, times, and numerical quantities.)-- Score sentences according to the overlap with answer type, checked against the

answer type and the semantic entities found by IdentiFinder

Evaluation of the results:

Reformulations led to more correct answers when used in conjunction with a large corpus like the Web.

Conclusion

Likelihood of finding correct answers is increased by QR

IR module produces higher quality answer candidates

Scoring precision is increased for answer candidates

A strong match with a reformulation provides additional confidence in the correctness of the answer

Natural Language Based Reformulation Resource and Web Exploitation for Question Answering Ulf...

Documents

Transcript of Natural Language Based Reformulation Resource and Web Exploitation for Question Answering Ulf...

KPMG Inc. · 4 promos: 330.53 a and g inc. b9227: 17.99 a.t. cross: 964.49 abankwa, thelma abbadi, dalia abbas-zadeh-halabi, ashkan: abburi, srilekha abdelsaied, hany: abdessamad

Discovering and Linking Public ‘Omics’ Datasets 2 HENNING...OmicsDI – Discovering and Linking Public ‘Omics’ Datasets Henning Hermjakob European Bioinformatics Institute.

Département: Sciences économiques Liste des candidas ...candidaturemaster-fsjes.uca.ma/te/FB_TE_2019.pdf · 394 CHOUH ILIAS 403 Mesaaf Lobna 405 AIT OUMGHAR ABDESSAMAD 448 Fohmi

HIF 2014 Paris - FAVORISER L'INNOVATION AVEC LE CLOUD & LE BIG DATA Hicham Abdessamad, Executive Vice-President Global Services, Hitachi Data Systems

Proceedings of BioCreative III Workshop · Anna Divoli, University of Chicago, USA . Henning Hermjakob, EBI, UK . Eivind Hovig, Oslo University Hospital, Norwey . Lars Juhl Jensen,

Abstract Meaning Representation (AMR) 1.2 Speciﬁcationulf/amr/help/amr-guidelines.pdfLaura Banarescu, Claire Bonial, Shu Cai, Madalina Georgescu, Kira Grifﬁtt, Ulf Hermjakob, Kevin

Turning NMT research into commercial products · • Team members have published +100 on SMT and related tech – Bill Byrne, Abdessamad Echihabi, Dragos Munteanu ... • Summer internships,

Daniel Novotný, Abdessamad Belhaj, Marek ... - anatem.info · 2 Research Paper 3/2011 The Changing Security Situation in the Maghreb – April 2011 Executive Summary The Maghreb

TextMap: An Intelligent Question- Answering Assistant Project Members:Abdessamad Echihabi Ulf Hermjakob Eduard Hovy Kevin Knight Daniel Marcu Deepak Ravichandran.

fpn.ump.mafpn.ump.ma/uploads/files/1/Masters 2019/5d94ff8d7b2ef.pdf8/ EL HADDAD IKRAM 9/ HAMMADI Hakima 10/ HAYANI Abdessamad 11/El MARRAKI Fadwa 12/ EL Nawal 13/ AANAN Dina 14/ ASSOUFI

StudentID Term Level LastName FirstName Grade … · 155 628 june 2018 beg1 boukrim t.ezzahra 13 fail 155 641 june 2018 beg1 abidar fatima 57 fail 155 680 june 2018 beg1 barhoun abdessamad

Chapter - Information Sciences Institutemarcu/papers/textmap7-final-2005.doc · Web viewChapter # How to Select an Answer String? Abdessamad Echihabi, Ulf Hermjakob, Eduard Hovy,

Introducing the PRIDE Archive RESTful web services...Introducing the PRIDE Archive RESTful web services Florian Reisinger, Noemi del-Toro, Tobias Ternent, Henning Hermjakob and Juan

fpl.mafpl.ma/images/fpl-2012-2013/mht2.pdf · Ouafae Yassir Hind Ayoud HIND Yassine Khaoula Safae SALKA Jihad Jamal Abdessamad IMANE FATIMAEZZAHRA Rida AYMANE Ooussam Zakia WAFA HANAE

DAS Advance Search and its prototype implementation in MyDas Gustavo Adolfo Salazar Orejuela Supervised by: Nicola Mulder Henning Hermjakob DAS workshop.

doc amazon (Zure +Abdessamad+ Óscar + Carlos)

fpn.ump.mafpn.ump.ma/ftp/etudiants/candidatures/SKMBT_28316103120150.pdf · chait mimoune boujdadi lamia el mansouri abdelmoghit khoubi nour el houda maafi mustapha bouziani abdessamad

Marcu & Echihabi (2002). - Alice - Artificial Intelligence ...spenader/public_docs/ESSLLI_Empirical_Approaches_t… · – the English Gigaword Corpus ... When LM is trained and tested

BAOJ Urology & Nephrology - Bio AccentBAOJ Urol Nephrol, an open access journal Volume 1; Issue 1; 004 Arnaud Tayiri 1* , Venceslas Amboulou Ibarra 2 , Abdessamad El Bahri 3 , Abdelatif

EL MADIDI saï , EL BERKAOUI Abdessamad , BEN ELMAALEM … phenotypic... · Min Mean Max SD CV P (%) 2008 22 73 126 21.78 35.64 2009 15 78 230 37.17 47.71 I. Analysis of phenotypic