Overview of the Multilingual Question Answering Track

Alicante, September, 22, 20006QA@CLEF 2006 Workshop

Overview of the Multilingual Question Answering

Danilo Giampiccolo

Outline

Tasks Test set preparation Participants Evaluation Results Final considerations Future perspectives

QA 2006: Organizing Committee

ITC-irst (Bernardo Magnini): main coordinator CELCT (D. Giampiccolo, P. Forner): general coordination, Italian DFKI (B. Sacalenau): German ELDA/ELRA (C. Ayache): French Linguateca (P. Rocha): Portuguese UNED (A. Penas): Spanish U. Amsterdam (Valentin Jijkoun): Dutch U. Limerick (R. Sutcliff): English Bulgarian Academy of Sciences (P. Osenova): Bulgarian

♦ Only Source Languages:♦ Depok University of Indonesia (M. Adriani): Indonesian♦ IASI, Romania (D. Cristea): Romanian♦ Wrocław University of Technology (J. Pietraszko): Polish

QA@CLEF-06: Tasks

Main task:♦ Monolingual: the language of the question (Source language) and the

language of the news collection (Target language) are the same

♦ Cross-lingual: the questions were formulated in a language different from that of the news collection

One pilot task:♦ WiQA: coordinated by Maarten de Rijke

Two exercises: Answer Validation Exercise (AVE): coordinated by Anselmo Peñas Real Time: a “time-constrained” QA exercise coordinated by the

University of Alicante (coordinated by Fernando Llopis)

Data set: Question format

200 Questions of three kinds FACTOID (loc, mea, org, oth, per, tim; ca. 150): What party did Hitler belong to? DEFINITION (ca. 40): Who is Josef Paul Kleihues?

♦ reduced in number (-25%)♦ two new categories added:

– Object: What is a router?

– Other: What is a tsunami?

LIST (ca. 10): Name works by Tolstoy

♦ Temporally restricted (ca. 40): by date, by period, by event♦ NIL (ca. 20): questions that do not have any known answer in the target

document collection

input format: question type (F, D, L) not indicated

Multiple answers: from one to ten exact answers per question

♦ exact = neither more nor less than the information required

♦ each answer has to be supported by– docid

– one to ten text snippets justifying the answer (substrings of the specified document giving the actual context)

Data set: run format

Activated Tasks (at least one registered participant)

BG DE EN ES FR IN IT NL PT PL RO

11 Source languages (10 in 2005) 8 Target languages (9 in 2005) No Finnish task / New languages: Polish and Romanian

Activated Tasks

MONOLINGUAL CROSS-LINGUAL TOTAL

CLEF 2003 3 5 8

CLEF 2004 6 13 19

CLEF 2005 8 15 23

CLEF 2006 7 17 24

questions were not translated in all the languages Gold Standard: questions in multiple languages only for tasks were there was at least one registered participant

More interest in cross-linguality

Participants

America Europe Asia TOTALRegisteredparticipants

New comers Veterans

Absentveterans

CLEF 2003 3 5 - 8

CLEF 2004 1 17 -18

(+125%)22 13 5 3

CLEF 2005 1 22 124

(+33%)27 9 15 4

CLEF 2006 4 24 230

(+25%)36 10 20 4

List of participants

ACRONYM NAME COUNTRY

SYNAPSE SYNAPSE Developpement France

Ling-Comp U.Rome-La Sapienza Italy

Alicante U.Alicante- Informatica Spain

Hagen U.Hagen-Informatics Germany

Daedalus Daedalus Consortium Spain

Jaen U.Jaen-Intell.Systems Spain

ISLA U.Amsterdam Netherlands

INAOE Inst.Astrophysics,Optics&Electronics Mexico

DEPOK U.Indonesia-Comp.Sci. Indonesia

DFKI DFKI-Lang.Tech. Germany

FURUI Lab. Tokyo Inst Technology Japan

Linguateca Linguateca-Sintef Norway

LIC2M-CEA Centre CEA Saclay France

LINA U.Nantes-LINA France

Priberam Priberam Informatica Portugal

U.Porto U.Porto- AI Portugal

U.Groningen U.Groningen-Letters Netherlands

ACRONYM NAME COUNTRY

Lab.Inf.D‘Avignon

Lab.Inf. D'Avignon France

U.Sao Paulo U.Sao Paulo – Math Brazil

Vanguard Vanguard Engineering Mexico

LCC Language Comp. Corp. USA

UAIC U.AI.I Cuza" Iasi Romania

Wroclaw U. Wroclaw U.of Tech Poland

RFIA-UPV Univ.Politècnica de Valencia Spain

LIMSI CNRS Lab-Orsay Cedex France

U.Stuttgart U.Stuttgart-NLP Germany

ITC ITC-irst, Italy

JRC-ISPRA

Institute for the Protection and the Security of the Citizen

BTB BulTreeBank Project Sofia

dltg University of Limerick Ireland

Industrial Companies

Submitted runs

#Monolingual

#Cross-lingual

CLEF 2003 17 6 11

CLEF 2004 48 (+182%) 20 28

CLEF 2005 67 (+39.5%) 43 24

CLEF 2006 77 (+13%) 42 35

Number of answers and snippets per question

Number of RUNS with respect to number of answers

1 answer

Overview of the Multilingual Question Answering Track

Documents

Transcript of Overview of the Multilingual Question Answering Track

Multilingual Glossary

Web based Multilingual Question Answering - dfki.de

A multilingual framework for transforming online services to truly multilingual

1 Insights into Multilingual and Multimedia Question Answering G. Ciany *, A. Kulman +, P. Schone +, C. Van Ess-Dykema + * Dragon Development, + U.S. Dept.

Towards End-to-End Multilingual Question Answeringneumann/publications/new-ps/... · tion retrieval and question answering. Still, few systems e ciently integrate and present knowledge

Track B: Sales, Marketing & Business Development Workshop B-2: Customer Service: It’s About More Than Just Answering the Phone.

Co-funded by the European Union The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering E. Cabrio, M. Kouylekov,

QALL-ME: Question Answering Learning technologies … Question Answering Learning technologies in a multiLingual and multiModal Envinroment ⁄ Rub en Izquierdo, Oscar Ferr andez,

Multilingual brain

AMUSE: Multilingual Semantic Parsing for Question ...€¦ · AMUSE: Multilingual Semantic Parsing for Question Answering over Linked Data Sherzod Hakimov, Souﬁan Jebbara, ... as

Multilingual speech communities Language Choice in Multilingual Communities.

1 CLEF 2011, Amsterdam QA4MRE, Question Answering for Machine Reading Evaluation Question Answering Track Overview Main Task Anselmo Peñas Eduard Hovy.

MultiLingual Technologies Inc. (MLT) - Multilingual Translation & Localization Service Provider

CLEF 2008 Multilingual Question Answering Track

Improving multilingual catalog search services by means of multilingual … · 2020-04-20 · IMPROVING MULTILINGUAL CATALOG SEARCH SERVICES BY MEANS OF MULTILINGUAL THESAURUS DISAMBIGUATION

OVERVIEW OF THE CLEF 2008 MULTILINGUAL QUESTION …clef.isti.cnr.it/2008/working_notes/CLEF08Working_Notes_QA_Overview.pdfOVERVIEW OF THE CLEF 2008 MULTILINGUAL QUESTION ANSWERING

TRANSLATION - MultiLingual Computing, Inc. - Multilingual Magazine

A model of multilingual digital library - SciELO · multilingual digital library is defined as: Definition 01: Multilingual digital library A multilingual digital library is a digital

Overview of the INFILE track at CLEF 2009 multilingual INformation FILtering Evaluation

CLEF 2005: Multilingual Retrieval by Combining Multiple Multilingual Ranked Lists

1 Insights into Multilingual and Multimedia Question Answering G. Ciany , A. Kulman +, P. Schone +, C. Van Ess-Dykema + Dragon Development, + U.S. Dept.