The OPAL System at NTCIR 8 MOAT

Balahur, A., Boldrini, E., Montoyo, A., Mar3nez-‐Barco, P.

NTICIR 8 MOAT, 18 June, 2009 Tokyo

Outline

•  Mo:va:on

•  Contribu:on •  Related Work

•  English Monolongual Subtask •  English-‐Chinese Cross-‐Lingual Subtask •  Evalua:on •  Discussion

Mo:va:on Challenges

Growth in the access to technology + Development of the Social Web

Exponen:al increase of the subjec:ve data on the Web: factoid & opinionated info

Big impact on lives/business

NEED FOR AUTHOMATIC PROCESSING METHODS FOR SUBJECTIVE TEXT : Sen:ment Analysis (SA)

Mo:va:on

•  Factod Ques:on Answering (QA): long tradi:on

•  Opinionated QA: remains a challenge

•  Need for OQA

•  Need for specialised tools and methods

Contribu:on

•  Deep study of the performance of new sen:ment-‐topic detec:on methods

•  Introduc:on of specialised tools –  temporal expressions and –  anaphora resolu:on and analysis of their effect

•  New retrieval techniques (using Wikipedia with LSA), improving the performance.

Related Work •  Cardie et al., 2003: separated opinions from facts and summarized them as answer to opinion

ques:ons. •  Kim and Hovy, 2005: iden:fied opinion holders, which are frequently asked in opinion

ques:ons.

•  TAC 2008 Opinion QA track: a collec:on of factoid and opinion queries -‐“rigid list” (factoid) and “squishy list” (opinion) -‐ to which the tradi:onal systems had to be adapted.

•  The Alyssa system (Shen et al. 2007): a SVM classifier trained on the MPQA corpus, English NTCIR data and rules based on the subjec:vity lexicon

•  Varma et al., 2008: query analysis to detect the polarity of the ques:on using defined rules.

•  The QUANTA system (Li et al, 2008): OQA by detec:ng the opinion holder, the object and the polarity of the opinion. It uses a seman:c labeler based on PropBank and manually defined paierns. Sen:ment classifica:on: extract and classify the opinion words. Answer retrieval: score the retrieved snippets depending on the presence of topic and opinion words and only choose as answer the top ranking results

English Monolingual Subtask

•  20 topics •  For each topic:

–  a ques:on & short query were given –  Expected Polarity & Period of :me

•  Each topic: –  set of documents split into sentences –  opinion units

•  for polarity, •  opinion target •  source task

•  We submiied 3 runs of the OpAL system for: opinionated, relevance and polarity judgement tasks

English Monolingual subtask Judging sentence opinionatedness

•  The system has to assign YES –opinionated-‐/NO-‐factoid-‐ to each sentence

•  We employed

–  system run 1

–  system runs 2 & 3 •  Rule based but with different resources

•  Opinionated sentence: at least 2 opinion words or one opinion word preceeded by a modifier

English Monolingual subtask Judging sentence opinionatedness

•  1st approach: opinion word from:

–  General Inquirer, –  Micro WordNet Opinion and

–  Opinion Finder •  2nd approach: opinion word from:

–  General Inquirer –  Micro WordNet Opinion

English Monolingual subtask Determining sentence relevance

•  The system had to output for each sentence an assessmant of relevance

•  3 strategies:

•  1st JIRS (Java Informa:on Retrieval System) to find relevant snippets JIRS retrieves passages based on the ques:on structures (not keywords)

•  2nd Faceted Search Wikipedia and LSA, to find words related to the topic –find concepts-‐

•  We match the query words to a category in Wikipedia and two consecu:ve words, 3, 4 …the concepts found are the topic components.

•  For each topic component we employ LSA to the first 20 documents (retrieved by Yahoo) to determine the most related words (LSA we employ Infomap)

•  We expand the query using words similar to the topic

•  3rd we judge a part for the topic relevance, the temporal appropiateness

•  We employ TERSEO •  We filter the sentences obtained

Polarity and topic-‐polarity classifica:on for judging sentence answerness

•  Required the system to assign a value of POS, NEG or NEU to each of the sentences in the documents provided.

•  Polarity of the sentences: we passed each sentence through an opinion mining system employing SVM ML over the NTCIR 7 MOAT corpus, the MPQA corpus and Emo5Blog.

•  Each sentence is preprocessed using Minipar

•  System training: –  the part of speech (POS) –  opinionatedness/intensity -‐ if the word is annotated as opinion word, its polarity, i.e. 1 and -‐1 –  syntac:c relatedness with other opinion word – if it is directly dependent of an opinion word or

modifier (0 or 1), plus the polarity/intensity and emo:on of this word (0 for all the components otherwise).

Polarity and topic-‐polarity classifica:on for judging sentence answerness

•  The difference between the submiied runs consisted in the lexicons used to determine whether a word was opinionated or not.

•  1st run: General Inquirer, MicroWordNet and the Opinion Finder opinion resources.,

•  2nd: aside from these three sources, the “emo:on trigger” resource (Balahur and Montoyo, 2008).

English-‐Chinese cross-‐lingual subtask

•  Systems have to output, for each of the twenty topics and their corresponding ques:ons (in a language), the list of sentences containing answers (in another language).

•  The topics and ques:ons are given in English;

•  the output of the system contains the sentences in set of documents in Tradi:onal Chinese which contain an answer to the given topics.

•  3 runs of OpAL

•  To tokenize the Chinese texts using LingPipe

•  We applied a technique known as “triangula:on” to obtain opinion and subjec:vity resources for Chinese to obtain resources for different languages, star:ng from correct parallel resources in 2 ini:al languages.

•  This technique requires the existence of two correct parallel resources in two different languages to obtain correct resources for a third language.

•  We have previously translated and cleaned the General Inquirer, MicroWordNet and Opinion Finder lexicons for Spanish.

•  The “emo:on triggers” resource is available for English and Spanish-‐-‐ translated into Chinese (Google translator)

•  We performed the intersec:on of the obtained transla:ons –the corresponding words translated in the same manner (from English as well as from Spanish).

•  We removed words that we translated differently from English and Spanish.

•  We mapped each of these resources to four classes, depending on the score they are assigned in the original resource – of “high posi:ve”, “posi:ve”, “high nega:ve” and “nega:ve” and we give each word a corresponding value (4, 1, -‐4 and 1), respec:vely.

•  We translated the topic words determined in English using LSA.

•  For each of the sentence, we compute a score, given by the sum of the values of the opinion words that are matched in it.

•  Condi:ons to be considered an answer: –  to contain at least one topic word

–  the polarity determined corresponds to the required polarity, as given in the topic descrip:on.

•  1st: General Inquirer and MicroWordNet •  2nd :we added the “emo:on trigger resource”

•  3rd : only the Opinion Finder lexicon.

Evalua:on & Discussion

System RunID P R F

OPAL1 17.99 45.16 25.73

OPAL2 19.44 44 26.97

OPAL3 19.44 44 26.97

Results of system runs for relevance

System RunID P R F

OPAL1 82.05 47.83 60.43

OPAL2 82.61 5.16 9.71

OPAL3 76.32 3.94 7.49

Results of system runs for opinionated

System RunID P R F

OPAL1 38.13 12.82 19.19

OPAL2 50.93 12.26 19.76

Results of system runs for the cross-lingual task – agreed measures, Traditional Chinese

System RunID P R F

OPAL1 3.54 56.23 6.34

OPAL2 3.35 42.75 5.78

OPAL3 3.42 72.13 6.32

Results of system runs for polarity

System RunID P R F

OPAL1 14.62 60.47 21.36

OPAL2 14.64 49.73 19.57

OPAL3 15.02 77.68 23.55

Results of system runs for the cross-lingual task – non-agreed measures, Traditional Chinese

•  Extensive filtering for topic and the temporal restric:ons increases the system precision, but drama:c drop in the recall.

•  Simpler methods in the cross-‐lingual task yielded beier results, the OpAL cross-‐lingual run 3 obtaining the highest F score for the non-‐agreed measures and ranking second according to the agreed measures.

•  The LSA-‐based method to determine topic-‐related words is not enough to perform this task.

•  The terms obtained by employing this method are correct and useful –  they should be expanded using language models, to beier account for the language

variability.

•  Systems performing finer tasks, (temporal expression, anaphora resolu:on), are not mature enough and led to lower performances of the system and drama:c loss in recall.

• 

Thank you for your aien:on

The OPAL System at NTCIR 8 MOAT - RUA

Transcript of The OPAL System at NTCIR 8 MOAT - RUA

The OPAL System at NTCIR 8 MOAT - RUA

Documents

Transcript of The OPAL System at NTCIR 8 MOAT - RUA

PDF created with pdfFactory trial version · 3mm 4.5mm 6mm 8mm 10mm 12mm 15mm 20mm 25mm 30mm clear/opal clear/opal clear/opal clear clear/opal clear/opal clear/opal clear/opal clear/opal

Moat Homes Ltd - housingcare.org · Moat Homes Ltd Enquiries to: Moat Homes Ltd. Moat Homes Ltd housing and care homes. Ordered by: County. 72 results. View results online. East Sussex

Overview of the NTCIR-9 INTENT Taskresearch.nii.ac.jp/ntcir/workshop/OnlineProceedings9/...Overview of the NTCIR-9 INTENT Task Ruihua Song† Min Zhang‡ Tetsuya Sakai† Makoto P.

HITS’ Graph-based System at the NTCIR-9 Cross …research.nii.ac.jp/.../NTCIR/04-NTCIR9-CROSSLINK-FahrniA.pdfHITS’ Graph-based System at the NTCIR-9 Cross-lingual Link Discovery

Overview of the NTCIR-13: MedWeb Taskresearch.nii.ac.jp/ntcir/workshop/OnlineProceedings13/pdf/ntcir/01... · Medical natural language processing, Twitter, Social media, Shared task,

Overview of NTCIR-9 1CLICKresearch.nii.ac.jp/ntcir/workshop/OnlineProceedings9/NTCIR/01-NTCI… · The PMO is used for normalising our evaluation metric. URL is a “supporting document”

Oracle Buys Moat · –Moat will add attention analytics to Oracle Data loud’s suite of targeting and measurement solutions •About Moat –Moat is the fastest-growing digital

Decision Tree Method for the NTCIR-13 ECA Taskresearch.nii.ac.jp/ntcir/workshop/OnlineProceedings13/pdf/ntcir/03-NTCIR13-ECA-LiX.pdfDecision Tree Method for the NTCIR-13 ECA Task Xiangju

NTCIR-15 Conference

Information Extraction based Approach for the NTCIR-9 ...research.nii.ac.jp/ntcir/workshop/Online... · Information Extraction based Approach for the NTCIR-9 1CLICK Task ABSTRACT

Overview of the NTCIR-7 ACLIA Tasks: Advanced Cross ...research.nii.ac.jp/ntcir/workshop/OnlineProceedings7/pdf/...Proceedings of NTCIR-7 Workshop Meeting, December 16 19, 2008, Tokyo,

Microsoft Research Asia at the NTCIR-9 Intent Taskresearch.nii.ac.jp/ntcir/workshop/OnlineProceedings9/NTCIR/04... · Table2: 15 POS features usedinthe SVM training Numberof NT VV

PKUTM Experiments in NTCIR 8 MOAT Taskresearch.nii.ac.jp/.../NTCIR/03-NTCIR8-MOAT-WangC_slides.pdfPKUTM Experiments in NTCIR‐8 MOAT Task Author: *Chenfeng Wang*, Tengfei Ma , Liqiang

Overview of the NTCIR-12 MobileClick-2 Taskresearch.nii.ac.jp/ntcir/workshop/OnlineProceedings12/pdf/ntcir/... · Overview of the NTCIR-12 MobileClick-2 Task Makoto P. Kato Kyoto

Overview of the Seventh NTCIR Workshopresearch.nii.ac.jp/ntcir/workshop/OnlineProceedings7/pdf/... · 2010-06-29 · proceedings of the seventh NTCIR Workshop. Keywords: evaluation,

Companies With Moat

NTCIR-12 MathIR Task Overviewresearch.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...NTCIR-12 MathIR Task Overview Richard Zanibbi Rochester Institute of Technology rlaz@cs.rit.edu

Proceedings of the Third NTCIR Workshopresearch.nii.ac.jp/ntcir/workshop/Online... · Proceedings of the Third NTCIR Workshop 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0,0 0,1

Overview of the Second NTCIR Workshopresearch.nii.ac.jp/ntcir/workshop/OnlineProceedings2/ovview-kando.pdfOverview of the Second NTCIR Workshop National Institute of Informatics (NII),

THIR2 at the NTCIR-13 Lifelog-2 Task: Bridging Technology ...research.nii.ac.jp/ntcir/workshop/OnlineProceedings13/pdf/ntcir/03... · Bridging Technology and Psychology through the

PKUTM Experiments in NTCIR 8 MOAT Taskresearch.nii.ac.jp/.../NTCIR/03-NTCIR8-MOAT-WangC_slides.pdfPKUTM Experiments in NTCIR‐8 MOAT Task Author: Chenfeng Wang, Tengfei Ma , Liqiang