Seminar:Recent Advances in Parsing Technologyyzhang/rapt-ws1213/slides/... · 2013-02-08 · ENJU...

SEMINAR: RECENT

ADVANCES IN PARSING

TECHNOLOGY

Parser Evaluation Approaches

NATURE OF PARSER EVALUATION

Return accurate syntactic structure of sentence. Which representation?

Robustness of parsing.

Applicable across frameworks

Evaluation based on different sources.

E.g Evaluation too forgiving for same training and testing

PARSER EVALUATION

Test parser accuracy

independently as “ a

stand-alone” system.

Test parser output

along Treebank

annotations.

BUT: High accuracy on

intrinsic evaluation

does not guarantee

domain portability.

Test accuracy of the

parser by evaluating its

impact on a specific

NLP task.(Molla &

Hunchinson 2003)

Accuracy along

frameworks and tasks.

Intrinsic Evaluation Extrinsic Evaluation

PARSER EVALUATION

PennTreebank

training & parser

testing

PARSEVAL metrics

PSR Bracketings

LA, LR,

LAS-UAS for

dependency Parsing

NLU-Human Comp

Interaction Systems.

IE Systems (PETE).

And more . . .

Intrinsic Evaluation Extrinsic Evaluation

TASK-ORIENTED EVALUATION OF

SYNTACTIC PARSERS &

REPRESENTATIONS

Miyao,Saetre,Sagae,Matsuzaki,Tsujii(2008),Procee

dings of ACL

PARSER EVALUATION ACROSS

FRAMEWORKS

Parsing accuracy can’t be equally evaluated due to:

Multiple Parsers

Grammatical Frameworks

Output representations: Phrase-Strucure Trees,

Dependency Graphs, Predicate Argument

Relations.

Training and testing along the same sources e.g:

Evaluation?

Dependency

Parsing PS Parsing

Dependency Parsing

TASK-ORIENTED APPROACH TO PARSING

EVALUATION GOAL

Evaluate different syntactic parsers and their

representations based on a different methods.

Measure accuracy by using an NLP task: PPI(Protein

Protein Interaction).

NO-RERANK

RERANK

BERKLEY

STANFORD

ENJU-GENIA

PPI Extraction

OUTPUTS

Statistical features in ML classifier

Conversion of

representation

WHAT IS PPI? I

• Automatically detecting interactions

between proteins.

• Extraction of relevant information from

biomedical papers.

• Developed in IE Task.

Multiple

techniques

employed for

effectiveness

Dependency

Parsing

WHAT IS PPI? II

<IL-8, CXCR1>

<RBP, TTR>

PARSERS & THEIR FRAMEWORKS* Dependency Parsing:

MST: projective dep parsing

KSDEP:Prob shift-reduce parsing.

Phrase Structure Parsing:

NO-RERANK: Charniak’s(2000), lexicalized PCFG

Parser.

RERANK: Receives results from NO-RERANK &

selects the most likely result.

BERKLEY:

STANFORD: Unlexicalized Parser

Deep Parsing

Predicate-Argument Structures reflecting

semantic/syntactic relations among words, encoding

deeper relations.

ENJU: HPSG parser and extracted Grammar from

Penn Treebank.

ENJU-GENIA: Adapted to biomedical textsGENIA

PARSERS & THEIR FRAMEWORKS

CONVERSION SCHEMES

Convert each default parse output to other possible

representations.

CoNLL: dependency tree format, easy constituent-to-

dependency conversion.

PTB: PSR Trees output

HD: Dep Trees with syntactic heads.

SD: Stanford Dependency Format

PAS: Default output of ENJU & ENJU GENIA

CONVERSION SCHEMES

4 Representations for the PSR parsers.

5 Representations for the deep parsers.

DOMAIN PORTABILITY All versions of parsers run 2 times.

WSJ(39832) original source

GENIA(8127): Penn treebank style corpus of

biomedical texts.

Retraining of the parsers with GENIA* to

illustrate domain portability , accuracy

improvements domain adaptation

EXPERIMENTS

Aimed corpus

225 biomedical paper abstracts

EVALUATION RESULTS

Same level of achievement across WSJ trained

parsers.

EVALUATION RESULTS

• Dependency Parsers fastest of all.

• Deep Parsers in between speed.

DISCUSSION

FORMALISM INDEPENDENT

PARSER EVALUATION WITH CCG

& DEPBANK

DEPBANK

Dependency bank, consisting of PAS Relations.

Annotated to cover a wide selection of grammatical

features.

Produced semi-automatically as a product of XLE

System

Briscoe’s& Caroll(2006) Reannotated DepBank

Reannotation with simpler GRs.

Original DepBank annotations kept the same.

GOAL OF THE PAPER

Perform evaluation of CCG Parser outside of the CCG

Evaluation in DepBank .

Conversion of CCG dependencies to Depbank GRs.

Measuring the difficulty and effectiveness of the

conversion.

Comparison of CCG Parser against RASP Parser.

CCG PARSER

Predicate- Argument dependencies in terms of

CCG lexical categories.

“IBM bought the company”

<bought, (S/𝑁𝑃1 )/𝑁𝑃2, 2 company, - >

MAPPING OF GRS TO CCG

DEPENDENCIES

Measuring the difficulty transformation from one formalism

to other

MAPPING OF GRS TO CCG

DEPENDENCIES

2nd Step

Post Processing of the output by comparing CCG

derivations corresponding to Depbank outputs .

Forcing the parser to produce gold-standard

derivations.

Comparison of the GRs with the Depbank outputs

and measuring Precision & Recall.

Precision : 72.23% Recall: 79.56% F-score:77.6%

Shows the difference between schemes.

Still a long way to the perfect conversion

EVALUATION WITH RASP PARSER

Seminar:Recent Advances in Parsing Technologyyzhang/rapt-ws1213/slides/... · 2013-02-08 · ENJU...

Documents

Transcript of Seminar:Recent Advances in Parsing Technologyyzhang/rapt-ws1213/slides/... · 2013-02-08 · ENJU...

Innovation in the Biopharmaceutical Pipeline · Innovation in the Biopharmaceutical Pipeline: A Multidimensional View . Authors: Genia Long Analysis Group, Inc. 111 Huntington Avenue,

The Biopharmaceutical Pipeline: Innovative Therapies in ...€¦ · The Biopharmaceutical Pipeline: Innovative Therapies inClinical Development Genia Long Analysis Group, Inc. 111

GENIA-GR: a Grammatical Relation Corpus for Parser Evaluation in the Biomedical Domain

Marketing miks u online okru enju marketinškim akcijama kao što su marketing dozvole. Brojni marketing stručnjaci ipak smatraju da koncpt 5I ne može zameniti koncept 7P, već ga

APPLICATION OF GENOMICS IN CLINICAL … · ishod u le~enju leukemije, karcinoma dojke, prostate, bron-ha i dr. tumora. Apsolutno je mogu}e dijagnostikovati pri- ... kancer, prognoza,

SOME STATISTICAL TESTS - evol.bio.lmu.deevol.bio.lmu.de/_statgen/Rcourse/ws1213/slides/R_Course_2013_Day-6...Overview Theory of statistical tests Test for a difference in mean Test

ANALIZA NIVOA PROTEINA I ENZIMA U SERUMU OSOBA … · 2020. 9. 14. · ANALIZA NIVOA PROTEINA I ENZIMA U SERUMU OSOBA OBOLELIH OD OSTEOARTRITISA Areeba Ahmad1, Mohd Irshad1, ... enju

GENIA KLAPHOLZ [2-1-16] Tape two, side one

The GENIA Toolkit - United Nations Girls' Education Initiative · –Part 2: Tools for a gender-responsive educational environment –Part 3: Tools for gender-responsive educational

Central Protectionism in China: The Central SOE …...Central Protectionism in China: The “Central SOE Problem” in Environmental Governance Sarah Eaton* and Genia Kostka† Abstract

Advances inusers.physik.fu-berlin.de/~pelster/Vorlesungen/WS1213/scully.pdfOXFORD •PARIS •SAN DIEGO•SAN FRANCISCO•SINGAPORE SYDNEY •TOKYO Academic Press is an imprint of

EDGEFIELD COUNTY, SOUTH CAROLINA · 2018. 8. 30. · EDGEFIELD COUNTY, SOUTH CAROLINA PRINCIPAL COUNTY OFFICIALS JUNE 30, 2016 County Council Dean Campbell – Chairman Genia Blackwell

``qarTuli genia rokviT ganfenili~qarTuli genia rokviT ...dspace.nplg.gov.ge/bitstream/1234/191378/1/Saqartvelos_Qoreografia_2016_N15.pdfra cxovrebiseuli segmenti _ qarTuli folkloruli

omitato Scientifico La storia dell’arte a Venezia ieri e ... · ilcaso di Emmanuele Antonio Cicogna. E genia Q e ci, UniversitàLaSapienzadiRoma. AugustoSezanne e Venezia. ... diManfredo

Genia S-80 Boomlift - Service Manual - Rentalex

Was Bose-Einstein statistics arrived at by serendipity?users.physik.fu-berlin.de/~pelster/Vorlesungen/WS1213/delbrueck.pdf · Was Bose-Einstein Statistics California Institute of

NLP for Biomedicine - Ontology building and Text Mining - Junichi Tsujii GENIA Project ( Computer Science Graduate.

Stanzerei - Home | Bally Areal · 2018-02-27 · C:\Users\evgenia\Dropbox\Genia\Bodmer\EcoAreal Bodmer\Abgabedoku\AST-Logo.jpg Datum 11.12.2017 N Gewerbe/Büro Fläche, 1. OG Fläche:

4th Sunday of Great Lent: St. John Climacus (of the Ladder ... · 3/26/2017 · Dr. Vartan & Lisa Yeghiazarians & Family Genia Yeghiazarians Christian Yianopoulos ... James “Jimmy”

Enhancing HMM-based Biomedical Named Entity … · Evaluation on the GENIA V3.02 and V1.1 shows our system achieves ... GENIA V2.1 corpus (536 ... we selected 20 most frequent verbs