Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997...

80
Relations and Realization in Syntax and Parsing Columbia University June 15, 2010 Reut Tsarfaty The Department of Linguistics and Philology Uppsala University

Transcript of Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997...

Page 1: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Relations and Realizationin Syntax and Parsing

Columbia UniversityJune 15, 2010

Reut TsarfatyThe Department of Linguistics and Philology

Uppsala University

Page 2: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Statistical Parsing from a Typological Perspective

Statistical ParsingHow can we learn statistical parsing models from data?I What are the units of generalization?I How can we exploit evidence in the data?

A Typological PerspectiveStatistical modeling in the face of cross-linguistic variationI Which probabilistic models for which languages?I Statistical modeling for parsing rich morphosyntax

The HypothesisDifferent languages different realization different modeling

Page 3: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Statistical Parsing from a Typological Perspective

Statistical ParsingHow can we learn statistical parsing models from data?I What are the units of generalization?I How can we exploit evidence in the data?

A Typological PerspectiveStatistical modeling in the face of cross-linguistic variationI Which probabilistic models for which languages?I Statistical modeling for parsing rich morphosyntax

The HypothesisDifferent languages different realization different modeling

Page 4: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Statistical Parsing

”This is easy”

Page 5: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Statistical Parsing

”This is easy”

Page 6: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Statistical Parsing

”This is easy”

Page 7: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Constituency-Based Statistical Parsing

S

NP-SBJ

PRP

“This”

VP-PRD

VBD

“is”

ADJP

ADJ

“easy”

Page 8: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Constituency-Based Supervised Statistical Parsing

S

NP-SBJ

PRP

“This”

VP-PRD

VBD

“is”

ADJP

ADJ

“easy”

Model Study F-Score

Treebank Charniak 75Grammar 1996

Head- Collins 88.6Driven 1997

Discriminative Collins 89.7Reranking 2000

Discriminative- Johnson &Reranking Charniak 2005 91.0

Self- McClosky 92.1Training 2006

State- Petrov et al 90.1Splits 2007

Forest Liang Huang 91.7Reranking 2008

Page 9: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Constituency-Based Supervised Statistical Parsing

And what about this?

And this?

And this?

And? ...

Language Parser F-Score

Rafferty & 79.2German Manning 2008

Collins 79.3Czech et al. 1999

Levy & 78.8Chinese Manning 2003

Maamouri, Bies & 78.1Arabic Kulick 2008

Tsarfaty & 74.4Hebrew Sima’an 2007

Page 10: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

So What Is Going On?

Often ConsideredI Corpora Size

E.g., For Chinese (Bikel & Chiang 2000)

I Annotation IdiosyncrasiesE.g., For Arabic (Maamouri, Bies & Kulick 2008, 2009)

I Evaluation MattersE.g., For German (Rehiben & van Genabith 2007, Kubler 2008)

In This TalkI Modeling StrategyI Language Type

Page 11: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

So What Is Going On?

Often ConsideredI Corpora Size

E.g., For Chinese (Bikel & Chiang 2000)

I Annotation IdiosyncrasiesE.g., For Arabic (Maamouri, Bies & Kulick 2008, 2009)

I Evaluation MattersE.g., For German (Rehiben & van Genabith 2007, Kubler 2008)

In This TalkI Modeling StrategyI Language Type

Page 12: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Strategies

S

NP

NNP

“John”

VP

VB

“likes”

NP

NNP

“Mary”

Page 13: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Strategies

S

NPNP

NNPNNP

“John”

VPVP

VBVB

“likes”

NPNP

NNPNNP

“Mary”

Page 14: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Strategies

S

NPNP

NNPNNP

“John”

VPVP

VBVB

“likes”

NPNP

NNPNNP

“Mary”

P(NP VP|S) 1P(NNP|NP) 1P(VB NP|VP) 1

P(“John”|NNP) 0.5P(“likes”|VB) 1P(“Mary”|NNP) 0.5

⇒P(“John likes Mary”)=P(NP VP|S)×...×P(“Mary”|NNP)= 0.25

Page 15: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Strategies

S

NPNP

NNPNNP

“John”

VPVP

VBVB

“likes”

NPNP

NNPNNP

“Mary”

P(NP VP|S) 1P(NNP|NP) 1P(VB NP|VP) 1

P(“John”|NNP) 0.5P(“likes”|VB) 1P(“Mary”|NNP) 0.5

⇒P(“John likes Mary”)=P(NP VP|S)×...×P(“Mary”|NNP)= 0.25

Page 16: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Strategies

S

NPNP

NNPNNP

“Mary”

VPVP

VBVB

“likes”

NPNP

NNPNNP

“John”

P(NP VP|S) 1P(NNP|NP) 1P(VBD NP|VP) 1

P(“John”|NNP) 0.5P(“likes”|VB) 1P(“Mary”|NNP) 0.5

⇒P(“Mary likes John”)=P(NP VP|S)×...×P(“Mary”|NNP)= 0.25

Page 17: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Strategies

S

NP

PRP.NOM

“He”

VP

VB

“likes”

NP

PRP.ACC

“her”

P(NP VP|S) 1P(PRP.NOM|NP) 0.5P(PRP.ACC|NP) 0.5P(VB NP|VP) 1

P(”He”|PRP.NOM) 1P(”likes”|VB) 1P(”her”|PRP.ACC) 1

Page 18: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Strategies

S

NP

PRP.ACC

“Her”

VP

VB

“likes”

NP

PRP.NOM

“he”

P(NP VP|S) 1P(PRP.NOM|NP) 0.5P(PRP.ACC|NP) 0.5P(VB NP|VP) 1

P(”He”|PRP.NOM) 1P(”likes”|VB) 1P(”her”|PRP.ACC) 1

P(“Her likes he”)=P(NP VP|S)×...×P(“her”|PRP.ACC)= 0.25

Page 19: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Example 1: Parent Encoding (Johnson 1998)

S

NP@S

PRP.NOM

“He”

VP@S

VB

“likes”

NP@VP

PRP.ACC

“her”

P(NP@S VP@S|S) 1P(PRP.NOM |NP@S) 1P(PRP.ACC |NP@VP) 1P(VB NP@VP|VP@S) 1

P(”He”|PRP.NOM) 1P(”likes”|VP) 1P(”her”|PRP.ACC) 1

Page 20: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Example 1: Parent Encoding (Johnson 1998)

S

NP@S

PRP.NOM

“He”

VP@S

VB

“likes”

NP@VP

PRP.ACC

“her”

P(NP@S VP@S|S) 1P(PRP.NOM |NP@S) 1P(PRP.ACC |NP@VP) 1P(VB NP@VP|VP@S) 1

P(”He”|PRP.NOM) 1P(”likes”|VP) 1P(”her”|PRP.ACC) 1

Page 21: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Example 2: Head-Driven Processes (Collins 1999)

S

NP

PRP.NOM

“He”

VP

VB

“likes”

NP

PRP.ACC

“her”

P(NP VP|S) 1P(PRP.NOM|NP) 0.5P(PRP.ACC|NP) 0.5P(VB NP|VP) 1

P(”He”|PRP.NOM) 1P(”likes”|VB) 1P(”her”|PRP.ACC) 1

Page 22: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Head-Driven Processes (Collins 1999)

S

<VB>

L,∆L1 , <VB>

PRP.NOM

He

H,∆0,<VB>

VP

<VB>

H,∆0,V <VB>

VB

likes

R,∆R1 ,V <VB>

PRP.ACC

her

⇒P(<VB>|S) 1P(L∆L1 ,H∆0| <VB>,S) 1P(PRP.NOM|L,∆L1 ,<VB>,S) 1P(VP|H,∆0,<VB>,S) 1P(<VB>|VP) 1P(PRP.ACC|R,∆R1 ,<VB>,S) 1P(VB|H,∆0,<VB>,S) 1

P(”He”|PRP.NOM) 1P(”likes”|VB) 1P(”her”|PRP.ACC) 1

Page 23: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Head-Driven Processes (Collins 1999)

S

<VB>

L,∆L1 , <VB>

PRP.NOM

He

H,∆0,<VB>

VP

<VB>

H,∆0,V <VB>

VB

likes

R,∆R1 ,V <VB>

PRP.ACC

her

⇒P(<VB>|S) 1P(L∆L1 ,H∆0| <VB>,S) 1P(PRP.NOM|L,∆L1 ,<VB>,S) 1P(VP|H,∆0,<VB>,S) 1P(<VB>|VP) 1P(PRP.ACC|R,∆R1 ,<VB>,S) 1P(VB|H,∆0,<VB>,S) 1

P(”He”|PRP.NOM) 1P(”likes”|VB) 1P(”her”|PRP.ACC) 1

Think: X-Bar Syntax!Works amazingly well for English

Page 24: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Head-Driven Processes (Collins 1999)

S

<VB>

L,∆L1 , <VB>

PRP.NOM

He

H,∆0,<VB>

VP

<VB>

H,∆0,V <VB>

VB

likes

R,∆R1 ,V <VB>

PRP.ACC

her

⇒P(<VB>|S) 1P(L∆L1 ,H∆0| <VB>,S) 1P(PRP.NOM|L,∆L1 ,<VB>,S) 1P(VP|H,∆0,<VB>,S) 1P(<VB>|VP) 1P(PRP.ACC|R,∆R1 ,<VB>,S) 1P(VB|H,∆0,<VB>,S) 1

P(”He”|PRP.NOM) 1P(”likes”|VB) 1P(”her”|PRP.ACC) 1

Think: X-Bar Syntax!Works amazingly well for English

Page 25: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Strategies

The Setup

I We are given a treebank and (often) a formal deviceI We can learn different models reflecting different theories

The Question

I How can we learn a model that captures the best theory,as it is reflected in the treebank data?

Page 26: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Data

Typological Dimensions of Variation

Basic Word-Order Typology(Greenberg 1966, Mithun 1992)

Page 27: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Data

Typological Dimensions of Variation

Basic Word-Order Typology(Greenberg 1966, Mithun 1992)

Page 28: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Data

Basic Word-Order Typology

Word-Order TypeThe order in which a Subject, a Verb and an Object appear in acanonical, neutral, unmarked sentence (SVO, VSO, VOS, etc)(Greenberg 1963)

Word-Order FreenessThe order is pragmatically determined (Mithun 1992)

RIGID ———————————— FREE

Vietnames ————————— Warlpiri

Page 29: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Data

Basic Word-Order Typology

Word-Order TypeThe order in which a Subject, a Verb and an Object appear in acanonical, neutral, unmarked sentence (SVO, VSO, VOS, etc)(Greenberg 1963)

Word-Order FreenessThe order is pragmatically determined (Mithun 1992)

RIGID ———————————— FREE

Vietnames ————————— Warlpiri

Page 30: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Data

Typological Dimensions of Variation

Basic Word-Order Typology(Greenberg 1966, Mithun 1992)

Morphological Typology(Sapir 1921, Greenberg 1954)

Page 31: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Data

Morphological Typology

Morphological SynthesisMorpheme-to-word ratio:

ISOLATING ————————– POLYSYNTHETIC

Vietnamese ———————————— Yu’pic

Morphological FusionEase of segmentation:

AGGLUTINATIVE ————————– FUSIONAL

Turkish ——————————————– Latin

Page 32: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Data

Morphological Typology

Morphological SynthesisMorpheme-to-word ratio:

ISOLATING ————————– POLYSYNTHETIC

Vietnamese ———————————— Yu’pic

Morphological FusionEase of segmentation:

AGGLUTINATIVE ————————– FUSIONAL

Turkish ——————————————– Latin

Page 33: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Data

Typological Dimensions of Variation

Basic Word-Order Typology(Greenberg 1966, Mithun 1992)

Morphological Typology(Sapir 1921, Greenberg 1954)

Nonconfigurationality(Hale 1983, Austin and Bresnan 1996)

Page 34: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Nonconfigurationality as Misalignment

Predicate-Argument Relations

‘SBJ’ did ‘PRD’ to ‘OBJ’

Syntactic Configuration

S

NP

PRP.NOM

“He”

VP

VB

“likes”

NP

NN.ACC

‘her”

Page 35: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Nonconfigurationality as Misalignment

Predicate-Argument Relations

‘SBJ’ did ‘PRD’ to ‘OBJ’

Syntactic Configuration

S

NP

PRP.NOM

“He”

VP

VB

“likes”

NP

NN.ACC

‘her”

Page 36: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Nonconfigurationality as Misalignment

Predicate-Argument Relations

‘SBJ’ did ‘PRD’ to ‘OBJ’

Syntactic Configuration

S

NP

PRP.NOM

“He”

VP

VB

“likes”

NP

NN.ACC

‘her”

Page 37: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Nonconfigurationality as Misalignment

Predicate-Argument

‘SBJ’ did ‘PRD’ to ‘OBJ’

Syntactic Configuration

S

NP

PRP.NOM

“He”

VP

VB

“likes”

NP

NN.ACC

‘her”

Page 38: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Understanding Nonconfigurationality

Word-Order in Modern Hebrew

(1) a. daniDani

natangave

et hamatanaACC the-present

ledinato-Dina

“Dani gave the present to Dina” (SVO)

b. et hamatanaACC the-present

natangave

daniDani

ledinato-Dina

“Dani gave the present to Dina” (OVS)

c. natangave

daniDani

et hamatanaACC the-present

ledinato-Dina

“Dani gave the present to Dina” (VSO)

d. ledinato-dina

natangave

daniDani

et hamatanaACC the-present

“Dani gave the present to Dina” (VSO)

Page 39: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Argument Marking in Modern Hebrew (1:1)

Case-Assigning Prepositions

(2) a. daniDani

natangave

etACC

hamatanaDEF-present

ledinaDAT-Dina

b. etACC

hamatanaDEF-present

natangave

daniDani

ledinaDAT-Dina

c. natangave

daniDani

etACC

hamatanaDEF-present

ledinaDAT-Dina

d. ledinaDAT-dina

natangave

daniDani

etACC

hamatanaDEF-present

Page 40: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Argument Marking in Modern Hebrew (1:many)

Differential Object-Marking

(3) a. daniDani

natangave

etACC

hamatanaDEF-present

ledinato-Dina

b. etACC

hamatanaDEF-present

natangave

daniDani

ledinato-Dina

c. natangave

daniDani

etACC

hamatanaDEF-present

ledinato-Dina

d. ledinato-dina

natangave

daniDani

etACC

hamatanaDEF-present

Page 41: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Argument Marking in Modern Hebrew (1:many)

Feature Spreading (Danon, 2007)

(4) a. daniDani

natangave

[et[ACC

matnatpresent

yomday

hahuledet]DEF-birth]

ledinato-Dina

b. [et[ACC

matnatpresent

yomday

hahuledet]DEF-birth]

natangave

daniDani

ledinato-Dina

c. natangave

daniDani

[et[ACC

matnatpresent

yomday

hahuledet]DEF-birth]

ledinato-Dina

d. ledinato-dina

natangave

daniDani

[et[ACC

matnatpresent

yomday

hahuledet]DEF-birth]

Page 42: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Argument Marking in Modern Hebrew (1:m)

Agreement

(5) a. daniDani.MS

natangave.3MS

etACC

hamatanaDEF-present

ledinaDAT-Dina

b. etACC

hamatanaDEF-present

natangave.3MS

daniDani.MS

ledinaDAT-Dina

c. natangave.MS

daniDani.3MS

etACC

hamatanaDEF-present

ledinaDAT-Dina

d. ledinaDAT-dina

natangave.3MS

daniDani.MS

etACC

hamatanaDEF-present

Page 43: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Argument Marking Modern Hebrew (many:1)

Clitics and Null Anaphors

(6) a. daniDani.MS

natangave.3MS

etACC

hamatanaDEF-present

ledinaDAT-Dina

“Dani gave the present to Dina”b. natati

gave.1SetACC

hamatanaDEF-present

ledinaDAT-Dina

“I gave the present to Dina”c. natatiha

gave.1S.ACC.3FSledinaDAT-Dina

“I gave it to Dina”

Page 44: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Language Types and Modeling Strategies

Recap:

CONFIGURATIONAL ————— NONCONFIGURATIONAL

1:1 ————————————————————— m:nVietnamese > English > German > Hebrew > Warlpiri

I Realization is the mapping of functions to formsI Different Languages show different realization strategiesI Different realization strategies may require different models

Question:How can we model generally complex form-function mappings?

Page 45: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Language Types and Modeling Strategies

Recap:

CONFIGURATIONAL ————— NONCONFIGURATIONAL

1:1 ————————————————————— m:nVietnamese > English > German > Hebrew > Warlpiri

I Realization is the mapping of functions to formsI Different Languages show different realization strategiesI Different realization strategies may require different models

Question:How can we model generally complex form-function mappings?

Page 46: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Morphology

Morpheme-Based Morphology (Bloomfield, 1933)

‘kids’

‘kid’KID

‘s’plural

, ‘oxen’

‘ox’OX

‘en’plural

, ‘men’

‘m..n’MAN

eplural

, ‘sheep’

‘sheep’SHEEP

∅plural

Morphological Exponence

I Simple Exponence (e.g., ‘s’ in ‘cats’)I Cumulative Exponence (e.g., ‘s’ in ‘eats’)I Extended Exponence (e.g., ‘i’,‘ren’ in ‘children’)

Page 47: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Morphology

Morpheme-Based Morphology (Bloomfield, 1933)

‘kids’

‘kid’KID

‘s’plural

, ‘oxen’

‘ox’OX

‘en’plural

, ‘men’

‘m..n’MAN

eplural

, ‘sheep’

‘sheep’SHEEP

∅plural

Morphological Exponence

I Simple Exponence (e.g., ‘s’ in ‘cats’)I Cumulative Exponence (e.g., ‘s’ in ‘eats’)I Extended Exponence (e.g., ‘i’,‘ren’ in ‘children’)

Page 48: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Morphology

Morpheme-Based Morphology (Bloomfield, 1933)

‘kids’

‘kid’KID

‘s’plural

, ‘oxen’

‘ox’OX

‘en’plural

, ‘men’

‘m..n’MAN

eplural

, ‘sheep’

‘sheep’SHEEP

∅plural

Morphological Exponence

I Simple Exponence (e.g., ‘s’ in ‘cats’)I Cumulative Exponence (e.g., ‘s’ in ‘eats’)I Extended Exponence (e.g., ‘i’,‘ren’ in ‘children’)

Page 49: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Morphology: Primitives and Processes

LEXICAL vs. INFERENTIAL Approaches

I LEXICAL:morphemes are primary, properties stored in the lexicon

I INFERENTIAL:properties are primary, forms are computed

INCREMENTAL vs. REALIZATIONAL Approaches

I INCREMENTAL:morphemes/properties are accumulated incrementally(“monotonic” rules)

I REALIZATIONAL:property-bundles are pre-condition for rule application(“spell-out” rules)

Page 50: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Modeling Morphology: A Taxonomy

LEXICAL INFERENTIAL

INCREMENTAL Item & Arrangement Item & Processes(Bloomfield 1933) (Hocket 1954)

(Lieber 1992) (Steele 1995)

REALIZATIONAL Distributed Morphology (Extended) Word & Paradigm(Halle and Marantz 1993) (Matthews 1972), (Anderson 1992)

Lexical Phonology (Stump 2001), (Blevins 2006)

Table: A Taxonomy of Models for Morphology (Stump 2001)

Page 51: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Word-and-Paradigm Morphology

Paradigmatic Organization

/EAT/ 1Sing 2Sing 3Sing 1Pl 2Pl 3Pl

Past 1SingPast 2SingPast 3SingPast 1PlPast 2PlPast 3PlPastPresent 1SingPres 2SingPres 3SingPres 1PlPres 2PlPres 3PlPresPerfect 1SingPerf 2SingPerf 3SingPerf 1PlPerf 2PlPerf 3PlPerf

Realization Rules

/EAT/+1SingPast

‘ate’

, /EAT/+3SingPast

‘ate’

, /EAT/+1SingPres

‘eats’

, /EAT/+3SingPres

‘eat’

Page 52: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Proposal (I): “Lifting” the Terminology

Morphological Exponence

: Properties Words

I Simple (e.g., PL ‘kids’)I Cumulative (e.g. 3PER+SING ‘eats’)I Distributed/Extended (e.g. PL ‘children’)

Morphosyntactic Exponence

: Relations Configurations

I Simple (e.g., SBJ nominative )I Cumulative (e.g., SBJ,PRD,OBJ clitics)I Distributed/Extended (e.g., OBJ DOM, FS)

Page 53: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Proposal (I): “Lifting” the Terminology

Morphological Exponence : Properties Words

I Simple (e.g., PL ‘kids’)I Cumulative (e.g. 3PER+SING ‘eats’)I Distributed/Extended (e.g. PL ‘children’)

Morphosyntactic Exponence : Relations Configurations

I Simple (e.g., SBJ nominative )I Cumulative (e.g., SBJ,PRD,OBJ clitics)I Distributed/Extended (e.g., OBJ DOM, FS)

Page 54: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Proposal (II): Modeling Principles

CONFIGURATIONAL vs. RELATIONAL Approaches

I CONFIGURATIONAL:configurations are primary, relations are derived

I RELATIONAL:relations are primary, configurations are derived

INCREMENTAL vs. REALIZATIONAL Approaches

I INCREMENTAL:Syntactic rules are monotonic(incrementally accumulate relations)

I REALIZATIONAL:Syntactic rules define spellout(relations as precondition to realization)

Page 55: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Proposal (III): A Taxonomy of Parsing Frameworks

CONFIGURATIONAL RELATIONAL

INCREMENTAL Head-Driven Parsing Dependency Parsing

REALIZATIONAL Stochastic TAG, CCG Relational-Realizational

Table: A Taxonomy of Statistical Parsing Frameworks (Tsarfaty 2010)

Page 56: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Proposal (IV): Relational-Realizational Modeling

S〈PRED〉 FEATS Affirmative Interrogative Imperative

ARG-ST

intransitive Saffirm+{SBJ,PRD} Sinter+{SBJ,PRD} Simper+{SBJ,PRD}transitive Saffirm+{SBJ,PRD,OBJ} Sinter+{SBJ,PRD,OBJ} Simper+{SBJ,PRD,OBJ}ditransitive Saffirm+{SBJ,PRD,OBJ,COM} Sinter+{SBJ,PRD,OBJ,COM} Simper+{SBJ,PRD,OBJ,COM}

Figure: Paradigmatic Organization

Saffirm+{SBJ,PRD,OBJ,COM}

〈NPnomDaniDani

,VB

natangave

,NPdef.acc

et hamatanaACC-the-present

,NPdatledinato-Dina

Saffirm+{SBJ,PRD,OBJ,COM}

〈NPdef.acc

et hamatanaACC-the-present

,VB

natangave

,NPnomDaniDani

,NPdatledinato-Dina

Figure: Form-Function Separation

Page 57: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Model

Relational-Realizational (RR) Parsing(Tsarfaty, Sima’an and Scha 2008, 2009)

I Separate Form and FunctionI First Generate Grammatical RelationsI Then Spell-out (Morpho)Syntactic Realization

I Separate Means of RealizationI First Generate ConfigurationI Then Morphosyntactic Representation

Page 58: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Model

Relational-Realizational (RR) Parsing(Tsarfaty, Sima’an and Scha 2008, 2009)

I Separate Form and FunctionI First Generate Grammatical RelationsI Then Spell-out (Morpho)Syntactic Realization

I Separate Means of RealizationI First Generate ConfigurationI Then Morphosyntactic Representation

Page 59: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Tsarfaty et al. (2008, 2009)

Relational-Realizational (RR) Parsing

S

NP-SBJ

daniDani

VB-PRD

natangave

ADVP

etmolyesterday

NP+Def+Acc-OBJ

et hamatanaAcc Def-present

PP-COM

ledinato Dina

Page 60: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Tsarfaty et al. (2008, 2009)

Relational-Realizational (RR) Parsing

S

{SBJ,PRD,OBJ,COM}@S

NP

daniDani

VB

natangave

ADVP

etmolyesterday

NP+Def+Acc

et hamatanaAcc Def-present

PP

ledinato Dina

Page 61: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Tsarfaty et al. (2008, 2009)

Relational-Realizational (RR) Parsing

S

{SBJ,PRD,OBJ,COM}@S

SBJ@S

NP

daniDani

PRD@S

VB

natangave

PRD:OBJ@S

ADVP

etmolyesterday

OBJ@S

NP+Def+Acc

et hamatanaAcc Def-present

COM@S

PP

ledinato Dina

Page 62: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Tsarfaty et al. (2008, 2009)

Relational-Realizational (RR) Parsing

S

{PRD,SBJ,OBJ,COM}@S

SBJ@S

NP

daniDani

PRD@S

VB

natangave

PRD:OBJ@S

ADVP

etmolyesterday

OBJ@S

NP+Def+Acc

et hamatanaAcc Def-present

COM@S

PP-COM

ledinato Dina

S

{PRD,SBJ,OBJ,COM}@S

OBJ@S

NP+Def+Acc

et hamatanaAcc Def-present

PRD@S

VB

natangave

PRD:OBJ@S

ADVP

etmolyesterday

SBJ@S

NP

daniDani

COM@S

PP-COM

ledinato Dina

Page 63: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Model ParametersProjection:

P

{gri}ni=1@P

Configuration:

{gri}ni=1@P

gr1@P gr1 : gr2@P ... grn@P

Realization:

gr1@P

C1

gr1 : gr2@P

..C1:2i ..

... grn@P

Cn

Page 64: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

The Probabilistic Model

The RR Probabilities:

PRR(r) =Projection Pp({gri}n

i=1|P) ×Configuration Pc(〈gr0 : gr1,g1, . . .〉|{gri}n

i=1,P) ×Realization

∏ni=1 Pr1(Ci |gri ,P) ×

Pr2(〈C01, ...,C0m0〉|gr0 : gr1,P) ×∏n

i=1 Pr2(〈Ci1, ...,Cimi〉|gri : gri+1,P)

The RR Parser:

π∗ = argmaxπP(π) = argmaxπ∏

r∈π PRR(r)

Page 65: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Application I: Parsing Modern Hebrew

Page 66: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

A Taxonomy of PCFG-based Parsers

CONFIGURATIONAL RELATIONAL

INCREMENTAL Head-Driven Parsing

Dependency Parsing

(Collins 1999)

(Collins 1999 enhanced)

REALIZATIONAL

Flattened Trees

Relational-Realizational

(Johnson 1998)

(Tsarfaty et al. 2009)

Table: A Taxonomy of PCFG-Based Parsing Frameworks

Page 67: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

A Taxonomy of PCFG-based Parsers

CONFIGURATIONAL RELATIONAL

INCREMENTAL Head-Driven Parsing Dependency Parsing(Collins 1999) (Collins 1999 enhanced)

REALIZATIONAL Flattened Trees Relational-Realizational(Johnson 1998) (Tsarfaty et al. 2009)

Table: A Taxonomy of PCFG-Based Parsing Frameworks

Page 68: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Case Study: Differential Object-MarkingDataThe Modern Hebrew Treebank v2, head annotated.6500 sentences, 500/5500/500 dev/train/test split

ModelsI Grammatical Functions: PRD, SBJ, OBJ, COM, CNJI Morphological Splits: PoS/Def/AccI Conditioning Context: Horizontal/Vertical

EstimationRelative Frequency + Simple Unknown Words Smoothing

ParsingExhaustive Viterbi Parsing (using BitPar, Schmid 2004)

EvaluationPARSEVAL (i) Overall, and (ii) Per Category Evaluation

Page 69: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Overall Results

74.66/74.35 73.52/74.84 76.32/76.51(7385) (21399) (13618)

Page 70: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Overall Results

74.66/74.35 73.52/74.84 76.32/76.51(7385) (21399) (13618)

Page 71: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

A Relational-Incremental Model

73.52/74.84 76.32/76.51(21399) (13618)

Page 72: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

A Relational-Incremental Model

73.52/74.84 72.84/74.62 76.32/76.51(21399) (16460) (13618)

Page 73: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Results Per Category

NP 77.39 / 74.32 77.94 / 73.75 78.96 / 76.11PP 71.78 / 71.14 71.83 / 69.24 74.4 / 72.02

SBAR 55.73 / 59.71 53.79 / 57.49 57.97 / 61.67ADVP 71.37 / 77.01 72.52 / 73.56 73.57 / 77.59

ADJP 79.37 / 78.96 78.47 / 77.14 78.69 / 78.18S 73.25 / 79.07 71.07 / 76.49 72.37 / 78.33

SQ 36.00 / 32.14 30.77 / 14.29 55.56 / 17.86PREDP 36.31 / 39.63 44.74 / 39.63 44.51 / 46.95

Page 74: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Take Home

CONFIGURATIONAL RELATIONAL

INCREMENTAL Head-Driven Parsing Dependency Parsing

REALIZATIONAL Stochastic TAG, CCG Relational-Realizational

Table: A Taxonomy of Statistical Parsing Frameworks (Tsarfaty 2010)

Page 75: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Application II: Probabilistic Universal GrammarBasic Word-Order Parameter:P(< configuration >|{SBJ,PRD,OBJ}@S)

Probability Configuration tri- bi-

0.2% OBJ � SUBJ PRD OSV OV0.2% PRD OBJ SBJ � VOS VO0.2% � PRD OBJ � SBJ � VOS VO0.2 % PRD SBJ � OBJ � VSO VO0.4 % � PRD � SBJ � OBJ � VSO VO0.6 % OBJ � PRD SBJ � OVS OV0.8 % OBJ PRD � SBJ � OVS OV1 % � PRD � SBJ OBJ � VSO VO1.3% SBJ � PRD OBJ � SVO VO1.7% � PRD OBJ SBJ � VOS VO1.7% � SBJ PRD � OBJ � SVO VO3% OBJ PRD SBJ � OVS OV3.7% � PRD SBJ � OBJ � VSO VO4.1% SBJ � PRD � OBJ � SVO VO6.5% � SBJ PRD OBJ � SVO VO10.3% SBJ � PDR OBJ � SVO VO12.3% � PRD SBJ OBJ � VSO VO15.6% SBJ PRD � OBJ � SVO VO35.3% SBJ PRD OBJ � SVO VO

Page 76: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Application II: Probabilistic Universal Grammar

Differential Object-Marking Parameter:P(< morphosyntactic representation >| OBJ@S)

Probability Realization

5.8% NP.DEF.ACC〈PRP〉@S6.5% NP.DEF.ACC〈NNT 〉@S6.7% NP.DEF.ACC〈NN.DEF 〉@S7.4% NP.DEF.ACC〈NNP〉@S8.8% NP〈NNT 〉@S14.7% NP.DEF.ACC〈NN〉@S43.5% NP.〈NN〉@S

Page 77: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Towards Computational Typology?

Can we Use the RR parameters to...

I Quantify Intra-Language Variation?I Quantify Cross-Linguistic Variation?I Learn Parameters Settings from Data?I Quantify Nonconfigurationality?

Page 78: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

RRRecap

I Languages are different! Modeling strategies should accommodate differences

I Nonconfigurational languages are not configurational! Modeling strategies should account for misalignments

I Modeling Morphology vary in underlying assumptions Inferential-Realizational approaches model m:n mapping

I Modeling Morphosyntax meets similar considerations Relational-Realizational modeling allows for misalignments

Page 79: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

Thank You!

Questions?

Page 80: Relations and Realization in Syntax and Parsing · 2017. 4. 6. · Head- Collins 88.6 Driven 1997 Discriminative Collins 89.7 Reranking 2000 Discriminative- Johnson & Reranking Charniak

PRRR

Let’s Try it for Different Languages!

For more InformationRelational-Realizational ParsingReut Tsarfaty, University of AmsterdamPhD Manuscript, 2010