1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger...
-
Upload
savannah-taylor -
Category
Documents
-
view
216 -
download
3
Transcript of 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger...
![Page 1: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/1.jpg)
1
Developing a German grammar for analysis and generation using OpenCCG
Ciprian GerstenbergerUniversity of Saarland
IGK Colloquium January 13th 2005
![Page 2: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/2.jpg)
2
Outline
1. NLP environments: a comparison
2. The choice: OpenCCG
3. The formalism: MMCCG
4. The German grammar
5. Future work
![Page 3: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/3.jpg)
3
Dialogue systems
building dialogue systems → linguistic resources
linguistic resources → tools for developing and maintaining
wide range of different NLP environments
⇒ Which is the most appropriate environment for our purposes?
![Page 4: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/4.jpg)
4
NLP environments for dialogue systems
General requirements
both for analysis and generation multi-lingual easy domain reconfigurability
Requirements for NLG
realization of contextually sensitive utterances linguistically motivated control over flexible sentence realization
![Page 5: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/5.jpg)
5
NLP environments for dialogue systems
Technical requirements
freely available well documented offering support when needed freely available resources (for German) efficient platform independent
![Page 6: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/6.jpg)
6
NLP environments
KPML (Lisp): Systemic-Functional Grammar (SFG) OpenCCG (Java): Multi-Modal Combinatory Categorial Grammar
(MMCCG) Babel (Prolog): Head-Driven Phrase Structure Grammar (HPSG) LKB (Lisp): Head-Driven Phrase Structure Grammar (HPSG) XLE (C): Lexical Functional Grammar (LFG) XTAG (Lisp): Tree Adjoning Grammar (TAG) XDG (Oz): Topological Dependency Grammar (TDG)
![Page 7: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/7.jpg)
7
NLP environments: Babel
Babel-System (S. Müller)
implementing HPSG Prolog only analysis, no generation multi-lingual (?) resources for German: grammar with good coverage freely available documentation support (?)
![Page 8: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/8.jpg)
8
NLP environments: LKB
LKB
implementing HPSG Lisp multi-lingual both analysis and generation but: resources for German not usable for generation freely available documentation support (?)
![Page 9: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/9.jpg)
9
NLP environments: XTAG
XTAG
implementing TAG Lisp both analysis and generation multi-lingual resources for German (DFKI ?) freely available documentation support (?)
![Page 10: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/10.jpg)
10
NLP environments: XDG
XDG
implementing TDG Oz only analysis (generation as dependency parsing using TAGs) multi-lingual (?) resources for German (toy grammars) freely available documentation (?) support
![Page 11: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/11.jpg)
11
NLP environments: KPML
KOMET-Penman Multilingual Linguistic resource development
implementing Systemic-Functional Grammar (SFG) Lisp multi-lingual flexible generation good sentence realization control only for generation, no parsing resources for German: grammar with good coverage freely available documentation and support
![Page 12: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/12.jpg)
12
NLP environments: XLE
Xerox Linguistic Environment
implementing LFG C and Tcl/Tk multi-lingual both analysis and generation resources for German (not freely available) documentation support not freely available
![Page 13: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/13.jpg)
13
NLP environments: OpenCCG
OpenCCG
implementing Multi-Modal Combinatory Categorial Grammar (MMCCG)
open source Java-based NLP library both analysis and generation multi-lingual no resources for German, but grammars for English freely available documentation support
![Page 14: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/14.jpg)
14
NLP environments: The Choice
OpenCCG
Java-based NLP library → platform independent analysis and generation → uniform grammar resources multi-lingual → extendable used and in use in several other projects: FLIGHTS, COMIC, COSY supporting output format for TTS (e.g. APML) optimized sentence realization flexible generation sentence realization control
![Page 15: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/15.jpg)
15
Basic formalism: CCG
Combinatory Categorial Grammar
lexicalized grammar formalism
lexical items are assigned syntactic categories
combinatory rules
![Page 16: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/16.jpg)
16
MMCCG
Multi-Modal Combinatory Categorial Grammar
refining CCG by introducing means of controlling the application of combinatory rules
specifying modes on category forming operators (slashes)
making application of rules dependent on the slash mode
four basic modes governing different levels of associativity and permutativity
![Page 17: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/17.jpg)
17
Example
Der Hund sieht die Katze.
![Page 18: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/18.jpg)
18
Example (cont.)
Der Hund sieht die Katze.
![Page 19: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/19.jpg)
19
Developing a German Grammar
joint work with Magdalena Wolska (DIALOG Project)
Desiderata uniform resources for analysis and generation covering all phenomena in our domains achieve more generality of the grammar than wrt
phenomena encountered in our (relatively small) corpora
![Page 20: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/20.jpg)
20
Phenomena
Some phenomena in German
agreement position of the finite verb Topological Fields: controlling the Vorfeld complex sentences ambiguity controlling sentence realization
![Page 21: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/21.jpg)
21
Lexical forms
![Page 22: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/22.jpg)
22
Agreement
![Page 23: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/23.jpg)
23
Agreement (cont.)
![Page 24: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/24.jpg)
24
Agreement/Complex sentences
![Page 25: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/25.jpg)
25
Clause types
Verb-initial clauses
• yes/no questions:Soll ich die den Titel zu der Liste hinzufügen?
• alternative questions:Möchtest Du Mozart oder Bach hören?
imperatives:Wähle das Album „Californication“ von den Red Hot Chili Peppers!
![Page 26: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/26.jpg)
26
Clause types (cont.)
Verb-second clauses
main declarative: Der Titel wurde hinzugefügt.
• wh-question:Welcher Künstler spielt „Missunderstood“?
![Page 27: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/27.jpg)
27
Clause types (cont.)
Verb-final clauses
subordinate clause:Wenn Sie möchten, kann ich „We Just Can´t Get Enough CCG“ abspielen.
relative clause:Ich nehme aus den ersten vier Alben, die du hast, jeweils den ersten Song.
• complement clause:Ich glaube, daß das Album „Dangerously In Love“ heißt.
![Page 28: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/28.jpg)
28
Topological Fields
Controlling the Vorfeld occupation using flags
![Page 29: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/29.jpg)
29
Topological Fields (cont.)
Controlling the Vorfeld occupation using flags
![Page 30: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/30.jpg)
30
Analysis: Ambiguities
Der Hund von dem traurigen Mann den ich sah rennt.
![Page 31: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/31.jpg)
31
Analysis: Ambiguities (cont.)
Das Kind rennt wenn der Hund rennt weil die Katze rennt.
![Page 32: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/32.jpg)
32
Generation
Sentence realization without control
![Page 33: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/33.jpg)
33
Generation (cont.)
Sentence realization with control: fronted subject
![Page 34: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/34.jpg)
34
Generation (cont.)
Sentence realization with control: fronted object
![Page 35: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/35.jpg)
35
Future Work (1)
extending the grammar wrt the two domain currently modelled (MP3 and maths tutorial)
(AP, NP, sentence, etc.) coordination complex NP (e.g. postmodifications) control and raising verbs particle verbs (Ich spiele den Song ab vs. Ich möchte den
Song abspielen) Topological Fields: scrambling in the Mittelfeld
![Page 36: 1 Developing a German grammar for analysis and generation using OpenCCG Ciprian Gerstenberger University of Saarland IGK Colloquium January 13th 2005.](https://reader036.fdocuments.us/reader036/viewer/2022070305/551430005503466d1a8b4f94/html5/thumbnails/36.jpg)
36
Future Work (2)
analysis: coping with partial input, ill-formed utterances generation: realizing elliptical output using a dynamic morphological module development of an ontology