Automated Translation of Multi-word Expressions Application in English-Latvian SMT - poster booster
-
Upload
matiss-rikters -
Category
Technology
-
view
59 -
download
1
Transcript of Automated Translation of Multi-word Expressions Application in English-Latvian SMT - poster booster
![Page 1: Automated Translation of Multi-word Expressions Application in English-Latvian SMT - poster booster](https://reader036.fdocuments.us/reader036/viewer/2022083104/58808bbb1a28ab35718b6acd/html5/thumbnails/1.jpg)
Automated Translation of Multi-word Expressions:
Application in English-Latvian SMT
Prof. Inguna Skadiņa1 and Matīss Rikters2
1,2University of Latvia, 19 Raina Blvd., Riga, Latvia1Institute of Mathematics and Computer Science, 29 Raina Blvd., Riga, Latvia
2nd PARSEME Training SchoolLa Rochelle, France
June 27, 2016
![Page 2: Automated Translation of Multi-word Expressions Application in English-Latvian SMT - poster booster](https://reader036.fdocuments.us/reader036/viewer/2022083104/58808bbb1a28ab35718b6acd/html5/thumbnails/2.jpg)
General schema of experiments
![Page 3: Automated Translation of Multi-word Expressions Application in English-Latvian SMT - poster booster](https://reader036.fdocuments.us/reader036/viewer/2022083104/58808bbb1a28ab35718b6acd/html5/thumbnails/3.jpg)
Data and Tools• JRC Acquis corpus(v. 3.0):• 1 472 367 parallel sentences as training data• 1134 random sentences as development data• 1599 random sentences as test data• 64 290 multiword expressions
• Tools:• Moses toolkit (Keohn et al., 2007) for training the MT system• MPAligner (Pinnis, 2014) for alignment of multiword expressions• SRILM (Stolcke et al., 2011) for training 5-gram language model• MERT (Och, 2003) for tunning the MT system