A Probabilistic Framework for Structure-based Alignment

Language & K nowledge Engineering Lab

A Probabilistic Framework for Structure-based

AlignmentKurohashi-lab M2

56430 Toshiaki Nakazawa

Language & Knowledge Engineering Lab

OutlineI. Introduction of Machine Translation

i. What is Alignment?ii. Statistical Machine Translation (SMT)iii. Example-based Machine Translation (EBMT)

II. Baseline alignment methodIII. A probabilistic framework for alignment

i. Corresponding Pattern score (CP-score)ii. Integration of Maximum Entropy (ME)

IV. Experiments and resultsV. Discussion and conclusion

Language & Knowledge Engineering LabStandard Way of Machine Translation

ParallelCorpus Alignment Resource

Output

Translation

Parallel Corpus: Text which is written in two different languages but the content is almost same.Alignment: To find the correspondence between two parallel sentences. (word level, phrase level, etc…)

The performance of alignment affects the accuracy of

translation.

Language & Knowledge Engineering LabStatistical Machine Translation (SMT)

Learn models for translation from parallel corpus statistically

Not use any linguistic resources Small translation unit (= “word”)

– Recently, the number of studies handling bigger unit (= “couple of words” or “phrase”) is increasing

Require large parallel corpus for highly-accurate translation

Basic Method for SMT Translate by maximizing the probability:

)|()(maxarg

)|(maxarg

Language Model Translation Model

Learn from a parallel corpus(usually with unsupervised learning algorithm)

Ex) IBM Model [Brown et al., 93]

Overview of EBMT

ParallelCorpus Alignment TMDB

Output

Translation

Advanced NLP technologies

Translation Memory Data

Language & Knowledge Engineering LabExample-based Machine Translation (EBMT)

Divide the input sentence into a few parts Find a similar expressions (examples)

from parallel corpus for each parts Combine the examples to generate output

translation Use any linguistic resources as much as

possible Larger translation unit (larger example) is

better

Flow of EBMT

SMT vs. EBMT

SMT EBMT

GoodPoint

- Works enough for languages which don’t have sufficient NLP resources.

- Active to utilize any kinds of NLP resources.- High performance.

BadPoint

- Not easy to achieve high performance.- Weak for the wide difference between the languages.

- Algorithm is usually heuristic.- Modification is necessary for each language pair.

We introduce a probabilistic framework for structure-

based alignment.

Alignment

交差点で、突然

あの車が

飛び出して来たのです

the carcame

at mefrom the side

at the intersection

1. Transformation into dependency structure

J: 交差点で、突然あの車が飛び出して来たのです。

E ： The car came at me from the side at the intersection.

J: JUMAN/KNPE: Charniak’s nlparser → Dependency tree

Alignment

あの車が

the carcame

at mefrom the side

at the intersection

1. Transformation into dependency structure2. Detection of word(s) correspondences

• Bilingual dictionaries• Transliteration detection

ローズワイン → rosuwain ⇔ rose wine (similarity:0.78)新宿 → shinjuku ⇔ shinjuku (similarity:1.0)

Alignment

あの車が

the carcame

at mefrom the side

at the intersection

1. Transformation into dependency structure2. Detection of word(s) correspondences3. Disambiguation of correspondences

Disambiguation

日本で保険

会社に対して

保険請求の

申し立てが可能ですよ

will haveto file

insurance

an claim

insurance

with the office

in Japan

Cunamb → Camb : 1/(Distance in J tree) + 1/(Distance in E tree)

1/2 + 1/1

Alignment

あの車が

the carcame

at mefrom the side

at the intersection

1. Transformation into dependency structure2. Detection of word(s) correspondences3. Disambiguation of correspondences4. Handling of remaining phrases

Alignment

あの車が

the carcame

at mefrom the side

at the intersection

1. Transformation into dependency structure2. Detection of word(s) correspondences3. Disambiguation of correspondences4. Handling of remaining phrases5. Registration to translation example database

Corresponding Pattern (CP)

(1, 2, 1, 1)

(0, 1, 0, 1)(0, 2, 0, 1) (0, 2) (0, 1) (0, 1) (0, 1)

(1, 2) (1, 1)

CP-score Assign a score to each CP = CP-score Calculation of CP-score

– Count the frequency of each CP Using the aligned parallel corpus by the baseline align

ment method

– Divide the frequency by the total frequency of all CPs (CP-score is a probability of occurrence)

Alignment Score (AS) by CP-score

ijjiscoreCPAS

Alignment Disambiguation by AS

Adopt the alignment with highest AS

Maximum Entropy (ME) The principle of maximum entropy:

– a method for analyzing the available information in order to determine a unique epistemic probability distribution. (by WIKIPEDIA)

Maximum Entropy (ME) The principle of maximum entropy:

– a method for analyzing the available information in order to determine a unique epistemic probability distribution. (by WIKIPEDIA)

Alignment probability with ME [Och et al,. 02]

),,( TSAmhS: Source sentenceT: Target sentenceA: Alignment

TSATSA

]),,(exp[

]),,(exp[),|Pr(

:Feature function

: Model parameter

Feature Functions

1. Alignment Score (AS)2. Parse score (Jap. and Eng.)3. Depth pattern score (DP-score)4. Probability of lexicon (Jap. and Eng.)5. Coverage of the correspondences (Jap.

and Eng.)6. Average size of the correspondences

(Jap. and Eng.)

Experiments Select 500 moderately long sentences

from BTEC corpus of IWSLT2005 training data set

Manually annotate phrase-to-phrase alignment

Conducted 5-fold cross validation– 400 for training and 100 for testing

Calculated the F-measure

2 P: Precision

R: Recall

Results

Method

All sentences w/ function words

All sentences w/o function words

Ambiguous sentences w/ function words

Baseline 63.86 65.14 60.43

+CP-score 64.21 65.54 61.60

+ME 64.58 66.03 63.00

GIZA++ 22.14 52.85 23.78

Discussion Not considering clause

– Correspondences in the same clause of source sentence are likely to be in the same clause of target sentence

Sentence complexity– Proposed method works effectively for

long and complex sentences Preciseness of dictionary

– Erroneous correspondence by the dictionary makes bad effects on alignment

Conclusion Proposed a probabilistic framework to

improve structure-based alignment Proposed a new criteria CP-score for

evaluating alignment Integrate the ME model into alignment

approach

Future Work Sophisticate the CP and CP-score

– Consider clauses Select the feature functions Test our method on other corpora

– Longer and more complex sentences

A Probabilistic Framework for Structure-based Alignment

Documents

Transcript of A Probabilistic Framework for Structure-based Alignment

Ensembles and Probabilistic Forecasting. Probabilistic Prediction Because of forecast uncertainties, predictions must be provided in a probabilistic framework,

PROBABILISTIC FRAMEWORK FOR EEG-BASED DROWSINESS …roman.rosipal/Papers/talk_sensation07.pdf · PROBABILISTIC FRAMEWORK FOR EEG-BASED DROWSINESS AND VIGILANCE MONITORING Roman Rosipal1,

Probabilistic Sequence Alignment BMI 877 Colin Dewey cdewey@biostat.wisc.edu February 25, 2014.

Probabilistic Constraint Handling in the Framework of ...kdeb/papers/k2012006.pdf · Probabilistic Constraint Handling in the Framework of Joint Evolutionary-Classical Optimization

A Probabilistic Framework for Structure-based Alignment

A Probabilistic Framework for Multimodal Retrieval using ...ozdemir/papers/nips14_iibp.pdfA Probabilistic Framework for Multimodal Retrieval using Integrative Indian Buffet Process

XXXX BayesWipe: A Scalable Probabilistic Framework …rakaposhi.eas.asu.edu/BayesWipe-JDIQ.pdf · XXXX BayesWipe: A Scalable Probabilistic Framework for Improving Data Quality SUSHOVAN

Probabilistic framework to quantify the reliability levels ...

Framework For Upstream Synchronization and Alignment

From Sequence to Expression: A Probabilistic Framework

Probabilistic Transmission Planning: Framework, Sample ... · Probabilistic Transmission Planning: Framework, Sample Analysis, & Tools ... Load Cases Sensitivities ... • Handling

Probabilistic, Information-Theoretic Models for ...etymon.cs.helsinki.fi/Papers/wettig-2013-thesis.pdf · Probabilistic, Information-Theoretic Models for Etymological Alignment Hannes

A Probabilistic Framework for Jammer Identi cation in MANETswpage.unina.it/alessandra.debenedictis/index_file/doc/jammerLocalization.pdf · A Probabilistic Framework for Jammer Identi

A Probabilistic Analytical Seismic Vulnerability Assessment Framework … · 2017-10-17 · 1 A Probabilistic Analytical Seismic Vulnerability Assessment Framework for Substandard

Trinity English exams - Framework alignment

A Probabilistic Framework for Video Representation

A Probabilistic Framework for Real-time 3D …davheld.github.io/segmentation3D/segmentation_extended.pdf · A Probabilistic Framework for Real-time 3D Segmentation using Spatial,

An Investigation Into a Probabilistic Framework for ... › staff › m.harman › dave-phd.pdf · An Investigation Into a Probabilistic Framework for Studying Symbolic Execution

PR-OWL: A Framework for Probabilistic Ontologies

Linguistically-motivated Tree-based Probabilistic Phrase Alignment