Jointly Identifying Temporal Relations with Markov Logic Katsumasa Yoshikawa †, Sebastian Riedel...

Jointly Identifying Temporal Relations with Markov Logic

Katsumasa Yoshikawa†, Sebastian Riedel‡, Masayuki Asahara†, Yuji Matsumoto†

†Nara Institute of Science and Technology, Japan‡ University of Massachusetts, Amherst

ACL-IJCNLP2-7 August, 2009 Suntec Singapore

2

Outline

Background and Motivation Related work of temporal relation

identification Proposed global approach with Markov

Logic Experimental setup and highlighted data Summary and future work

3

Background and Motivation Temporal Relation Identification (temporal ordering)

Identifying temporal orders of events and time expressions in a document

introduction

PresentPast Future

Document Creation Time(August 2009)

became

BEFORE

2003

Essential work for document understanding

With the introduction of the TimeBank corpus (Pustejovsky et al., 2003), machine learning approaches to temporal ordering became possible.

4

Outline

Background and Motivation Related work of temporal relation

identification Proposed global approach with Markov


5

EVENT / TIME

AFTER

IAFTER

ENDS

DURING

BEGUN_BY

SIMULTANEOUS

BEGINS

ENDED_BY

IBEFORE

BEFORE

>

mi

oi

f

d

si

=

s

c

fi

o

m

<

after

met-by

overlapped-by

finishes

during

started-by

equal

startscontains

finished-by

overlapsmeets

before

TimeML(11 Labels)

Allen’s(13 Labels)

INCLUDES

Allen‘s Temporal Logic [Allen 1983] TimeML and TimeBank [Pustejovsky et al. 2003]

We regard temporal ordering as a classification task With TimeML, the TimeBank corpus was created

6

TempEval (SemEval 2007 Task 15)

Temporal Relation Identification in SemEval 2007 Shared Task (TempEval)

Six temporal relation labels Main Label (BEFORE, AFTER ， OVERLAP) Sub-Label (BEFORE-OR-OVERLAP, OVERLA

P-OR-AFTER, VAGUE) TempEval includes three types of tasks (A,

B, and C)

7

introduction

DCT(August 2009)

became

2003

OVERLAP

Task A of TempEval

Temporal relations between events and time expressions that occur within the same sentence

Past FuturePresent


8

introduction

DCT(August 2009)

became

2003

Task B of TempEval

Temporal relations between events and the Document Creation Time (DCT)

BEFORE BEFORE

Past FuturePresent


9

BEFOREcreated

DCT(August 2009)

became

2003

Task C of TempEval

Temporal relations between the main events of adjacent sentences

Past FuturePresent

The TimeBank corpus was created (Pustejovsky et al., 2003). As a result, machine learning approaches to temporal ordering became possible.

10

Issues of the TempEval Participants Local approaches with machine learning are empl

oyed by many participants in TempEval Considering only a single relation at a time Local approach cannot take into account the other

relations

A global approach can be useful in that case

EVENT 1 EVENT 2BEFORE (Task C)AFTER ? (Task C)

DCT

EVENT 1

BEFORE (Task B)

EVENT 2

AFTER (Task B)DCT

11

A global approach can be useful in that case

Issues of the TempEval Participants Local approaches with machine learning are empl

oyed by many participants in TempEval Considering only a single relation at a time Local approach cannot take into account the other

relations

EVENT 1 EVENT 2BEFORE (Task C)

DCT

BEFORE (Task B) AFTER (Task B)

12

Outline

Background and Motivation Related work and task reviews of temporal

relation identification Proposed global approach with Markov


13

Overview of Our Global Approach

Ensure consistency among the multiple relations with hard and soft constraints based on the transition rules

Jointly identify the three types of relations in TempEval Learning one global model for the three tasks

Global approach with Markov Logic

14

Markov Logic[Richardson and Domingos, 2006]

A Statistical Relational Learning framework An expressive template language of Marko

v Networks Not only hard but also soft constraints A Markov Logic Network (MLN) is a set of p

airs (φ, w) where φ is a formula in first-order logic w is a real number weight

Higher weight stronger constraint

15

※ e1 and e2 are events

An Example of Markov Logic Networks hasPastTense(a) : indicates that an event a has past tense beforeDCT(a) : indicates that an event a happens before the DCT before(a,b) : indicates that an event a happens before another

event b

ID Weight function Weigh value Ground formula

(A1)　　 wa(e1) 　　 3.1 hasPastTense(e1) beforeDCT(e1)⇒

(A2) 　　 wa(e2) 　　 -0.9 hasPastTense(e2) beforeDCT(e2)⇒

(B1) wb(e1,e2) 1.7 beforeDCT(e1) ^ ¬ beforeDCT(e2) before(e1, e2)⇒

hasPastTense(e1)

beforeDCT(e1)

wa(e1)

beforeDCT(e2)

wb(e1,e2)

before (e1,e2) hasPastTense(e2)

wa(e2)grounding

16

Global Feature Representation (Predicate Definition)

relE2T(e, t, r) : the relation r between an event e and a time expression t relDCT(e, r) : the relation r between an event a and the DCT relE2E(e1, e2, r) : the relation r between two events e1 and e2 relT2T(t1, t2, r) : the relation r between two time expressions t1 and t2 dctOrder(t, r) : the relation r between a time expression t and the DCT

EVENT (e1)

DCT

TIME (t1)

EVENT (e2)

TIME (t2)

relE2E(C)

relDCT(B)

relE2T(A)

dctOrder dctOrder

relT2T

relDCT(B)

relE2T(A)

17

We jointly solve the three tasks of TempEval We use global features named Joint formulae A joint formula is based on a transition rule

EVENT (e1)

DCT

EVENT(e2)

BEFORE AFTER

BEFORE

BEFORE & AFTER BEFORE⇒

EVENT (e2)

DCT

EVENT(e1)

AFTER

BEFOREBEFORE

BEFORE & AFTER BEFORE⇒

If e1 happens before DCT and e2 happens after DCT => then e1 is before e2

If e1 happens before DCT and e1 happens after e2, => then e2 happens before DCT

Global Feature Representation (Transition Rules)

B→C C→B

18

Global Feature Representation (Templates of the all Joint Formulae)

Tasks Joint Formula (first-order logic)

A→B dctOrder(t1,r) & relE2T(e1, t1, r1) relDCT(e1,r2)⇒

B→A dctOrder(t1,r) & relDCT(e1,r1) relE2T(e1, t1, r2)⇒

B →C relDCT(e1, r1) & relDCT(e2, r2) relE2E(e1, e2, r3)⇒

C→B relDCT(e2, r1) & relE2E(e1,e2, r2) relDCT(e1, r3)⇒

A→C relE2T(e1,t1,r1) & relT2T(t1,t2,r2) & relE2T(e2,t2,r3) relE2E(e1,e2,r4)⇒

C→A relE2T(e2,t2,r2) & relT2T(t1,t2,r1) & relE2E(e1,e2,r3) relE2T(e1,t1,r4)⇒

They are developed with events, time expressions and relations

19

Global Feature Representation (Templates of the all Joint Formulae)

Tasks Joint Formula (first-order logic)

A→B dctOrder(t1,r) & relE2T(e1, t1, r1) relDCT(e1,r2)⇒

B→A dctOrder(t1,r) & relDCT(e1,r1) relE2T(e1, t1, r2)⇒

B →C relDCT(e1, BEFORE) & relDCT(e2, AFTER) relE2E(⇒ e1, e2, BEFORE)

C→B relDCT(e1, BEFORE) & relE2E(e1,e2, AFTER) relDCT(⇒ e2, BEFORE)

A→C relE2T(e1,t1,r1) & relT2T(t1,t2,r2) & relE2T(e2,t2,r3) relE2E(e1,e2,r4)⇒

C→A relE2T(e2,t2,r2) & relT2T(t1,t2,r1) & relE2E(e1,e2,r3) relE2T(e1,t1,r4)⇒

They are developed with events, time expressions and relations

20

Outline

Background and Motivation Related work and task reviews of temporal



21

Experimental Setup

Use a MLN Engine “Markov thebeast” Weight learning : MIRA Inference : Cutting Plane Inference

(base solver: ILP) [Riedel, 2008] Employ the local features referred to the ea

rly work in TempEval [SemEval, 2007] Select joint formulae as global features Use the same data and evaluation scheme

s of TempEval

22

Comparison of Local and Global

Local Global

Task A 0.613 0.662 (+0.049)

Task B 0.789 0.799 (+0.010)

Task C 0.533 0.552 (+0.019)

All 0.667 0.689 (+0.022)

Results with 10-fold cross validation on training data

Over all tasks, Global is better than Local On Task A, Global model outperformed Local one.

ρ< 0.01 (McNemar’s test, 2-tailed)

※All scores denote F1-value

23

Results with the other systems on test data (F1-value)

Comparison to State-of-the-art

Outperformed the others on Tasks A and C Always performed better than the best pure m

achine-learning based system (CU-TMP[Bethard and Martin, 2007])

Other Systems

TempEval Best

TempEval Average CU-TMP

Task A 0.62 0.56 0.61

Task B 0.80 0.74 0.75

Task C 0.55 0.51 0.54

Our Systems

Local Global

0.62 0.65

0.74 0.76

0.53 0.57※All scores denote F1-value

24

Outline

Background and motivation Related work and task reviews of temporal



25

Summary

We proposed a global framework with Markov Logic for Temporal Relation Identification

Our global model with joint formulae successfully improved the performances of the identifications

Our approach reported the competitive results among all participants in TempEval

26

Future Work

Issues inherent to the task and the dataset Low inter annotator agreement Low transitive connectivity Small size

Semi-supervised approaches ease some issues

TRAIN DEV TEST TOTAL

Task A 1359 131 169 1659

Task B 2330 227 331 2888

Task C 1597 147 258 2002

Numbers of labeled relations for all tasks and datasets

Jointly Identifying Temporal Relations with Markov Logic Katsumasa Yoshikawa †, Sebastian Riedel...

Documents

Transcript of Jointly Identifying Temporal Relations with Markov Logic Katsumasa Yoshikawa †, Sebastian Riedel...