Recent Advances in Distantly Supervised Relation Extractionwilliam/papers/Part1_Distant... ·...

Recent Advances in Distantly Supervised Relation Extraction

William WangDepartment of Computer Science

University of California, Santa Barbara

Joint work with Jiawei Wu, Lei Li, Pengda Qin, Weiran Xu.

CIPS Summer School 2018

Beijing, China

Agenda

• Motivation

• Challenges in Semi-Supervised Learning • Reinforced Co-Training (Wu et al., NAACL 18)• Reinforced Distant Supervision Relation Extraction

(Qin et al., ACL 18a)

• DSGAN (Qin et al., ACL 18b)• Conclusions

Motivation

• Most of the existing successful stories of deep learning are still based on supervised learning.

• For example, object recognition, machine translation, text classification.

• However, in many applications, it is not realistic to obtain large amount of labeled data.

• We need to leverage unlabeled data.

A Classic Example of Semi-Supervised Learning

• Co-Training (Blum and Mitchell, 1998)

Challenges• The two classifiers in co-training have to be

independent.• Choosing highly-confident self-labeled examples

could be suboptimal.• Sampling bias shift is common.

Our Approach: Reinforced SSL

• Assumption: not all the unlabeled data are useful.

• Idea: performance-driven semi-supervised learning that learns an unlabeled data selection policy with RL, instead of using random sampling.

• 1. Partition the unlabeled data space• 2. Train a RL agent to select useful unlabeled data• 3. Reward: change in accuracy on the validation set

Reinforcement Learning

Environment

𝑎𝑡𝑠$%&

𝑟$%&

𝑠$𝑟$

Agent Environment

DeepQ-Network UnlabeledData

Reinforced Co-Training(Wu et al., NAACL 2018)

Deep Q-Learning

Experiment 1: Clickbait Detection

Experiment 2: Generic Text Classification

Conclusion

• We proposed a novel RL framework for semi-supervised learning• Strong results in SSL text classification• Also showed effectiveness in relation extraction

Deep Reinforcement Learning for Distantly Supervised Relation Extraction

Outline

• Motivation• Algorithm• Experiments• Conclusion

Outline

Plain Text Corpus(Unstructured Info)

Classifier Entity-relation Triple(Structured Info)

Relation Type withLabeled Dataset

Relation Type withoutLabeled Dataset

Relation Extraction

“If two entities participate in a relation, any sentencethat contains those two entities might express thatrelation.” (Mintz, 2009)

Distant Supervision

Data(x): <Belgium, Nijlen>Label(y): /location/contains

Target Corpus(Unlabeled)

DS Label(y): /location/contains

Nijlen is a municipalitylocated in the Belgianprovince of Antwerp.

Distant Supervision

DS Data(x):

v Within-Sentence-Bag Level

v Entity-Pair Level

§ Hoffmann et al., ACL 2011.§ Surdean et al., ACL 2012.§ Zeng et al., ACL 2015. § Li et al., ACL 2016.

§ None

Wrong Labeling

§ Place_of_Death

i. Some New York city mayors – William O’Dwyer, Vincent R. Impellitteri and Abraham Beame – were born abroad.

ii. Plenty of local officials have, too, including two New York city mayors, James J. Walker, in 1932, and William O’Dwyer, in 1950.

Wrong Labeling

v Entity-Pair Level

v Most of entity pairs only have several sentences

1 Sentence55%2 Sentence

Other4%

Wrong Labeling

v Lots of entity pairs have repetitive sentences

Outline

Sentence-Level Indicator

Entity-Pair Level Wrong Labeling Problem

Learn a Policy to Denoise theTraining Data

General Purpose and Offline Process

Without Supervised Information

Requirements

Negative set

Positive set

Negative set

Positive setFalse Positive Policy Based Agent

𝑅𝑒𝑤𝑎𝑟𝑑

𝐴𝑐𝑡𝑖𝑜𝑛

Classifier𝑇𝑟𝑎𝑖𝑛

False Positive

DS Dataset Cleaned Dataset

Overview

v State

v Action

v Reward§ ???

Deep Reinforcement Learning

§ Sentence vector§ The average vector of previous removed sentences

§ Remove & retain

v One relation type has an agent

v Sentence-level

v Split into training set and validation set

§ Positive: Distantly-supervised positive sentences§ Negative: Randomly sampled

RL Agent Train

TrainRelationClassifier

RelationClassifier

𝐹&34&

𝐹&3× +𝓡3 + ×(−𝓡3)

Noisydataset𝑃$=>3

Cleaneddataset

Removedpart

𝓡3 = 𝛼(𝐹&3 - 𝐹&34& )

RL Agent

Epoch i-1

EpochiNoisy

dataset𝑃$=>3

+𝑁$=>3

𝑁$=>3 +

Positive Set Negative Set

§ Accurate

§ Steady

§ Fast

Reward

False Positive

§ Obvious

Positive Set Negative Set

RelationClassifier

PreTrain RelationClassifier

Calculate

Epoch 𝑖

Reward

False Positive

Positive Negative

Outline

v Dataset: SemEval-2010 Task 8v True Positive: Cause-Effectv False Positive: Other

Evaluation on a Synthetic NoiseDataset

v True Positive + False Positive: 1331 samples

0 10 20 30 40 50 60 70 80 90 100

F1Score

200 FPs in 1331 Samples

(198/388)

(197/339)

(195/308)(180/279) (179/260)

False PositiveRemoved Part

0 10 20 30 40 50 60 70 80 90 100

F1Score

0 FPs in 1331 samples

(0/258)

(0/150)

(0/121)

(0/59)(0/32)

v CNN+ONE, PCNN+ONE§ Distant supervision for relation extraction via piecewise

convolutional neural networks. (Zeng et al., 2016)

v CNN+ATT, PCNN+ATT§ Neural relation extraction with selective attention over

instances. (Lin et al., 2016)

Distant Supervisionon NYT Freebase Dataset

Distant Supervision

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4

CNN-basedCNN+ONE

CNN+ONE_RL

CNN+ATT

CNN+ATT_RL

Distant Supervision

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4

PCNN-based

PCNN+ONE

PCNN+ONE_RL

PCNN+ATT

PCNN+ATT_RL

Outline

vWe propose a deep reinforcement learningmethod for robust distant supervision relationExtraction.

v Our method is model-agnostic.

v Our method boost the performance of recentlyproposed neural relation extractors.

Conclusion

DSGAN: Adversarial Learning for DenoisingDistantly Supervised Relation Extraction

(Qin et al., ACL 2018b)

DS data space

DS Positive Data

DS Negative Data

Distant Supervision Data Distribution

DS data space

DS False PositiveData

DS True Positive Data

DS Negative Data

The Decision Boundaryof DS Data

The DesiredDecision Boundary

Data Distribution

Adversarial Training

True Positive

False Positive

Noisy Positive Set

Generator

True Positive

False Positive

Label1

Label0

Label1

DSGAN (Qin et al., ACL 2018b)

Epoch 𝐢

𝐁𝐚𝐠𝐤4𝟏

𝐁𝐚𝐠𝐤

𝐁𝐚𝐠𝐤%𝟏

DSPositiveDataset

𝑠&𝑠I𝑠J

𝑝& = 0.57

𝑝I = 0.02

𝑝J = 0.83

𝑝T = 0.26

𝑝V = 0.90

Sampling

𝐥𝐚𝐛𝐞𝐥 = 𝟏

𝐥𝐚𝐛𝐞𝐥 = 𝟎D

𝒓𝒆𝒘𝒂𝒓𝒅

DS positivedataset

𝐥𝐚𝐛𝐞𝐥 = 𝟏

𝐥𝐚𝐛𝐞𝐥 = 𝟎

Pre-training

𝑝K= 0.7

High-confidencesamples

Low-confidencesamples

Generator Discriminator

DS negativedataset

v Sentence-Level Noise Reduction

v Training Without Supervised Information

v Model-Agnostic

Characteristics

Distant Supervision Relation Extraction

0 0.1 0.2 0.3 0.4

CNN-basedCNN+ONECNN+ONE_GANsCNN+ATTCNN+ATT_GANs

Distant Supervision Relation Extraction

0 0.1 0.2 0.3 0.4

PCNN-basedPCNN+ONEPCNN+ONE_GANsPCNN+ATTPCNN+ATT_GANs

Conclusion

• We introduce Reinforced Co-Training, a new approach that combines reinforcement learning and semi-supervised learning.• We show that in weakly-supervised relation

extraction, reinforcement learning can be utilized to de-noise the training signals.• Adversarial learning serves as a joint learning

framework, and it can also be applied to de-noising distantly supervised IE data.

Thanks!

http://nlp.cs.ucsb.edu

Recent Advances in Distantly Supervised Relation Extractionwilliam/papers/Part1_Distant... ·...

Documents

Transcript of Recent Advances in Distantly Supervised Relation Extractionwilliam/papers/Part1_Distant... ·...

Seed Selection for Distantly Supervised Web-Based Relation Extraction

Populating Web Scale Knowledge Graphs using Distantly ... · a deep learning based technology for relation extraction that can be trained by a distantly supervised approach. In addition

DESCGEN: A Distantly Supervised Datasetfor Generating ...

DDMagazine march 2011 - “DISTANTLY”

Progressive Combinatorial Algorithm for Multiple Structural Alignments:Application to Distantly Related Proteins Maria Elena Ochagavia and Shoshana Wodak.

Paul Beame University of Washington

1 CSE 417: Algorithms and Computational Complexity Winter 2001 Lecture 14 Instructor: Paul Beame.

Robust Distant Supervision Relation Extraction via Deep ... · Some New York city mayors –William O’Dwyer, Vincent R. Impellitteri and Abraham Beame – were born abroad. ii.

1 Time-space tradeoff lower bounds for non-uniform computation Paul Beame University of Washington 4 July 2000.

Dakota Humphries (Project Lead) Thomas Impellitteri (Tech Lead) Daryl McGhee II (Design Lead) Keith Rosier (Asset Lead)

CERES: Distantly Supervised Relation Extraction from the Semi … · 2019-07-12 · CERES: Distantly Supervised Relation Extraction from the Semi-Structured Web Colin Lockard University

Improving Distantly-Supervised Neural Relation …...Improving Distantly-Supervised Neural Relation Extraction using Side Information Shikhar Vashishth, Rishabh Joshi, Sai S. Prayaga,

S KEW IN P ARALLEL Q UERY P ROCESSING Paraschos Koutris Paul Beame Dan Suciu University of Washington PODS 2014.

hw1 scan - University of Washington · Title: hw1_scan.pdf Author: beame Created Date: 9/28/2012 4:30:57 PM

Beame User Manual Electronic · USER MANUAL. Title: Beame User Manual Electronic Created Date: 6/18/2019 12:18:15 PM

Arsenic Treatment Technologies Christopher A. Impellitteri USEPA/ORD/WSWRD/WQMB Water Technologies for Rural Texas Tuesday, December 2, 2003.

Review of Basic Computational Complexityreif/courses/complectures/Beame/root.pdfMichael Sipser, Introduction to the Theory of Computation. Christos Papadimitrion, Computational Complexity

Performing Bayesian Inference by Weighted Model Counting Tian Sang, Paul Beame, and Henry Kautz Department of Computer Science & Engineering University.

1 Paul Beame University of Washington Phase Transitions in Proof Complexity and Satisfiability Search Dimitris Achlioptas Michael Molloy Microsoft Research.

· g.go l.go 3 PTAS 6.800,' Z goo. e AL BEN SEGMENTS BAD IMPELLITTERI Oder JOSHUA bewiesen, REDAK¶ONS.CUECK 01. ROB …