Effective Semantics for Engineering NLP Systemsexobrain.kr/images/(2-1)Effective Semantics...

Exobrain Symposium, 2018

Effective Semantics for

Engineering NLP

Systems

Andre Freitas

Outline

• Motivation

• From Text to Knowledge Graphs (KGs)

• Knowledge Graphs & Distributional Semantics

– (A marriage made in heaven?)

• Using KGs for building AI applications

• Explainability

• Take-away Message

Coping with the long tail of data

variety frequency of use

# of entities and attributes

relational NoSQL

schema-less unstructured

Consumption (Querying, Software)

Formalization

Semiotic breakdown

Exponential growth of

vocabulary size

“On our best behaviour”

“We need to return to our roots in Knowledge

Representation and Reasoning for language and from

language.”

Levesque, 2013

“We should not treat English text as a monolithic

source of information. Instead, we should carefully

study how simple knowledge bases might be used to

make sense of the simple language needed to build

slightly more complex knowledge bases, and so on.”

Explainable AI

“The right to explanation”

“… such processing should be

subject to suitable safeguards,

… to obtain an explanation of

the decision …”

Building Knowledge Graphs

Open Information Extraction

• Extracting unstructured facts from text.

• TextRunner [Banko et al., IJCAI ’07], WOE [Wu & Weld,

ACL ‘10].

• ReVerb [Fader et al., EMNLP ‘11].

• OLLIE [Mausam et al., EMNLP ‘12].

• OpenIE [Mausam et al., IJCAI ‘16].

• Graphene [Niklaus et al, COLING 17].

Graphene

• Captures contextual relations.

• Extends the default Open IE representation in

order to capture inter-proposition relationships.

• Increase the informativeness and

expressiveness of the extracted tuples.

Niklaus et al., A Sentence Simplification System for Improving Relation Extraction,

COLING (2017)

Simplification/

Transformation Stage

Rhetorical Relations

Input Document

Transformation Stage

Relation Extraction

Asian stocks fell anew and the yen rose to session highs

in the afternoon as worries about North Korea simmered,

after a senior Pyongyang official said the U.S. is

becoming ``more vicious and more aggressive'' under

President Donald Trump .

Asian stocks fell anew

The yen rose to session highs in the afternoon

spatial

attribution

Worries simmered about North Korea

The U.S. is becoming becoming `` more vicious and more aggressive '' under Donald Trump

A senior Pyongyang official said

background

Precision:

Recall:

Improving Open Relation Extraction

using Clausal and Phrasal

Disembedding, Under Review, (2017)

What to expect

(Wikipedia & Newswire)

https://github.com/Lambda-3/Graphene

Niklaus et al., A Sentence Simplification System for Improving Relation Extraction,

COLING (2017)

Software: Extracting Knowledge

Graphs from Text

Argumentation Structures

Stab & Gurevych, Parsing Argumentation Structures in Persuasive

Essays, 2016.

Argumentative Discourse

Unit Classification

Stab & Gurevych, Parsing Argumentation Structures in Persuasive

Essays, 2016.

Argumentation Schemes

Argument Mining Approaches

What to expect?

F1-score: 0.74

Stab & Gurevych, Parsing

Argumentation Structures in

Persuasive Essays, 2016.

Emerging perspectives

• The evolution of parsing and classification methods in NLP

is inducing a new lightweight semantic representation.

• This representation dialogues with elements from logics,

linguistics and the Semantic/Linked Data Web (especially

• However, they relax the semantic constraints of previous

models (which were operating under assumptions for

deductive reasoning or databases).

• Knowledge graphs as lexical semantic models operating

under a semantic best-effort mode (canonical identifiers

when possible, otherwise, words).

• Possibly closer to the surface form of the text.

• Priority is on segmenting, categorizing and when

possible, integrating.

• A representation (data model) convenient for AI

engineering.

Categorization

A fact (main clause):

* Can be a taxonomic fact.

term, URI term, URI term, URI

instance,

class,

triple

type, property,

schema property

instance,

class,

triple

Categorization

A fact with a context:

s0 p0 o0

reification

• subordination

(modality, temporality,

spatiality, RSTs)

• fact probability

• polarity

Categorization

Coordinated facts:

s0 p0 o0

s1 p1 o1

p2 e.g.

• coordination

• RSTs

• ADU

See all ›

18 References

Nam et al., SRDF: A Novel Lexical Knowledge Graph for Whole Sentence

Knowledge Extraction, LDK 2017.

https://github.com/Lambda-3/Graphene/blob/master/wiki/RDFNL-Format.md

Knowledge Graphs &

Distributional Semantics

(A marriage made in heaven?)

• Computational models that build contextual

semantic representations from text.

• Semantic context is represented by a vector.

• Statistical analysis of the linguistic contexts of a

• Semantic similarity/relatedness as the core

operation over the model.

Distributional Semantic /

Word Vector Models

Distributional Semantics as

Commonsense Knowledge

Commonsense is here

Semantic Approximation is

Semantic Model with

low acquisition effort

Context Weighting Measures

Kiela & Clark, 2014

Similarity Measures

… and of course, Glove and W2V

Distributional-Relational

Networks

Distributional Relational Networks, AAAI Symposium (2013).

A Compositional-Distributional Semantic Model for Searching Complex Entity

Categories, ACL *SEM (2016)

Barack Obama

Sonia Sotomayor

nominated

First Supreme Court Justice of Hispanic descent

LSA, ESA, W2V, GLOVE, …

s0 p0 o0

Compositionality of Complex

Nominals

Barack Obama

Sonia Sotomayor

nominated

On Complex Nominals

Building on Word Vector

Space Models

• But how can we represent the meaning of longer phrases?

• By mapping them into the same vector space!

the country of my birth

the place where I was born

Mixture vs Function

Compositional-distributional

model for paraphrases

A Compositional-Distributional Semantic Model for

Searching Complex Entity Categories, *SEM (2016)

Software: Indra

• Semantic approximation server

• Multi-lingual (12 languages)

• Multi-domain

• Different compositional models

https://github.com/Lambda-3/indra

Semantic Relatedness for All (Languages): A Comparative Analysis

of Multilingual Semantic Relatedness using Machine Translation,

EKAW, (2016).

Recursive vs recurrent

neural networks

Segmented Spaces vs

Unified Space

s0 p0 o0

• Assumes is <s,p,o> naturally

irreconcilable.

• Inherent dimensional reduction

mechanism.

• Facilitates the specialization of

embedding-based approximations.

• Easier to compute identity.

• Requires complex and high-

dimensional tensorial model.

How to access Distributional-

Knowledge Graphs efficiently?

s0 p0 o0

Inverted index

sharding

disk access

optimization

Multiple Randomized

K-d Tree Algorithm

The Priority Search

K-Means Tree algorithm

Database + IR

planning

Cardinality

Indexing

Skyline

Bitmap

indexes

Structured Queries Approximation Queries

How to access Distributional-

Knowledge Graphs efficiently?

s0 p0 o0

Database + IR

Structured Queries Approximation Queries

Software: StarGraph

• Distributional Knowledge Graph Database.

• Word embedding Database.

https://github.com/Lambda-3/Stargraph

Freitas et al., Natural Language Queries over Heterogeneous

Linked Data Graphs: A Distributional-Compositional Semantics

Approach, 2014.

• Graph-based data models + Distributional Semantic Models

(Word embeddings) have complementary semantic values.

• Graph-based Data Models:

– Facilitates querying, integration and reasoning.

• Distributional Semantic Models:

– Supports semantic approximation, coping with vocabulary variation.

• AI systems require access to comprehensive background

knowledge for semantic interpretation tasks.

• Inheriting from Information Retrieval and Databases:

– General Indexing schemes,

– Particular Indexing schemes,

• Spatial, temporal, topological, probabilistic, causal, …

– Query planning,

– Data compression,

– Distribution,

– … even supporting hardware strategies.

• One size of embedding does not fit all: Operate with

multiple distributional + compositional models for different

data model types (I, C, P), different domains and different

languages.

• Inheriting from Information Retrieval and Databases:

– Indexing schemes,

– Query planning,

– Data compression,

– Query distribution,

– even supporting hardware.

Effective Semantic Parsing

for Large KBs

The Vocabulary Problem

Barack Obama

Sonia Sotomayor

nominated

The Vocabulary Problem

Barack Obama

Sonia Sotomayor

nominated

Latino origins

selected

Judge High

Last US president

“On our best behaviour”

“It is not enough to build knowledge bases without paying

closer attention to the demands arising from their use.”

Levesque, 2013

“We should explore more thoroughly the space of

computations between fact retrieval and full

automated logical reasoning.”

Minimizing the Semantic Entropy

for the Semantic Matching

Definition of a semantic pivot: first query term to be resolved in the database.

• Maximizes the reduction of the semantic configuration space.

• Less prone to more complex synonymic expressions and abstraction-level differences.

• Semantic pivot serves as interpretation context for the remaining alignments.

• Heuristics for ambiguity.

Distributional Inverted Index

Distributional-

Relational Model

Reference

Commonsense

corpora

Core semantic approximation

& composition operations

Semantic Parser

Query Plan

Scalable semantic

parsing

Learn to Rank

Question Answers

𝒒 = 𝒕Γ𝟎, … , 𝒕Γ

t h0 t m10

Γ= {𝑰, 𝑷, 𝑪, 𝑽} …

lexical specificity # of senses lexical category

… …

- Vector neighborhood density

- Semantic differential

𝒒 = 𝒕Γ𝟎, … , 𝒕Γ

t h0 t m10

Γ= {𝑰, 𝑷, 𝑪, 𝑽} …

… …

𝒒 = 𝒕Γ𝟎, … , 𝒕Γ

t h0 t m10

Γ= {𝑰, 𝑷, 𝑪, 𝑽} …

… …

Δ𝑠𝑟

Δ𝑟

Semantic pivoting

- Distributional compositionality

𝒒 = 𝒕Γ𝟎, … , 𝒕Γ

t h0 t m10

Γ= {𝑰, 𝑷, 𝑪, 𝑽} …

… …

t h0 t m10

o t h0 t m10

t m10 =

… …… …

What to expect (@ QALD1)

F1-Score: 0.72

MRR: 0.5

Freitas & Curry, Natural Language Queries over Heterogeneous

Linked Data Graphs, IUI (2014).

• Schema-agnostic / distributional

database.

http://stargraph.net

Explainability

T: IBM cleared $18.2 billion in the first quarter.

H: IBM’s revenue in the first quarter was $18.2 billion.

Entailment?

- To clear is to yield as a net profit

- A net profit is an excess of revenues

over outlays in a given period of time

Recognizing and Justifying Text Entailment

through Distributional Navigation on Definition

Graphs, AAAI (2018).

Explainable AI

Explainable Findings

From Tensor Inferences Back to KGs

Explainable Findings

From Tensor Inferences Back to KGs

Entity Linking

Integration

Co-reference Resolution

KG Completion

Natural Language Inference

Named Entity Recognition

Semantic Parsing

KG Construction

Inference

Distributional Semantics

Server

Query By Example

spatial

temporal

probabilistic

causal

Indexes

NL Generation

NL Query

Answers

Explanations

Open IE

Taxonomy Extraction

Arg. Classif.

Definition Extraction

Take-away Message

• The evolution of methods, tools and the availability of data in NLP

creates the demand for a knowledge representation method to

support complex AI systems.

• A relaxed version of RDF* (SRDF, RDF-NL) can provide this

answer.

– Establishes a dialogue with a standard (with existing data).

– Inherits optimization aspects from Databases.

• Word-embeddings (DSMs) + compositional models + RDF*.

• Moving beyond facts and taxonomies: rhetorical structures,

arguments, polarity, stories.

Take-away Message

• Syntactical and lexical features can go a long way for

structuring text.

– Context-preserving

• KGs can support explainable AI:

– Meeting point between extraction, reasoning and querying.

• Scalable Querying: Inherit infrastructures from DB and IR.

Take-away Message

Opportunities:

• ML orchestrated pipelines with:

– Richer discourse-representation models.

– Explicit semantic representations (centered on KGs).

– Different compositional/distributional models (beyond

W2V & Glove)

• KGs and impact on explainability.

Papers & Software

http://andrefreitas.org/

Effective Semantics for Engineering NLP Systemsexobrain.kr/images/(2-1)Effective Semantics...

Documents

Transcript of Effective Semantics for Engineering NLP Systemsexobrain.kr/images/(2-1)Effective Semantics...

Computational Semantics - Shane · Computational Semantics LING 571 — Deep Processing for NLP October 28, 2019 1. Announcements HW5: your grammar should use rules and features that

EFFECTIVE NEGOTIATION SKILLS USING NLP SBL Effective... · EFFECTIVE NEGOTIATION SKILLS USING NLP October 31 - November 1, 2012 | 9.00am - 5.00pm | Concorde Hotel Shah Alam CONTENTS

CSE391 – 2005 NLP 1 Natural Language Processing Machine Translation Predicate argument structures Syntactic parses Lexical Semantics Probabilistic Parsing.

Session 4: The Semantics of Verbs L113 Word Meaning and ... · L113 Word Meaning and Discourse Understanding Session 4: The Semantics of Verbs ... NLP methods for nding verb similarities

Introduction to Semantics and Pragmatics. LING 2000 - 2006 NLP 2 NLP tends to focus on: Syntax – Grammars, parsers, parse trees, dependency structures.

Lexical Semantics & Word Sense Disambiguation Ling571 Deep Processing Techniques for NLP February 16, 2011.

Knowledge extraction in Web media: at the frontier of NLP, Machine Learning and Semantics

[PPT]LING 180 Intro to Computer Speech and Language …kathy/NLP/ClassSlides/Slides09/... · Web viewLexical Semantics The meanings of individual words Formal Semantics (or Compositional

Statistical NLP Spring 2010 Lecture 21: Compositional Semantics Dan Klein – UC Berkeley Includes slides from Luke Zettlemoyer.

Answer Extraction: Redundancy & Semantics Ling573 NLP Systems and Applications May 24, 2011.

Semantics and Pragmatics of NLP Lexical Semantics: Polysemy · Semantics and Pragmatics of NLP Lexical Semantics: Polysemy Alex Lascarides School of Informatics University of Edinburgh

Effective Semantics for Engineering NLP Systemsucrel.lancs.ac.uk/crs/attachments/UCRELCRS-2018-05... · Semantic Relatedness for All (Languages): A Comparative Analysis of Multilingual

Semantics and Pragmatics of NLP Lexical Semantics: Machine ... › ... › spnlp › lectureslides › 10.slides.pdf · Machine Learning What is Logical Metonymy? Semantic type of

Using semantics and NLP in experimental protocols

Social Networks and Multimedia Semantics · 2008-09-11 · • Thesis: Social Networks and the Semantic Web – Researcher at Yahoo! Research Barcelona • NLP and semantics • Semantic

Effective Semantics for Engineering NLP Systems211.109.9.33/images/(2-1)Effective Semantics for Engineering NLP... · • Knowledge graphs as lexical semantic models operating under

Lecture 6: Vector Semantics and Word EmbeddingsCS447 Natural Language Processing (J. Hockenmaier) Two ways NLP uses context for semantics Distributional similarities (vector-space

Truth-Conditional Semantics Statistical NLPklein/cs288/sp10/slides... · 2010-04-12 · 1 Statistical NLP Spring 2010 Lecture 20: Compositional Semantics Dan Klein –UC Berkeley

NLP - University of Michiganweb.eecs.umich.edu/~radev/coursera-slides/nlpintro_co3...Introduction to NLP Other Levels of Linguistic Analysis Semantics • Semantics – Lexical semantics

CS460/626 : Natural Language Processing/Speech, NLP and the Web (Lecture 37– Semantics; Universal Networking Language) Pushpak Bhattacharyya CSE Dept.,