Ensemble Solutions for Link- Prediction in Knowledge Graphs Denis Krompaß 1,2 and Volker Tresp 1,2...

Ensemble Solutions for Link-Prediction in Knowledge Graphs

Denis Krompaß1,2 and Volker Tresp1,2

1 Department of Computer Science. Ludwig Maximilian University, 2 Corporate Technology, Siemens AG

12.09.2015

Outline

1. Knowledge Graphs, what are they and what are they good for?

2. Representation Learning in Knowledge Graphs State of the Art Latent Variable Models Integrating Prior Knowledge about Relation-Types

3. Analyzing the Complementary “Potential” of State of the Art Representation Learning Algorithms

Knowledge Graphs

Stores facts about the world as relations between entities. Entities are no longer just strings but real world objects with

attributes, taxonomic information and relations to other objects. (AlbertEinstein, bornIn, Ulm)

Providing a machine with semantic information: Search engines Information retrieval Word-sense disambiguation …

Prominent Examples: Google Knowledge Graph IBM Watson

Learning in Knowledge Graphs

AlbertEinstein

1. Link-Prediction2. Link-based

Clustering3. Disambiguation

Similarities

Latent Variable Model

bornIn

bornIn0.2

2.10.6

-0.91.7

0.1-0.1

0.2-0.2

Knowledge Graph Triples

Latent representations (or embeddings) for Entities and Relation-Types that disentangle complex relationships observed in the data (semantics).

State of the Art Latent Variable Models

1. RESCAL Third-Order Tensor Factorization Methods Least-Squares Cost Function

2. TransE Distance-based Method Ranking Cost Function

3. Google Knowledge Vault Multi-way Neural Network (mwNN) Logistic Cost Function

Problem: Large Knowledge Graphs Contain Millions of Entities and thousands of Relation-Types Low dimensional representations have to be learned Try to find ways to increase prediction-quality under this constraint

Prior Knowledge about Relation-Types

Denis Krompaß, Stephan Baier and Volker Tresp. Type-Constrained Representation Learning in Knowledge Graphs. 14th International Semantic Web Conference (ISWC), 2015

*Results on large samples from these knowledge graphs 6

Domain and Range Constraints for Relation-Types

Latent Variable Model

Link-Prediction Improvement

+77% (Freebase*)+40% (YAGO2*)

+54% (DBpedia-Music*)

Type-Constraints (From the Schema)

Local closed-world assumption (From the Data)

RESCAL TransE Google KVault Neural NetworkWith low-dimensional embeddings

Integration in model training

Prediction of new triples

Complementary Prediction?

• State of the art models differ in many aspectsDiverse predictors

• Analysis to which degree the models are complementary– Combine through arithmetic mean– Use Plat scaling for mapping the different outputs to

probabilities•70% Training Set•10% Validation Set

Hyperparameter Tuning + Plat Scaling•20 % Test Set

Results

1. Ensemble has always much better link-prediction quality

Results

2. Best complement is between TransE and mwNN

Results

3. RESCAL provides only complementary predictions in case of the Freebase dataset

Results

3. RESCAL provides only complementary predictions in case of the Freebase dataset

4. For the local closed-world assumption, very similar observations could be made

Summary

• Models are complementary to each other– This applies especially when low dimensional

embeddings are used (d=10)– Ensemble with d=10 comparable to best single

predictor with d=100– Up to more than 10% improvement on top of the

improvements achieved when Type-Constraints or the Local closed-world assumption are exploited

Questions ?

http://www.dbs.ifi.lmu.de/~krompass/

Denis.Krompass@siemens.com

Ensemble Solutions for Link- Prediction in Knowledge Graphs Denis Krompaß 1,2 and Volker Tresp 1,2...

Documents

Transcript of Ensemble Solutions for Link- Prediction in Knowledge Graphs Denis Krompaß 1,2 and Volker Tresp 1,2...

1,2,* , Sara Castro Barquero 1,2

Zadonina E.O. (1) , Caldeira B. (1,2) , Bezzeghoud M. (1,2), Borges J.F. (1,2)

1,2,*, Sowmya Ramesh 1,2, Renita Raymond 1,2, Agnes Selina 1,2

Systems/Circuits ......Systems/Circuits ElevatedCorrelationsinNeuronalEnsemblesofMouse AuditoryCortexFollowingParturition GideonRothschild,1,2 LiorCohen,1,2 AdiMizrahi,1,2 andIsraelNelken1

NeurobiologyofDisease ... · NeurobiologyofDisease ChemicalManipulationofHsp70ATPaseActivityRegulates TauStability UmeshK.Jinwal,1,2 YoshinariMiyata,4 JohnKorenIII,1,2 JeffreyR.Jones,1,2

1,2 Functional Properties and Electromagnetic 1,2 ...

1,2, 1,2 í ález 1,2 n

Electrons at Saturn’s moons: selected CAPS-ELS results A.J. Coates 1,2. G.H. Jones 1,2, C.S.Arridge 1,2, A. Wellbrock 1,2, G.R. Lewis 1,2, D.T. Young 3,

1,2 Samuel Penna Wanner, 1,2 1

Convolutional Neural Networks - LMU Munich · 2018-11-13 · Convolutional Neural Networks Presenter: Dr. Denis Krompaß Siemens Corporate Technology – Machine Intelligence Group

1,2, 1,2 3 1,2 4...Water-Coupled Cation Disorder Xi Liu 1,2,*, Zhaoyang Sui 1,2, Hongzhan Fei 3, Wei Yan 1,2, Yunlu Ma 1,2 and Yu Ye 4 1 School of Earth and Space Sciences, Peking

Non-Negative Tensor Factorization with RESCAL Denis Krompaß 1, Maximilian Nickel 1, Xueyan Jiang 1 and Volker Tresp 1,2 1 Department of Computer Science.

Probabilistic Clustering-Projection Model for Discrete Data Shipeng Yu 1,2, Kai Yu 2, Volker Tresp 2, Hans-Peter Kriegel 1 1 Institute for Computer Science,

1,2, Yanlin Ge 1,2 1,2,* , Huijun Feng 1,2

artebox.irartebox.ir/v1/images/ahmad-pejman-cover/IRAN.pdf · ° ¢ ° ¢ ° ¢ ° ¢ ° ¢ Piccolo Flute 1,2 Oboe 1,2 Clarinet 1,2 in Bb Bassoon 1,2 4 Horns in F Trumpet 1,2 in Bb

1,2,* , Grace Burns 1,2, Jennifer Pryor 1,2, Simon ... - MDPI

Representing Probabilistic Rules with Networks of …tresp/papers/tresp-mlj.pdfREPRESENTING PROBABILISTIC RULES WITH NETWORKS OF GBFS 3 man et al. (1988) use mixtures of Gaussians

Relational Latent Class Models - cs.ubc.camurphyk/nips07NetworkWorkshop/talks/tresp.pdf · Corporate Research and Technology Relational Latent Class Models Volker Tresp Siemens Corporate

Antonio Hernández 1,2 , Miguel Reyes 1,2 , Laura Igual 1,2 , Josep Moya 3 ,

1,2 1,2, - MDPI