About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content...

23
About Link prediction on Knowledge graph using TuckER by Haoyu and Joss March 2020 by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 1 / 23

Transcript of About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content...

Page 1: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

About Link prediction on Knowledge graph usingTuckER

by Haoyu and Joss

March 2020

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 1 / 23

Page 2: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

ContentBased on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevic etal,. 2019)

What is a knowledge graph?

Link prediction

Previous Models

TuckER

Summary of the different models

Advantages of TuckER

Comparison of different algorithms

Implementation and Experiments

Conclusion

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 2 / 23

Page 3: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

What is a knowledge graph? (KG)

Oriented labeled graph-structured database which stores relations

Vertices: entities (objects, persons, situations, events, etc.)

Edges: relations

Based on real-world facts

Figure: An example of knowledge graph

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 3 / 23

Page 4: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

What is a knowledge graph?

Figure: How the corresponding dataset looks like

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 4 / 23

Page 5: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

What is a knowledge graph?Why do we care about link prediction?

Figure: Another example of knowledge graph

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 5 / 23

Page 6: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Example of known knowledge graph data sets

FB15K (Freebase) : 362 million of facts, 4% are not symmetric butsuffer from test set leakage1

FB15k-237 : Created from FB15k by removing the inverse of manyrelations that are present in the training set.

WN18 (WordNet)2 : 60% are not symmetric, suffer from test setleakage

WN18RR : Subset of WN18 created by removing the inverse of manyrelations that are present in the training set.

1inverse relations from the training set were present in the test set2This data set is often categorized as a semantic graph

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 6 / 23

Page 7: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

What is a knowledge graph?Why do we care about link prediction?

We focus on the Open world assumption (OWA).The objective is to complete the graph (find out the unknown). Manyproblems can happen in OWA:

Link predictionEntity resolution : which entities refer to the same entity?Link-based clustering: grouping entities based on similarity of theirlinks.

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 7 / 23

Page 8: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Link predictionAlso known as knowledge graph completion

Given a knowledge graph G , we let:

E denotes set of entities

R denotes set of relations

(s, r , o) ∈ E ×R× E is a tuple.

ζ denotes the set of tuples that are true in a world

r ∈ R is called symmetric if (a, r , b) ∈ ζ ⇔ (b, r , a) ∈ ζAn embedding is a function fs : E → V , or fr : R → V or fo : E → Vwhere V is a vector space.

a tensor factorization model is (E ,R, fs , fr , fo , φ) whereφ : E ×R× E → R is the scoring function

Notation: v [i ] is the i th entry of v .

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 8 / 23

Page 9: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Models

Linear models

CPRESCALDistMultComplExSimplE

Non-linear models

ConvEHypER

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 9 / 23

Page 10: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

RESCAL (Nickel et al,. 2011)

φ(s, r , o) = e>s Wreo

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 10 / 23

Page 11: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

DistMult (Yang et al., 2015)

Special case of RESCAL with diagonal matrix

Scale to large knowledge graph, at cost of learning less expressivefeatures

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 11 / 23

Page 12: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

SimplE (Kazemi and Poole, 2018)

φ((ei , r , ej)) = 12(〈hei , vr , tej 〉+ 〈hej , vr−1 , tei 〉)

Fully expressive for embedding dimension min(|E||R|, γ + 1) :

for |E||R| bounds: (ei , rj , ek) ∈ ζ then set hei [n] = 1⇔ n mod |E| ==i else 0&vrj [n] = 1⇔ if n

|E| = j else 0 and tek [j |E|+ i ] = 1

for γ + 1 bounds: induction on γ, γ = 0 clear, inductive step...

Background knowledge encode : (ei , r , ej) ∈ ζ ⇔ (ej , r , ei ) ∈ ζ bytying vr−1 , vr as (ei , r , ej) ∈ ζ ⇒ 〈hei , vr , tej 〉 > 0&〈hej , vr−1 , tei 〉 >0⇒ 〈hej , vr , tei 〉 > 0&〈hei , vr−1 , tej 〉 > 0

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 12 / 23

Page 13: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

TuckER

Linear model based on the Tucker decompositionOne Entity embedding matrix E ∈ Rne×de

One Relation Embedding matrix R ∈ Rnr×dr

One Core tensor W ∈ Rde×dr×de .

Scoring function φ(s, r , o) = W ×1 es ×2 wr ×3 eo = e>s (W ×2 wr )eo

1-N scoring: Takes one (s, r) pair and scores it against all entities.

Bernoulli negative log-likelihood loss function:

L = −1

n

ne∑i=1

(y (i)log(p(i)) + (1− y (i))log(1− p(i)))

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 13 / 23

Page 14: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Summary of the different models

Figure: Models Summary

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 14 / 23

Page 15: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Summary of the different models

Figure: Models view as TuckER

Figure: Models view as SimpLE

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 15 / 23

Page 16: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Advantages of TuckER

TuckER is fully expressive

One entity embedding matrix

It has parameter sharing encoded in W

Encode the interaction between the entities and relationsInduce less parametersInduce asymmetryInduce multi-task learning

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 16 / 23

Page 17: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

TuckER is fully expressive

Theorem : TuckER is fully expressive

Proof:

Let es , eo ∈ Rne be the one-hot encoding of subject s and object orespectively, wr ∈ Rnr the one-hot encoding of relation r .Let the core tensor W ∈ Rne×nr×ne be 1 at position (s, r , o) if (s, r , o)holds, −1 otherwise.Then the tensor product will accurately represent the ground truthfrom the rest:If (s, r , o) holds, φ(s, r , o) = W ×1 es ×2 wr ×3 eo = 1If not, φ(s, r , o) = −1

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 17 / 23

Page 18: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Influence of Parameter Sharing

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 18 / 23

Page 19: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Comparison of Different algorithms

Comparison idea: use the rank of our ground truth

Common evaluation metrics

Mean Reciprocal Rank (MRR)Hits@k , with k a small integer (ex: Hits@1 is equivalent to theaccuracy, Hits@ne always gives 100%)

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 19 / 23

Page 20: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Implementation and Experiments

Figure: Best performing hyper-parameter values for Tucker

Figure: Best performing hyper-parameter values for ComplEx and SimplE onFB15k-237

Figure: MRR vs embedding sizeby Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 20 / 23

Page 21: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Comparison of the algorithms

Figure: Comparison of the models

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 21 / 23

Page 22: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

Conclusion

TuckER outperforms state-of-the-art models

Number of parameter grows linearly

previous models are special case of TuckER

How to incorporate background knowledge?

Linear model rocks !

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 22 / 23

Page 23: About Link prediction on Knowledge graph using TuckERgrabus/courses/ift6760_W20_file… · Content Based on TuckER: Tensor Factorization for Knowledge Graph Completion (Balazevi c

References

Ivan Balazevic, Carl Allen, and Timothy M. Hospedales. TuckER:Tensor Factorization for Knowledge Graph Completion. CoRR,abs/1901.09590, 2019. URL hp://arxiv.org/abs/1901.09590

Seyed Mehran Kazemi. David Poole SimplE Embedding for LinkPrediction in Knowledge Graphs In Advances in Neural InformationProcessing Systems.

Maximilian Nickel, Volker Tresp, and Hans-Peter Kriegel 2011 AThree-Way Model for Collective Learning on Multi-Relational Data InInternational Conference on Machine Learning.

Theo Trouillon, Johannes Welbl, Sebastian Riedel, Eric Gaussier, andGuillaume Bouchard. 2016 Complex Embeddings for Simple LinkPrediction In International Conference on Machine Learning.

by Haoyu and Joss About Link prediction on Knowledge graph using TuckER March 2020 23 / 23