ChapterPDF Available

A Study of the Similarities of Entity Embeddings Learned from Different Aspects of a Knowledge Base for Item Recommendations: ESWC 2018 Satellite Events, Heraklion, Crete, Greece, June 3-7, 2018, Revised Selected Papers

August 2018

August 2018

DOI:10.1007/978-3-319-98192-5_52

In book: The Semantic Web: ESWC 2018 Satellite Events (pp.345-359)

Authors:

Guangyuan Piao

National University of Ireland, Maynooth

John G Breslin

University of Galway

The recent development of deep learning approaches provides a convenient way to learn entity embeddings from different aspects such as texts and a homogeneous or heterogeneous graph encoded in a knowledge base such as DBpedia. However, it is unclear to what extent domain-specific entity embeddings learned from different aspects of a knowledge base reflect their similarities, and the potential of leveraging those similarities for item recommendations in a specific domain has not been explored. In this work, we investigate domain-specific entity embeddings learned from different aspects of DBpedia with state-of-the-art embedding approaches, and the recommendation performance based on the similarities of these embeddings. The experimental results on two real-word datasets show that recommender systems based on the similarities of entity embeddings learned from a homogeneous graph via the dbo:wikiPageWikiLink property provides the best performance compared to the ones learned from other aspects.

Knowledge about the movie entity Soulless (film) from different aspects of DBpedia.

…

Doc2Vec model for learning document/paragraph vector [10].

…

Figures - uploaded by Guangyuan Piao

Content may be subject to copyright.

Content uploaded by Guangyuan Piao

Content may be subject to copyright.

A Study of the Similarities of Entity

Embeddings Learned from Diﬀerent Aspects of a

Knowledge Base for Item Recommendations

Guangyuan Piao and John G. Breslin

Insight Centre for Data Analytics, Data Science Institute,

National University of Ireland, Galway, Ireland

{guangyuan.piao}@insight-centre.org,{john.breslin}@nuigalway.ie

Abstract.

The recent development of deep learning approaches pro-

vides a convenient way to learn entity embeddings from diﬀerent aspects

such as texts and a homogeneous or heterogeneous graph encoded in a

knowledge base such as DBpedia. However, it is unclear to what extent

domain-speciﬁc entity embeddings learned from diﬀerent aspects of a

knowledge base reﬂect their similarities, and the potential of leveraging

those similarities for item recommendations in a speciﬁc domain has

not been explored. In this work, we investigate domain-speciﬁc entity

embeddings learned from diﬀerent aspects of DBpedia with state-of-the-

art embedding approaches, and the recommendation performance based

on the similarities of these embeddings. The experimental results on

two real-word datasets show that recommender systems based on the

similarities of entity embeddings learned from a homogeneous graph

via the

dbo:wikiPageWikiLink

property provides the best performance

compared to the ones learned from other aspects.

Keywords:

Deep Learning, Semantic Similarity, Knowledge Base, Entity

Embeddings, Recommender Systems, Knowledge Graph

1 Introduction

Knowledge bases (KBs) such as DBpedia [12] and Wikidata [29] have received

great attention in the past few years due to the embedded knowledge which

is useful for a wide range of tasks including recommender systems [3]. For

example, Linked Open Data-enabled recommender systems (LODRS) aim to

utilize the background knowledge about items (entities) from linked datasets

such as DBpedia for improving the quality of recommendations [6,7]. However,

most previous studies on LODRS view a KB as a heterogeneous knowledge graph

(KG) based on the domain-speciﬁc entities and properties deﬁned in an ontology

(e.g., DBpedia ontology). Take DBpedia as an example, the heterogeneous KG

can be seen as one aspect of a knowledge base, and a KB can contain several

aspects of knowledge with respect to entities (see Figure 1) such as:

2 Guangyuan Piao and John G. Breslin

!"#$$%&&'()*$+,-

.%/"0'1"2/3045#6-

!%07%*'8*23%9-

…

wikiPageWikiLink

:32*$3';"<$"9&6=-

…

homogeneous graph

thumbnail

visual knowledge

!"#$$%&&-*&-3->?@>-A#&&*32-B$346-4"+%/=C

/03+3-)*$+-B3&%/-"2-D5%-2"9%$-!"#$$%&&…-

abstracts

textual knowledge

830*3';"<5%92*6"93-

starring

!%07%*'8*23%9-

writer

…

heterogeneous graph

Fig. 1: Knowledge about the movie entity Soulless (film) from diﬀerent

aspects of DBpedia.

–

Textual knowledge: This type of knowledge denotes textual knowledge about

entities, e.g., the abstracts of movies via dbo1:abstracts property.

–

Knowledge from a homogeneous graph: This type of knowledge denotes the

inherited knowledge from Wikipedia

based on the

dbo:wikiPageWikiLink

property, which provides a set of connected entities via the same property.

–

Knowledge from a heterogeneous graph: This type of knowledge is powered

by the heterogeneous graph, which consists of domain-speciﬁc entities and

other nodes connected to those entities via diﬀerent properties deﬁned in

the ontology of a KB, and has been widely used for extracting background

knowledge about items (entities) for LODRS.

–

Visual knowledge: This denotes visual information about entities, e.g., the

thumbnails of movies via dbo:thumbnail property.

1The preﬁx dbo denotes http://dbpedia.org/ontology/

2https://www.wikipedia.org

A Study of Entity Embeddings from Diﬀerent Aspects of a Knowledge Base 3

Recently, a great number of studies have been proposed to learn entity

embeddings in a KG for the KG completion task [4] or for other classiﬁcation or

recommendation tasks using those low-dimensional representations of entities as

features [26, 27] based on deep learning approaches. While related work reveals

several insights regarding the entity embeddings learned from the heterogeneous

graph of a KB, there exists little research on understanding the similarities

between those entity embeddings learned from other aspects of KBs. There has

been considerable semantic similarity/distance measures which were designed for

measuring the similarity/distance between entities in the same domain in linked

datasets such as DBpedia for various purposes such as item recommendations in

a cold start. This preliminary work can be seen as being in the same direction

as these studies but with the focus on investigating the similarities between

entity embeddings learned from embedding approaches using deep learning or

factorization models with domain knowledge.

In this preliminary work, we aim to investigate the semantic similarities

of entity embeddings learned from diﬀerent aspects of a KB, and evaluate

them in the context of item recommendations in the music and book domains.

Speciﬁcally, we focus on the textual knowledge, knowledge from a homogeneous

or heterogeneous graph based on dedicated embedding approaches including deep

learning techniques. Deep learning approaches have been proved to be eﬀective on

learning the latent representations of various forms of data such as images, texts,

as well as nodes in networks. Therefore, we use

Doc2Vec

[10] and

Node2Vec

[8] to

learn the entity embeddings based on the textual knowledge and the knowledge

from a homogeneous graph, and use an embedding model for knowledge graph

completion to learn the entity embeddings based on the heterogeneous graph of

a KB. We use DBpedia as our knowledge base in this work. In particular, we are

interested in investigating the following research questions with results in Section

–

How do those entity embeddings learned from diﬀerent aspects of a KB reﬂect

the similarities between items (entities) in a speciﬁc domain in the context of

item recommendations in a cold start?

–

Do those entity embeddings learned from diﬀerent aspects complement each

other?

To the best of our knowledge, this is the ﬁrst work on investigating the

semantic similarities between entity embeddings learned from diﬀerent aspects of

a KB, and exploring their usages in the context of recommender systems. A shorter

version of this paper was published in the CEUR proceedings of the 1st Workshop

on Deep Learning for Knowledge Graphs and Semantic Technologies [23].

2 Related Work

Here we review some related work on linked data similarity/distance measures

for measuring the similarity/distance between two entities in a speciﬁc domain

4 Guangyuan Piao and John G. Breslin

for recommendation purposes, and the approaches exploring entity embeddings

for item recommendations.

2.1 Linked Data Similarity/Distance Measures

LDSD

[19,20] is one of the ﬁrst approaches for measuring the linked data semantic

distance between entities in a linked dataset such as DBpedia. Leal et al. [11] pro-

posed a similarity measure which is based on a notion of proximity. This method

measures how connected two entities are (e.g., based on the number of paths

between two entities), rather than how distant they are. Piao et al. [21] revised

LDSD

in order to satisfy some fundamental axioms as a distance-based similarity

measure, and further improved it based on diﬀerent normalization strategies [22].

More recently, Alfarhood et al. [1] considered additional resources beyond the

ones one or two hops away in

LDSD

, and the same authors also proposed applying

link diﬀerentiation strategies for measuring the linked data semantic distance

between two entities in DBpedia [2]. In contrast to aforementioned approaches,

Meymandpour et al. [14] proposed a information content-based semantic similar-

ity measure for measuring the similarity between two entities in linked open data

cloud, which can consider multiple linked datasets for measuring the similarity.

In this work, we are interested in the similarities of entity embeddings learned

from diﬀerent aspects of a knowledge base, and compare those similarities with

one of the semantic similarity/distance measures [22].

2.2 Exploring Entity Embeddings for Item Recommendations

Recently, entity embeddings learned from a knowledge graph using deep learning

approaches have been used for item recommendations. In [27], the authors

proposed

RDF2Vec

, which runs random walks on a heterogeneous RDF

graph in

DBpedia, and then applies

Word2Vec

[15,16] techniques by treating the sequences

of triples as sentences. The learned entity embeddings based on the whole KG were

then used to ﬁnd the k-nearest neighbors of items. Afterwards, those neighbors

were used as side information for factorization machines [25] for providing item

recommendations. In contrast to [27] which uses the whole KG for learning entity

embeddings, we learn domain-speciﬁc entity embeddings from diﬀerent aspects of

a KB. The entity embeddings learned from the whole KG might reﬂect relatedness

of entities instead of their similarities as they are learned by incorporating all

properties and nodes from other domains. However, related entities are not always

similar, e.g., a musical artist and his/her spouse are related but not similar.

Zhang et al. [30] proposed collaborative knowledge base embedding, which

jointly learn the latent representations in collaborative ﬁltering for item rec-

ommendations as well as the ones for a knowledge base. However, those entity

embeddings were used as features and the similarities between them were not

investigated. Palumbo et al. [18] used domain-speciﬁc triples from DBpedia for

learning entity embeddings with

Node2Vec

for item recommendations. In order

3https://www.w3.org/RDF/

A Study of Entity Embeddings from Diﬀerent Aspects of a Knowledge Base 5

to use

Node2Vec

for the heterogeneous graph based on domain-speciﬁc properties,

the authors applied

Node2Vec

to each heterogeneous graph which consists of all

triples based on a single property. Afterwards, those property-speciﬁc similarity

scores were used as features for a learning-to-rank framework with the training

dataset. In contrast, we are interested in the entity embeddings learned from the

heterogeneous graph and the similarities between those embeddings.

3 Learning Entity Embeddings from Diﬀerent Aspects of

DBpedia

In this section, we discuss three state-of-the-art embedding/vectorization ap-

proaches that we adopted for learning entity embeddings based on diﬀerent

aspects of knowledge from DBpedia.

3.1 Entity Embeddings with Textual Knowledge

Doc2Vec

[10], which is inspired by

Word2Vec

[15, 16], was devised for learning

embeddings for larger blocks of text such as documents or sentences. This model

uses document vectors and contextual word vectors to predict the next word,

which is a multi-class classiﬁcation task. Figure 2 shows the

Doc2Vec

model,

where each document/paragraph is mapped to a latent vector which is a column

in a document/paragraph matrix D, and each word has its embedding which is

a column in a word embedding matrix W. As we can see from the ﬁgure, the

document vector and word vectors are concatenated to predict the next word

in a context. The document and word vectors can be learned by optimizing the

classiﬁcation error in a given set of documents. For example, with a window size

8, the model predicts the 8th word based on the document and 7 contextual word

vectors. We used the

gensim

[24] implementation of

Doc2Vec

for our experiment.

Fig. 2: Doc2Vec model for learning document/paragraph vector [10].

6 Guangyuan Piao and John G. Breslin

In our experiment, each abstract of an entity in a speciﬁc domain is a document,

which is provided by the

dbo:abstracts

property, and the set of all abstracts

is used for learning entity embeddings in a speciﬁed domain with the

Doc2Vec

model. The window size is set to 8 in the same way as [10].

3.2 Entity Embeddings with A Homogeneous Graph

Node2Vec

[8], which is also inspired by

Word2Vec

[15], aims to learn latent

representations of nodes for a homogeneous network. It extends the Skip-gram

architecture (see Figure 3) to networks, and optimizes the (log) probability of

observing a network neighborhood for each node. To apply the Skip-gram model

for networks,

Node2Vec

ﬁrst executes random walks based on a deﬁned searching

strategy, and the sequence of nodes obtained via the search is used for the

Skip-gram model. We used the author’s implementation4for our experiment.

In our study, we treat the graph which consists of all items in a speciﬁc

domain and other connected nodes to those items via the

dbo:wikiPageWikiLink

property as the homogeneous graph from DBpedia, and apply

Node2Vec

to learn

the entity embeddings based on this homogeneous graph.

Parameters. We choose smaller values for some parameters compared to the set-

tings in the original paper as there is a great number of

dbo:wikiPageWikiLink

relationships, which takes a long time for training the model due to its ex-

pensiveness. Our settings for the main hyperparameters of

Node2Vec

are as

follows:

–walk length=10: The length of walk for each node.

–num walks=10: The number of walks per node.

–p=q=1

and

denote the return and in-out hyperparameters for random

walks, respectively.

–window size=5: The context size for optimization.

3.3 Entity Embeddings with A heterogeneous Graph

TransE

[4] is a translation-based model for knowledge graph completion by

learning the embeddings of entities and their relationships. In short,

TransE

learns those embeddings in order to satisfy

(

) +

(

)

≈E

(

) for a valid

triple (

s, p, o

) in a knowledge base, where

(

) denotes x’s embedding. Although

TransE

has been used for learning entity embeddings for KG completion by

considering all triples in a KG, for item recommendations in a speciﬁc domain,

most previous studies extract the domain-speciﬁc DBpedia graph which consists

of all entities in that domain and incoming or outgoing nodes via domain-speciﬁc

properties [19, 22]. Therefore, to learn domain-speciﬁc entity embeddings, we

extract all triples for the entities/items in that domain with relevant properties.

In consistence with a previous work [22], we used the top-15 properties for each

domain in order to obtain all triples for the subjects in that domain. Table 1

A Study of Entity Embeddings from Diﬀerent Aspects of a Knowledge Base 7

Fig. 3: The Skip-gram model architecture from [16], which aims to learn word

vector representations that are good at predicting the nearby words.

shows those properties we used to extract domain knowledge about items for our

experiment in Section 4.

The dimensionality of entity embeddings is set to 100 for all three approaches,

and the trained embeddings are available at

https://github.com/parklize/

DL4KGS

. For our experiment, we used the HDT [5] dump for the DBpedia 2016-04

version, which is available at http://www.rdfhdt.org/datasets/.

4 Experiment Setup

We evaluate the similarities of entity embeddings learned from diﬀerent aspects

of DBpedia in the context of cold-start scenarios in recommender systems where

the top-N items are recommended based on the cosine similarities between entity

embeddings, which are learned from diﬀerent aspects of DBpedia.

4http://snap.stanford.edu/node2vec

8 Guangyuan Piao and John G. Breslin

Music Book

–dct:subject

–dbo:genre

–dbo:associatedBand

–dbo:associatedMusicalArtist

–dbo:instrument

–dbo:recordLabel

–dbo:occupation

–dbo:hometown

–dbo:bandMember

–dbo:formerBandMember

–dbo:currentMember

–dbo:inﬂuencedBy

–dbo:pastMember

–dbo:associatedAct

–dbo:inﬂuenced

–dct:subject

–dbo:author

–dbo:publisher

–dbo:literaryGenre

–dbo:mediaType

–dbo:subsequentWork

–dbo:previousWork

–dbo:country

–dbo:series

–dbo:nonFictionSubject

–dbo:coverArtist

–dbo:illustrator

–dbo:genre

–dbo:translator

–dbo:recordLabel

Table 1: The top-15 domain-speciﬁc properties used for extracting valid triples

from DBpedia and for training TransE.

4.1 Datasets

We use two real-world datasets in the music and book domains for our experiment.

The ﬁrst dataset is a last.fm dataset from [17], which consists of 232 musical

artists, and the top-10 similar artists for each of the 232 artists obtained from

last.fm. Those top-10 similar artists provided in last.fm for each artist are used as

the ground truth. The second dataset is a dbbook dataset

in the book domain,

which consists of 6,181 users and 6,733 items which have been rated by at least

one user. We randomly selected 300 users who have liked at least 10 books for

our experiment. For each user, we randomly chose one item and recommended

items similar to the chosen one based on their similarities. Therefore, the other

books liked by each user except the chosen one are used for our ground truth

here. For both datasets, all items in each dataset are considered as candidate

items for recommendations.

To learn domain-speciﬁc entity embeddings, we extracted background knowl-

edge from DBpedia for all entities/items in two domains: the music and book

domains. The subjects in the music domain are the entities that have their

rdf:type

(s) as

dbo:MusicalArtist

and

dbo:Band

, and the subjects in the book

domain are the ones that have their

rdf:type

(s) as

dbo:Book

. After obtaining

all subjects, we further obtain their abstracts, connected nodes (entities and

categories) via the

dbo:wikiPageWikiLink

property, and the connected nodes

5http://challenges.2014.eswc-conferences.org/index.php/RecSys#DBbook_

dataset

A Study of Entity Embeddings from Diﬀerent Aspects of a Knowledge Base 9

via those properties deﬁned in Table 1. Table 2 shows the details of the domain

knowledge with respect to the music and book domains.

Music Book

# subjects 171,812 76,639

# abstracts 131,622 70,654

# wikiPageWikiLinks 5,480,222 2,340,146

# triples 1,481,335 316,969

Table 2: The statistics of background knowledge about items from diﬀerent

aspects of DBpedia.

4.2 Evaluation Metrics

The recommendation performance is evaluated by the evaluate metrics below:

– P@N

: Precision at rank N (P@N) is the proportion of the top-N recommen-

dations that are relevant to the user, which is measured as follows:

P@N=|{relevant items@N}|

– R@N

: Recall at rank N (R@N) represents the mean probability that relevant

items are successfully retrieved within the top-N recommendations.

R@N=|{relevant items@N}|

|{relevant items}|

– nDCG@N

: nDCG (Normalized Discounted Cumulative Gain) takes into

account rank positions of the relevant items. nDCG@N can be computed as

follows:

nDCG@N=1

IDCG@N

k=1

2ˆruk −1

log2(k+ 1)

where

ˆruk

is the relevance score of the item at position kwith respect to a

user uin the top-N recommendations, and the normalization factor IDCG@N

denotes the score obtained by an ideal top-N ranking.

We used the paired t-test in order to test the statistical signiﬁcance where

the signiﬁcance level is set to 0.05.

10 Guangyuan Piao and John G. Breslin

4.3 Compared Methods

We compare the similarity measures below to evaluate the similarities of item

embeddings based on diﬀerent aspects of DBpedia:

–Resim

[22]: This is a semantic distance/similarity measure for LOD dataset

such as DBpedia, which measures the similarity based on the direct and

indirect properties between two entities. We use the implementation from

our previous work6for our experiment.

–Cos(Vtk:Doc2Vec)

: This method uses the cosine similarity measure for the

entity embeddings learned from textual knowledge of entities from DBpedia

using Doc2Vec.

–Cos(Vhmk:Node2Vec)

: This method uses the cosine similarity measure for the

entity embeddings learned from homogeneous graph knowledge of entities

from DBpedia using Node2Vec.

–Cos(Vhtk:TransE)

: This method uses the cosine similarity measure for the

entity embeddings learned from heterogeneous graph knowledge of entities

from DBpedia using TransE.

–Cos([Vx, Vy])

: This method uses the cosine similarity measure for the

concatenated entity embeddings learned from several aspects of entities from

DBpedia. For example,

Cos([Vhtk:TransE, Vtk:Doc2Vec ])

denotes the method

using the cosine similarity measure for the concatenated entity embeddings

based on

TransE

and

Doc2Vec

, and

Cos([all])

denotes the concatenated

ones based on all embedding approaches.

5 Results

Figure 4 and 5 show the nDCG@N results and the precision-recall curve of item

recommendations based on the similarities of diﬀerent entity embeddings in the

music and book domains. Overall, we observe that the recommendations based

on the entity embeddings with

Node2Vec

provide the best performance followed

by the ones with TransE and Doc2Vec.

In both datasets, the results using the embeddings learned from

Node2Vec

signiﬁcantly outperform the ones learned from

TransE

and

Doc2Vec

, which

show that the great amount of information provided by

dbo:wikiPageWikiLink

reﬂects the similarities between entities better than other aspects of DBpedia.

We also observe that combining the embeddings based on

TransE

and

Doc2Vec

improves the recommendation performance signiﬁcantly compared to using the

embeddings learned from

TransE

Doc2Vec

. However, combining all embeddings

learned from the three diﬀerent aspects do not provide further improvement

on the recommendation performance. Also, the concatenated embeddings with

Node2Vec

and other embeddings do not provide better performance compared to

using the ones learned from

Node2Vec

alone, and the results are omitted from

Figure 4 and 5 for clarity.

6https://github.com/parklize/resim

A Study of Entity Embeddings from Diﬀerent Aspects of a Knowledge Base 11

In the last.fm dataset, we observe some signiﬁcant improvement of

Node2Vec

and

Cos([all])

over

Resim

. For example, the recommendation performance is

improved by 25.4% and 11.1% with

Node2Vec

and

Cos([all])

compared to using

Resim

. In contrast, there is no statistical diﬀerence between the recommendation

performance using those embeddings and using

Resim

in the dbbook dataset.

This might be due to the relatively small size of subjects in the book domain

and their related aspects for training those embeddings.

0.05$

0.15$

0.25$

0.35$

0.45$

0.55$

N=1$ N=5$ N=10$ N=20$

Cos(Vhmk:Node2Vec)$$

Cos(Vtk:Doc2Vec)$$

Cos(Vhtk:TransE)$$

Cos([Vhtk:TransE,$

Vtk:Doc2Vec])$$

Cos([all])$$

Resim$

(a) nDCG@N

0.05$

0.15$

0.25$

0.35$

0$ 0.1$ 0.2$ 0.3$ 0.4$ 0.5$

Cos(Vhmk:Node2Vec)$$

Cos(Vtk:Doc2Vec)$$

Cos(Vhtk:TransE)$$

Cos([Vhtk:TransE,$

Vtk:Doc2Vec])$$

Cos([all])$$

Resim$

(b) P@N (y-axis) and R@N (x-axis) curve when N = 1, 5, 10, 20.

Fig. 4: The performance of item recommendations on the last.fm dataset with all

methods compared.

12 Guangyuan Piao and John G. Breslin

0.05$

0.1$

0.15$

0.2$

0.25$

0.3$

0.35$

N=1$ N=5$ N=10$ N=20$

Cos(Vhmk:Node2Vec)$$

Cos(Vtk:Doc2Vec)$$

Cos(Vhtk:TransE)$$

Cos([Vhtk:TransE,$

Vtk:Doc2Vec])$$

Cos([all])$$

Resim$

(a) nDCG@N

0.05"

0.1"

0.15"

0.2"

0" 0.02" 0.04" 0.06" 0.08" 0.1"

Cos(Vhmk:Node2Vec)""

Cos(Vtk:Doc2Vec)""

Cos(Vhtk:TransE)""

Cos([Vhtk:TransE,"

Vtk:Doc2Vec])""

Cos([all])""

Resim"

(b) P@N (y-axis) and R@N (x-axis) curve when N = 1, 5, 10, 20.

Fig. 5: The performance of item recommendations on the dbbook dataset with

all methods compared.

6 Conclusions and Future Work

In this paper, we investigated the embeddings learned from three diﬀerent aspects

of DBpedia using state-of-the-art deep learning and embedding-based approaches,

and the recommendation performance based on the similarities captured by those

embeddings in two real-world datasets. The preliminary results indicate that

the entity embeddings learned from the homogeneous graph powered by the

dbo:wikiPageWikiLink

property provide the best performance in the context

of item recommendations compared to the ones learned from other aspects of

A Study of Entity Embeddings from Diﬀerent Aspects of a Knowledge Base 13

DBpedia. We further explored potential synergies that exist by combining those

embeddings learned from diﬀerent aspects. The concatenated embeddings with

the ones learned from textual knowledge (using

Doc2Vec

) and the heterogeneous

graph (using

TransE

) signiﬁcantly improves the performance. This preliminary

study can be seen as a ﬁrst step towards investigating the similarity between

entity embeddings learned from diﬀerent aspects of a knowledge base for item

recommendations, and also poses many research questions for future work.

First, although we used state-of-the-art approaches for learning entity embed-

dings from diﬀerent aspects, there are many other state-of-the-art alternatives for

learning entity embeddings such as

Tweet2Vec

[28] for learning entity embeddings

based on their abstracts, and

ETransR

[13] for learning the embeddings based

on the heterogeneous graph. A further investigation of using other deep learning

and embedding-based approaches for learning entity embeddings for diﬀerent

aspects of a knowledge base is required.

Secondly, how to choose domain-speciﬁc triples out of all triples in the

knowledge base is a remaining question. Using triples extracted with domain-

speciﬁc properties might lead to a smaller number of triples for those embedding-

based approaches to learn good entity embeddings. In contrast, using the whole

heterogeneous graph might lead to general entity embeddings which tend to

capture their relatedness instead of the similarities. Further research is needed to

conﬁrm the hypothesis, and a recent approach such as [9] for extracting domain-

speciﬁc subgraphs can be further explored for extracting domain-speciﬁc triples

for training the entity embeddings in that domain.

Finally, the results of this study showed that concatenating all embeddings

does not further improve the performance, and those results suggest more research

is needed for combining those entity embeddings which are learned from diﬀerent

aspects of a knowledge base.

Acknowledgments.

This publication has emanated from research conducted

with the ﬁnancial support of Science Foundation Ireland (SFI) under Grant

Number SFI/12/RC/2289 (Insight Centre for Data Analytics).

References

Alfarhood, S., Labille, K., Gauch, S.: PLDSD: Propagated Linked Data Semantic

Distance. In: 2017 IEEE 26th International Conference on Enabling Technologies:

Infrastructure for Collaborative Enterprises (WETICE). pp. 278–283 (2017)

Alfarhood, S., Gauch, S., Labille, K.: Employing Link Diﬀerentiation in Linked

Data Semantic Distance. In: R´o ˙zewski, P., Lange, C. (eds.) Knowledge Engineering

and Semantic Web. pp. 175–191. Springer International Publishing, Cham (2017)

Blomqvist, E.: The use of Semantic Web technologies for decision support a survey.

Semantic Web 5(3), 177–201 (2014), http://dx.doi.org/10.3233/SW-2012-0084

Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating

Embeddings for Modeling Multi-relational Data. pp. 2787–2795 (2013)

Fern´andez, J.D., Mart´ınez-Prieto, M.A., Guti´errez, C., Polleres, A., Arias, M.:

Binary RDF representation for publication and exchange (HDT). Web Semantics:

14 Guangyuan Piao and John G. Breslin

Science, Services and Agents on the World Wide Web 19, 22–41 (2013),

http:

//www.sciencedirect.com/science/article/pii/S1570826813000036

Figueroa, C., Vagliano, I., Rodr´ıguez Rocha, O., Morisio, M.: A systematic literature

review of Linked Data-based recommender systems. Concurrency Computation

(2015)

de Gemmis, M., Lops, P., Musto, C., Narducci, F., Semeraro, G.: Semantics-Aware

Content-Based Recommender Systems BT - Recommender Systems Handbook. pp.

119–159. Springer US, Boston, MA (2015)

Grover, A., Leskovec, J.: node2vec: Scalable Feature Learning for Networks. CoRR

abs/1607.0 (2016), http://arxiv.org/abs/1607.00653

Lalithsena, S., Kapanipathi, P., Sheth, A.: Harnessing Relationships for Domain-

speciﬁc Subgraph Extraction: A Recommendation Use Case. In: IEEE International

Conference on Big Data. Washington D.C. (2016)

10.

Le, Q.V., Mikolov, T.: Distributed Representations of Sentences and Documents.

CoRR abs/1405.4 (2014), http://arxiv.org/abs/1405.4053

11.

Leal, J.P., Rodrigues, V., Queir´os, R.: Computing semantic relatedness using

dbpedia (2012)

12.

Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hell-

mann, S., Morsey, M., van Kleef, P., Auer, S.: Dbpedia-a Large-scale, Multilingual

Knowledge Base Extracted from Wikipedia. Semantic Web Journal (2013)

13.

Lin, H., Liu, Y., Wang, W., Yue, Y., Lin, Z.: Learning Entity and Relation Em-

beddings for Knowledge Resolution. In: Procedia Computer Science. vol. 108, pp.

345–354 (2017)

14.

Meymandpour, R., Davis, J.G.: A semantic similarity measure for linked data:

An information content-based approach. Knowledge-Based Systems 109, 276–293

(2016)

15.

Mikolov, T., Chen, K., Corrado, G., Dean, J.: Eﬃcient Estimation of Word Rep-

resentations in Vector Space. CoRR abs/1301.3 (2013),

http://arxiv.org/abs/

1301.3781

16.

Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed represen-

tations of words and phrases and their compositionality. In: Advances in neural

information processing systems. pp. 3111–3119 (2013)

17.

Oramas, S., Sordo, M., Espinosa-Anke, L., Serra, X.: A SEMANTIC-BASED

APPROACH FOR ARTIST SIMILARITY. In: ISMIR. pp. 100–106 (2015)

18.

Palumbo, E., Rizzo, G., Troncy, R.: Entity2Rec: Learning User-Item Relatedness

from Knowledge Graphs for Top-N Item Recommendation. In: Proceedings of the

Eleventh ACM Conference on Recommender Systems. pp. 32–36. RecSys ’17, ACM,

New York, NY, USA (2017), http://doi.acm.org/10.1145/3109859.3109889

19.

Passant, A.: dbrec: Music Recommendations Using DBpedia. In: ISWC 2010 SE -

14. pp. 209–224. Springer (2010)

20.

Passant, A.: Measuring semantic distance on linking data and using it for resources

recommendations. In: Proceedings of the AAAI Spring Symposium: Linked Data

Meets Artiﬁcial Intelligence. vol. 77, pp. 93–98 (2010), files/129/display.html

21.

Piao, G., showkat Ara, S., Breslin, J.G.: Computing the Semantic Similarity of

Resources in DBpedia for Recommendation Purposes. In: Semantic Technology. pp.

1–16. Springer International Publishing (2015)

22.

Piao, G., Breslin, J.G.: Measuring semantic distance for linked open data-enabled

recommender systems. In: Proceedings of the 31st Annual ACM Symposium on

Applied Computing. vol. 04-08-Apri, pp. 315–320. ACM, Pisa, Italy (2016)

A Study of Entity Embeddings from Diﬀerent Aspects of a Knowledge Base 15

23.

Piao, G., Breslin, J.G.: A Study of the Similarities of Entity Embeddings Learned

from Diﬀerent Aspects of a Knowledge Base for Item Recommendations. In: 1st

Workshop on Deep Learning for Knowledge Graphs and Semantic Technologies at

the 15th Extended Semantic Web Conference. CEUR-WS.org (2018)

24.

Radim Rehurek, P.S.: Software Framework for Topic Modelling with Large Cor-

pora. In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP

Frameworks. pp. 45–50. ELRA, Valletta, Malta (may 2010)

25.

Rendle, S.: Factorization Machines with libFM. ACM Trans. Intell. Syst. Technol.

3(3), 57:1—-57:22 (may 2012)

26.

Ristoski, P., Paulheim, H.: RDF2Vec: RDF Graph Embeddings for Data Mining BT

- The Semantic Web ISWC 2016. pp. 498–514. Springer International Publishing,

Cham (2016)

27.

Ristoski, P., Rosati, J., Di Noia, T., De Leone, R., Paulheim, H.: RDF2Vec: RDF

Graph Embeddings and Their Applications. Semantic Web Journal 0 (2018),

http:

//www.semantic-web-journal.net/system/files/swj1495.pdf

28.

Vosoughi, S., Vijayaraghavan, P., Roy, D.: Tweet2Vec: Learning Tweet Embeddings

Using Character-level CNN-LSTM Encoder-Decoder. In: Proceedings of the 39th

International ACM SIGIR Conference on Research and Development in Information

Retrieval. pp. 1041–1044. SIGIR ’16, ACM, New York, NY, USA (2016),

http:

//doi.acm.org/10.1145/2911451.2914762

29.

Vrandeˇci´c, D., Kr¨otzsch, M.: Wikidata: a Free Collaborative Knowledgebase. Com-

munications of the ACM 57(10), 78–85 (2014)

30.

Zhang, F., Yuan, N.J., Lian, D., Xie, X., Ma, W.Y.: Collaborative Knowledge Base

Embedding for Recommender Systems. In: Proceedings of the 22Nd ACM SIGKDD

International Conference on Knowledge Discovery and Data Mining. pp. 353–362.

KDD ’16, ACM, New York, NY, USA (2016)

Unifying Knowledge Graph Learning and Recommendation: Towards a Better Understanding of User Preferences

Preprint

Full-text available

Feb 2019

Incorporating knowledge graph (KG) into recommender system is promising in improving the recommendation accuracy and explainability. However, existing methods largely assume that a KG is complete and simply transfer the ”knowledge” in KG at the shallow level of entity raw data or embeddings. This may lead to suboptimal performance, since a practical KG can hardly be complete, and it is common that a KG has missing facts, relations, and entities. Thus, we argue that it is crucial to consider the incomplete nature of KG when incorporating it into recommender system. In this paper, we jointly learn the model of recommendation and knowledge graph completion. Distinct from previous KG-based recommendation methods, we transfer the relation information in KG, so as to understand the reasons that a user likes an item. As an example, if a user has watched several movies directed by (relation) the same person (entity), we can infer that the director relation plays a critical role when the user makes the decision, thus help to understand the user's preference at a finer granularity. Technically, we contribute a new translation-based recommendation model, which specially accounts for various preferences in translating a user to an item, and then jointly train it with a KG completion model by combining several transfer schemes. Extensive experiments on two benchmark datasets show that our method outperforms state-of-the-art KG-based recommendation methods. Further analysis verifies the positive effect of joint training on both tasks of recommendation and KG completion, and the advantage of our model in understanding user preference. We publish our project at https://github.com/TaoMiner/joint-kg-recommender.

Augmenting the User-Item Graph with Textual Similarity Models

Preprint

Full-text available

Sep 2021

This paper introduces a simple and effective form of data augmentation for recommender systems. A paraphrase similarity model is applied to widely available textual data, such as reviews and product descriptions, yielding new semantic relations that are added to the user-item graph. This increases the density of the graph without needing further labeled data. The data augmentation is evaluated on a variety of recommendation algorithms, using Euclidean, hyperbolic, and complex spaces, and over three categories of Amazon product reviews with differing characteristics. Results show that the data augmentation technique provides significant improvements to all types of models, with the most pronounced gains for knowledge graph-based recommenders, particularly in cold-start settings, leading to state-of-the-art performance.

Discovering Entity Profiles Candidate for Entity Resolution on Linked Open Data Halal Food Products

Conference Paper

Dec 2020

Exploiting knowledge graph and text for ranking entity types

Article

Sep 2020

Most of the online queries target entities and the type of an entity is a key piece of information. Entity type helps us to understand what an entity is and how it relates to other entities mentioned in a document. Search engine result pages (SERPs) often surface facts and entity type information from a background Knowledge Graph (KG) in response to queries that carry a semantic information need. In a KG, an entity usually holds multiple type properties. For example, popular types attached to the entity `Donald Trump' via rdfs:type statements might be Person, Businessman, and Leader. However, other types of this entity, e.g., Solicitor, Restaurateur, and Writer might also be interesting to some users. Unpopular entity types can be useful for tail queries like, for example, `Is Donald Trump an American Television Producer' or `Is Donald Trump an American Television Actor'. It is then important to, given an entity in a KG, rank entity types attached to the entity by relevance to a certain user and information need as not always the most popular type is the most informative within a textual context. In this paper we address the entity type ranking problem by means of KG embedding models. In our work, we show that entity type ranking can be seen as a special case of the KG completion problem. Embeddings can be learned from the structural, probabilistic and contextual description information of the entities. We propose and evaluate our methods to find the most relevant entity type based on collection statistics and on the graph structure interconnecting entities and types. Experimental results show that our proposed approaches outperform the state-of-the-art type ranking models while, at the same time, being more efficient and scalable. Our approach focuses on the task of ranking a set of types associated to an entity in a background knowledge graph to select the most relevant types.

Representation learning for entity type ranking

Conference Paper

Mar 2020

RDF2Vec: RDF graph embeddings and their applications

Article

Full-text available

Aug 2018

Linked Open Data has been recognized as a valuable source for background information in many data mining and information retrieval tasks. However, most of the existing tools require features in propositional form, i.e., a vector of nominal or numerical features associated with an instance, while Linked Open Data sources are graphs by nature. In this paper, we present RDF2Vec, an approach that uses language modeling approaches for unsupervised feature extraction from sequences of words, and adapts them to RDF graphs. We generate sequences by leveraging local information from graph sub-structures, harvested by Weisfeiler-Lehman Subtree RDF Graph Kernels and graph walks, and learn latent numerical representations of entities in RDF graphs. We evaluate our approach on three different tasks: (i) standard machine learning tasks, (ii) entity and document modeling, and (iii) content-based recommender systems. The evaluation shows that the proposed entity embeddings outperform existing techniques, and that pre-computed feature vector representations of general knowledge graphs such as DBpedia and Wikidata can be easily reused for different tasks.

Employing Link Differentiation in Linked Data Semantic Distance

Conference Paper

Full-text available

Oct 2017

The use of Linked Open Data (LOD) has been explored in recommender systems in different ways, primarily through its graphical representation. The graph structure of LOD is utilized to measure inter-resource relatedness via their semantic distance in the graph. The intuition behind this approach is that the more connected resources are to each other, the more related they are. The drawback of this approach is that it treats all inter-resource connections identically rather than prioritizing links that may be more important in semantic relatedness calculations. In this paper, we show that different properties of inter-resource links hold different values for relatedness calculations between resources, and we exploit this observation to introduce improved resource semantic relatedness measures, Weighted Linked Data Semantic Distance (WLDSD) and Weighted Resource Similarity (WResim), which are more accurate than the current state of the art approaches. Exploiting these proposed weighted approaches, we also present two different ways to calculate links weights: Resource-Specific Link Awareness Weights (RSLAW) and Information Theoretic Weights (ITW). To validate the effectiveness of our approaches, we conducted an experiment to identify the relatedness between musical artists in DBpedia, and it demonstrated that approaches that prioritize link properties resulted in more accurate recommendation results.

entity2rec: Learning User-Item Relatedness from Knowledge Graphs for Top-N Item Recommendation

Conference Paper

Full-text available

Aug 2017

Knowledge Graphs have proven to be extremely valuable to recommender systems, as they enable hybrid graph-based recommendation models encompassing both collaborative and content information. Leveraging this wealth of heterogeneous information for top-N item recommendation is a challenging task, as it requires the ability of effectively encoding a diversity of semantic relations and connectivity patterns. In this work, we propose entity2rec, a novel approach to learning user-item relatedness from knowledge graphs for top-N item recommendation. We start from a knowledge graph modeling user-item and item-item relations and we learn property-specific vector representations of users and items applying neural language models on the network. These representations are used to create property-specific user-item relatedness features, which are in turn fed into learning to rank algorithms to learn a global relatedness model that optimizes top-N item recommendations. We evaluate the proposed approach in terms of ranking quality on the MovieLens 1M dataset, outperforming a number of state-of-the-art recommender systems, and we assess the importance of property-specific relatedness scores on the overall ranking quality.

PLDSD: Propagated Linked Data Semantic Distance

Conference Paper

Full-text available

Jun 2017

Learning Entity and Relation Embeddings for Knowledge Resolution

Article

Full-text available

Dec 2017

Knowledge resolution is the task of clustering knowledge mentions, e.g., entity and relation mentions into several disjoint groups with each group representing a unique entity or relation. Such resolution is a central step in constructing high-quality knowledge graph from unstructured text. Previous research has tackled this problem by making use of various textual and structural features from a semantic dictionary or a knowledge graph. This may lead to poor performance on knowledge mentions with poor or not well-known contexts. In addition, it is also limited by the coverage of the semantic dictionary or knowledge graph. In this work, we propose ETransR, a method which automatically learns entity and relation feature representations in continuous vector spaces, in order to measure the semantic relatedness of knowledge mentions for knowledge resolution. Experimental results on two benchmark datasets show that our proposed method delivers significant improvements compared with the state-of-the-art baselines on the task of knowledge resolution.

Harnessing relationships for domain-specific subgraph extraction: A recommendation use case

Conference Paper

Full-text available

Dec 2016

Applications on the Web such as search engines and recommendation systems are increasingly adapting semantic approaches by leveraging knowledge graphs. While some applications require processing of the whole knowledge graph, most are domain-specific and require only a relevant subset of it. For example, a movie or a book recommendation system would require a subgraph that comprises knowledge relevant to the specific domain. In such scenarios, processing the whole knowledge graph, particularly the commonly used, large, and openly available knowledge graphs on the Web, is computationally intensive and the irrelevant portion may negatively impact the performance of the application. This necessitates the identification and extraction of relevant subgraphs that adequately captures entities and their relationships for a given application domain and/or task. In this work, we present an approach to identify a minimal domain-specific subgraph by utilizing statistic and semantic-based metrics. Our approach highlights the importance of relationships as first-class elements to capture the domain specificity of a subgraph. We demonstrate the applicability of this approach for a recommendation use case on two domains, i.e. movie and book. Our evaluation demonstrates a reduction of 80% to 90% of the knowledge graph with orders of magnitude decrease in time for computation without compromising accuracy.

Efficient Estimation of Word Representations in Vector Space

Conference Paper

Jan 2013

We propose two novel model architectures for computing continuous vector representations of words from very large data sets. The quality of these representations is measured in a word similarity task, and the results are compared to the previously best performing techniques based on different types of neural networks. We observe large improvements in accuracy at much lower computational cost, i.e. it takes less than a day to learn high quality word vectors from a 1.6 billion words data set. Furthermore, we show that these vectors provide state-of-the-art performance on our test set for measuring syntactic and semantic word similarities.

Computing the Semantic Similarity of Resources in DBpedia for Recommendation Purposes

Conference Paper

Mar 2016

The Linked Open Data cloud has been increasing in popularity, with DBpedia as a first-class citizen in this cloud that has been widely adopted across many applications. Measuring similarity between resources and identifying their relatedness could be used for various applications such as item-based recommender systems. To this end, several similarity measures such as LDSD (Linked Data Semantic Distance) were proposed. However, some fundamental axioms for similarity measures such as “equal self-similarity”, “symmetry” or “minimality” are violated, and property similarities have been ignored. Moreover, none of the previous studies have provided a comparative study on other similarity measures. In this paper, we present a similarity measure, called Resim (Resource Similarity), based on top of a revised LDSD similarity measure. Resim aims to calculate the similarity of any resources in DBpedia by taking into account the similarity of the properties of these resources as well as satisfying the fundamental axioms. In addition, we evaluate our similarity measure with two state-of-the-art similarity measures (LDSD and Shakti) in terms of calculating the similarities for general resources (i.e., any resources without a domain restriction) in DBpedia and resources for music artist recommendations. Results show that our similarity measure can resolve some of the limitations of state-of-the-art similarity measures and performs better than them for calculating the similarities between general resources and music artist recommendations.

DBpedia: A large-scale, multilingual knowledge base extracted from wikipedia

Article

Jens Lehmann

RDF2Vec: RDF Graph Embeddings for Data Mining

Conference Paper

Oct 2016
Lect Notes Comput Sci

Linked Open Data has been recognized as a valuable source for background information in data mining. However, most data mining tools require features in propositional form, i.e., a vector of nominal or numerical features associated with an instance, while Linked Open Data sources are graphs by nature. In this paper, we present RDF2Vec, an approach that uses language modeling approaches for unsupervised feature extraction from sequences of words, and adapts them to RDF graphs. We generate sequences by leveraging local information from graph sub-structures, harvested by Weisfeiler-Lehman Subtree RDF Graph Kernels and graph walks, and learn latent numerical representations of entities in RDF graphs. Our evaluation shows that such vector representations outperform existing techniques for the propositionalization of RDF graphs on a variety of different predictive machine learning tasks, and that feature vector representations of general knowledge graphs such as DBpedia and Wikidata can be easily reused for different tasks.

A Study of the Similarities of Entity Embeddings Learned from Different Aspects of a Knowledge Base for Item Recommendations: ESWC 2018 Satellite Events, Heraklion, Crete, Greece, June 3-7, 2018, Revised Selected Papers

Abstract and Figures

Recommended publications

Knowledge Graph Embeddings with node2vec for Item Recommendation: ESWC 2018 Satellite Events, Herakl...

Factorization Machines Leveraging Lightweight Linked Open Data-enabled Features for Top-N Recommenda...

Semantic Distance Spreading Across Entities in Linked Open Data

Transfer Learning for Item Recommendations and Knowledge Graph Completion in Item Related Domains vi...

Factorization Machines Leveraging Lightweight Linked Open Data-Enabled Features for Top-N Recommenda...