PreprintPDF Available

Making Use of Affective Features from Media Content Metadata for Better Movie Recommendation Making

July 2020

July 2020

Authors:

John Kalung Leung

George Mason University

Igor Griva

George Mason University

William George Kennedy

George Mason University

Preprints and early-stage research may not have been peer reviewed yet.

Our goal in this paper aims to investigate the causality in the decision making of movie recommendations from a Recommender perspective through the behavior of users' affective moods. We illustrate a method of assigning emotional tags to a movie by auto-detection of the affective attributes in the movie overview. We apply a text-based Emotion Detection and Recognition model, which trained by the short text of tweets, and then transfer the model learning to detect the implicit affective features of a movie from the movie overview. We vectorize the affective movie tags through embedding to represent the mood of the movie. Whereas we vectorize the user's emotional features by averaging all the watched movie's vectors, and when incorporated the average ratings from the user rated for all watched movies, we obtain the weighted vector. We apply the distance metrics of these vectors to enhance the movie recommendation making of a Recommender. We demonstrate our work through an SVD based Collaborative Filtering (SVD-CF) Recommender. We found an improved 60\% support accuracy in the enhanced top-5 recommendation computed by the active test user distance metrics versus $40\%$ support accuracy in the top-5 recommendation list generated by the SVD-CF Recommender

Confusion matrix of 7 emotion balanced dataset.

…

Confusion Matrix of 7 emotion unbalanced dataset.

…

Affect values of movie: "The Godfather (1972)".

…

Mood datasets gathered from Twitter.

…

EDR performance on balanced mood tweets dataset.

…

Figures - uploaded by John Kalung Leung

Content may be subject to copyright.

Content uploaded by John Kalung Leung

Content may be subject to copyright.

arXiv:2007.00636v1 [cs.IR] 1 Jul 2020

Making Use of Affective Features from Media Content Metadata for

Better Movie Recommendation Making

John Kalung Leung1, Igor Griva2and William G. Kennedy3a

1Computational and Data Sciences Department, Computational Sciences and Informatics, College of Science, George

Mason University, 4400 University Drive, Fairfax, Virginia 22030, USA

2Department of Mathematical Sciences, MS3F2, Exploratory Hall 4114, George Mason University,4400 University Drive,

Fairfax, Virginia 22030, USA

3Center for Social Complexity, Computational and Data Sciences Department, College of Science, George Mason

University, 4400 University Drive, Fairfax, Virginia 22030, USA

{jleung2, igriva, wkennedy}@gmu.edu

Keywords: context-aware, emotion mining, affective computing, recommender systems, movie recommendations, deep

learning

Abstract: Our goal in this paper aims to investigate the causality in the decision making of movie recommendations

from a Recommender perspective through the behavior of users’ affective moods. We illustrate a method of

assigning emotional tags to a movie by auto-detection of the affective attributes in the movie overview. We

apply a text-based Emotion Detection and Recognition model, which trained by the short text of tweets, and

then transfer the model learning to detect the implicit affective features of a movie from the movie overview.

We vectorize the affective movie tags through embedding to represent the mood of the movie. Whereas we

vectorize the user’s emotional features by averaging all the watched movie’s vectors, and when incorporated

the average ratings from the user rated for all watched movies, we obtain the weighted vector. We apply

the distance metrics of these vectors to enhance the movie recommendation making of a Recommender. We

demonstrate our work through an SVD based Collaborative Filtering (SVD-CF) Recommender. We found

an improved 60% support accuracy in the enhanced top-5 recommendation computed by the active test user

distance metrics versus 40% support accuracy in the top-5 recommendation list generated by the SVD-CF

Recommender.

1 Introduction

Movie recommendations come from different

sources. A more traditional way to make a movie

recommendation is by word of mouth through

moviegoers who have watched the movie, or relying

on elite movie critics who wrote about their opinions

of the ﬁlm, or through news media, publications, and

advertisements. Since the dawn of the Internet era

in the last century, we rely on machine automation

to make movie recommendations using various Rec-

ommender methodologies (Bobadilla et al., 2013),

(Zhang et al., 2011), (Scheel et al., 2012), and

(Kompan and Bielikova, 2014). More recently, we

have applied the more advanced Deep Learning (DL)

techniques in Recommenders (Zhang et al., 2017).

After a century, the ﬁeld in recommendation making

is still in active research (Jannach et al., 2010).

ahttps://orcid.org/0000-0001-9238-1215

Regardless of the efforts, we have invested in Rec-

ommenders research, and they always seem to be

more ways to make improvements even in the ﬁeld

(Beel et al., 2015). In this paper, we shall include

primary human emotions as an aspect of making

movie recommendations through a Recommender

(Canales and Mart´ınez-Barco, 2014).

Emotion affects human experience and inﬂuences our

daily activities on all levels of the decision-making

process. When a user ponders over a list of recom-

mended items such as songs, books, movies, prod-

ucts, or services, his affective state of preferences in-

ﬂuences his decision making on which recommended

item he chooses to consume. Emotion plays a role in

our decision-making process in preference selection

(Naqvi et al., 2006). However, up to now, informa-

tion retrieval (IF) and Recommender Systems (RS)

give little attention to include human emotion as a

source of user context (Ho and Tagmouti, 2006). Our

goal is to make affective awareness a component of

movie recommendations for users. The challenge is

that no ﬁlm database or movie dataset in the public

domain contains any explicit textual oriented human

emotional tag in the metadata. Though, the ﬁlm meta-

data ﬁelds such as plot, overview, storyline, script,

watcher reviews, and critics reviews contain excellent

subjective data that describes the general mood of a

movie. We can apply Machine Learning (ML) tech-

niques to identify and extract affective features im-

plicitly from the ﬁlm metadata and leverage the ﬁlm’s

emotional characteristics when making movie recom-

mendations to users.

1.1 Motivation

We envision that the affective elements of a cinema

represented by a low dimension continuous emotion

embedding vector denoted as the movie’s emotional

vector (mvec). The fact that a ﬁlm is unique and

mutually exclusive from its peers, thus, in theory,

a mvec. Some ﬁlm databases, such as the Movie-

Lens, track users’ movie-watching history, and feed-

back (Harper and Konstan, 2016). Using the movie-

watching records of a user, we can formulate a low

dimension continuous emotion embedding vector for

the user denoted as the user emotional vector (uvec).

We obtain a user’s uvec embedding value by taking

the average over all the movie’s mvec and the av-

erage rating the user has given to all the movies he

has watched. Note that, uvec may not be unique. It

merely represents the current measure of a user’s af-

fective preference for ﬁlms that he has enjoyed. The

difference between mvec and uvec is that mvec of a

movie is static with enduring value throughout its life-

time. Whereas, uvec is dynamic with value change as

the user watched and rated movies. The advantage

of using the dynamic nature of uvec in making a rec-

ommendation is that we are taking the most updated

user’s affective preference into consideration of the

user decision-making process every time. As the user

emotional preference change, so will the movie Rec-

ommender to adjust the recommendation making pro-

cess accordingly. We may be the ﬁrst party making

use of the novelty in leveraging the dynamic nature

of uvec over mvec to enhance the process of making

movie recommendations of a Recommender.

Also, we can leverage mvec to analyze the emotional

features of a movie. The continuous embedding na-

ture of the mvec represents the range and strength of

moods in a ﬁlm. In this study, we track six primary

human affective features in emotion: “joy”, “sad-

ness”, “hate”, “anger”, “disgust”, and “surprise ”. We

added “neutral” as the seventh affective feature for

convenience in our computing tasks. We normalized

the affective features when we compute mvec for a

ﬁlm. Thus, all affective features in mvec will add up

to one (1). We can interpret each affective feature in

a ﬁlm as a percentage in decimal. For example, Inter-

net Movie Database (IMDb) is a popular online movie

information database that has rated “The Godfather

(1972)” as the top movie of all time (IMDb, 2020).

Our emotion detector classiﬁed the movie with the

affective class “hate” and the mvec for the movie de-

picts in table 1.

Table 1: Affect values of movie: “The Godfather (1972)”.

neutral

0.0840931

joy sadness hate

0.059261046 0.08991193 0.23262443

anger disgust surprise

0.20177138 0.19720455 0.13513364

Glancing through table 1, we perform the affec-

tive analysis for the movie in the following way.

The movie mvec affective attribute “hate” stands as

the dominant feature followed by “anger”, “disgust”,

“surprise”, ”sadness”, “neutral”, and closed out by

“joy”. With the mvec affective features distribution

value, we can describe “The Godfather (1972)” movie

as a ﬁlm full of hate. It is an angry movie. The con-

tent is disgusting due to the violent nature of the ﬁlm.

However, the story of the movie is full of surprises,

but a sad movie. None can say the movie celebrates

happiness. By reciting the mvec attributes, we explain

the movie. On the same token, we can use the uvec of

a speciﬁc user who watched “The Godfather (1972)”

to explain how well the user has enjoyed the movie.

Surprisingly, we can leverage mvec and uvec as an ex-

planation tool in the Recommender recommendation

making process.

2 Related Work

Detecting primary human emotion expression in

text is a relatively new area of research in Nat-

ural Language Processing (NLP). A common ap-

proach in identifying the general thought, feel-

ing, or sense in writing is to classify the con-

textual polarity orientation (positive, neutral, and

negative) of opinionated text through the polarity

Sentimental Analysis (SA) (Wilson et al., 2005), and

(Maas et al., 2011). When applying ﬁne-grained Sen-

timent Analysis (Fink et al., 2011), researchers can

identify the intensity level of the polarity as a multi-

class single-label classiﬁcation problem (e.g., very

positive, positive, neutral, negative, and very nega-

tive) (Bhowmick et al., 2009). However, to determine

the mental, emotional state or composure (i.e., mood)

in subjective text, Emotional Analysis (EA) can bet-

ter suit to handle the task (Tripathi et al., 2016). Here

the researcher wants to know the feeling of the

writing under examination is in one of the follow-

ing primary human emotions or moods. The study

of primary human emotional expressions started

in the era of Aristotle in around 4th century BC

(Konstan and Konstan, 2006). However, not until

Charles Darwin (1872 - 1998) revisited the inves-

tigation of human emotional expression in the 19th

century, which propelled the ﬁeld to its present stage

of modern psychology research (Ekman, 2006). Paul

Ekman et alia in the 1970s developed a Facial Ac-

tion Coding System (FACS) to carry out a series

of research on facial expressions that have identi-

ﬁed the following six primary universal human emo-

tions: happiness, sadness, disgust, fear, surprise,

and anger (Ekman, 1999). Ekman later added con-

tempt as the seventh primary human emotion to his

list (Ekman et al., 2013). Robert Plutchik invented

the Wheel of Emotions avocated eight primary emo-

tions: anger, anticipation, joy, trust, fear, surprise,

sadness, and disgust. Adding to the primary eight

emotions are secondary and complementary emo-

tions for a total of 32 emotions depicted on the ini-

tial Wheel of Emotions (Plutchik, 2001). More re-

cent research by Glasgow University in 2014 amened

that couple pairs of emotions such as fear and sur-

prise elicited similar facial muscles response, so are

disgust and anger. The study broke the raw hu-

man emotions down to four fundamental emotions:

happiness, sadness, fear/surprise, and disgust/anger

(Tayib and Jamaludin, 2016).

In our study, we shall focus our emotion detec-

tion work on Ekman’s six primary human emotions.

We follow other emotion detection researchers’ foot-

steps on the same decision, for many have based

their work on Ekman’s six primary human emotions

(Canales and Mart´ınez-Barco, 2014). Also, we can

make use of the WordNet-Affect, a linguistic re-

source for a lexical representation of affective knowl-

edge in affective computing on human interaction

such as attention, emotions, motivation, pleasure,

and entertainment (Valitutti et al., 2004). It is wor-

thy to note that emotional expression research usu-

ally aims at detecting and recognizing emotion types

from human facial expression and vocal intonation

(De Silva et al., 1997). However, our EDR study fo-

cuses on the mood of text expression instead. The

question remains how much of an emotion we can

convey through writing.

3 Methodology

Emotion Detection and Recognition on text is a text

classiﬁcation problem, one of the favorite Natural

Language Processing and Supervised Machine Learn-

ing tasks (Danisman and Alpkocak, 2008). It is a Su-

pervised Machine Learning task because text clas-

siﬁcation requires a labeled dataset containing both

text documents and associated labels for training the

classiﬁer (Medhat et al., 2014). In the absence of an

explicit emotion labeled movie metadata dataset, we

build an affective text aware model in two steps, ﬁrst,

through a steadily available domain using tweets data

from the Twitter database. Next, we feed the movie

text metadata, such as storyline and overviews, to

the Emotion Detection and Recognition (EDR) model

built from tweets’ affective tags to classify the affec-

tive labels for the movie.

3.1 Affective Computing and Machine

Learning Modeling

Researchers apply Machine Learning (ML) method-

ologies to solve two classes of problems: regression

and classiﬁcation (Stone and Veloso, 2000). For ex-

ample, to predict tomorrow, we use the stock market

index is moving up or down and by how much. It is

a regression problem, whereas, to state the polarity or

ﬁne-grained sentiment of stock market traders regard-

ing the market performance to come tomorrow, it is a

classiﬁcation problem. In the ML classiﬁcation, the

polarity SA is a multi-class single-label classiﬁcation

problem (Nakov et al., 2019). However, human emo-

tion usually expresses in a combination of affective

moods, such a classiﬁcation task, a multi-class multi-

label classiﬁcation problem (Li and Ren, 2012). To

simplify our study, we treat the movie metadata text-

based emotion detection as a multi-class single-label

classiﬁcation problem instead. Our text-based EDR

model’s ﬁnal step output seven nodes each represents

the probability distribution of an affective feature. We

then take the argmax (Gould et al., 2016) from the

probability distribution of the candidate nodes as the

prediction mood value, as depicted in equation 1.

argmax

x∈Df(x) = {x|f(x)≥f(y),∀y∈D}(1)

The argmax function expresses xas a set of data

points for which f(x)obtains the most signiﬁcant val-

ues, if present, of the function.

3.2 Data Preperation

The challenge for our study in emotion detection

from movie text-based metadata is to obtain a large

enough movie metadata set with mood labels. No

such dataset is readily available. We, therefore,

need to build the required dataset by deriving it

from four different sources. For the movie rat-

ing datasets, we obtained the datasets from the

MovieLens datasets stored in the GroupLens repos-

itory (Harper and Konstan, 2016). We scraped The

Movie Database (TMDb) (TMDb, 2018) for movie

overviews and other metadata. We derived our emo-

tional word sense set as contextual emotional words

synonymous from WordNet (Miller, 1995). Finally,

we scraped the Twitter database for tweets with key-

word tags that matched our contextual emotion word

synonymous (Marres and Weltevrede, 2013). Movie-

Lens contains a “links” ﬁle that provides with cross-

reference links between MovieLens’ movie id and

TMDb’s tmdb id. We connect MovieLens and TMDb

datasets through the “links” ﬁle.

3.2.1 Extract emotion synonymous from

WordNetAffect EmotionLists

WordNet developed an affective knowledge linguis-

tic resource known as WordNet-Affect for lexical

representation (Strapparava et al., 2004). The se-

lection and tagging of a subset of synsets convey

the emotional meaning of a word in WordNet-

Affect. WordNet-Affect emotion lists contain

lists of concepts extracted from WordNet-Affect,

synsets with six emotions of interest: anger, disgust,

hate, joy, sadness, and surprise stored in a com-

pressed ﬁle: “WordNetAffectEmotionLists.tar.gz”

(Poria et al., 2012). We downloaded the “.gz” ﬁle

and uncompressed it into six emotion text ﬁles.

The alternative is to download from GitHub the

already uncompressed of the six emotion ﬁles from

https://github.com/robert-jm/twit-ranker/tree/master/dictionaries/WordNetAffectEmotionLists.

Each emotion ﬁle contains two columns of infor-

mation: synsets and the synonymous. Here, the

synonymous set of the synset corresponds to an

emotion class and store in the corresponding emo-

tion text ﬁle. We extract the synonymous column

from each emotion text ﬁle. We removed duplicate

synonymous, sorted the cleansed synonymous, and

stored the result in comma-separated values (CSV)

format in the corresponding emotion synonymous

ﬁle. Below in table 2 contains the statistic of the six

emotion synonymous ﬁles after performed the data

cleansing task.

Table 2: Synonymous statistic of six emotion.

mood count synonymous list

anger 255 ”abhor”,...,”wrothful”

disgust 53 ”abhorrent”,...,”yucky”

hate 147 ”affright”,...,”unsure”

joy 400 ”admirable”,...,”zestfulness”

sadness 202 ”aggrieve”,...,”wretched”

surprise 71 ”admiration”,...,”wondrously”

3.2.2 Extracting tweets from Twitter database

There are many types of tweets on Twitter, a popu-

lar social network, and the microblogging platform.

In this study, we only work with the regular tweet,

140 characters, or less short message, which posts on

Twitter. Almost every user’s tweets are extractable

and available to the public. Each tweet is search-

able by keyword. We wrote a simple Python script to

extract tweets through Twitter’s API (Makice, 2009).

We treat each synonymous in an emotion correspond-

ing ﬁle as a keyword of a tweet. By looping through

all the synonymous in Twitter’s search-by-keyword

API, we extract all the tweets with such keyword and

store them in a CSV ﬁle. The alternative is to extract

tweets and store them in a JSON ﬁle, as illustrated

in (Makice, 2009). For example, if the emotion syn-

onymous is belonging to the anger concept, we will

store the retrieved tweet in anger raw.csv ﬁle. As

depicted in table 2, the anger emotion corresponding

ﬁle, anger syn.txt, has 255 synonymous, we will store

all tweets retrieved from the corresponding keywords

in anger raw.csv.

We performed text pre-processing cleansing steps,

which involved removing any punctuation, HTML

tags, limit the set of special characters such as “,

′′, “.

′′,

or “#′′, stopwords removal, duplicate removal, stem-

ming the word phrases, and stored the cleansed. Our

effort yield the following gathered affective feature

records illustrated in the table 3.

Table 3: Mood datasets gathered from Twitter.

mood class record size

neutral 19108

joy 138019

sadness 60381

hate 38651

anger 17830

disgust 19887

surprise 15002

unbalance 7 mood classes 308878

each balanced mood 15000

each balanced mood train 12000

each balanced mood test 3000

Our mood dataset extracted from Twitter shows an

unbalance dataset with “joy” emotion class occupied

slightly over a third of the dataset. “sadness” emotion

class is half the amount of “joy” while “hate” is half

that of “sadness”. These three emotion classes, when

combined, represent 77% of the emotion dataset. The

distribution of the other four emotions ranges from

15,002 to 19,887, more or less evenly distributed.

If we apply the dataset directly to Machine Learn-

ing modeling without adjustment for the imbalance

classes, we will skew our result toward the dominant

mood types. We decide to balance the mood dataset

by subsampling each affective attribute dataset size to

15,000. We further split each affective dataset into

a training dataset with 80% of the samples (12,000)

and 20% of the samples for the test dataset (3,000).

Using a brute force method, we scrape the TMDb

database for movie metadata, particularly for movie

overview or storyline, which contains the subjective

writing movie description that we can classify the

mood of the text. Knowing we can query the TMDb

database by tmdb id, a unique movie identiﬁer as-

signed to a movie. The tmdb id starts from 1 and

up. However, in the sequence of tmdb id, there may

be a gap between consecutive numbers. Our effort

yields 452,102 records after cleansing the raw data

we scraped from TMDb.

3.3 Emotion Modeling

Inspired by the recent advance in Natural Language

Processing (NLP) for text classiﬁcation described by

(Sosa, 2017), we develop our text-based EDR model

by combining Long Short-Term Memory (LSTM),

a variant of Recurrent Neural Network (RNN), and

Conv-1D of Convolutional Neural Network (CNN).

We build our model similar to the method used in

(Liu, 2020). We deﬁne our model architecture con-

sists of two half. The ﬁrst half is RNN LSTM-CNN

Conv-1D architecture, as described in (Sosa, 2017)

that text input process by an LSTM architecture be-

fore follow up data processing by a CNN Conv-1D ar-

chitecture. In contrast, the second half of the model is

to reverse the processing order of architecture, CNN

Conv-1D-RNN LSTM. Input ﬁrst process by a CNN

Conv-1D architecture before feeding it to an RNN

LSTM architecture. The two half of the architecture

then combine to feed data into a max-pooling layer

of a CNN for a pooling operation to select the domi-

nant feature in the ﬁlter’s regional feature map. Next,

data passes into a ﬂattening layer of a CNN to convert

data into a one-dimensional array before feed data to

a fully connected dense layer of a CNN. The dense

layer’s output will feed to a set of nodes that are equal

to the number of classes the architecture aims to clas-

sify. Each of the output nodes holds the output dis-

tribution value of its class. In the ﬁnal act, a softmax

activation function examine and activate the appropri-

ate class node accordingly.

We use bi-directional RNN LSTM and CNN Conv1D

architectures to build our model. In the ﬁrst half of the

model, the RNN LSTM-CNN Conv-1D phase, we use

two bi-directional LSTM for the RNN LSTM archi-

tecture and seven Conv1D of the CNN architecture. In

the second half of the model, the CNN-LSTM phase,

we apply seven pairs of Conv1D of the CNN architec-

ture and two bi-directional LSTM for the LSTM ar-

chitecture. Follow the idea illustrated in (Kim, 2014);

we prepared two identical input layers of embedding

matrix constructed from a pre-trained GloVe embed-

ding matric similar to (Pennington et al., 2014). We

build the two input layers of the embedding matrix

with one of the input embedding layers set to “train-

able”, while the other is not, i.e. “frozen”. Dur-

ing the processing of the ﬁrst half of the model, the

RNN LSTM-CNN Conv-1D phase, the “trainable” in-

put layer occupies one of the bi-directional LSTM ar-

chitecture. In contrast, the “frozen” input layer ﬁlls

the other.

Similarly, when processing the second half of the

model, the CNN Conv-1D - RNN LSTM phase, a

“trainable” input layer, and a “frozen” input layer will

occupy each pair of the Conv1D units. We obtain

55.6% accuracy in classifying the emotion class of

the tweets’ balanced dataset, as depicted in table 4

and the confusion matrix in ﬁgure 1 depicts the per-

formance of the seven emotion classiﬁer. The classi-

ﬁcation result is acceptable to serve our purpose since

our goal is not to build the best emotion text classiﬁer,

but a usable one to classify the emotion class of movie

overviews.

Table 4: EDR performance on balanced mood tweets

dataset.

precision recall f1-score support

neutral 0.47 0.77 0.59 2992

joy 0.63 0.53 0.58 3030

sadness 0.64 0.44 0.52 3034

hate 0.64 0.51 0.57 2933

anger 0.62 0.68 0.65 2984

disgust 0.44 0.45 0.44 2987

surprise 0.55 0.51 0.53 3040

accuracy 0.56 21000

macro avg 0.57 0.56 0.55 21000

weighted

avg 0.57 0.56 0.55 21000

Figure 1: Confusion matrix of 7 emotion balanced dataset.

For comparison purposes, we include the ERD perfor-

mance result on unbalanced mood tweets data listed

in table 5. We also depicted the confusion matrix of

the seven emotion unbalanced tweets dataset in table

Figure 2: Confusion Matrix of 7 emotion unbalanced

dataset.

Although the result showed much better performance

metrics when compared with its counterpart, we de-

cide not to use the ERD model trained by the un-

balanced dataset because the dataset is heavily fa-

vor “joy”, “sadness”, and “hate” with F1-score 78%,

61%, and 58% respectively. The F1-score for the

other four moods, except for “anger”, stand at 56%,

“disgust”, “surprise”, and “neutral” are 30%, 36%,

and 41%, respectively. Besides, when it comes to

evaluating the performance result involving unbal-

anced data, the macro-average F1-score is a more

suitable metric to gauge. Here, the macro-average

F1-score for balanced and unbalanced datasets is 55%

versus 0.51%, respectively.

Table 5: EDR performance on unbalanced mood tweets

dataset.

precision recall f1-score support

neutral 0.34 0.50 0.41 3758

joy 0.73 0.83 0.78 27719

sadness 0.57 0.66 0.61 12126

hate 0.68 0.51 0.58 7710

anger 0.74 0.46 0.56 3518

disgust 0.49 0.22 0.30 3972

surprise 0.70 0.24 0.36 2972

accuracy 0.65 61775

macro avg 0.61 0.49 0.51 61775

weighted

avg 0.65 0.65 0.64 61775

Let us revisit the example we gave in the mood anal-

ysis of the most top movie of all time rated by IMDb,

“The Godfather (1972)”. We built another version

of our emotion detection classiﬁer model using the

unbalanced mood dataset. The classiﬁer classiﬁed

“hate” is the most dominant mood feature for the

movie, and the mvec for the ﬁlm with the model built

from the unbalanced dataset depicts in 6. The mvec

depicts the mood attributes descending order of the

movie is “hate”, “anger”, “joy”, “sadness”, “disgust,

“neutral”, and “surprise”. Comparing to table 1 which

shows the mood attributes descending order as “hate”,

“anger”, “disgust”, “surprise”, “sadness”, “neutral”,

and “joy”. Any person who has watched “The God-

father (1972)” movie would probably favor the mood

analysis result done by the emotion detection classi-

ﬁer model using the balanced mood dataset.

Table 6: Affect values of movie: “The Godfather (1972)”

derived from unbalanced mood dataset.

neutral

0.04276474

joy sadness hate

0.16501102 0.076094896 0.4305178

anger disgust surprise

0.1993026 0.053966276 0.03234269

3.4 Emotion Preduction

We build a seven text-based emotion predictor for

movie overviews from the seven emotion tweet clas-

siﬁer model. We run the predictor through all the

452,102 overviews scraped from the TMDb database

to generate a TMDb movie overview with an emo-

tion label dataset. As mentioned before, Movie-

Lens datasets come in different sizes. We will

work with the following MovieLens datasets: ml-1m,

which contains about one million rating information

of movies; ml-20m dataset, 20 million rating infor-

mation; ml-latest-small dataset, about ten thousand

rating information of 610 users; ml-latest-full dataset,

holds 27 million rating information; and the recently

leased ml-25m dataset, with 25 million rating infor-

mation. (Note the name of the MovieLens dataset

coveys the number of ratings, movies, users, and tags

contained in the dataset.) Table 7 depicts the num-

ber of ratings, users, and movies; each of the Movie-

Lens datasets contain. In each of the depicted Movie-

Lens dataset, it provides a links ﬁle to cross-reference

between MovieLens and two other movie databases,

TMDb and IMDb, through movie id, tmdb id, and

imdb id. MovieLens maintains a small number of

data ﬁelds, but users can link to TMDb and IMDb

databases via the links ﬁle to access other metadata

that MovieLens is lacking.

Table 7: MovieLens datasets.

dataset ratings users movies

ml-1m 1M 6000 4000

ml-20m 20M 138000 27000

ml-25 25M 162000 62000

ml-latest-small 100K 600 9000

ml-latest-full 27M 280K 58000

The ml-latest-full datasets maintain the most signiﬁ-

cant number of movies in MovieLens dataset collec-

tion. However, the ml-latest dataset will change over

time and is not a proper use for reporting research re-

sults. We use the ml-latest-small, and ml-latest-full

datasets in proof of concept and prototyping, not re-

search reporting work. The other MovieLens 1M,

20M, and 25M datasets are stable benchmark datasets

which we will use for research reporting work.

Although we have scraped 452,102 movie overviews

from TMDb when merging with MovieLens, we only

make use of one-eighth of the number of overviews

we have collected. Showing in table 8 is the number

of movie overviews the MovieLens datasets can ex-

tract from TMDb after performing the data cleaning

task.

Table 8: Number of overview in MovieLens extracted from

TMDb.

dataset number of overviews

ml-1m 1M

ml-20m 26603

ml-25m 25M

ml-latest-small 9625

ml-latest-full 56314

4 Implementation

4.1 Uvec and Mvec

For our study, we need a movie Recommender Sys-

tem to evaluate the performance of uvec and mvec.

We envision uvec and mvec play a role in the tail

end of making movie Recommendations, i.e., during

the stage of the top-N movie recommendation making

process. Any movie Recommenders can ﬁt to sup-

port the evaluation of uvec and mvec. In our case, we

develop a Collaborative Filtering-based movie Rec-

ommender System (CFRS) based on the SVD al-

gorithm. We then added functions to support uvec

and mvec operations in the enhancement of making

movie recommendations. We start by adding func-

tions in the SVD-CFRS to support the loading, stor-

ing, extraction, and manipulation operations of uvec

and mvec. The added uvec and mvec support func-

tions do not interfere with any normal RS operation,

including the ordinary making movie recommenda-

tions process. We let our SVD-CFRS run in its usual

way serving routine recommendation requests: no

uvec and mvec involvement in the movie recommen-

dation making just yet. Before we start to evaluate

uvec and mvec, we prepare each user has the uvec

by computing the average of all mvec of the movies

the user has watched. We deployed the MovieLens

ml-latest-small dataset to be the test set and ran-

domly pick a user, user id 400, as the active test user.

Our active test user’s uvec depicted in table 9 repre-

senting the overall average of 43 movies’ mvec the

user id 400 has watched. We ask the Recommender

to make movie recommendations for the user id 400

in a business-as-usual way. The Recommender fur-

nishes a top-N, where N is 20, movie recommenda-

tion list for the user id 400 as depicted in table 11.

Using the uvec of the user id 400, we compute the

pairwise similarity between the user id 400 and each

recommended movie’s mvec on the top-N list. We

sorted the pairwise distance metrics top-N list com-

puted using uvec in the descending order to match up

the presentation ordering of the Recommender’s top-

N list. We applied ﬁve different distance metrics for

computing the uvec’s pairwise distance metrics top-

N list. We depicted the comparative top-N results in

table 12. The ﬁve distance metrics we employed in

the comparison were Euclidean distance, Manhattan

distance, Minkowski distance, Cosine similarity, and

Pearson correlation with their formula illustrated in

equation 2 through 7.

Euclideandistance (x,y) = sn

∑

i=1

(xi−yi)2(2)

Manhattandistance (x,y) = ||x−y||1=

∑

i=1

|xi−yi|

(3)

Minkowskid istance (x,y) = n

∑

i=1

|xi−yi|p!1

(4)

Inner (x,y) = ∑

xiyi=hx,yi(5)

CosSim (x,y) = ∑ixiyi

q∑ix2

iqy2

=hx,yi

||x||||y|| (6)

PearCorr (x,y) = ∑i(x−¯x)(y−¯y)

q∑i(x−¯x)2q∑i(y−¯y)2

=hx−¯x,y−¯yi

||x−¯x||||y−¯y||

=CosSim (x−¯x,y−¯y)

(7)

where (x,y)are vectors x= (x1,x2,··· ,xn)and

y= (y1,y2,··· ,yn).

5 Evaluation

5.1 Findings

The following table 9 depicts the average mood value

uvec of the active test user id 400, while table 10

shows the weighted average mood value uvec of the

active test user.

Table 9: The average mood value of user id 400 user.

neutral

0.16352993

joy sadness hate

0.08873525 0.12708998 0.20331840

anger disgust surprise

0.11933819 0.15881287 0.13917538

Table 10: The weighted average mood value of user id 400

user.

neutral

0.14755723

joy sadness hate

0.08006808 0.11467653 0.18345939

anger disgust surprise

0.10768190 0.1433009 0.125581509

The following table 11 depicts the top-N movie rec-

ommendation list generated by our SVD-CF Recom-

mender for the active test user id 400.

Table 11: TopN recommendation list generated by SVD CF

recommender for user id 400 user.

no. mid title

0 527 Schindler’s List (1993)

1 5952 Lord of the Rings:

The Two Towers,

The (2002)

2 2858 American Beauty (1999)

3 2329 American History X (1998)

4 2028 Saving Private Ryan (1998)

5 1089 Reservoir Dogs (1992)

6 110 Braveheart (1995)

7 1291 Indiana Jones and

the Last Crusade

(1989)

8 4226 Memento (2000)

9 91529 Dark Knight Rises,

The (2012)

10 68157 Inglourious Basterds

(2009)

11 6016 City of God

(Cidade de Deus)

(2002)

12 589 Terminator 2:

Judgment Day

(1991)

13 1704 Good Will Hunting

(1997)

14 1200 Aliens (1986)

15 1214 Alien (1979)

16 1 Toy Story (1995)

17 99114 Django Unchained

(2012)

18 7361 Eternal Sunshine of

The (2012)

19 1136 Monty Python and

the Holy Grail

(1975)

The following table 12 depicts the comparison of the

top-N recommendation list generated by the ﬁve dis-

tance metrics for user id 400 user.

The following table 13 depicts the comparison of the

top-N recommendation list generated by the ﬁve dis-

tance metrics for the active test user id 400 using the

weighted mood values for the uvec. To get the aver-

age weighted uvec for the active test user id 400, we

multiply the averaged user id 400 uvec with the av-

erage normalized rating value of the watched movie.

We depicted the average weighted uvec of user id 400

in table 10. Using the weighted uvec of user id 400,

we compute the ﬁve weighted distance metrics de-

Table 12: Comparison of the top-N recommendation list

generated by the ﬁve distance metrics for user id 400 user.

no. mid Euc Man Min Cos Pear

0 527 1291 1291 1291 91529 110

1 5952 4226 5952 4226 68157 2329

2 2858 5952 4226 7361 527 91529

3 2329 7361 1089 5952 2329 527

4 2028 1089 7361 1089 2858 2028

5 1089 1200 2028 1200 6016 68157

6 110 2028 589 1704 110 1214

7 1291 1704 1704 2028 1 1

8 4226 589 1200 589 99114 1200

9 91529 1214 1214 1214 1136 1089

10 68157 99114 1 1136 1214 6016

11 6016 110 1136 99114 2028 2858

12 589 1136 110 110 589 99114

13 1704 1 2329 1 1200 1136

14 1200 6016 527 2858 1704 1704

15 1214 2329 99114 6016 1089 589

16 1 527 6016 527 7361 5952

17 99114 2858 91529 2329 5952 7361

18 7361 68157 2858 68157 4226 1291

19 1136 91529 68157 91529 1291 4226

picted in table 13. Interestingly enough, the top-N

recommendation list is identical whether the results

generated through Cosine similarity or Pearson Cor-

relation using weighted or non-weighted uvec. How-

ever, the other three metrics slightly differed in the

ranking of top-N recommendation using the weighted

or non-weighted uvec. Cosine similarity seems to act

like an invert ranking of its top-N list when compared

to Euclidean, Manhattan, Minkowski distance met-

rics.

Table 13: Comparison of the top-N recommendation list

generated by the ﬁve distance metrics for user id 400 user

using weighted uvec.

no. mid wEuc wMan wMin wCos wPear

0 527 1291 1291 1291 91529 110

1 5952 4226 5952 4226 68157 2329

2 2858 5952 7361 1200 527 91529

3 2329 7361 4226 1089 2329 527

4 2028 1089 1089 5952 2858 2028

5 1089 2028 589 7361 6016 68157

6 110 1200 2028 2028 110 1214

7 1291 1704 1704 1704 1 1

8 4226 589 1214 589 99114 1200

9 91529 1214 1200 1214 1136 1089

10 68157 110 1136 110 1214 6016

11 6016 99114 1 99114 2028 2858

12 589 1 110 1 589 99114

13 1704 1136 99114 1136 1200 1136

14 1200 2329 2329 527 1704 1704

15 1214 527 527 2329 1089 589

16 1 6016 6016 6016 7361 5952

17 99114 2858 91529 68157 5952 7361

18 7361 91529 2858 2858 4226 1291

19 1136 68157 68157 91529 1291 4226

5.2 Comparison of Top-N made by

Distance Metrics

We combined the following rating datasets, except the

ml-latest-small from the MovieLens into one dataset

by merging ml-1m, ml-20m, ml-25m, and ml-latest-

full datasets. We extracted all data points of user id

400 as the active test user under examination from the

combined rating dataset. We removed duplicated data

points based on the movie id and timestamp from the

combined rating dataset that matched the data points

in user id 400. The combined rating dataset em-

ploys here as movies to-be-watched by the active test

user id 400 in some future timeline. We employed

ml-latest-small as the input dataset for our SVD-CF

Recommender to get the top-N recommendation list

for our active test user id 400. The SVD-CF Recom-

mender is never aware of the combined user id 400

data points except those contained in the ml-latest-

small dataset. Once we excluded all the duplicated

data points found in ml-latest-small from the com-

bined rating datasets, we have 209 user id 400 data

points for validation work. We extracted data points

from the combined dataset, which matched the top-N

recommendation generated by the Recommender, as

depicted in Table 14. We put the extracted data points

into two groups: data points found in the top-N and

not found in the top-N. There are 8 data points in the

combined dataset sorted by timestamp in ascending

order to reﬂect the order of the movie the active user

has watched. That is a good sign because it shows the

SVD-CF Recommender works at a 40% supporting

rate, eight out of twenty recommendations coincided

with the preference of the active test user. If we draw a

cutoff point at top-5 across the comparison of the top-

N recommendation list, the following four movie id

data points in the combined dataset matched: 1291,

4226, 7361, and 2028. Cosine takes ﬁfth place; it

only hit the 4th spot in the top-5 list. Pearson takes

the fourth place and hit the top second and ﬁfth spots

on the top-5 list. Manhattan takes third place, hitting

the top 1, 3, and 5. Euclidean takes second place, hit-

ting the top 1, 2, and 4. Minkowski wins the round by

hitting the top 1, 2, and 3 spots.

6 Future Work

We plan to elaborate our affective computing study by

building an Emotion Aware Recommender using the

emotion labeled tags obtained from movie overviews

through the Tweets Affective Classiﬁer. We also plan

Table 14: Common data points found in TopN recommen-

dation list and in active user user id 400.

no. movie id movie id

in topN not in topN

1 1 110

2 2329 527

3 1291 1089

4 589 1136

5 1704 1200

6 2028 1214

7 4226 2858

8 7361 5952

9 6016

10 68157

11 91529

12 99114

to make use of affective features in users’ emotion

proﬁles to enhance Group Recommender in group

formation, group dynamic, and group decision mak-

ing. In our study, we would like to ﬁnd better metrics

to measure the performance of affective computing.

7 Conclusion

In this paper, we illustrate a strategy to generate af-

fective features for movies by transfer learning tech-

niques utilizing a different domain Emotion Detec-

tion and Recognition (EDR) model classiﬁer. We

developed the EDR model to detect and recognize

seven emotional features in tweets through affective

tags stored in the Twitter database. We then transfer

the learning of the EDR model from classifying the

emotional features of tweets to predict the moods of

a movie through the movie description in the movie

overview. We scraped the TMDb database for movie

overviews and metadata. Through the EDR program,

we generate emotional features, mvec, for each col-

lected movie from TMDb. We gather movie datasets

that contain rating information from the MovieLens

repository. We use the rating dataset of MovieLens to

build an SVD-CF Recommender. We add functions

to support uvec and mvec in enhancing the Recom-

mender in generating the top-N movie recommenda-

tions. We randomly pick an active user, user id 400,

as the test candidate. We generate a top-N with N set

to 20 to generate movie recommendations through the

SVD-CF Recommender. We compute the uvec for the

user id 400 by average all the mvec of movies that the

testing candidate has watched. We calculate the Eu-

clidean, Manhattan, Minkowski, Cosine, and Pearson

distance metrics and compare the ﬁve distance met-

rics’ rankings against the Recommender top-N. We

found Minkowski distance metrics performed the best

at 60% support accuracy versus the 40% accuracy

made by the Recommender. A 20% improvement in

top-N movie recommendations. We also make the

following observations:

•Text-based NLP EDR modeling technique works

and can apply to solve a real-world problem where

the abundance of subjective writing is available.

•Text-based EDR model is transferable from one

domain to another, and all it requires is that the

target text is in subjective writing form.

REFERENCES

Beel, J., Gipp, B., Langer, S., and Breitinger, C. (2015).

Research-paper recommender systems: a literature

survey. International Journal on Digital Libraries,

pages 1–34.

Bhowmick, P. K., Basu, A., Mitra, P., and Prasad, A. (2009).

Multi-label text classiﬁcation approach for sentence

level news emotion analysis. In International Con-

ference on Pattern Recognition and Machine Intelli-

gence, pages 261–266. Springer.

Bobadilla, J., Ortega, F., Hernando, A., and Guti´errez, A.

(2013). Recommender systems survey. Knowledge-

Based Systems, 46:109–132.

Canales, L. and Mart´ınez-Barco, P. (2014). Emotion de-

tection from text: A survey. In Proceedings of the

Workshop on Natural Language Processing in the 5th

Information Systems Research Working Days (JISIC),

pages 37–43.

Danisman, T. and Alpkocak, A. (2008). Feeler: Emotion

classiﬁcation of text using vector space model. In

AISB 2008 Convention Communication, Interaction

and Social Intelligence, volume 1, page 53.

De Silva, L. C., Miyasato, T., and Nakatsu, R. (1997). Fa-

cial emotion recognition using multi-modal informa-

tion. In Proceedings of ICICS, 1997 International

Conference on Information, Communications and Sig-

nal Processing. Theme: Trends in Information Sys-

tems Engineering and Wireless Multimedia Commu-

nications (Cat., volume 1, pages 397–401. IEEE.

Ekman, P. (1999). Basic emotions. Handbook of cognition

and emotion, 98(45-60):16.

Ekman, P. (2006). Darwin and facial expression: A century

of research in review. Ishk.

Ekman, P., Friesen, W. V., and Ellsworth, P. (2013). Emo-

tion in the human face: Guidelines for research and

an integration of ﬁndings, volume 11. Elsevier.

Fink, C. R., Chou, D. S., Kopecky, J. J., and Llorens, A. J.

(2011). Coarse- and ﬁne-grained sentiment analysis of

social media text. Johns hopkins apl technical digest,

30(1):22–30.

Gould, S., Fernando, B., Cherian, A., Anderson, P., Cruz,

R. S., and Guo, E. (2016). On differentiating pa-

rameterized argmin and argmax problems with ap-

plication to bi-level optimization. arXiv preprint

arXiv:1607.05447.

Harper, F. M. and Konstan, J. A. (2016). The movielens

datasets: History and context. ACM Transactions on

Interactive Intelligent Systems (TiiS), 5(4):19.

Ho, Ai Thanh, M. I. L. and Tagmouti, Y. (2006). E-

mrs: Emotion-based movie recommender system. In

Proceedings of IADIS e-Commerce Conference. USA:

University of Washington Both-ell, pages 1–8.

IMDb (2020). Top 100 greatest movies of all

time (the ultimate list). Published online at

http://www.imdb.com/list/ls055592025/. Accessed:

2020-05-17 16:40:27.

Jannach, D., Zanker, M., Felfernig, A., and Friedrich,

G. (2010). Recommender systems: an introduction.

Cambridge University Press.

Kim, Y. (2014). Convolutional neural networks for sentence

classiﬁcation. arXiv preprint arXiv:1408.5882.

Kompan, M. and Bielikova, M. (2014). Group recommen-

dations: survey and perspectives. Computing and In-

formatics, 33(2):446–476.

Konstan, D. and Konstan, D. (2006). The emotions of the

ancient Greeks: Studies in Aristotle and classical lit-

erature, volume 5. University of Toronto Press.

Li, J. and Ren, F. (2012). Emotion recognition of we-

blog sentences based on an ensemble algorithm of

multi-label classiﬁcation and word emotions. IEEJ

Transactions on Electronics, Information and Sys-

tems, 132(8):1362–1375.

Liu, T. (2020). Multi-class emotion classiﬁcation for short

texts. Published online at https://tlkh.github.io. Ac-

cessed: 2020-05-18 16:11:30.

Maas, A. L., Daly, R. E., Pham, P. T., Huang, D., Ng, A. Y.,

and Potts, C. (2011). Learning word vectors for sen-

timent analysis. In Proceedings of the 49th Annual

Meeting of the Association for Computational Lin-

guistics: Human Language Technologies, pages 142–

150, Portland, Oregon, USA. Association for Compu-

tational Linguistics.

Makice, K. (2009). Twitter API: Up and running: Learn

how to build applications with the Twitter API. ”

O’Reilly Media, Inc.”.

Marres, N. and Weltevrede, E. (2013). Scraping the social?

issues in live social research. Journal of cultural econ-

omy, 6(3):313–335.

Medhat, W., Hassan, A., and Korashy, H. (2014). Sentiment

analysis algorithms and applications: A survey. Ain

Shams engineering journal, 5(4):1093–1113.

Miller, G. A. (1995). Wordnet: a lexical database for en-

glish. Communications of the ACM, 38(11):39–41.

Nakov, P., Ritter, A., Rosenthal, S., Sebastiani, F., and Stoy-

anov, V. (2019). Semeval-2016 task 4: Sentiment

analysis in twitter. arXiv preprint arXiv:1912.01973.

Naqvi, N., Shiv, B., and Bechara, A. (2006). The role

of emotion in decision making: A cognitive neuro-

science perspective. Current directions in psychologi-

cal science, 15(5):260–264.

Pennington, J., Socher, R., and Manning, C. D. (2014).

Glove: Global vectors for word representation. In

Proceedings of the 2014 conference on empirical

methods in natural language processing (EMNLP),

pages 1532–1543.

Plutchik, R. (2001). The nature of emotions: Human emo-

tions have deep evolutionary roots, a fact that may ex-

plain their complexity and provide tools for clinical

practice. American scientist, 89(4):344–350.

Poria, S., Gelbukh, A., Cambria, E., Yang, P., Hussain,

A., and Durrani, T. (2012). Merging senticnet and

wordnet-affect emotion lists for sentiment analysis. In

2012 IEEE 11th International Conference on Signal

Processing, volume 2, pages 1251–1255. IEEE.

Scheel, C., Castellanos, A., Lee, T., and De Luca, E. W.

(2012). The reason why: A survey of explanations

for recommender systems. In International Work-

shop on Adaptive Multimedia Retrieval, pages 67–84.

Springer.

Sosa, P. M. (2017). Twitter sentiment analysis using com-

bined lstm-cnn models. Eprint Arxiv.

Stone, P. and Veloso, M. (2000). Multiagent systems: A sur-

vey from a machine learning perspective. Autonomous

Robots, 8(3):345–383.

Strapparava, C., Valitutti, A., et al. (2004). Wordnet affect:

an affective extension of wordnet. In Lrec, volume 4,

page 40. Citeseer.

Tayib, S. and Jamaludin, Z. (2016). An algorithm to de-

ﬁne emotions based on facial gestures as automated

input in survey instrument. Advanced Science Letters,

22(10):2889–2893.

TMDb (2018). Tmdb about. Published online

at www.themoviedb.org. Accessed: 2018-05-11

17:29:37.

Tripathi, V., Joshi, A., and Bhattacharyya, P. (2016). Emo-

tion analysis from text: A survey. Center for Indian

Language Technology Surveys.

Valitutti, A., Strapparava, C., and Stock, O. (2004). Devel-

oping affective lexical resources. PsychNology Jour-

nal, 2(1):61–83.

Wilson, T., Wiebe, J., and Hoffmann, P. (2005). Recogniz-

ing contextual polarity in phrase-level sentiment anal-

ysis. In Proceedings of human language technology

conference and conference on empirical methods in

natural language processing, pages 347–354.

Zhang, S., Yao, L., and Sun, A. (2017). Deep learning based

recommender system: A survey and new perspectives.

arXiv preprint arXiv:1707.07435.

Zhang, Z.-K., Zhou, T., and Zhang, Y.-C. (2011). Tag-aware

recommender systems: a state-of-the-art survey. Jour-

nal of computer science and technology, 26(5):767.

ResearchGate has not been able to resolve any citations for this publication.

An Algorithm to Define Emotions Based on Facial Gestures as Automated Input in Survey Instrument

Article

Full-text available

Jul 2016

This study is carried out to help researchers who are having problem of acquiring accurate responses or answers from dyslexic children in their survey instrument. Due to the children disability in reading, spelling and writing fluently, researchers who are conducting studies on this particular group are unable to gather valuable information from their subjects. Therefore, this paper will examine on using facial gestures as an input in survey instrument. The features of facial gestures that play important role in creating emotions will be selected to be formulated in an algorithm. This is being done by studying users and the problem domains, analyzing the relationship between problem domains, and formulating solutions to the problem. The aim of this paper is to produce an algorithm to define emotions based on given facial gestures. This algorithm will be useful in developing a more interactive survey instrument and improving user interaction between researchers and dyslexic children.

Glove: Global Vectors for Word Representation

Conference Paper

Full-text available

Jan 2014

Research-paper recommender systems: A literature survey

Article

Full-text available

Jul 2015

In the last sixteen years, more than 200 research articles were published about research-paper recommender systems. We reviewed these articles and present some descriptive statistics in this paper, as well as a discussion about the major advancements and shortcomings and an overview of the most common recommendation concepts and approaches. We found that more than half of the recommendation approaches applied content-based filtering (55%). Collaborative filtering was applied by only 18% of the reviewed approaches, and graph-based recommendations by 16%. Other recommendation concepts included stereotyping, item-centric recommendations, and hybrid recommendations. The content-based filtering approaches mainly utilized papers that the users had authored, tagged, browsed, or downloaded. TF-IDF was the most frequently applied weighting scheme. In addition to simple terms, n-grams, topics, and citations were utilized to model users’ information needs. Our review revealed some shortcomings of the current research. First, it remains unclear which recommendation concepts and approaches are the most promising. For instance, researchers reported different results on the performance of content-based and collaborative filtering. Sometimes content-based filtering performed better than collaborative filtering and sometimes it performed worse. We identified three potential reasons for the ambiguity of the results. A) Several evaluations had limitations. They were based on strongly pruned datasets, few participants in user studies, or did not use appropriate baselines. B) Some authors provided little information about their algorithms, which makes it difficult to re-implement the approaches. Consequently, researchers use different implementations of the same recommendations approaches, which might lead to variations in the results. C) We speculated that minor variations in datasets, algorithms, or user populations inevitably lead to strong variations in the performance of the approaches. Hence, finding the most promising approaches is a challenge. As a second limitation, we noted that many authors neglected to take into account factors other than accuracy, for example overall user satisfaction. In addition, most approaches (81%) neglected the user-modeling process and did not infer information automatically but let users provide keywords, text snippets, or a single paper as input. Information on runtime was provided for 10% of the approaches. Finally, few research papers had an impact on research-paper recommender systems in practice. We also identified a lack of authority and long-term research interest in the field: 73% of the authors published no more than one paper on research-paper recommender systems, and there was little cooperation among different co-author groups. We concluded that several actions could improve the research landscape: developing a common evaluation framework, agreement on the information to include in research papers, a stronger focus on non-accuracy aspects and user modeling, a platform for researchers to exchange information, and an open-source framework that bundles the available recommendation approaches.

Sentiment Analysis Algorithms and Applications: A Survey

Article

Full-text available

May 2014

Sentiment Analysis (SA) is an ongoing field of research in text mining field. SA is the computational treatment of opinions, sentiments and subjectivity of text. This survey paper tackles a comprehensive overview of the last update in this field. Many recently proposed algorithms' enhancements and various SA applications are investigated and presented briefly in this survey. These articles are categorized according to their contributions in the various SA techniques. The related fields to SA (transfer learning, emotion detection, and building resources) that attracted researchers recently are discussed. The main target of this survey is to give nearly full image of SA techniques and the related fields with brief details. The main contributions of this paper include the sophisticated categorizations of a large number of recent articles and the illustration of the recent trend of research in the sentiment analysis and its related areas.

Emotion Detection from text: A Survey

Conference Paper

Jan 2014

Darwin and Facial Expression: A Century of Research in Review.

Article

Dec 1974
Syst Zool

Merging SenticNet and WordNet-Affect emotion lists for sentiment analysis

Conference Paper

Oct 2012

SenticNet is currently one of the most comprehensive freely available semantic resources for opinion mining. However, it only provides numerical polarity scores, while more detailed sentiment-related information for its concepts is often desirable. Another important resource for opinion mining and sentiment analysis is WordNet-Affect, which in turn lacks quantitative information. We report a work on automatically merging these two resources by assigning emotion labels to more than 2700 concepts.

Coarse-and Fine-Grained Sentiment Analysis of Social Media Text

Article

Jun 2011
J HOPKINS APL TECH D

Sentiment analysis-the automated extraction of expressions of positive or negative attitudes from text-has received considerable attention from researchers during the past 10 years. During the same period, the widespread growth of social media has resulted in an explosion of publicly available, user-generated text on the World Wide Web. These data can potentially be utilized to provide real-time insights into the aggregated sentiments of people. The tools provided by statistical natural language processing and machine learning, along with exciting new scalable approaches to working with large volumes of text, make it possible to begin extracting sentiments from the web. We discuss some of the challenges of sentiment extraction and some of the approaches employed to address these challenges. In particular, we describe work we have done to annotate sentiment in blogs at the levels of sentences and subsentences (clauses); to classify subjectivity at the level of sentences; and to identify the targets, or topics, of sentiment at the level of clauses.

The Reason Why: A Survey of Explanations for Recommender Systems

Conference Paper

Jan 2013

Recommender Systems refer to those applications that offer contents or items to the users, based on their previous activity. These systems are broadly used in several fields and applications, being common that an user interact with several recommender systems during his daily activities. However, most of these systems are black boxes which users really don’t understand how to work. This lack of transparency often causes the distrust of the users. A suitable solution is to offer explanations to the user about why the system is offering such recommendations. This work deals with the problem of retrieving and evaluating explanations based on hybrid recommenders. These explanations are meant to improve the perceived recommendation quality from the user’s perspective. Along with recommended items, explanations are presented to the user to underline the quality of the recommendation. Hybrid recommenders should express relevance by providing reasons speaking for a recommended item. In this work we present an attribute explanation retrieval approach to provide these reasons and show how to evaluate such approaches. Therefore, we set up an online user study where users were asked to provide movie feedback. For each rated movie we additionally retrieved feedback about the reasons this movie was liked or disliked. With this data, explanation retrieval can be studied in general, but it can also be used to evaluate such explanations.

Emotion Recognition of Weblog Sentences Based on an Ensemble Algorithm of Multi-label Classification and Word Emotions

Article

Aug 2012

Weblogs have greatly changed the communication ways of mankind. Affective analysis of blog posts is found valuable for many applications such as text-to-speech synthesis or computer-assisted recommendation. Traditional emotion recognition in text based on single-label classification can not satisfy higher requirements of affective computing. In this paper, the automatic identification of sentence emotion in weblogs is modeled as a multi-label text categorization task. Experiments are carried out on 12273 blog sentences from the Chinese emotion corpus Ren_CECps with 8-dimension emotion annotation. An ensemble algorithm RAKEL is used to recognize dominant emotions from the writer's perspective. Our emotion feature using detailed intensity representation for word emotions outperforms the other main features such as the word frequency feature and the traditional lexicon-based feature. In order to deal with relatively complex sentences, we integrate grammatical characteristics of punctuations, disjunctive connectives, modification relations and negation into features. It achieves 13.51% and 12.49% increases for Micro-averaged F1 and Macro-averaged F1 respectively compared to the traditional lexicon-based feature. Result shows that multiple-dimension emotion representation with grammatical features can efficiently classify sentence emotion in a multi-label problem.

Making Use of Affective Features from Media Content Metadata for Better Movie Recommendation Making

Abstract and Figures

Recommended publications

Using Affective Features from Media Content Metadata for Better Movie Recommendations

Using Affective Features from Media Content Metadata for Better Movie Recommendations

Text-based Emotion Aware Recommender

Making Cross-Domain Recommendations by Associating Disjoint Users and Items Through the Affective Aw...