Conference PaperPDF Available

Tweets Emotion Prediction by Using Fuzzy Logic System

Authors:

Figures

Content may be subject to copyright.
Abstract The social media world is growing day by day;
people are using social media platforms to express their
feelings. Leading companies are deploying those platforms in
measuring customer’s satisfaction with their products and
services. The huge amount of data produced by such platforms
can be analyzed to help those companies in improving their
businesses. This paper introduces a new contribution in the
field of emotion classification for Tweets by using Fuzzy Logic
intellection that classify each tweet to an emotion with different
degrees of intensity. We achieved that by developing two fuzzy
classification systems, the first one inspects the text and
referred to as (TCFL), while the second one inspects emojis
associated with tweet’s text and referred to as (ECFL). Our
approaches classify tweets into eight different emotion
categories (Joy, Sadness, Anger, Disgust, Trust, Fear, Surprise,
and Anticipation) with seven emotion degrees (Extremely
High, Very High, High, Medium, Low, Very Low, and
Extremely Low). After comparing the developed two systems
with human-based classification, TCFL outperformed ECFL
with 48.96% match as compared to 32.54% match for ECFL.
Keywords- Sentiment analysis, emotion detection, fuzzy logic,
multi-label classification, text classification,emoji classification,
Tweets.
I. INTRODUCTION
Over 3.2 billion people are using social media
platforms, 73% of marketers are using social media for
marketing purposes, and 54% of social media users are
using social media platforms to search products and services
and express their opinions about it [1]. This huge usage for
social media platforms is producing a tremendous amount of
unstructured data over the web, and in the last few years,
analyzing such data to extract useful information from it,
and use it to make crucial business decisions has become an
important issue. Big companies are investing their money to
manage and analyze the data they get about their business
from their social media accounts. The reason for this is that
people posts, comments, tweets, and other types of online
interactions can be used to explore the opinions about a
specific brand, what they think about a new product, what
people want, and the general trend in a specific topic. If they
explored such information, business owners can adjust their
business to increase customer satisfaction and achieve
higher profits.
Sentiment analysis is a natural language processing
(NLP) branch that constructs data mining systems to extract
the opinion from a text, and whether the text holds a positive
or negative opinion or feeling [2][3]. The polarity
classification here is concerned with classifying the
document, word, or sentence to positive, negative, or neutral
opinion or feeling. While the other type of sentiment
analysis is called the beyond polarity classification and it
aims at not only classifying the document to positive or
negative but also classify it to an emotion like sad, happy,
angry, and surprised [4][5].
Emotion recognition is a type of beyond polarity
classification that aims at uncovering the emotional
sentiment in a text and the feeling of a user when he/she
wrote a review, comment, or his opinion about something.
In order to detect the emotion, the text itself needs to be
analyzed in a certain way. Usually, the process starts with
breaking the text down to words and then using a specific
approach that processes those words based on their
sentiment in order to discover the final emotion of the
document or the sentence [6]. Most of the emotion detection
systems consider the text and make the use of lexicons that
pair the word with a sentiment and a weight that represent
the intensity of that emotion, and this is called text
classification [7][8][9].
Text classification is a data mining task that maps a
data instance (text) to one label from a set of possible labels
called classes. The built classification model analysis the
text features to decide the label of the data instance. Many
variations are possible here. The hard version of the text
classification maps the data instance to an explicit label.
While in the soft version, the instance could be assigned a
probability. In another variation, the same data instance
could be classified to more than one class at the same time,
and this is called multi-label classification [10]. The most
common techniques used to build the classification model
are decision trees (DT), Support vector machine (SVM),
Tweets Emotion Prediction by Using Fuzzy Logic
System
Yahya M. Tashtoush, Dana Abed Al Aziz Orabi
Department of Computer Science
Jordan University for Science and Technology
Irbid, Jordan
Email: yahya-t@just.edu.jo,daorabi16@cit.just.edu.jo
2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)
978-1-7281-2946-4/19/$31.00 ©2019 IEEE
83
naïve bayes (NB), and Bayesian classifiers [11][12].
Although most of the applications process the text only,
there has been an increased attention to the emojis, given
their importance in the sentiment analysis task, leading to
more reliable classification results [13][14].
In the multi-label classification, the same data instance
can be classified to two or more labels from the classes data
set. The motivation for this concept came from the medical
diagnosis and text classification problem where the same
text could cover more than one subject like social and
religion. And the same patient medical profile could be
classified to more than one disease together like blood
pressure and diabetes [15] [16]. Boosting, k-nearest
neighbor, decision trees, neural networks, and fuzzy logic
are classification methodologies that can be adjusted and
optimized to solve the multi-label classification problem
[17] [18] [19]. In our contribution we deployed the fuzzy
logic concept to develop the two multi-label classification
systems (TCFL and ECFL).
This paper will be completed as follows: section 2
explains the fuzzy logic concept in details, section 3
presents the problem statement and the study’s goal. section
4 demonstrates the related works, section 5 presents our
classification methodology, section 6 presents the
experimental results, section 7 concludes our contribution,
and section 8 points to future research directions.
II. FUZZY LOGIC
The word fuzzy means the thing that is not clear or
cloudy. In the real world, there is a lot of situations that we
cannot judge whether a specific thing is true or false,
absolute positive or absolute negative, because it is not
absolute for us. This was the reason for discovering the
fuzzy logic concept by Lotfi Zadeh back in the 60’s. He
stated that the modern computers with the language of 0’s
and 1’s cannot understand the natural language of humans
because it cannot be converted to an absolute 0’s and 1’s.
His new idea introduced the fuzzy logic as a way of
computing that is based on the degree of truth instead of an
absolute 0’s or 1’s as in the Boolean logic [20] [21].
A. Fuzzy Logic Methods
There are three types of fuzzy logic inference methods
which are Tsukamoto, Mamdani, and Sugeno. They all have
the three stages of the fuzzy logic framework which are
fuzzification, rule generation, and defuzzification which are
demonstrated in detail in the following section. But the core
difference between the fuzzy logic methods is the
methodology that is used to create the crisp output out of the
fuzzy input [22]. The Mamdani fuzzy inference system uses
the center of gravity algorithm in the defuzzification stage,
the Sugeno uses the weighted average methodology, and the
Tsukamoto uses the height method. The researchers use the
inference system that is suitable for the available input
values and the desired output from the system based on the
developed application [23][24][25].
B. Fuzzy Logic Architecture
Basically, the fuzzy logic system consists of four parts
which are the fuzzification module, knowledge base,
inference engine, and defuzzification module [26][27]. In
the fuzzification module, the input numerical values are
converted to linguistic values. Those input values are
mapped to linguistic values based on a membership
function. The fuzzy controller that consist of the knowledge
base and the inference engine performs the core steps of the
fuzzy logic framework [28]. The knowledge base consists of
a set of well-defined if-then fuzzy rules, and the inference
engine uses those rules for the assigning of the output
values. The inference engine is responsible for determining
which rules are applied based on the input values. Follows
that a step of rule combining to develop the control actions.
There are many methods to combine the rules such as the
maximum method and the bounded sum. In our study we
use the maximum method to combine the rules. The last step
in the fuzzy system is the defuzzification. In the
defuzzification module, the fuzzy sets generated from the
previous steps are converted to a crisp output. There is a lot
of methods to apply the defuzzification which are the center
of gravity, weighted average, left most, and right most
maximum [29][30][31].
C. Fuzzy Knowledge Base
The fuzzy knowledge base consists of a set of fuzzy
if-then rules that are used in the classification process.
Those rules should be generated and studied carefully
because the efficiency of the classification system depends
on the quality of the rules that resides in the fuzzy
knowledge base. There are several methodologies for the
fuzzy rule generation. The first one is the histogram
methodology where the input feature values are mapped to a
histogram in a different way for each class and then used as
membership function for the classification task. The other
type is the rule generation based on the mean and standard
deviation of the input values. In this approach, there is a rule
generated for each class based on the mean and standard
deviation of each attribute values in the dataset [32][33].
The last rule generation methodology is the simple
fuzzy grid where each attribute is partitioned to a fuzzy set.
After that the fuzzy grid is built and it consists of fuzzy
subspaces. The number of the subspaces is determined by
the number of the attributes and their fuzzy sets. Each
attribute is represented by a dimension in the grid and the
dimension is partitioned to the fuzzy sets. Each subspace
presents a meeting point between one fuzzy set from each
attribute. After that, for each one of the fuzzy subspaces, a
fuzzy rule is generated [34].
2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)
84
D. Fuzzy Membership Functions
At the stages of fuzzy logic related to fuzzification and
defuzzification, the crisp input values need to be fuzzified
and converted to linguistic values, and then defuzzified and
converted back to a crisp output value. This is done using
the membership functions that present the degree of truth
based on the fuzzy logic and it is presented as a graphical
form. Each input value is mapped to one or two values
between 0 and 1. If the same input value got two linguistic
values, this means that it belongs to two fuzzy sets with
different degrees of membership and the summation of the
degrees should be equal to 1. There are different types of
membership functions which are the trapezodial, triangular
and Gaussian. For the fuzzification we used the triangular
form while for the defuzzification we used the trapezoidal
form [35] [36].
III. PROBLEM STATEMENT AND STUDY AIM
Usually, emotion classification systems classify the text
to an absolute class of emotion like sad, anger, happy. And
most of them focus on the words in the text only without
paying attention to emojis that appear in it and could add a
piece of very important sentiment information.
The aim of this study is to develop a classification
framework by using the fuzzy logic approach which helps
us in classifying the tweet to an emotion in addition to
detecting the degree of intensity of that emotion in the
tweet, with two different variations of the system. The first
system considers the text in a tweet to perform the
classification task, and through this thesis, we refer to this
system as textual classification by using fuzzy logic (TCFL)
system. The other one is based on emojis contained in a
tweet and we consider them only to perform the
classification task. We refer to this system as emojis
classification by using fuzzy logic (ECFL). In the end, we
answer the question: which one of the two, the text or emoji
is more representative of the true emotion in a tweet?
IV. LITIRATURE REVIEW
This section presents the latest scientist’s contributions
related to tweets sentiment analysis, Emojis based sentiment
analysis, and fuzzy logic classification.
A. Tweets Sentiment Analysis
Sharma et al. [37] developed a web application that
performs alive tweets sentiment analysis on a regional level.
Python Flask platform is used by the authors to render the
live web pages based on the user search request. After the
twitter API gathers the relative tweets based on the search
string, data features including text, user name, friends list,
and followers are extracted from the metadata. The location
of the user is extracted also from the metadata in case the
user provided it in the first place, otherwise, the location is
extracted based on the location signs mentioned in the text
like countries and cities names. Next, the python text blob
library is used to perform the sentiment analysis task, this
library assigns the text with a sentiment score that ranges
from -1 to 1 from the most negative to the most positive.
The last step is to create a .csv file that contains all the data
of the tweets along with their sentiment score. This file is
used by the Plotly library that generates a file and sends it to
the front end at the shape that allows the data to be plotted
on the world map using JavaScript. The final result is at the
shape of a plotted world map shaded based on the sentiment
of the tweets the user searched for. The degree of the
shading in a specific location represent the sentiment of the
tweets in that area. For example, dark shade could mean
positive sentiment and light shade could represent the
negative sentiment.
Jabreel et al. [38] proposed a new methodology for the
problem of multi-label classification, each multi-label
classification task should go through the stage of problem
transformation that transforms the multi-label classification
problem to multiple single labeling problems. There are
many methods to perform the problem transformation
operation such as the binary relative method. They proposed
a new method and called it the XY pair set. This algorithm
considers a multi-labeled data set and the set of all possible
labels and transforms it to a binary data set that pairs the
data instances of data along with the possible labels with a
value of 0 or 1 that represent whether the instance is labeled
with that class or not. Then they built a deep learning model
with 3 parts: The embedding module assign each word-label
with a weight. The attention model uses those weights to
define the strength of the relationship between each word
and the label. The second part is the encoding module that
uses a recurrent neural network to transform the previous
vector to vector of hidden status to prepare it for the
classification module that does the classification task using
the neural network. The proposed methodology achieves the
highest accuracy of 54% compared with the baseline that
uses the TCS and SVM.
Sharifirad et al. [39] worked on a project that detects
the emotion and the intensity degree of it in sexist tweets.
Their work is based on a previous study that categorized the
sexist subjects to three categories which are indirect
harassment, sexual harassment, and physical harassment.
They worked on four emotions anger, joy, sadness, and fear
with no, low, medium and high degree of intensity. For the
purpose of multi-label classification, they used the one-vs.-
rest (OVR), support vector machine (SVM), Naïve Bayes
(NB), k-nearest neighbor (KNN), Multi-layer perceptron
(MLP), long-short-term memory (LSTM), and
convolutional neural network (CNN) with the embedding
representations of the words W2V, Glove, and Fast text with
the Fast text achieving the highest classification accuracy
with all classes of emotions. The next step was to study the
intensity of each emotion under the 3 categories of sexist
speech. This was applied by the classification of data set
that is already classified to the 3 categories of harassment to
16 different classes that represent the emotion with a degree
of intensity to discover what emotions are held under each
2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)
85
harassment category. The results showed that indirect
harassment is strongly related with high intensity of joy
feeling, the sexual harassment is strongly attached to anger,
joy and sadness feelings with a high degree of intensity,
while the physical harassment caries high degree of anger,
joy and sadness emotions.
B. Emojis Based Sentiment Analysis
Tashtoush et al. [40] built an automatic labeling system
that uses emojis to classify an Arabic twitter data set to one
of four emotions which are anger, joy, sadness, and disgust.
They labeled the most used emojis on twitter manually to
one of the four emotions. Then the weights for those emojis
are assigned based on the AFINN lexicon and if the weight
is not available, it is assigned the weight of its representative
word from the same lexicon. The tweet is labeled with the
emotion of the emoji that has the highest absolute sentiment
score in the tweet and the sentiment of the emoji is doubled
if it is mentioned more than on time. Then another portion
of the data is labeled manually by the authors. The entire
data set is preprocessed to build a classification model based
on the two classifications (manually and automatically) and
compare between them to see the effect of using emojis in
the classification process. They used three classifiers which
are the SVM, MNB, and RF. The automatic classification
that uses emojis outperformed the manual one in all of the
classifiers. In addition, the MNB achieved the highest
precision score of 0.757.
Chen et al. [41] studied the effect of engaging the emoji
in the sentiment classification process. The main idea was to
measure the performance of the classifiers when the input
data set enters the classifier with or without emojis. The
authors decided to consider 7 classes of emotions which are
sad, angry, happy, scared, thankful, surprised, and love.
Then they represented each emotion with a set of
representative keywords for the purpose of tweets
collection. If the tweet contains one of the keywords as a
hashtag it is included in the data set. Then for labeling the
data set, the hashtags in the tweet are studied and the tweet
is labeled with an emotion based on the last keyword
hashtag. For example, if the last hashtag is #joy, the tweet is
labeled with the happy emotion. For the purpose of
evaluation, they used the multinomial NB and SVM
classifiers. The MNB, in general, outperformed the SVM
accuracy. In the MNB, the classification accuracy when
using emojis is higher than the accuracy when eliminating
them. On the other hand, when using the SVM classifier, the
classification accuracy is slightly higher when eliminating
emojis than keeping them through the classification process.
C. Fuzzy Logic Classification
Bahreini et al. [42] introduced a fuzzy logic detection
system that detects the emotions based on live facial
gestures. This system detects the emotion from an image,
video or even a live camera recording. This methodology
used a database of facial expressions classified to emotions.
The proposed system studied the features in the data set of
the images to extract numerical features related to facial
dimensions to build the fuzzy rules based on them. The
fuzzy logic system study the features in the facial image,
transform the crisp values to fuzzy values, then using the
fuzzy rules, the emotion is detected. The proposed system
achieved an 83% accuracy level, stating that the system
performs better when working on the exhaustive emotions.
Liu et al. [43] introduced a fuzzy logic classification
system to classify social media posts and comments to one
of the hate speech categories which are sexual, religion,
color, and disability. The text is transformed into a
numerical vector using the bag of words methodology. After
that, the vector values are transformed to fuzzy values based
on the fuzzy membership function, then those values are
defuzzified to get the final class with its degrees by applying
the fuzzy rules. The other contribution of the authors is a
methodology to solve the problem of ambiguous instances
that gets the same membership degree by retraining those
instances and using the K-nearest neighbor to classify it
based on the similarity with the surrounding instances.
V. METHODOLOGY
This section presents our research methodology
including the dataset type, fuzzy logic classification and
defuzzification process.
A. Dataset
The dataset we obtained for our research is called
(Customer Support on Twitter), and it consists of 3 million
tweets and replies that reflects customer opinions and
problems about a group of famous brands products and
services. It is free and available on Kaggle [44]. For our
research purposes, we filtered the 3 million tweets and saved
the ones that contain multi-emoji to end up with 30233
tweets. We selected 120 emoji for the filtering process. We
selected emojis that represent a feeling or emotion and
excluded the ones that do not give a sentiment meaning.
We handled the 30233 tweets as follows: we separated
the text from emojis of the tweet. We generated two
separate files for the two systems (TCFL and ECFL), one
for the text only and the other contains emojis related to
each tweet. We handled each file in a different way that is
explained later in this paper.
Preprocessing steps included removing twitter handles,
hashtags, links, punctuations, numbers, special characters,
words less than two letters, and English stop words. We
handled repeated letters manually, and substituted
abbreviations and cursing words with a representative word.
B. Sentiment Lexicons
The sentiment lexicon is a special type of lexicons which
refers to dictionaries of words with a label or weight that
specifies their sentiment, whether a word gives a positive or
2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)
86
negative sentiment, or a weight that specifies the sentiment
of it. For the textual based classification system (TCFL) we
used 6 NRC sentiment lexicons [45] [46] [47] [48], which
attaches the words with a sentiment label (positive or
negative) and score. While for the emojis based
classification system (ECFL), we used the emoji lexicon in
[40] and followed their methodology to give sentiment
labels and scores for the emojis not included in their study.
C. Base Class Identification
Each tweet first is classified to a base emotion (Anger,
Sadness, Joy, Fear, Disgust, Surprise, Anticipation, and
Trust), and then the tweet words in TCFL and the tweet
emojis in ECFL enters the fuzzy logic framework to specify
the degrees of the classification. In TCFL, the word with the
highest sentiment score decides the base class, if not found
in the emotion lexicon, we move to the less weighted word,
and so on. NRC emotion lexicons [49] [50] are used for this
step. For the ECFL we classified the 120 emojis manually to
one of the 8 main emotions and the first emoji in the tweet
decides the main base class based on the constructed
emotion-emojis lexicon.
D. Fuzzification
The crisp values (weights) of the words and emojis are
converted to linguistic values using two fuzzification
membership functions we built for our systems using the
triangular form. The TCFL fuzzification membership
function ranges from 0 to ∞, and ECFL fuzzification
membership function ranges from 0 to 5 based on the crisp
values ranges over the Low, Medium, and High fuzzy sets.
The same ranges are used for the negative sentiment but
with a negative sign, as the membership function works as
mirror for both. The sentiment weights (crisp values) for the
words series in the TCFL and the emojis series in the ECFL
are fuzzified to linguistic values.
E. Fuzzy Rule Generation and Combining
We used the simple fuzzy grid to generate simple if-then
fuzzy rules. Then after that, the output of the rules is
combined based on the maximum methodology. We
generated 84 fuzzy rules as in [51]. After the fuzzification
step is done, we start by taking the first two words/emojis,
with their linguistic assignments, construct all the possible
combinations between those assigned pairs, apply the
suitable fuzzy rule to get the output. If more than one pair
has the same fuzzy rule output, we consider the maximum
degree to combine them. The output of this step is the fuzzy
set output with a numerical membership degree.
F. Defuzzification
The last step in the fuzzy logic classification is the
defuzzification process. In this step a final crisp output is
computed from the result we get from the previous steps.
There is a wide range of methods that can be used to
calculate the crisp output, in our study we used the center of
gravity. And after we calculate this defuzzified value, we
use the defuzzification membership function to get the final
class. The equation to calculate the center of gravity is as
follows [52]:
X* = (Ai ∗ Xi)
𝑁
𝑖=1
Ai
𝑁
𝑖=1
(1)
where X* represents the crisp fuzzified value, the total area
of the fuzzy set is divided to sub-areas, and N reflects the
number of sub-areas covered by the result from the previous
step, Xi refers to the centroid of a specific subarea, and Ai
presents the area under each sub-area which is a fixed and
already specified value. After we multiply the area of each
sub-area covered with its centroid, they are summed and
divided by the summation of the total area covered, to get
the defuzzified crisp value. The crisp output is then mapped
to one or more of the following classes: Extremely Low,
Very Low, Low, Medium, High, Very High, and Extremely
High, with different degrees of truth based on the
defuzzification membership function we built or this step.
There are many shapes to build the membership functions
and for this step, we used the trapezoidal with arrange from
0 to 15.5. In another meaning we consider the first two
words/emojis then we apply the fuzzification, rule
generation and combining, and defuzzification steps, the
results of those are then taken with the third word/emoji and
the same steps are repeated and so on until there is no more
words/emojis in the tweet, which means we reached the
final classification of the tweet using the fuzzy logic under
the two system TCFL and ECFL.
For example, the tweet @Uber_Support I need help
with my Account, it gets disabled every time I try to login
and it's getting really frustrating! is classified to
medium sadness with degree 0.174 and high sadness with
degree 0.826 under TCFL system, and to high anger with
degree 0.625 and very high anger with degree 0.375 under
ECFL classification system. Finally, we considered the class
with the higher degree as the final classification result.
VI. EXPERIMENTAL RESULTS
In order to evaluate the reliability of the classification
results, we obtained using our both developed systems; they
need to be compared with the human-based classification of
the same dataset and then measuring the percentage of
similarity between the results from the human-based
classification and our systems classification. For this reason,
we used the RapidMiner studio in order to obtain a human-
based classification for our dataset using the clustering data
mining task.
B. Human-Based Classification
Clustering is a data mining task that splits the data set
under study to a set of groups based on the similarity
between the instances. The instances inside one cluster are
2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)
87
similar to each other and the main goal of the clustering task
is to minimize the distance between the same cluster
instances and maximize it between the instances that belong
to different clusters [53]. There are various clustering
algorithms and in our study, we used the k-means clustering.
K-means clustering is an algorithm that splits the data
objects in the data set into a set of groups based on its
attributes. The k refers to the number of groups (clusters)
you want to split the data to. The main idea of the k-means
is that grouping the objects is done based on the idea of
minimizing the distance between the cluster centroid and the
objects of it [54].
We used the k-means in RapidMiner with k = 40, the k
in our study refers to the number of fuzzy emotion classes
we built our systems based on, such as high joy, low
sadness, high disgust, low disgust and so on, which means
each cluster represents a class of emotion. The number of
iterations we chose is 10. By the end of this step, our data
instances (tweets) are grouped to different 40 clusters based
on the similarity between them. The next step is to label
each cluster of the 40 based on the instances that belong to
it. We did this step manually. We took clusters one by one,
read most of the tweets that belong to it, and labeled it based
on human judgment to the most suitable emotion class from
the 40 classes. Which means all the tweets that belong to
that cluster is labeled with the class we chose for that
cluster. For this reason, they say that the main goal of k-
means is to do the classification. By the end of this step, we
managed to obtain a human-based classification for our data
set that can be used as a base to compare the performance of
our fuzzy classification systems with it.
C. Evaluation Methodology
We evaluated the performance of our systems TCFL
and ECFL by comparing their classification results with the
human-based classification in three different ways:
1. Exact Class Match
In this type of evaluation, the classification result of our
systems should match exactly (base class and degree) with
the human-based classification to considerate as a hit
(match) otherwise it is not being considered as a match
(miss).
2. Base Class Match
This is the simplest type of evaluation we applied. If the
base class from the fuzzy system classification result
matches the human-based classification base class, we
considered it as a match (hit) regardless of the degree of
membership. And if the base class does not match we
considered it as a mess.
3. Partial Class Match
The main idea here is that if the base class of the two
classifications we are comparing match but the degrees do
not match, it is a little bit unfair to consider it as an absolute
miss. It is fair to consider it as a partial match based on
certain criteria and by giving the match a percentage of
matching. To be clearer, the degree of very high and very
high have a matching percentage of 100%, very high and
high 80%, very high and medium have 60%, very high and
low have 40%, and very high and very low have 20%
matching percentage, and so on for the rest of degree pairs.
D. Final Results
Under the exact class match evaluation method, the
percentage of match between TCFL classification results
and the human-based classification is 48.96% and between
ECFL and the human-based classification, the matching
percentage is 32.54% under this type of evaluation that
requires a full exact match. We can see here that the text
based system accuracy outperforms the emoji based system
accuracy.
Under the base class match evaluation method, the
percentage of match between TCFL classification results
and the human-based classification is 56.84% and between
ECFL and the human-based classification, the matching
percentage is 43.49% under this type of evaluation that
requires the base class match only regardless the degree. We
can notice that, again, TCFL accuracy under this evaluation
type is higher than the accuracy of ECFL.
Under the partial class match evaluation method, the
percentage of match between TCFL classification results
and the human-based classification is 53.73% and between
ECFL and the human-based classification, the matching
percentage is 39.05% when considering the partial match
between the classes based on a matching percentage
between the degrees. For the third time, TCFL outperforms
ECFL accuracy. The results from all the evaluation
methodologies are shown in figure (1).
We can see that TCFL outperforms ECFL under any
type of evaluation. And if we looked at TCFL separately, it
is obvious that the highest accuracy percentage (percentage
of match) achieved when using base class match, follows it
the partial match and then comes the exact match
evaluation. The same thing for ECFL, if we looked at it
separately. This is logical because the exact match is the
strictest type of evaluation that requires a full match
regarding the base class and the degree to consider it as a
hit, which means the number of matching classification
pairs between the fuzzy system and the human-based system
will be less than the other two types of evaluation. On the
other hand, it is the most reliable type, because it can give
the most realistic view of the performance with no room for
doubt. Yet, if you are looking for more flexible and
inclusive systems, the other two types of evaluation, which
are the base class and the partial class match, are a good
choice giving a higher percentage of accuracy. In
conclusion, although both of our fuzzy systems (TCFL and
ECFL) give a good classification results, and by looking at
2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)
88
the performance of both, we can state that the text
classification using the fuzzy logic (TCFL) system wins
over the emoji classification using the fuzzy logic (ECFL)
system in giving a better and more accurate indication of the
emotion that a specific tweet holds.
Fig. 1. TCFL vs. ECFL under all types of evaluation
VII. CONCLUSION AND FUTURE WORK
The main objective of our study was to establish
sentiment analysis systems that extrapolates the emotion a
specific text holds in its words and emojis by choosing the
multi-labeling fuzzy logic approach for the classification
task. In addition to finding an answer to the question: which
one, the words or emojis, gives a better indication of the
emotion a text holds? We developed two emotion fuzzy
classification systems, the text-based system (TCFL) and
emoji-based system (ECFL). Our evaluation results led us to
conclude that the text of a tweet is more reliable in detecting
the tweet real emotion than the emojis it contains.
In the future, we are planning to build a united fuzzy
logic emotion classification system based on the text and
emojis along with each other. In addition to this, we are
planning to expand the number of main emotions that the
systems cover, and increasing the size of the data set in
order to increase the reliability of the final results.
REFERENCES
[1] M. Mohsin, "10 Social Media Statistics You Need to
Know in 2019 [Infographic]," 7 March 2019.
[2] "Sentiment Analysis: Nearly Everything You Need to
Know | MonkeyLearn", MonkeyLearn, 2019.
[3] T. Young, D. Hazarika, S. Poria, and E. Cambria,
“Recent trends in deep learning based natural language
processing [Review Article],” IEEE Comput. Intell.
Mag., vol. 13, no. 3, pp. 5575, 2018.
[4] A. Collomb, C. Costea, D. Joyeux, O. Hasan, and L.
Brunie, “A Study and Comparison of Sentiment
Analysis Methods for Reputation Evaluation,” Res.
Rep. RR-LIRIS-2014-002, 2013.
[5] L. Luo et al., “Beyond polarity: Interpretable financial
Sentiment analysis with hierarchical query-driven
attention,” IJCAI Int. Jt. Conf. Artif. Intell., vol. 2018-
July, pp. 42444250, 2018.
[6] B. Kratzwald, S. Ilić, M. Kraus, S. Feuerriegel, and H.
Prendinger, “Deep learning for affective computing:
Text-based emotion recognition in decision support,”
Decis. Support Syst., vol. 115, pp. 2435, 2018.
[7] G. Haralabopoulos and E. Simperl, “Crowdsourcing
for Beyond Polarity Sentiment Analysis A Pure
Emotion Lexicon,” 2017.
[8] Ayyoub, Essa, and Alsmadi, “Lexicon-based sentiment
analysis of Arabic tweets,” Int. J. Soc. Netw. Min. 2(2)
101, vol. X, 2015.
[9] R. T. Nakatsu and E. B. Grossman, “A Task-Fit Model
of Crowdsourcing : Finding the Right Crowdsourcing
Approach to Fit the Task,” J. Inf. Sci., pp. 111, 2014.
[10] C. C. Aggarwal and C. X. Zhai, Mining text data, vol.
9781461432234. 2013.
[11] C. N. Mahender and V. Korde, “Text
Classificationandclassifiers: a Survey,” Int. J. Artif.
Intell. Appl., vol. 3, no. 2, pp. 8599, 2012.
[12] H. Zhang and Z. Pan, “Cross-voting SVM method for
multiple vehicle classification in wireless sensor
networks,” Sensors (Switzerland), vol. 18, no. 9, 2018.
[13] "Emoji Science Facts, Statistics, And What Your
Emojis Say About You! Steemit", Steemit.com,
2019.
[14] B. Felbo, A. Mislove, A. Søgaard, I. Rahwan, and S.
Lehmann, “Using millions of emoji occurrences to
learn any-domain representations for detecting
sentiment, emotion and sarcasm,” 2017.
[15] G. Tsoumakas and I. Katakis, “Multi-Label
Classification: An Overview.”2007.
[16] S. Kanj, F. Abdallah, T. Denœux, and K. Tout,
“Editing training data for multi-label classification
with the k-nearest neighbor rule,” Pattern Anal. Appl.,
vol. 19, no. 1, pp. 145161, 2016.
[17] A. Joly, L. Wehenkel, and P. Geurts, “Gradient tree
boosting with random output projections for multi-
label classification and multi-output regression,” pp.
140, 2019.
[18] E. A. Tanaka, S. R. Nozawa, A. A. Macedo, and J. A.
Baranauskas, “A multi-label approach using binary
relevance and decision trees applied to functional
genomics,” J. Biomed. Inform., 2015.
[19] S. Kanj, F. Abdallah, T. Denœux, and K. Tout,
“Editing training data for multi-label classification
with the k-nearest neighbor rule,” Pattern Anal. Appl.,
vol. 19, no. 1, pp. 145161, 2016.
[20] “What is fuzzy logic? - Definition from
WhatIs.com”, SearchEnterpriseAI.
[21] R. Kay, "Sidebar: The history of fuzzy
logic", Computerworld, 2019.
[22] W. E. Sari, O. Wahyunggoro, and S. Fauziati, “A
comparative study on fuzzy Mamdani-Sugeno-
Tsukamoto for the childhood tuberculosis diagnosis,”
AIP Conf. Proc., vol. 1755, no. July 2016, 2016.
2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)
89
[23] A. Saepullah and S. W. Romi, “Comparative Analysis
of Mamdani , Sugeno And Tsukamoto Method of
Fuzzy Inference System for Air Conditioner Energy
Saving,” J. Intell. Syst., vol. 1, no. 2, pp. 143147,
2015.
[24] O. Cordón, “A historical review of evolutionary
learning methods for Mamdani-type fuzzy rule-based
systems: Designing interpretable genetic fuzzy
systems,” Int. J. Approx. Reason., vol. 52, no. 6, pp.
894913, 2011.
[25] K. Ishibashi et al., “Simultaneous Bidirectional
Transceiver Logic,” IEEE Micro, vol. 19, no. 1, pp.
1419, 1999.
[26] D. Team, "What is Fuzzy Logic Systems in AI -
Architecture, Application - DataFlair", DataFlair.
[27] "Fuzzy Logic Tutorial: What is, Application &
Example", Guru99.com.
[28] "What is Fuzzy Logic System - Operation, Examples,
Advantages & Applications", ELECTRICAL
TECHNOLOGY.
[29] "Fuzzy Logic - How Does Fuzzy Logic Work:
Architecture and Applications", ElProCus - Electronic
Projects for Engineering Students.
[30] P. Fortemps and M. Roubens, “Ranking and
defuzzification methods based on area compensation,”
Fuzzy Sets Syst., vol. 82, no. 3, pp. 319330, 1996.
[31] T. A. Runkler, "Selection of appropriate
defuzzification methods using application specific
properties," in IEEE Transactions on Fuzzy Systems,
vol. 5, no. 1, pp. 72-79, Feb. 1997.
[32] H. Ishibuchi and T. Nakashima, “A study on
generating fuzzy classification rules using histograms,”
no. April, pp. 132140, 2002.
[33] O. Cordón, F. Herrera, and P. Villar, “Generating the
knowledge base of a fuzzy rule-based system by the
genetic learning of the data base,” IEEE Trans. Fuzzy
Syst., 2001.
[34] Y. H. Joo, “Fuzzy Systems Modeling : An
Introduction,” vol. 1, pp. 734–736, 2009.
[35] U. Kose, “Fundamentals of Fuzzy Logic with an Easy-
to-use , Interactive Fuzzy Control Application,” Int. J.
Mod. Eng. Res., vol. 2, no. 3, pp. 11981203, 2012.
[36] "Fuzzy Logic Membership
Function", www.tutorialspoint.com.
[37] N. Sharma, R. Pabreja, U. Yaqub, V. Atluri, S. A.
Chun, and J. Vaidya, “Web-based application for
sentiment analysis of live tweets,” pp. 12, 2018.
[38] M. Jabreel and A. Moreno, “A Deep Learning-Based
Approach for Multi-Label Emotion Classification in
Tweets,” Appl. Sci., vol. 9, no. 6, p. 1123, 2019.
[39] S. Sharifirad, B. Jafarpour, and S. Matwin, “How is
Your Mood When Writing Sexist tweets? Detecting
the Emotion Type and Intensity of Emotion Using
Natural Language Processing Techniques,” 2019.
[40] W. Hussien, M. Al-Ayyoub, Y. Tashtoush, and M. Al-
Kabi, “On the Use of Emojis to Train Emotion
Classifiers”, 2019.
[41] T. Lecompte and J. Chen, “Sentiment Analysis of
Tweets Including Emoji Data,Proc. - 2017 Int. Conf.
Comput. Sci. Comput. Intell. CSCI 2017, pp. 793798,
2018.
[42] K. Bahreini, W. van der Vegt, and W. Westera, “A
fuzzy logic approach to reliable real-time recognition
of facial emotions,” Multimed. Tools Appl., 2019.
[43] H. Liu, P. Burnap, W. Alorainy, and M. L. Williams,
“A Fuzzy Approach to Text Classification with Two-
Stage Training for Ambiguous Instances,” IEEE Trans.
Comput. Soc. Syst., vol. 6, no. 2, pp. 227240, 2019.
[44] "Customer Support on Twitter", Kaggle.com.
[45] S. Kiritchenko, X. Zhu, and S. M. Mohammad,
“Sentiment analysis of short informal texts,” J. Artif.
Intell. Res., vol. 50, pp. 723762, 2014.
[46] S. M. Mohammad, S. Kiritchenko, and X. Zhu, “NRC-
Canada: Building the State-of-the-Art in Sentiment
Analysis of Tweets,” 2013.
[47] X. Zhu, S. Kiritchenko, and S. Mohammad, “NRC-
Canada-2014: Recent Improvements in the Sentiment
Analysis of Tweets,” pp. 443–447, 2015.
[48] S. Kiritchenko, X. Zhu, C. Cherry, and S. Mohammad,
“NRC-Canada-2014: Detecting Aspects and Sentiment
in Customer Reviews,” pp. 437–442, 2015.
[49] S. M. Mohammad and P. D. Turney, “Crowdsourcing a
word-emotion association lexicon,” Comput. Intell.,
vol. 29, no. 3, pp. 436465, 2013.
[50] S. M. Mohammad and S. Kiritchenko, “Using
Hashtags to Capture Fine Emotion Categories from
Tweets - Mohammad - 2014 - Computational
Intelligence - Wiley Online Library,” Comput. Intell.,
vol. 31, no. 2, 2015.
[51] Tasneem Mallah, Advisor: Yahya Tashtoush, “Fuzzy
Logic System for Emotion Identification of Tweets,
Thesis, The CS department, Jordan University of
Science and Technology, 2018.
[52] E. Van Broekhoven and B. De Baets, “Fast and
accurate center of gravity defuzzification of fuzzy
system outputs defined on trapezoidal fuzzy
partitions,” Fuzzy Sets Syst., 2006.
[53] P. Berkhin, “A survey of clustering data mining
techniques,” in Grouping Multidimensional Data:
Recent Advances in Clustering, 2006.
[54] K. Teknomo, “K-Means Clustering Tutorial,”
Medicine (Baltimore), pp. 112, 2007.
2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)
90
... The importance of building an emoji sentiment lexicon using the Arabic context was recognized after reading research that discussed how social media users use emojis. People use the same emoji in different meanings and contexts, according to the geographical location of the writers and traditions in that region [9] [10] [44] [45]. Also, some emojis are used a lot in some regions, while others are not. ...
... After analyzing 10,069 English app reviews with emoticons, they found a notable enhancement in polarity classification accuracy, which is detailed sentiment conveyed by emoticons at different levels of text efficiently, that it utilizes emoticons as a dependable signal of text sentiment, highlighting their importance in sentiment analysis. VOLUME 11, 2023 Tashtoush and Orabi [45] made a noteworthy contribution to tweet emotion classification by utilizing Fuzzy Logic to examine tweets according to different levels of emotional intensity. They developed two unique fuzzy classification systems: TCFL, which focuses on analyzing the text, and ECFL, which looks at the emojis linked to the text. ...
Article
Full-text available
Recently given the rise of types of social media networks, the analysis of sentiment and opinions in textual data has gained significant importance. However, sentiment analysis in informal Arabic text presents challenges due to morphological complexities and dialectal variances. This research aims to develop an Emoji Sentiment Lexicon (Emo-SL) tailored to Arabic-language tweets and demonstrate performance improvements by combining emoji-based features with machine learning (ML) for sentiment classification. We constructed the Emo-SL using a corpus of 58K Arabic tweets containing emojis, calculating sentiment scores for 222 frequently occurring emojis based on their distribution across positive and negative categories. Emoji weighting is integrated with text-based feature extraction using lexicons to train classifiers on an Arabic tweet dataset. ML models, including Support Vector Machines (SVM), Naive Bayes, Random Forests, and K-Nearest Neighbors (KNN) are evaluated after optimal preprocessing and normalization. The results show that adding Emo-SL derived emoji features to ML classifiers can significantly improve accuracy by 26.7% over just textual features. The emoji-aware integrated approach achieves 89% F1 score, outperforming the rule-based VADER sentiment analyzer. Additionally, analysis of n-gram impacts further confirms the value of fusing emoji and text semantics for Arabic sentiment classification. The Emo-SL lexicon provides an effective framework for extracting nuanced emotional insights from noisy micro-text, which demonstrates the potential of contextualized emoji understanding to advance multilingual sentiment analysis performance.
... Emotion ambiguity and a few emotion classes compared to human feelings are some difficulties distinguishing emotions [42]. The fuzzy approach represents a valuable method for dealing with imprecision, and it has been utilized in several studies to represent emotions and their intensity in audio, speech, and video [43]- [46], color image [4], [8], [27]- [29], [47], text [48]- [51], face [44], [45], [52], [53] and product [54], [55] emotion analysis. ...
Article
Full-text available
Art objects can evoke certain emotions. Color is a fundamental element of visual art and plays a significant role in how art is perceived. This paper introduces a novel approach to classifying emotions in art using Fuzzy Sets. We employ a fuzzy approach because it aligns well with human judgments’ imprecise and subjective nature. Extensive fuzzy colors (n=120) and a broad emotional spectrum (n=10) allow for a more human-consistent and context-aware exploration of emotions inherent in paintings. First, we introduce the fuzzy color representation model. Then, at the fuzzification stage, we process the Wiki Art Dataset of paintings tagged with emotions, extracting fuzzy dominant colors linked to specific emotions. This results in fuzzy color distributions for ten emotions. Finally, we convert them back to a crisp domain, obtaining a knowledge base of color-emotion associations in primary colors. Our findings reveal strong associations between specific emotions and colors; for instance, gratitude strongly correlates with green, brown, and orange. Other noteworthy associations include brown and anger, orange with shame, yellow with happiness, and gray with fear. Using these associations and Jaccard similarity, we can find the emotions in the arbitrary untagged image. We conducted a 2AFC experiment involving human subjects to evaluate the proposed method. The average hit rate of 0.77 indicates a significant correlation between the method’s predictions and human perception. The proposed method is simple to adapt to art painting retrieval systems. The study contributes to the theoretical understanding of color-emotion associations in art, offering valuable insights for various practical applications besides art, like marketing, design, and psychology.
... The last process is defuzzification, this process converts the fuzzy output into a firm value [14]. The defuzzification equation shown in Eq. (8) , in this process uses Weighted Average [15]. ...
Article
The problem with reporting civil servants activities with the current system is that each activity uses points where civil servants who have many activities can easily collect points in a relatively short time. As well as the difficulty of determining civil servants who are diligent and who are not. In this study, a program was created to determine the diligent and not diligent civil servants using the web-based Fuzzy Tsukamoto method with the Laravel framework. This study uses 3 indicators, namely the number of activities, the number of minutes of activity and the percentage of attendance. Of the 3 indicators, for the first stage, the fuzzification value is sought, then from the results of the fuzzification value, inference and defuzzification can be calculated. The results obtained are the classification of civil servants, namely diligent and not diligent. From this final score, the government can determine the level of discipline of civil servants and can provide disciplinary awards or punishments.
... Emotion ambiguity and a few emotion classes compared to human feelings are some difficulties distinguishing emotions [40]. The fuzzy approach represents a valuable method for dealing with imprecision, and it has been utilized in several research to represent emotions and their intensity in audio, speech, and video [41]- [44], color image [?], [4], [8], [27], [45], [46], text [47]- [50], face [42], [43], [51], [52] and product [53], [54] emotion analysis. ...
Preprint
Full-text available
Art objects can evoke certain emotions. Color is a fundamental element of visual art and plays a significant role in how art is perceived. This paper introduces a novel approach to classifying emotions in art using Fuzzy Sets. We employ a fuzzy approach because it aligns well with human judgments' imprecise and subjective nature. Extensive fuzzy colors (n=120) and a broad emotional spectrum (n=10) allow for a more human-consistent and context-aware exploration of emotions inherent in paintings. First, we introduce the fuzzy color representation model. Then, at the fuzzification stage, we process the Wiki Art Dataset of paintings tagged with emotions, extracting fuzzy dominant colors linked to specific emotions. This results in fuzzy color distributions for ten emotions. Finally, we convert them back to a crisp domain, obtaining a knowledge base of color-emotion associations in primary colors. Our findings reveal strong associations between specific emotions and colors; for instance, gratitude strongly correlates with green, brown, and orange. Other noteworthy associations include brown and anger, orange with shame, yellow with happiness, and gray with fear. Using these associations and Jaccard similarity, we can find the emotions in the arbitrary untagged image. We conducted a 2AFC experiment involving human subjects to evaluate the proposed method. The average hit rate of 0.77 indicates a significant correlation between the method's predictions and human perception. The proposed method is simple to adapt to art painting retrieval systems. The study contributes to the theoretical understanding of color-emotion associations in art, offering valuable insights for various practical applications besides art, like marketing, design, and psychology.
... The following sequential steps are usually followed when implementing fuzzy logic in a real application [35,36]: ...
Article
Full-text available
It is crucial to analyze opinions about the significant shift in education systems around the world, because of the widespread use of e-learning, to gain insight into the state of education today. A particular focus should be placed on the feedback from students regarding the profound changes they experience when using e-learning. In this paper, we propose a model that combines fuzzy logic with bidirectional long short-term memory (BiLSTM) for the sentiment analysis of students’ textual feedback on e-learning. We obtained this feedback from students’ tweets expressing their opinions about e-learning. There were some ambiguous characteristics in terms of the writing style and language used in the collected feedback. It was written informally and not in adherence to standardized Arabic language writing rules by using the Saudi dialects. The proposed model benefits from the capabilities of the deep neural network BiLSTM to learn and also from the ability of fuzzy logic to handle uncertainties. The proposed models were evaluated using the appropriate evaluation metrics: accuracy, F1-score, precision, and recall. The results showed the effectiveness of our proposed model and that it worked well for analyzing opinions obtained from Arabic texts written in Saudi dialects. The proposed model outperformed the compared models by obtaining an accuracy of 86% and an F1-score of 85%.
... In general, the most commonly used functions are trapezoidal, Gaussian, and triangular functions. The following sequential steps are usually followed when implementing fuzzy logic in a real application [33]: ...
Conference Paper
Analyzing opinions about the current significant shift in education systems around the world is very interesting, especially students’ feedback about the profound changes they face when using e-learning which is a different learning experience. In this paper, we propose a hybrid model that combines fuzzy logic with recurrent neural networks for analyzing students’ feedback related to e-learning. This feedback was obtained in the form of tweets posted by students expressing their opinions about e-learning. The collected feedback had ambiguous characteristics in terms of writing style and language used. The tweets were written informally and without writing standards, using the Saudi dialect. The proposed model benefits from the learning capabilities of the deep neural network and the ability of fuzzy logic to handle uncertainties. The experimental results showed our proposed model’s effectiveness and that it worked well with high performance for analyzing textual feedback written in the Saudi dialect. Furthermore, the comparative results confirmed that the proposed models outperform the baseline and relevant models.
... Defuzzification is the final process of converting a fuzzy output into a crisp value [31]. The equation applied at this stage is the Weighted Average as Equation 3 [32]. ...
Article
Full-text available
The idea of the Internet of Things (IoT) is utilized to increase the advantages of internet connectivity. As long as the electrical device is still connected to the internet, IoT can be utilized to operate it. This study applies the concept of IoT as a fan control and monitoring of the humidity and room temperature through a mobile-based application, so that the room can be controlled remotely using a smartphone before use. It is necessary to always keep the room comfortable to use, so that users can increase productivity and always avoid potential disease attacks when the temperature is unstable. In addition, the IoT concept also makes it easier to turn off the fan if at any time you forget to turn it off. This fan is integrated with the ESP32 microcontroller which has been equipped with a Wireless Fidelity (WiFi) module to access data changes in the Firebase Realtime Database. Room temperature and humidity are measured with a DHT22 sensor and processed using Tsukomto’s Fuzzy Inference System to produce the appropriate fan speed in the Duty Cycle unit. Tools and applications can work as predicted based on the findings of the research conducted. However, the impact of the fan on a closed room cannot cool the room temperature but the room temperature moves up with a change of 0.3 °C to 0.5 °C within 40-75 minutes. Therefore, in the next study it is recommended to use cooling devices such as Exhaust fans or Air Conditioners.
... There are hate speech studies that used Fuzzy Rule-Based [38,87], Fuzzy Multi-Task Learning [58], and Association Rule types [92]. ...
Article
Full-text available
Homophobic expressions are a form of insulting the sexual orientation or personality of people. Severe psychological traumas may occur in people who are exposed to this type of communication. It is important to develop automatic classification systems based on language models to examine social media content and distinguish homophobic discourse.
Article
Twitter is one of the micro-blogging platforms in social media that is being used by many users to voice their opinions and expressions. Thus, it acts as a rich source for analyzing effective texts. This work proposes to predict multi-class emotions from tweets. Tweets are short text with a restricted number of words, include emoticons and emojis to express the content in a picturized way, involve contractions and slang, and contain various components such as URLs, mentions, hashtags, and so on. Hence, a methodology that involves tweet-specific pre-processing strategies is necessitated. Further, it is realized that the exploitation of sentence embedding features about emotion identification in tweets is minimal. In this regard, the proposed methodology involves tweet collection, tweet pre-processing through natural language processing techniques to derive pre-processed tweet sentences and pre-processed tweet words, feature vector formulation through polarity of pre-processed tweet sentences and sentence embedding of pre-processed tweet sentences and word embedding of pre-processed tweet words derived through bidirectional encoder representation transformers, classification through deep neural networks, evaluation through train-test methodology and evolving of optimized weights for emotion identification from tweets with improved accuracy. The methodology is evaluated on a public dataset, Twitter Reviews for Emotion Analysis, available in the Kaggle repository using metrics namely accuracy, recall, precision, and f-measure. Various combinations of features have been provided as input to assess the best combination of features that yields the highest possible performance. Further, the parameters of the deep neural network namely dropout percentage are tuned to accomplish the best performance. As the dataset is imbalanced, the impact of class weights is analyzed to achieve better outcomes.
Chapter
This paper presents the SELFI framework which uses information from a range of indirect measures to reduce the burden on users of context-sensitive apps in the need to self-report their mental state. In this framework, we implement multiple combinations of facial emotion recognition tools (Amazon Rekognition, Google Vision, Microsoft Face), and feature reduction approaches to demonstrate the versatility of the framework in facial expression based emotion estimation. The evaluation of the framework involving 20 participants in a 28-week in-the-wild study reveals that the proposed framework can estimate emotion accurately using facial image (\(83\%\) and \(81\%\) macro-F1 for valence and arousal, respectively), with an average reduction of \(10\%\) self-report burden. Moreover, we propose a solution to detect the performance drop of the model developed by SELFI, during runtime without the use of ground truth emotion, and we achieve accuracy improvements of 14%.KeywordsExperience Sampling Method (ESM)Emotion predictionSelf-report burdenFacial image processing
Article
Full-text available
Currently, people use online social media such as Twitter or Facebook to share their emotions and thoughts. Detecting and analyzing the emotions expressed in social media content benefits many applications in commerce, public health, social welfare, etc. Most previous work on sentiment and emotion analysis has only focused on single-label classification and ignored the co-existence of multiple emotion labels in one instance. This paper describes the development of a novel deep learning-based system that addresses the multiple emotion classification problem in Twitter. We propose a novel method to transform it to a binary classification problem and exploit a deep learning approach to solve the transformed problem. Our system outperforms the state-of-the-art systems, achieving an accuracy score of 0.59 on the challenging SemEval2018 Task 1:E-cmulti-label emotion classification problem.
Article
Full-text available
This paper represents our newly developed software for emotion recognition from facial expressions. Besides allowing emotion recognition from image files and recorded video files, it uses webcam data to provide real-time, continuous, and unobtrusive facial emotional expressions. It uses FURIA algorithm for unordered fuzzy rule induction to offer timely and appropriate feedback based on learners’ facial expressions. The main objective of this study was first to validate the use of webcam data for a real-time and accurate analysis of facial expressions in e-learning environments. Second, transform these facial expressions to detected emotional states using the FURIA algorithm. We measured the performance of the software with ten participants, provided them with the same computer-based tasks, requested them a hundred times to mimic specific facial expressions, and recorded all sessions on video. We used the recorded video files to feed our newly developed software. We then used two experts’ opinions to annotate and rate participants’ recorded behaviours and to validate the software’s results. The software provides accurate and reliable results with the overall accuracy of 83.2%, which is comparable to the recognition by humans. This study will help to increase the quality of e-learning.
Article
Full-text available
Sentiment analysis is a very popular application area of text mining and machine learning. The popular methods include Support Vector Machine, Naive Bayes, Decision Trees and Deep Neural Networks. However, these methods generally belong to discriminative learning, which aims to distinguish one class from others with a clear-cut outcome, under the presence of ground truth. In the context of text classification, instances are naturally fuzzy (can be multi-labeled in some application areas) and thus are not considered clear-cut, especially given the fact that labels assigned to sentiment in text represent an agreed level of subjective opinion for multiple human annotators rather than indisputable ground truth. This has motivated researchers to develop fuzzy methods, which typically train classifiers through generative learning, i.e. a fuzzy classifier is used to measure the degree to which an instance belongs to each class. Traditional fuzzy methods typically involve generation of a single fuzzy classifier and employ a fixed rule of defuzzification outputting the class with the maximum membership degree. The use of a single fuzzy classifier with the above fixed rule of defuzzification is likely to get the classifier encountering the text ambiguity situation on sentiment data, i.e. an instance may obtain equal membership degrees to both the positive and negative classes. In this paper, we focus on cyberhate classification, since the spread of hate speech via social media can have disruptive impacts on social cohesion and lead to regional and community tensions. Automatic detection of cyberhate has thus become a priority research area. In particular, we propose a modified fuzzy approach with two stage training for dealing with text ambiguity and classifying four types of hate speech, namely: religion, race, disability and sexual orientation - and compare its performance with those popular methods as well as some existing fuzzy approaches, while the features are prepared through the Bag-of-Words and Word Embedding feature extraction methods alongside the correlation based feature subset selection method. The experimental results show that the proposed fuzzy method outperforms the other methods in most cases.
Article
Full-text available
A novel multi-class classification method named the voting-cross support vector machine (SVM) method was proposed in this study, for classifying vehicle targets in wireless sensor networks. The advantages and disadvantages of available methods were summarized, after a comparative analysis of commonly used multi-objective classification algorithms. To improve the classification accuracy of multi-class classification and ensure the low complexity of the algorithm for engineering implementation on wireless sensor network (WSN) nodes, a framework was proposed for cross-matching and voting on the category to which the vehicle belongs after combining the advantages of the directed acyclic graph SVM (DAGSVM) method and binary-tree SVM method. The SVM classifier was selected as the basis two-class classifier in the framework, after comparing the classification performance of several commonly used methods. We utilized datasets acquired from a real-world experiment to validate the proposed method. The calculated results demonstrated that the cross-voting SVM method could effectively increase the classification accuracy for the classification of multiple vehicle targets, with a limited increase in the algorithm complexity. The application of the cross-voting SVM method effectively improved the target classification accuracy (by approximately 7%), compared with the DAGSVM method and the binary-tree SVM method, whereas time consumption decreased by approximately 70% compared to the DAGSVM method.
Conference Paper
Full-text available
Sentiment analysis has played a significant role in financial applications in recent years. The informational and emotive aspects of news texts may affect the prices, volatilities, volume of trades, and even potential risks of financial subjects. Previous studies in this field mainly focused on identifying polarity~(e.g. positive or negative). However, as financial decisions broadly require justifications, only plausible polarity cannot provide enough evidence during the decision making processes of humanity. Hence an explainable solution is in urgent demand. In this paper, we present an interpretable neural net framework for financial sentiment analysis. First, we design a hierarchical model to learn the representation of a document from multiple granularities. In addition, we propose a query-driven attention mechanism to satisfy the unique characteristics of financial documents. With the domain specified questions provided by the financial analysts, we can discover different spotlights for queries from different aspects. We conduct extensive experiments on a real-world dataset. The results demonstrate that our framework can learn better representation of the document and unearth meaningful clues on replying different users? preferences. It also outperforms the state-of-the-art methods on sentiment prediction of financial documents.
Conference Paper
Full-text available
Sentiment analysis has gained a lot of importance and popularity during past few years. As an increasing number of people use social media to express their opinions, the ability to perform sentiment analysis in-order to predict opinions and attitudes has also increased. In this paper, we present our web-based application, that allows for visualization of current sentiments associated with a keyword (hashtag, phrase or word) of Twitter messages by plotting them on a map. This allows users to not only measure the sentiment but also map it's intensity in terms of geography. The primary motivation behind building this application is to provide a single automated platform, which serves as a complete end-to-end system for sentiment analysis of Twitter messages along with their visualization.
Article
Full-text available
Emotions widely affect the decision-making of humans and, hence, affective computing takes emotional states into account with the goal of tailoring decision support to individuals. However, the accurate recognition of emotions within narrative materials presents a challenging undertaking due to the complexity and ambiguity of language. Even though deep learning has evolved as the state-of-the-art in various tasks from text mining, its benefits with regard to affective computing are not yet understood. We thus propose the following innovations: (1) we adapt recurrent neural networks from the field of deep learning to affective computing. (2) We extend these networks for predicting the score of different affective dimensions. (3) We implement transfer learning for pre-training word embeddings. Analyzing the results, we find that deep learning consistently outperforms traditional machine learning with improvements of up to 21% in F1-score when labeling emotions and 6% in forecast errors when rating the intensity of emotions. Altogether, the findings have considerable implications for the use of affective computing in providing decision support.
Article
Deep learning methods employ multiple processing layers to learn hierarchical representations of data, and have produced state-of-the-art results in many domains. Recently, a variety of model designs and methods have blossomed in the context of natural language processing (NLP). In this paper, we review significant deep learning related models and methods that have been employed for numerous NLP tasks and provide a walk-through of their evolution. We also summarize, compare and contrast the various models and put forward a detailed understanding of the past, present and future of deep learning in NLP.