ArticlePDF Available

A Context-Adaptive Ranking Model for Effective Information Retrieval System

Authors:

Abstract and Figures

When using Information Retrieval (IR) systems, users often present search queries made of ad-hoc keywords. It is then up to information retrieval systems (IRS) to obtain a precise representation of user's information need, and the context of the information. Context-aware ranking techniques have been constantly used over the past years to improve user interaction in their search activities for improved relevance of retrieved documents. Though, there have been major advances in context-adaptive systems, there is still a lack of technique that models and implements context-adaptive application. The paper addresses this problem using DROPT technique. The DROPT technique ranks individual user information needs according to relevance weights. Our proposed predictive document ranking model is computed as measures of individual user search in their domain of knowledge. The context of a query determines retrieved information relevance. Thus, relevant context aspects should be incorporated in a way that supports the knowledge domain representing users' interests. We demonstrate the ranking task using metric measures and ANOVA, and argue that it can help an IRS adapted to a user's interaction behaviour, using context to improve the IR effectiveness.
Content may be subject to copyright.
International Journal of Information Science 2018, 8(1): 1-12
DOI: 10.5923/j.ijis.20180801.01
A Context-Adaptive Ranking Model for Effective
Information Retrieval System
Kehinde Agbele*, Eniafe Ayetiran, Olusola Babalola
Department of Mathematics and Computer Science, Elizade University, Ilara-Mokin, Nigeria
Abstract When using Information Retrieval (IR) systems, users often present search queries made of ad-hoc keywords. It
is then up to information retrieval systems (IRS) to obtain a precise representation of user’s information need, and the context
of the information. Context-aware ranking techniques have been constantly used over the past years to improve user
interaction in their search activities for improved relevance of retrieved documents. Though, there have been major advances
in context-adaptive systems, there is still a lack of technique that models and implements context-adaptive application. The
paper addresses this problem using DROPT technique. The DROPT technique ranks individual user information needs
according to relevance weights. Our proposed predictive document ranking model is computed as measures of individual user
search in their domain of knowledge. The context of a query determines retrieved information relevance. Thus, relevant
context aspects should be incorporated in a way that supports the knowledge domain representing users’ interests. We
demonstrate the ranking task using metric measures and ANOVA, and argue that it can help an IRS adapted to a user's
interaction behaviour, using context to improve the IR effectiveness.
Keywords Context-awareness, Information retrieval, DROPT technique, Information relevance
1. Introduction
Recent years have witnessed ever-growing amount of
online information. The development of the World Wide
Web (WWW) led to increase in the volume and diversity of
accessible information. The question that now arises is how
access to this information can be effectively supported.
Users require the assistance of tools aimed to locate
documents that satisfy their specific needs. Information
retrieval (IR) concerns searching documents for information
that meet a user need. Traditionally, document
representations are expressed by extracting meaningful
keywords (index terms) from the documents in the form of
a cross-reference lookup. When the user sends a search
request, a representation of his/her information need will
also be expressed in the same manner. Then the user query
and the representation of the document will be matched
according to specific matching conditions. Results are
presented to the user in a form of a ranked list that contains
the most relevant documents. Most of the documents that
are retrieved however are irrelevant to the user because
search engines cannot determine the user context. Diverse
IR models have been developed for this purpose.
* Corresponding author:
Kehinde.agbele@elizadeuniversity.edu.ng (Kehinde Agbele)
Published online at http://journal.sapub.org/ijis
Copyright © 2018 The Author(s). Published by Scientific & Academic Publishing
This work is licensed under the Creative Commons Attribution International
License (CC BY). http://creativecommons.org/licenses/by/4.0/
Ideally, the relevance of documents should be defined
based on user context. Thus, the problem of ranking of
retrieved documents should be based on user context and
preferences. Relevance is a standard measure utilized in IR
to evaluate effectiveness of an IR system based on the
documents retrieved. The concept of relevance, however, is
one that is subjective and influenced by diverse factors. To
this end, user perception and user knowledge level are
factors that influence the relevance of a retrieved document.
Therefore, there has been a paradigm shift from a view of
relevance as simple term matching between query and
document, to a view of relevance as a cognitive and
dynamic process involving interaction between the
information user and the information source. It is important
for IR systems to obtain accurate representations of
users‘ information needs and context of information need.
Hence, search knowledge encompasses a wide variety of
aspects of the search, such as the interaction mode by users.
A context refers to the environment around a user that
reflects or affects the user's search goal. Web search
personalization is the process that allows a search engine to
adapt the search results to user's specific goal by integrating
user's context information beyond the query provided. The
goal of context information is to determine what a user is
trying to accomplish. We propose a solution to this problem
to quantify the context of retrieved information. The
technique aims to avoid the drawback of manually scanning
through and selecting from a long list of documents. We
also apply context-awareness to reformulate queries in
2 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System
order to improve the predicted relevance of retrieved
documents.
The rest of the paper is organised as follows: Section 2
presents the background and related work. Section 3
describes the context-adaptive IRS model. Sections 4
describes the DROPT technique while Section 5 describes
the experimental design. Sections 6 and 7 present the results
of the experiments. Section 8 presents the statistical
analysis results and discussions. Section 9 concludes the
paper.
2. Background and Related Work
One of the key drivers and developments towards creating
personalized solutions that support context-adaptive systems
has been the results from research work in personalization
systems. The main indication derived from these results
showed that it was very difficult to create generic
personalization solutions, without in general having a large
knowledge about the particular problem being solved. These
seemed to result in either a very specialized or a rather
generic solution that provided very limited personalization
capabilities. In order to address some of the limitations of
classic personalization systems, researchers have looked to
the new emerging area defined by the so-called
context-aware applications and systems (Abowd et. al., 1997
and Brown et. al., 2007).
The term context and context-awareness, denotes a
general class of systems that can sense a continuously
changing physical environment and provide relevant
services to users on this basis Dey, (20011). The definitions
of context are varied, from the surrounding objects within an
image, to the physical location of the system's user. The
definition and treatment of context varies significantly
depending on the application of study (Edmonds, 1999).
Context in information retrieval has also a wide meaning,
going from surrounding elements in an XML retrieval
application (Arvola et. al., 2005), recent selected items or
purchases on proactive information systems (Billsus et. al.,
2005), broadcast news text for query-less systems (Hezinger
et al., 2003), recently accessed documents (Bauer and Leake,
2001), visited Web pages (Sugiyama et al., 2004), past
queries and clickthrough data (Bharat 2003; Dou et. al., 2007;
Sugiyama et. al., 2004; Shen et. al., 2005), text surrounding a
query (Finkelstein et. al., 2001), text highlighted by a user
(Finkelstein et. al., 2001), recently accessed documents
(Bauer and Leake, 2001)etc.
Context-aware systems can be classified by 1) the concept
the system has for context, 2) how the context is acquired, 3)
how the context information is represented and 4) how the
context representation is used to adapt the system. One of the
most important parts of any context-aware system is the
context acquisition. Note that this is conceptually different to
profile learning techniques, context acquisition aims to
discover the short-term interests (or local interests) of the
user (Dou et. al., 2007; Sugiyama et. al., 2004; Shen et al;
2005), where the short-term profile information is usually
disposed once the user's session is ended. On the other hand,
user profile learning techniques do cause a much great
impact on the overall performance of the retrieval system, as
the mined preferences are intended to be part of the user
profile during multiple sessions.
One simple solution for context acquisition is the
application of explicit feedback techniques, like relevance
feedback (Rocchio and Salton, 1971 and Salton and
Buckley, 1988). Relevance feedback builds up a context
representation through an explicit interaction with the user.
In a relevance feedback session: 1) The user makes a query.
2) The IR system launches the query and shows the result set
of documents. 3) The user selects the results that considers
relevant from the top n documents of the result set. 4) The IR
system obtains information from the relevant documents,
operates with the query and returns to 2). Relevance
feedback has been proven to improve the retrieval
performance. However, the effectiveness of relevance
feedback is considered to be limited in real systems,
basically because users are often reluctant to provide such
information [Sugiyama et al., 2004], this information is
needed by the system in every search session, asking for a
greater effort from the user than explicit feedback techniques
in personalization. For this reason, implicit feedback is
widely chosen among context-aware retrieval systems (Kelly
and Teevan, 2002; Shen et al., 2005; White and Kelly, 2006).
Based on this fundamental definition, various authors
(Emmanouilidis et. al, 2013; Jara et. al, 2013; Noh et. al,
2012 and Xu and Deng 2012) focus on different aspects of
context-awareness, including modelling interactions
between users and IR systems nature, and how to modelling
context. The research reported in Nyongesa and
Maleki-Dizaji (2006) showed that based on preferences of
users, genetic algorithms (GA) could be applied to improve
the search rresults. Similarly, the work reported in Koorangi
and Zamanifar (2007) proposed improvement of internet
engines using multi-agent systems. In this work, a
meta-search engine gives a user documents based on an
initial query while a feedback mechanism returns to the
meta-search engine the user’s suggestions about retrieved
documents.
In Allan (2002), contextual information retrieval (CIR) is
defined as: "combine search technologies and knowledge
about query and user context into a single framework in
order to provide the most appropriate answer for user's
information needs". CIR intends to optimize the retrieval
accuracy by involving two related steps: appropriately
defining the context of user information needs, commonly
called search context, and then adapting the search by taking
it into account in the information selection process.
Several studies have addressed context specification
within and across application domains (Jara et. al, 2013;
Dinh and Tamine 2012; Kebler et. al, 2009; Goker and
Myrhaug, 2008; Vieira et. al, 2007). Device, user, task,
document and spatio-temporal are the five context specific
dimensions that have been explored in context-based
information retrieval literature (Emmanouilidis et. al, 2013;
International Journal of Information Science 2018, 8(1): 1-12 3
Dinh and Tamine 2012; Li et. al, 2011; Asfari et. el, 2009;
Mylonas et. al, 2008; Anand and Mobasher, 2007; Maeco et.
al, 2013; Lukowic et. al, 2011; Zhou et. al, 2012).
In Shen et. al., (2005) proposed a ranking technique for
multi-search projections on the Web for results aggregation
model based on query words, search results, and search
history to achieve user’s intention. To this end the Web can
offer a rich context of information which can be expressed
through the relevancy of document contents. In Shivaswamy
and Joachims (2011) proposed a model for online learning
that is specifically adequate for user feedback. The
experiment conducted shown retrieval effectiveness for web
search ranking. In the context of web search ranking, these
techniques aim at finding the best ordering function over the
returned documents is important. The authors argue that,
regression on labels may be adequate and, indeed,
competitive in the case of large numbers of retrievals. To
make the web more interesting, there is need to develop a
good and efficient ranking algorithm to deliver more suitable
results for users.
Agbele [2014] developed and coined the acronym DROPT
(Document Ranking OPTimization) to name a new adaptive
algorithm that provides a limited number of ranked
documents in response to a given query. The author argue
that, it can improve the ranking mechanism for the search
results in an attempt to adapt the retrieval environment of the
users and amount of relevant context-aware information
according to each user’s request. The DROPT measure must
be self-learning that can automatically adjust its search
structure to a user’s query behaviour. The DROPT technique
is employed in this paper to improve the retrieval
effectiveness based on the user interaction behaviour as
depicted in Figure 1.
3. Context-Adaptive for IR System
Context-adaptive IR requires an adaptation of the
processed information with respect to the individual users. It
depends on the user’s personal context-adaptive whether a
user blog article is worth reading with respect to the user’s
expectations and abilities. We are thus looking for a
workflow to enable how users can judge context changes for
adaptive retrieval based on the user profile. One major
problem of most current IR system is that they provide
uniform access and retrieval of IR results to all users
specially based on the query terms users entered to the
system.
To address these issues we propose a context-adaptive IR
model based on document preferences as search context to
rank individual users results effectively and the behaviour
that individual user has engaged in during the matching tasks.
The idea of context-adaptive is to predict relevant ranked
documents according to relevance weights. This
demonstrates a search context from search engine by
observing and analysing user behaviour (i.e. keyword
matching based querying frequency). The workflow of the
design and evaluation of this proposed context-adaptive IR
model is shown in Figure 1(see Appendix A). We generate
two user predictive models about document ranking: 1) a
predictive user model of the relevance of document content;
2) a predictive user model of ranking for currently retrieved
documents. We believe this model (Table 1) can enhance
individual user’s system retrieval performance greatly.
Table 1. Predictive document ranking model (PDRM) for user model
preference
Description of document
ranking model
Document
content
context
Can model predict
documents
relevance?
Predicted to adapt current
retrieved documents for
ranking tasks.
Relevant
Yes
Predicted to perform initial
queries reformulation but
ignored if found to be
irrelevant later.
Irrelevant
Not yet
The predictive user model generated data analysis by
individual users knowledge domain, while interacting with
the search engine in which ranking of retrieved document
has been controlled independently. By analyzing the
statistical associations between measures of user behaviour
and their judgments of document relevance, we create a
predictive user model of document relevance by assigning a
numerical weight to each retrieved document and ranking of
retrieved document, we can get a predictive user model of
current search context (relevant or irrelevant). Ranking of
retrieved documents could influence user’s context because a
user indicates documents that are relevant and otherwise
according to relevance weights. The problem at hand is thus
to find IR mechanism that allows for adaptive context-aware
IR. Agbele (2014), developed a Document Ranking
OPTimization (DROPT) technique and is employed in this
present paper to enable context-adaptive IR as illustrated in
Figure 1.
The purpose of predicting document ranking for IR system
in this paper is to adapt retrieved documents to individual
users during their search context, rather than after they finish
the entire document ranking tasks. So, the measures of user
behaviour context, which can be immediately noticed is
based on calculating the weight of keywords in the document
index vectors, calculated as a function of the frequency of a
keyword across a document should be the main sources to
predict ranking of retrieved documents according to
relevance weights. The work reported in Li and Belkin (2008)
identified task type in human information behaviour as
contextual factors to influence the way users search for
information. We apply context-awareness in this paper as a
technique to reformulate original user’s queries in order to
improve the predicted relevance of retrieved documents.
Also by reformulating a query we could not only increase the
number of relevant documents but also rank the candidate
documents. Therefore, user context is any relevant
information that can be used to characterize the situation of a
4 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System
user, such as where the user is, whom the user is with and
what resources are available to the user.
Before the current retrieved document is predicted from
individual users’ behaviours context, the predictive user
model of document relevance is calculated as measures of
individual user search in their domain of knowledge; once
the retrieved document is predicted from the model, and then
the system can activate predictive model of document
relevance for ranking task. This demonstrates how the
predicted relevance documents can be used to assist users
reformulate their initial queries to better understand users’
current information needs by user preferences. To adapt
search results means to explicitly make use of the user
preferences to tailor search results in their knowledge
domain. The next section describes the DROPT technique.
4. DROPT Technique
This section describes the document ranking technique for
context-aware IR known as a document ranking optimization
(DROPT) according to information relevance. A document
ranking technique is an algorithm that tries to match
documents in the corpus to the user, and then ranks the
retrieved documents by listing the most relevant documents
to the user at the top of the ranking. Unfortunately, despite
the exposure of individual users to domain of Web retrieval
and online documentation systems with document ranking
features; it rarely addresses the information relevance of
ranked output as core issue.
4.1. Parameters Used for Ranking Principles
In this sub-section we study the problem of ranking of
retrieved documents. For example, we desire to rank a set of
scientific articles such that those related to the
query ’information retrieval’ are retrieved first. The basic
assumption we make is that such a ranking can be obtained
by a weighting function
)( idftfw
which conveys to us
how relevant document d is for query q. The document
ranking will be done by taking a weighted average of all
determined parameters. Table 2 depicts the summary of
notations.
Table 2. Summary of ranking notations
Parameters Name
Description
indexed document
j
q
i-th query vector
( , )qd
document-query pair
()w D Q
convolution matrix
()w tf idf
weighting function
tf
term frequency
idf
index term frequency
max( )
i i j
Val t
maximum relevance weight value added
to matrix G
 
/0
i
D d if val
documents sorted in ascending order of
relevance value
 
0,1V
relevance numerical weight values
normalization interval
ij n l
Gg


query vector defined as a matrix G
2
1
1
1
ni
ij
iw
l
weighted root mean square (RMS) to
determine the overall relevance fitness of
all documents with respect to a given
query
n
number of queries for self-learning
N
size of the corpus
ij
W
Weights of terms in the document vectors
4.2. Formalization of Mathematical Model Definitions
This optimization of IR is obtained by ranking the
documents according to a relevance numerical weight value
()w tf idf
which is obtained from the weighting function
w in descending order. Then we wish to return a relevance
numerical weight subset
i
v
of
v
such that for each
i
dD
, we optimize the following weighting function:
()w tf idf
(1)
According to equation (1), a DROPT measure for
documents retrieved from a corpus is developed with respect
to document index keywords and the query vectors. This
mathematical model definition is based on calculating the
weight (wij) of keywords in the document index vector,
calculated as a function of the frequency of a keyword
j
k
across a document
i
d
.
The DROPT technique is based on IR result rankings,
where a ranking R consists of an ordered set of ranks. Each
rank consists of a relevance numerical weight value
 
1,0
V
where v represents the relevance numerical weights of the
retrieved documents. Each rank is assigned an ascending
rank number n, such that:
 
 
12
1, , 2, ,..., , n
R v v n v


(2)
Where
n
vvv ...
21
Our technique, DROPT is composed of six steps.
Step 1: Initialization of Parameters
(a) Let a query vector, Q, be defined as:
 
1 2 3
, , ,... l
Q q q q q
(3)
where,
  ,  being a term string with a weight of 1.
(b) Let the indexed document corpus be represented by
the matrix:
International Journal of Information Science 2018, 8(1): 1-12 5
(4)
where,     being an index string, with
weight  .
(c) We compute the convolution matrix W = DQ by a
simple multiplication of the document vectors and the
query vectors representing:
W = DQ = (5)
  is equal string ignore case   , where
 are query vectors,  are document vectors,  are
weights of terms in the document vectors, and  are
weights of terms in the query vectors, while n is the number
of retrieved documents that are indexed by at least one
keyword in the query vector. The matrix W gives a numeric
measure with no context information.
Step 2: Search String Processing
The comparison of the issued query term against the
document representation is called the query process. The
matching process results are a list of potentially relevant
context information. Individual users will scrutinize this
document list in search of the information they needs.
Step 3: Calculate Relevance Weight
Retrieved documents that are more relevant are ranked
ahead of other documents that are less relevant. It is
important to find relevance numerical weights of the
retrieved documents and provide a ranked list to the user
according to their information requests.
(a) Based on equation (1), the relevance weight is
obtained according to document content.
(b) Subsequently we calculate the average mean weight
using the weighted root mean squares (RMS) to
determine the overall fitness value of retrieved
documents with respect to a given query calculated
as:
2
1
11
nlij
j
i
ww
l
(6)
where,
w
is the average relevance mean weight of each
retrieved document, n is the number of keywords terms
occurrences in each retrieved document, l is the total size of
the keywords in the corpus, and wij are the sum weights of
terms of the document vectors.
Step 4: User Feedback about Retrieved Documents
User feedback about retrieved documents is based on
overall relevance weights to construct a personalized
user profiling of interests. We can achieve this when a user
indicates the documents that are relevant or otherwise, from
the designated databases context.
(a) The overall relevance judgment is given by:
ij nl
Gg


(7)
where,    and 1 i n, 1 j l and G is
a query vector with a small-operator defined as a matrix,
 are weights of terms of the document vectors, and 
are queries vectors. Any numerical weight component of
matrix G greater than the average mean weight, (6)
will be retained to add to a matrix T given by:
(8)
where,
(b) Based on matrix T (8) we calculate relevance
numerical weight values, for all set of documents D,
which are the largest weighting values for each
corresponding vector given by:
max{ },1
i ij
Val t i n
i j l
 

(9)
(c) Thus, any document whose value
i
val
was higher
than the overall average relevance weight would be
predicted as a relevant document; any document with
a lower value would be predicted as irrelevant
document (9). Thus average relevance mean value
within the normalization interval    is
computed for each document given by:
        (10)
Step 5: Relevance Judgment
The individual user is asked to judge contextual factor (e.g.
information relevance) influence on ranking given a certain
contextual dimension (numerical weight is relevant or not).
(a) If the ranked document is relevant to user information
needs, the user finishes his/her query search context,
then GO to Step 4 according to the user’s document
preference.
(b) Otherwise, the user continues to search the document
databases by reformulating the query or stop querying
the designated database until relevant documents are
ranked. GO to Step 6.
Step 6: Update Term Weight and Keywords Set
The keyword term set n provided by the ranked documents
NjNNN
j
j
dddd
dddd
dddd
D
321
2232221
1131211
nl
w
n
w
n
w
n
w
l
wwww
l
wwww
321
2232221
1131211
 
lnij
tT
ljni
gift
gifgt
ijij
ijijij
1,1
,0
,
6 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System
and the relevance numerical weight values will be updated
by user feedback.
(a) Any new query term not belonging to n will be added
and a new column of relevance weight value will be
computed and expanded for ranked documents
routinely.
(b) If any ranked document di is retrieved by the users,
the corresponding relevance weight values with
respect to the query keywords will be increased
by (11). The default of β is set to increase the
corresponding relevance numerical weight values.
 
ij ij
ww
(11)
where,
         and   
We coined the acronym DROPT to name our adaptive
algorithm that provides a limited number of ranked
documents in response to a given query. Also it can improve
the ranking mechanism for the search results in an attempt to
adapt the retrieval environment of the users and amount of
relevant context information according to each user’s request.
Finally, the DROPT measure must be self-learning that can
automatically adjust its search structure to a user’s query
behaviour.
5. Experimental Design
The experiment was designed to study a new user’s
behaviour source i.e. ranking of retrieved documents that can
influence the information retrieval process. Though
considering user searching actions (i.e. clicking on a
document in a search result, printing a document, moving a
document into a folder, etc.) as sources for implicit relevance
of documents, the techniques presented in this paper is
different because it considers document ranking. From that
view, the techniques is interesting and innovative as it
emphasizes that the IR process is not just about matching
between documents and queries but relationships among
matching, user actions and user preferences in ranked
documents of retrieved results. The experiment was designed
and piloted using systems that allows interactive information
retrieval (IIR) experiments that log users ‘in different
browsers interactive search behavior. The system has a
search engine where tables are created for experimental
generated data from searching tasks. The systems were used
to determine the frequency of keyword matching-based
querying results to monitor the progress of the experiment.
They performs several information related tasks activities
such as searching, filtering, matching, displaying, and
learning information needs over time. This is concerned with
the reuse of the existing standards, approaches, and how to
incorporate them into the design of the IR system. During the
search, the participant interactions with the search engine
were logged via the system log in menu. In each search task,
the participants were asked to obtain the frequency of
keyword matching based querying across a document; that
were relevant to meet their information requests. The
behavioral measures we examine are the frequencies of the
user issued query (i.e. frequency of keyword matching based
querying) while interacting with the IR system.
We involved three system users (Master students) in the
area of Computer Science in the Department of Computer
Science to collect data through the WampServer search
engine back end prototype. The three study system user
participants were given 10 search tasks each in their domain
of knowledge. During the search context, the students’
interactions with the search engine back end prototype were
logged via the system log in menu with their "student
identification number". In each task, the students were asked
to obtain the frequency of keyword matching based querying
across a document that were relevant to meet their
information requests to achieve document ranking task based
on individual users preference, or ignore documents that
were found to be irrelevant. The user behavioural measures
we examine are the frequencies of the issued query. The
function of the frequency of the keyword across a document
from the document database collected is stored in the
WampServer site localhost database. WampServer is a
Windows Internet environment that allows user to create
Internet applications with Apache 2, PHP and a MySQL
database. PHP Myadmin allows user to manage easily our
databases. This measure was used to predict the ‘relevant”
documents marked ‘X” for document ranking model. To
evaluate the performance of the proposed technique, we
performed an experiment on small scale search of different
30 queries from the system users to validate the effectiveness
of the technique. Table 3 gives the statistics of the queries
considered in the experiment. The personalized predictive
ranking model identifies retrieved documents to individual
user from the domains according to his/her preferences.
6. Ranking Performance Results
With the intention of measure ranking performance, the
DROPT technique, according to Agbele (2014) for ranking
search results list was tuned by experimenting with the
prototype system for relevance judgment. In this paper, each
query produced a document based on the matching
conditions and the retrieval was repeated for 10 query
reformulations from the domain of system user experts. The
underlying philosophy of the relevance judgment rules for
user model judgment using the DROPT technique is to rank
those documents, which exceeded the overall weighted
fitness score that the system user judges to be relevant to
his/her information needs, and ignore those documents the
system users judge to be irrelevant (less preferred).
International Journal of Information Science 2018, 8(1): 1-12 7
Figure 2. Ranking performance graph results at the known relevant documents
7. Comparison of DROPT Technique
with TF-IDF Method
In this section, we present the results that show the
performance of our DROPT technique against a traditional
tf-idf method. We compared our ranking algorithms with
selected well-known baseline algorithms such as TF-IDF to
evaluate the performance of our ranking technique in
standard "Precision at position n" (P@n) measure. For the
information needs and document collection of the
experiment, relevance was assessed by different system
users in their domain of experts. They are knowledgeable in
their domain and were asked to judge the relevance of the
retrieved documents on a six level scales: (0=Harmful,
1=Bad, 2=Fair, 3=Good, 4=Excellent and 5=Perfect) with
respect to a given query. For comparison of results, we have
used P@n metrics Jarvelin and Kekalainen (2010). Precision
at n measures the relevancy of the top n results of the ranking
list with respect to a given query according to equation (12).
P@n=No. of relevant document in top n results / n (12)
P@n can only handle cases with binary judgment
“relevant” or “irrelevant” with respect to a given query at
rank n. To compute P@n, 30 queries were judged in these six
levels by users.
The test process involved using the 30 queries provided by
the system users. The measure (P@n) is used for the
evaluation. Naturally, this is computed for each query, and
then takes the average dimension (n) for all queries. Fig. 3
shows the comparison of the DROPT algorithm with other
algorithms in the P@n measure. As the figure shows, our
adaptive algorithm outperforms TF-IDF model. The DROPT
algorithm achieves a 28% in P@n compared to TF-IDF. The
empirical results have been compared with the traditional
relevance feedback model. It shows that the precision value
of the DROPT ranking technique is comparatively higher for
all the query sets. This achievement resides in the
combination of context-based algorithms using user
preferences for query reformulations. In this regard, the
number of top n results showed to users depicts the relevancy
degree of the retrieved documents with respect to a given
query with rank n (judged by the system users).
Table 3. Precision Results from the 3 Domains of Expert for Ranking at
Known Relevant Documents
Document#
Queries
Relevant
Tf
Precision
Fitness Score
1
Q1
19
0.000
0.37
2
Q2
X
3
0.500
0.90
3
Q3
8
0.000
0.73
4
Q4
X
2
0.500
0.93
5
Q5
8
0.000
0.73
6
Q6
X
3
0.500
0.90
7
Q7
X
2
0.571
0.93
8
Q8
X
5
0.625
0.83
9
Q9
X
5
0.667
0.83
10
Q10
X
5
0.700
0.83
11
Q11
X
4
0.727
0.87
12
Q12
13
0.000
0.57
13
Q13
X
3
0.692
0.90
14
Q14
X
4
0.714
0.87
15
Q15
10
0.000
0.67
16
Q16
X
3
0.688
0.90
17
Q17
X
6
0.706
0.80
18
Q18
9
0.000
0.70
19
Q19
16
0.000
0.47
20
Q20
18
0.000
0.40
21
Q21
13
0.000
0.57
22
Q22
X
2
0.591
0.93
23
Q23
X
4
0.609
0.87
24
Q24
X
2
0.625
0.93
25
Q25
X
4
0.640
0.87
26
Q26
14
0.000
0.53
27
Q27
X
2
0.630
0.93
28
Q28
X
2
0.643
0.93
29
Q29
X
2
0.655
0.93
30
Q30
8
0.000
0.73
Average
0.631
0.75
8 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System
The corpora were manually built with minimal number of
documents for evaluation purposes. For easy evaluation and
scalability issues, we use our manually built corpus to
evaluate the effectiveness of our DROPT technique. The
empirical results have been compared with the traditional
relevance feedback model. In future, we intend to perform
100 queries reformulation and compared with other
well-known standards in TREC.
Figure 3. Ranking performance graph results at the known relevant
documents
8. Statistical Analysis and Discussion
Agbele et. al (2016) presented the DROPT algorithm
results and extended in this present paper by performing
statistical analysis using ANOVA on 30 queries.
Significance test interpretation was carried out in this
research study with the purpose of measuring the
effectiveness of IR system using interactive reinforcement
learning (user’s feedback and context-awareness) in
comparison to relevance feedback. The test was established
to reject the null hypothesis, H0 that there is difference
between the group means of Domain of system user
participants 1, 2, and 3. Rejecting H0 infers accepting the
alternative hypothesis; H1 with at least one of the means is
different from others in retrieval efficacy in order to improve
the system performance.
Since F-statistical table falls to the left of F-distribution
(5.19 > 4.74) under the acceptance region. Therefore we may
conclude at a 5% level of significance test that there is a
significant difference in the means of at least one group of
Domains 1, 2, and 3. This is because the values of ad-hoc
keywords matched against documents that were searched
independently across each of the domains of system user’s
participants and the corresponding values of occurrences of
issued query were obtained. The interpretation of this
statistical result demonstrates the improvement of
information retrieval efficacy through the attributes from the
user behaviour actions while interacting with the IR system.
Our results on the indexed ad-hoc keywords represent
domain of the system user’s three participants in an in-lab
experimental setting. The results demonstrate that combining
individual system user’s behavioral measures can improve
ranking prediction accuracy (according to relevance
weights), for documents ranking tasks, and however that
individual users ranking performed much better than
combining document rankings of the systems. This
accomplishes personalization of retrieved documents for
individual users as the focus of this paper. The retrieval
effectiveness is measured using well known metrics
Precision and Recall, at known relevant documents.
Definitions:
Let MSB depicts variance between the three domains
considered in this study.
Let MSW depicts variance within the three domains
considered in this study.
In order to evaluate both the means and standard
deviations of the keyword matching based querying
experiments, we construct hypothesis test based on the
values obtained across all issued queries after 30 generations
(10 search tasks from each participant domain) using
Analysis of Variance (ANOVA).
H0:
=
1 =
2 =
3 where 1, 2, and 3 are domains
considered in this study.
H1: At least one of the means is different from the others.
Figure 4. Showing values of 4.74 at F 0.05, 2, 4.74
It is noted that there are presently the value of K = 3
domains, that is, Domains 1, 2, and 3. Therefore, DOFN =
K-1 = 3-1 = 2. The sum total of data for all the three domains
depicted as 10 + 10 + 10 = 30.
Using the DOFD = N-K = 10-3 = 7 and α = 0.05 (the least
significant value). The critical value if F0.05, 2, 7 = 4.74
(determined using F-Distribution table).
We need to find: = mean of mean =
MSB = and MSW =
The mean of mean was determined as follows:
=
= 268+177+202 = 647/30 = 21.6
The mean for each domain are evaluated as follows:
Domain 1 = = 268/10 = 26.8
Domain 2 = = 177/10 = 17.7
Domain 3 = = 202/10 = 20.2
The variance for each domain is evaluated as follows:
F0.05, 2, 7 = 4.74
0.9
5
Rejection region
α = 0.05
F-Distribution
International Journal of Information Science 2018, 8(1): 1-12 9
Domain 1 = 228.9/10 = 22.89
Domain 2 = 154.5/10 = 15.45
Domain 3 = 200.01/10 = 20.01
Mean of mean = (268+177+202)/30 = 21.6
Also MSB = could be determined as follows:
M
SB
=
ni (x
Domain1
x)
2

ni (x
Domain2
x)
2
ni (x
Domain 3
x)
2
/ K
1
MSB = 10(26.8-21.6)2 + 10(17.7-21.6)2 + 10(20.2-21.6)2
/3-1 = 442.1/2 = 221.05
MSB = 221.05
Also, MSW =
MSW = (10-1) Domain1 + (10-1) Domain 2 + (10-1)
Domain 3 (10-1) / N-K
Domain 3/30-3= 9(9.94) +9(12.73) +9(10.45)/7=298.08/7
MSW = 42.58
Therefore, the test statistics is F = MSB/MSW
= 221.05/42.58 = 5.19
9. Conclusions
Using adaptive IR system, situations can be detected and
classified as contexts. Once the proposed system has
recognized in which context an interaction takes place, this
information can be used to change and adapt the behaviour of
IR applications and systems. One has to keep in mind that
users learn how to interact with the system, and that they
adapt their behaviour. So, it is crucial to develop
understandable context-aware IR system that adapts to the
users’ expectations. In line with this, well-designed
context-awareness is a great and powerful way to make
user-friendly and enjoyable IR applications.
User interactive behavior measures on relationships
among matching help understand how users interact on the
clicked documents in response to a given query, and they are
indicative of document relevance. Also, user interactive
behaviours measures during user actions help describe what
the user does between issuing one query and the next. User
interactive behaviours about user preferences help
understand how to acquire search results. This in turn could
improve the information retrieval effectiveness. The adapted
search results means to explicitly make use of the user
context to tailor search results.
Our results demonstrate a significant effect of document
ranking on predictive ranking model according to document
relevance. Document ranking not only affected the user
interactive behaviour as predictors of document relevance, it
also affected the relevance weights for each of the user
interactive behaviours to improve IR effectiveness. In
addition, when document information is available, the
ranking model gives better prediction of document relevance.
Therefore, we can conclude that it is important for adapted
IR systems to detect the context in which a search is
conducted, especially the document ranking, and then to
apply the user model to adapt search results to individual
users. Also document ranking influenced how users
interacted with search systems during search sessions. The
interpretation of the statistical results using ANOVA
demonstrates the improvement of information retrieval
effectiveness through the attributes.
A DROPT technique has been evaluated to reflect how
individual user judges the context changes in IR from the
user behaviour actions while interacting with the IR system
results ranking. Predictive user model of document ranking
were presented to adapt retrieved documents to individual
users during their search context, rather than after they finish
the entire ranking tasks.
ACKNOWLEDGEMENTS
The authors would like to thank Elizade University
Management for funding this research project.
10 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System
Appendix A
REFERENCES
[1] A. K. Dey, (2001): Understanding and Using Context.
Personal Ubiquitous Computing, vol. 5, no. 1, pp: 4-7.
[2] C. Emmanouilidis, R-A. Koutsiamanis, A. Tasidou, (2013):
Taxonomy of architecture, context-awareness, technologies
and applications. Journal of Network and Computer
Applications, Vol. 36, pp. 103-125.
[3] A. J. Jara, P. Lopez, D. Fernandez,. F. Cashlo, M. A. Zamora,
and F. Skarmeta, (2013): Mobile discovery: discovering and
interacting with the world through the Internet of things.
Journal of Personal and Ubiquitous Computing, Published
Springer-Verlag London.
[4] H. Y. Noh, J. H. Lee, K. S. Oh, and S. B. Cho, (2012):
Exploiting indoor location and mobile information for
context-awareness service. Journal of Information Processing
and Management,. Vol. 14, Issue 1, pp. 1-12.
[5] W. Xue, and H. Deng, (2012): Unstructured queries based on
mobile user context. International Journal of Pervasive
Computing and Communications, Vol. 8, Issue 4, pp.
368-394.
[6] H. O. Nyongesa, and S. Maleki-dizaji, (2006): User
modelling using evolutionary interactive reinforcement
learning. Inf Retrieval, vol. 9, no. 3, pp. 343-355.
DOI: 10.1007/s10791-006- 4536-3.
[7] M. Koorangi, K. Zamanifar, (2007): a Distributed Agent
Based Web Search using a Genetic Algorithm. International
Journal of Computer Science and Network Security, vol. 7, no.
1, pp. 65-76.
Estimate measures of
individual user’s behaviours
context in user matching tasks
Generate predictive
user models of
document ranking
Implement personalized adaptive IR algorithm in the
system
Evaluation of the IR
system
Observe individual user behaviour context
(keyword matching based querying)
Predictive user model of document ranking
Can it predict the
document ranking?
Perform initial queries
reformulation
Personalize retrieved
documents to individual
users
Rank retrieved documents based on individual user
preferences
Context-based
personalized IRS
Not yet
Yes
Predictive
user model
of the
relevance of
adaptive
document
documents
content
International Journal of Information Science 2018, 8(1): 1-12 11
[8] J. Allan, (2002): Challenges in information retrieval and
language modelling. Report of a workshop held at the
Centre for Intelligent Information Retrieval, University of
Massachusetts, Amherst, September 2002.
[9] D. Dinh, and L. Tamine, (2012): Towards a Context Sensitive
Approach to Searching Information based on Domain
Specific Knowledge Source. Web Semantics: Science
Services and Agents on the WWW, Vol. 12, pp. 41-52.
[10] C. Kebler, M. Raubal, and C. Wosniok, (2009): Semantic
Rules for Context-aware Geographical Information Retrieval.
In P. Barnaghi, editor, 4th European Conference on Smart
Sensing d Context, EuroSSC 2009, University of Surrey. Vol.
5741 of LNCS Springer, pp. 77-92.
[11] A. Goker, and H. Myrhaug, (2008): Evaluation of a mobile
information system in context. Information Process
Management, Vol. 44, no. 1, pp. 39-65.
[12] V. Vieira, P. Tedesco, A. C. Salgado, and P. Brézillon,
(2007): Investigating the specifics of contextual elements
management: the cemantika approach. Context, pp. 493-506.
[13] W. Li, D. Ganguly, G. J. F. Jones, (2011): Enhanced
Information Retrieval Using Domain-Specific Recommender
Models. In: Amati, G., Crestani, F. (eds.) ICTIR 2011. LNCS,
vol. 6931, pp. 201212.
[14] O. Asfari, B. L. Doan, Y. Bourda, J. P. Sansonnet, (2009):
Personalized access to information by query reformulation
based on the state of the current task and user profile, In: The
Third International Conference on Advances in Semantic
Processing, Malta.
[15] P. Mylonas, D. Vallet, P. Castells, M. Fernandez, Y. Avrithis,
(2008): Personalized information retrieval based on context
and ontological knowledge. Knowledge Engineering Review,
Vol. 23, No. 1, pp. 73-100.
[16] S. S. Anand, and B. Mobasher, (2007): Introduction to
intelligent techniques for web personalization. ACM
Transactions on Internet Technology, Vol.7, no 4, pp. 18.
[17] D. S. Marco, N. Vidhya, E. Churchill, E. (2013): In
Proceedings of the SIGHI Conference on Human Factors in
Computing Systems, pp. 2487-2496.
[18] P. Lukowicz, A. S. Pentland, and A. Ferscha, (2011): From
Context Awareness to Socially Aware Computing, IEEE
Pervasive Computing, Vol. 11, No. 1, pp. 32-41.
[19] D. Zhou, S. Lawless, V. Wade, (2012): Improving search via
personalized query expansion using social media. Inf. Retr.
pp. 125. Doi: 10.1007/s10791-012-9191-2.
[20] X. Shen, B. Tan, C. Zhai, (2005): Implicit user modelling for
personalized search. In: 14th ACM International Conference
on Information and Knowledge Management (CIKM 2005),
pp. 824831. ACM, Bremen.
[21] P. K. Shivaswamy, and T. Joachims, (2011): Online Learning
with Preference Feedback. In NIPS workshop on Choice
Models and Preference Learning.
[22] K. Agbele. (2014). “Context-Awareness for Adaptive
Information Retrieval Systems”, Unpublished PhD Thesis,
University of the Western Cape.
[23] Y. Li, and N. J. Belkin, (2008): A faceted approach to
conceptualizing tasks in information seeking. Information
Processing and Management, Vol. 44, No. 6, pp. 1822-1837.
[24] K. Jarvelin, and J. Kekalainen, (2010): IR evaluation methods
for retrieving highly relevant documents. Published in: Belkin,
N. J., Ingwersen, P. and Leong, M. K. (eds). In: Proceedings
of the 23rd Annual International ACM SIGIR Conference on
Research and Development in Information Retrieval. New
York, NY: ACM, pp. 41-48.
[25] K. K. Agbele, E. F. Ayetiran, K. D. Aruleba and D. O. Ekong,
"Algorithm for Information Retrieval optimization," 2016
IEEE 7th Annual Information Technology, Electronics and
Mobile Communication Conference (IEMCON), Vancouver,
BC, 2016, pp. 1-8. doi: 10.1109/IEMCON.2016.7746242.
[26] F. Liu, C. Yu, and W. Meng, (2014):”Personalized Web
Search for Improving Retrieval Effectiveness,” IEEE
Transactions on Knowledge and Data Engineering, vol. 16,
No.1, pp.28-40.
[27] K. Sugiyama, K. Hatano, and M. Yoshikawa, (2004):
Adaptive web search based on user profile constructed
without any effort from users. in Proceedings of the 13th
international conference on World Wide Web, (New York,
NY, USA, 2004), ACM, 675-684.
[28] P.J. Brown, J.D. Bovey, and X. Chen, (2007): Context-aware
applications: from the laboratory to the marketplace. Personal
Communications, IEEE], 4 (5). 58-64.
[29] G. D. Abowd, A.K. Dey, R. Orr, and J. Brotherton, (1997):
Context-awareness in wearable and ubiquitous computing. in
First International Symposium on Wearable Computers.
(ISWC 97), (Cambridge, MA, 1997), 179-180.
[30] T. Haveliwala, (2002): Topic-sensitive PageRank. in
Proceedings of the Eleventh International World Wide Web
Conference, (Honolulu, Hawaii, USA, 2002).
[31] K. Bharat, (2000): SearchPad: explicit capture of search
context to support Web search. in Proceedings of the 9th
international World Wide Web conference on Computer
networks: the international journal of computer and
telecommunications netowrking, (Amsterdam, The
Netherlands, 2000), North-Holland, 493-501.
[32] Z. Dou, R. Song, and J. Wen, (2007): A Large-scale
Evaluation and Analysis of Personalized Search
Strategies.pdf. in Proceedings of the 16th international World
Wide Web conference (WWW2007), (Banff, Alberta, Canada,
2007), 572-581.
[33] D. Billsus, D. Hilbert, and D. Maynes-Aminzade, (2005):
Improving proactive information systems. in IUI '05:
Proceedings of the 10th international conference on
Intelligent user interfaces, (San Diego, California, USA,
2005), ACM, 159-166.
[34] T. Bauer, and D. Leake, (2001): Real time user context
modeling for information retrieval agents. in CIKM '01:
Proceedings of the tenth international conference on
Information and knowledge management, (Atlante, Georgia,
USA, 2001), ACM, 568-570.
[35] M. Henzinger, B. Chang, B. Milch, and S. Brin, (2003):
Query-free news search. In WWW '03: Proceedings of the
12th international conference on World Wide Web, (Budapest,
Hungary, 2003), ACM, 1-10.
[36] White, R. and Kelly, D., A study on the effects of
personalization and task information on implicit feedback
12 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System
performance. in CIKM '06: Proceedings of the 15th ACM
international conference on Information and knowledge
management, (Arlington, Virginia, USA, 2006), ACM,
297-306.
[37] H. Kim, and P. Chan, (2003): Learning implicit user interest
hierarchy for context in personalization. in IUI '03:
Proceedings of the 8th international conference on Intelligent
user interfaces, (Miami, USA, 2003), ACM, 101-108.
[38] Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E., Solan,
Z., Wolfman, G. and Ruppin, E., Placing search in context:
the concept revisited. in World WideWeb, (2001), 406-414.
[39] J. Rocchio, and G. Salton: Relevance feedback in information
retrieval. Prentice-Hall, 1971.
[40] G. Salton, and C. Buckley, (1988): Improving retrieval
performance by relevance feedback. Journal of the American
Society for Information Science, 41 (4). 288-297.
[41] R. White, and D. Kelly, (2006): A study on the effects of
personalization and task information on implicit feedback
performance. in CIKM '06: Proceedings of the 15th ACM
international conference on Information and knowledge
management, (Arlington, Virginia, USA, 2006), ACM,
297-306.
[42] X. Shen, B. Tan, and C.Zhai, (2005): Implicit user modeling
for personalized search. in CIKM '05: Proceedings of the 14th
ACM international conference on Information and knowledge
management, (Bremen, Germany, 2005), ACM,824-831.
... In accordance with this set, the selection criteria for the found documents are already presented as Pareto-optimal. is feature is poorly reflected in the algorithmic support of the final stage of search and presentation of SERPs in DIRS [15,16]. ...
... In general, the above function takes the minimum zero value only in states that meet constraints (11)- (12) and represent a plan of assignments. Combining the function (16) with the function (17) or the function (18), we build the energy function in its finished form: ...
Article
Full-text available
The article is devoted to the problem of optimization of search request ranking algorithms in the digital information retrieval system. The algorithm of functioning of the neural network ranking unit based on Hopfield neural network is built. The ability to generate a ranked list of pages found as a result of the request in the digital information retrieval system can be provided by solving two problems of integer optimization: the problem of assignment of combinatorial sets of criteria for assessing the relevance of web page search and the problem of sorting of numbers—relevance values. The architecture of the neural network model based on the dynamic Hopfield neural network with binary output function designed for combinatorial optimization of the final list of documents found in the digital information retrieval system was synthesized. Promising variants of neural network models with binary output function of neurons for synthesis of the optimal evaluation plan with a combinatorial set of criteria by solving the problem of assignment were built. It has been proven that the built models differ in the rules for determining the coefficients of synaptic connections and external shifts; each of the created rules can be used independently or in different combinations with one another. In the course of analytical research, it was found that the optimization formulation of the problem of sorting of relevance values of search pages is identical to the problem of assignment of combinatorial groups of evaluation criteria provided that the elements of the performance matrix of the latter are defined as linear combinations of relevance values.
... The main reason behind the problem is speculating the documents based upon inadequate query terms at the database level and correspondingly ranking them. Although, there exist techniques that either contextually reformulates the user query [8][9][10] or contextually rank the documents using sentence embeddings [11][12][13], but no technique combined both aspects simultaneously. Thus, there is a need for a combined approach that can express the user's information needs at the level of query submission as well as document ranking. ...
Article
Full-text available
Due to the exponential increase in the information on the web, extracting relevant documents for users in a reasonable time becomes a cumbersome task. Also, when user feedback is scarce or unavailable, content-based approaches to extract and rank relevant documents are critical as they suffer from the problem of determining semantic similarity between texts of user queries and documents. Various sentence embedding models exist today that acquire deep semantic representations through training on a large corpus, with the goal of providing transfer learning to a broad range of natural language processing tasks such as document similarity, text summarization, text classification, sentiment analysis, etc. So, in this paper, a comparative analysis of six pre-trained sentence embedding techniques has been done to identify the best model suited for document ranking in IR systems. These are SentenceBERT, Universal Sentence Encoder, InferSent, ELMo, XLNet, and Doc2Vec. Four standard datasets CACM, CISI, ADI, and Medline are used to perform all the experiments. It is found that Universal Sentence Encoder and SentenceBERT outperform other techniques on all four datasets in terms of MAP, recall, F-measure, and NDCG. This comparative analysis offers a synthesis of existing work as a single point of entry for practitioners who seek to use pre-trained sentence embedding models for document ranking and for scholars who wish to undertake work in a similar domain. The work can be expanded in many directions in the future as various researchers can combine these strategies to build a hybrid document ranking system or query reformulation system in IR.
Article
This article concentrates on solving the $k$ -winners-take-all $(k$ WTA) problem with large-scale inputs in a distributed setting. We propose a multiagent system with a relatively simple structure, in which each agent is equipped with a 1-D system and interacts with others via binary consensus protocols. That is, only the signs of the relative state information between neighbors are required. By virtue of differential inclusion theory, we prove that the system converges from arbitrary initial states. In addition, we derive the convergence rate as $\mathcal{O}(1/t)$ . Furthermore, in comparison to the existing models, we introduce a novel comparison filter to eliminate the resolution ratio requirement on the input signal, that is, the difference between the $k$ th and $(k+1)$ th largest inputs must be larger than a positive threshold. As a result, the proposed distributed $k$ WTA model is capable of solving the $k$ WTA problem, even when more than two elements of the input signal share the same value. Finally, we validate the effectiveness of the theoretical results through two simulation examples.
Article
Full-text available
The study investigated the use of information retrieval system for library services by undergraduate students in Prof. Aghagbo Nwako he study investigated the use of information retrieval system for library services by undergraduate students in Prof. Aghagbo Nwako Library Awka. The purpose of the study are to determine: The library information retrieval system available for library services to undergraduate students, the use of information retrieval system for library services to undergraduate students, the constraints to the use of information retrieval system in provision of library services to undergraduate students and the ways to improve the use of information retrieval system in provision of library services to undergraduate students. Four research questions guided the study. The study employed descriptive survey research design. The population of the study comprised of 308 regular undergraduate students (100 to 400 Levels) of Library and Information Science, Nnamdi Azikiwe University, Awka. Simple random sampling technique was used for the study; therefore, 120 undergraduate students were sampled. The sample size of the study was 30 regular undergraduate students selected from each of the levels. A self-structured questionnaire was the instruments used for data collection. The data collected was analyzed using descriptive statistics. The findings of the study revealed among others that the undergraduates were aware of information retrieval system available in and use it for their academic purposes and the study also revealed that the students faced unreliable power supply, poor maintenance culture; low bandwidth of internet access, lack of awareness among others factors that hinder the utilization of library information retrieval system. The study concludes among others that information retrieval services are available for use by undergraduate students. But the undergraduate students do not utilize library information retrieval system to a fuller extent due to unreliable power supply, poor maintenance culture, low bandwidth of internet access are among others factors that hinder the utilization of library information retrieval system. The study recommends that high Internet connectivity should be provided in the Library, so as to encourage library users to patronize the library resources and services when they are made available. Also it was recommended that university management should provide adequate fund for maintenance and alternative source of power.
Chapter
People have realized the importance of finding and archiving information with the computer advents for thousands of years, and storing of large amount of information became possible. It is actually not related to the fetching of the documents, it informs the user on the whereabouts and existence of the documents. In this paper, hybrid model has been used in which the document is classified using the support vector machine (SVM) classifier, and after the condition is applied, if it is satisfied, the extraction of the matched paragraph and the sentence is responsible for the generation of relevant answer. The knowledge base gets updated if condition does not match, and new updated answer will be generated. Finally, the best answer is displayed after ranking by using the PSO optimization. Word2vector is applied for feature extraction. In this paper, comparison of RankSVM, RankPSO and RankHSVM + PSO for the implementation of IR ranking is considered. Here, first SVM is used as a classifier for dividing most relevant and non-relevant results, and afterward PSO is used for the optimization of the result means extraction of the best answer or document. Selection of appropriate parameters is difficult in case of simple SVM, but for the ranking of the answers it gives potential solutions. PSO is used for optimization which has global search capability and is easy to implement and thus to optimize the ranking of document retrieval. We propose the RankHSVM + PSO model to find the fitness function. This technique improves the performance of the system as comparative to other techniques. The result shows that the algorithm applied here improves the value of performance evaluation by 4–5%. TREC 2004 QA DATA dataset is used which contains my datasets. It has a question answering track since 1999. The task was defined in each track. Retrieval of true equivalent test collection for standard retrieval is an open problem. In a retrieval test collection, the unit that is judged the document has a unique identifier.
Chapter
Full-text available
The Covid-19, a pandemic situation, effects the economy of the whole world severely and is gaining much huge attention in the field of research currently across the globe. The Internet of things (IoT) technology is playing a great role for taking care of the patients by monitoring and controlling the symptoms and is very much essential for the developing countries, where monitoring of health of huge population has its own challenges. So the IoT and its amalgamation with mobile ad hoc network (MANET) acts as base of networks where devices send information among each other wirelessly thus also named as wireless mesh networks (WMN), in which various nodes are either stationary or allied with static position. Sensors and different other devices involved in e-healthcare sector used in WMN converse wirelessly and hence become the main gate to a numerous susceptibilities. The main aim of this research study is to evaluate the performance of reactive, secured and hybrid routing protocols for throughput as one of the important quality of service (QoS) parameters in absence as well as in presence of distributed denial of service (DDoS). The NS-2(network simulator) is used to simulate AODV (ad hoc on-demand vector, SAODV (secured AODV) and hybrid wireless mesh protocol) in scenario of changing nodes. The comparative analysis concludes the HWMP as most suitable protocol among the other two routing protocols with impact on throughput for handling DDoS attacks. This research study aids in providing implications to enhance existing protocols and alleviate the consequence of DDoS instigated by such attacks.
Chapter
Full-text available
Depression is presently one of society's main psychological disorders. An intensified public mental health concern has been prompted by recent experiences with the emergence of corona virus disease 2019 (COVID-19). At present, the emphasis of research on human emotional state representation has changed from basic emotions to a large number of emotions in continuous three-dimensional space owing to the complexity of describing and evaluating a vast number of emotions within a single framework. Significant considerations of 3D continuous valence, arousal and dominance space while overseeing mental health issues are important as they relate to the expression of emotion and behavioural reactions. The goal of this research is to design a machine learning regressor modal to estimate the continuous valence, arousal and dominance score which results from the process of emotional intelligence via text interpretation. In the pursuit of goal, EmoBank dataset, which contains text information as well as valence–arousal–dominance values and for validation ISEAR, a labelled corpus of categorical emotions datasets is used. We learn an embedding using three pre-trained word embeddings: word2vec, Doc2vec and BERT, and find that BERT significantly outperforms the result. In a future study, the regressor model will be adopted in depression detection by distributing the categorical negative emotions in terms of VAD.
Article
Full-text available
Large amount of unstructured designed information is difficult to deal with. Obtaining specific information is a hard mission and takes a lot of time. Information Retrieval System (IR) is a way to solve this kind of problem. The main reason for this work is to help internet and library database users to find the required information with high performance. Finding information on a particular word, subject, topic or article is useful when specific keywords such as title, author's name, ISBN/ISSN, year of publication for the object are known. The study is guided by two (2) objectives and adopted documentary research method in social research and the core aim is to provide a general understanding on information retrieval system in a digital era as well as specific process for its utilization.
Conference Paper
Full-text available
When using Information Retrieval (IR) systems, users often present search queries made of ad-hoc keywords. It is then up to the information retrieval systems (IRS) to obtain a precise representation of the user's information need and the context (preferences) of the information. To address this problem, we investigate optimization of IRS to individual information needs in order of relevance. The goal of this article is to develop algorithms that optimize the ranking of documents retrieved from IRS according to user search context. In particular, the ranking task that led the user to engage in information-seeking behaviour during search tasks. This article discusses and describes a Document Ranking Optimization (DROPT) algorithm for IR in an Internet-based or designated databases environment. Conversely, as the volume of information available online and in designated databases is growing continuously, ranking algorithms can play a major role in the context of search results. In this article, a DROPT technique for documents retrieved from a corpus is developed with respect to document index keywords and the query vectors. This is based on calculating the weight (w ij ) of keywords in the document index vector, calculated as a function of the frequency of a keyword k j across a document. The purpose of the DROPT technique is to reflect how human users can judge the context changes in IR result rankings according to information relevance. This article shows that it is possible for the DROPT technique to overcome some of the limitations of existing traditional (tƒ × idƒ) algorithms via adaptation. The empirical evaluation using metrics measures on the DROPT technique carried out through human user interaction shows improvement over the traditional relevance feedback technique to demonstrate improving IR effectiveness.
Article
Full-text available
In this paper, the problems of current web search engines are analyzed, and the need for a new design is justified. Some ideas on how to improve current web search engines are presented, and then an adaptive method for web meta-search engines with a multi-agent specially the mobile agents is presented to make search engines work more efficiently. In the method, the cooperation between stationary and mobile agents is used to make more efficiency. The meta-search engine gives the user needed documents based on the multi-stage mechanism. The merge of the results obtained from the search engines in the network is done in parallel. Using a reduction parallel algorithm, the efficiency of this method is increased. Furthermore, a feedback mechanism gives the meta-search engine the user's suggestions about the found documents, which leads to a new query using a genetic algorithm. In the new search stage, more relevant documents are given to the user. The practical experiments were performed in Aglets programming environment. The results achieved from these experiments confirm the efficiency and adaptability of the method.
Article
Purpose – Many mobile devices today are equipped with diversified sensors that enable the acquisition of rich user context (e.g. GPS location, phone activity) for application utilization. With the growing usage of mobile devices in daily life, the problem of conveniently and promptly searching a piece of content that a user has viewed on his/her device before becomes more and more crucial. This paper aims to propose a context‐based query processing framework called UCQP that supports unstructured queries for content search in a user's access history. Design/methodology/approach – Beyond the keywords related to the content properties, a context query in the framework is specified with freeform phrases that describe high‐level mobile contexts of the user at a previous time when the user viewed the searched content. Findings – Experimental results on a prototype system of the framework illustrate its good accuracy and small response time. Originality/value – To tolerate the incompleteness and inaccuracy in user query texts caused by fading human memory, the authors develop several semantic query parsers that are tailored for different types of contexts using natural language processing and information retrieval techniques. The authors further propose a similarity model to rank the multiple result contents of a query by comparing context entities specified in the query and historical context values associated with each result.
Article
Social tagging systems have gained increasing popularity as a method of annotating and categorizing a wide range of different web resources. Web search that utilizes social tagging data suffers from an extreme example of the vocabulary mismatch problem encountered in traditional information retrieval (IR). This is due to the personalized, unrestricted vocabulary that users choose to describe and tag each resource. Previous research has proposed the utilization of query expansion to deal with search in this rather complicated space. However, non-personalized approaches based on relevance feedback and personalized approaches based on co-occurrence statistics only showed limited improvements. This paper proposes a novel query expansion framework based on individual user profiles mined from the annotations and resources the user has marked. The underlying theory is to regularize the smoothness of word associations over a connected graph using a regularizer function on terms extracted from top-ranked documents. The intuition behind the model is the prior assumption of term consistency: the most appropriate expansion terms for a query are likely to be associated with, and influenced by terms extracted from the documents ranked highly for the initial query. The framework also simultaneously incorporates annotations and web documents through a Tag-Topic model in a latent graph. The experimental results suggest that the proposed personalized query expansion method can produce better results than both the classical non-personalized search approach and other personalized query expansion methods. Hence, the proposed approach significantly benefits personalized web search by leveraging users’ social media data.
Article
Portable devices are increasingly employed in a wide range of mobile guidance applications. Typical examples are guides in urban areas, museum guides, and exhibition space aids. The demand is for the delivery of context-specific services, wherein the context is typically identified by a combination of data related to location, time, user profile, device profile, network conditions and usage scenario. A context-aware mobile guide is intended to provide guidance services adjusted to the context of the received request. The adjustment may refer to tailoring the user interface to the perceived context, as well as delivering the right type of information to the right person at the right time and the right location. It may also refer to intermediary adaptation, as in the case of mobile multimedia transmission. This paper offers a taxonomy of mobile guides considering multiple criteria. The taxonomy considers several aspects of the mobile applications space, including context awareness, client architectures, mobile user interfaces, as well as offered functionalities, highlighting functional, architectural, technological, and implementation issues. Existing implementations are classified accordingly and a discussion of research issues and emerging trends is offered.
Article
As the volume and variety of information sources continues to grow, there is increasing difficulty with respect to obtaining information that accurately matches user information needs. A number of factors affect information retrieval effectiveness (the accuracy of matching user information needs against the retrieved information). First, users often do not present search queries in the form that optimally represents their information need. Second, the measure of a document’s relevance is often highly subjective between different users. Third, information sources might contain heterogeneous documents, in multiple formats and the representation of documents is not unified. This paper discusses an approach for improvement of information retrieval effectiveness from document databases. It is proposed that retrieval effectiveness can be improved by applying computational intelligence techniques for modelling information needs, through interactive reinforcement learning. The method combines qualitative (subjective) user relevance feedback with quantitative (algorithmic) measures of the relevance of retrieved documents. An information retrieval is developed whose retrieval effectiveness is evaluated using traditional precision and recall.