ArticlePDF Available

A Context-Adaptive Ranking Model for Effective Information Retrieval System

October 2018

October 2018

DOI:10.5923/j.ijis.20180801.01

Authors:

Norwegian University of Science and Technology

When using Information Retrieval (IR) systems, users often present search queries made of ad-hoc keywords. It is then up to information retrieval systems (IRS) to obtain a precise representation of user's information need, and the context of the information. Context-aware ranking techniques have been constantly used over the past years to improve user interaction in their search activities for improved relevance of retrieved documents. Though, there have been major advances in context-adaptive systems, there is still a lack of technique that models and implements context-adaptive application. The paper addresses this problem using DROPT technique. The DROPT technique ranks individual user information needs according to relevance weights. Our proposed predictive document ranking model is computed as measures of individual user search in their domain of knowledge. The context of a query determines retrieved information relevance. Thus, relevant context aspects should be incorporated in a way that supports the knowledge domain representing users' interests. We demonstrate the ranking task using metric measures and ANOVA, and argue that it can help an IRS adapted to a user's interaction behaviour, using context to improve the IR effectiveness.

Ranking performance graph results at the known relevant documents

…

Showing values of 4.74 at F 0.05, 2, 4.74 It is noted that there are presently the value of K = 3 domains, that is, Domains 1, 2, and 3. Therefore, DOF N = K-1 = 3-1 = 2. The sum total of data for all the three domains depicted as 10 + 10 + 10 = 30. Using the DOF D = N-K = 10-3 = 7 and α = 0.05 (the least significant value). The critical value if F 0.05 , 2, 7 = 4.74 (determined using F-Distribution table). We need to find: = mean of mean = ∑ M SB = ∑ and M SW = ∑ The mean of mean was determined as follows: = ∑ = 268+177+202 = 647/30 = 21.6 The mean for each domain are evaluated as follows: Domain 1 = ∑ = 268/10 = 26.8 Domain 2 = ∑ = 177/10 = 17.7 Domain 3 = ∑ = 202/10 = 20.2 The variance for each domain is evaluated as follows:

…

Figures - uploaded by Eniafe Festus Ayetiran

Content may be subject to copyright.

Content uploaded by Eniafe Festus Ayetiran

Content may be subject to copyright.

International Journal of Information Science 2018, 8(1): 1-12

DOI: 10.5923/j.ijis.20180801.01

A Context-Adaptive Ranking Model for Effective

Information Retrieval System

Kehinde Agbele*, Eniafe Ayetiran, Olusola Babalola

Department of Mathematics and Computer Science, Elizade University, Ilara-Mokin, Nigeria

Abstract When using Information Retrieval (IR) systems, users often present search queries made of ad-hoc keywords. It

is then up to information retrieval systems (IRS) to obtain a precise representation of user’s information need, and the context

of the information. Context-aware ranking techniques have been constantly used over the past years to improve user

interaction in their search activities for improved relevance of retrieved documents. Though, there have been major advances

in context-adaptive systems, there is still a lack of technique that models and implements context-adaptive application. The

paper addresses this problem using DROPT technique. The DROPT technique ranks individual user information needs

according to relevance weights. Our proposed predictive document ranking model is computed as measures of individual user

search in their domain of knowledge. The context of a query determines retrieved information relevance. Thus, relevant

context aspects should be incorporated in a way that supports the knowledge domain representing users’ interests. We

demonstrate the ranking task using metric measures and ANOVA, and argue that it can help an IRS adapted to a user's

interaction behaviour, using context to improve the IR effectiveness.

Keywords Context-awareness, Information retrieval, DROPT technique, Information relevance

1. Introduction

Recent years have witnessed ever-growing amount of

online information. The development of the World Wide

Web (WWW) led to increase in the volume and diversity of

accessible information. The question that now arises is how

access to this information can be effectively supported.

Users require the assistance of tools aimed to locate

documents that satisfy their specific needs. Information

retrieval (IR) concerns searching documents for information

that meet a user need. Traditionally, document

representations are expressed by extracting meaningful

keywords (index terms) from the documents in the form of

a cross-reference lookup. When the user sends a search

request, a representation of his/her information need will

also be expressed in the same manner. Then the user query

and the representation of the document will be matched

according to specific matching conditions. Results are

presented to the user in a form of a ranked list that contains

the most relevant documents. Most of the documents that

are retrieved however are irrelevant to the user because

search engines cannot determine the user context. Diverse

IR models have been developed for this purpose.

* Corresponding author:

Kehinde.agbele@elizadeuniversity.edu.ng (Kehinde Agbele)

Published online at http://journal.sapub.org/ijis

This work is licensed under the Creative Commons Attribution International

License (CC BY). http://creativecommons.org/licenses/by/4.0/

Ideally, the relevance of documents should be defined

based on user context. Thus, the problem of ranking of

retrieved documents should be based on user context and

preferences. Relevance is a standard measure utilized in IR

to evaluate effectiveness of an IR system based on the

documents retrieved. The concept of relevance, however, is

one that is subjective and influenced by diverse factors. To

this end, user perception and user knowledge level are

factors that influence the relevance of a retrieved document.

Therefore, there has been a paradigm shift from a view of

relevance as simple term matching between query and

document, to a view of relevance as a cognitive and

dynamic process involving interaction between the

information user and the information source. It is important

for IR systems to obtain accurate representations of

users‘ information needs and context of information need.

Hence, search knowledge encompasses a wide variety of

aspects of the search, such as the interaction mode by users.

A context refers to the environment around a user that

reflects or affects the user's search goal. Web search

personalization is the process that allows a search engine to

adapt the search results to user's specific goal by integrating

user's context information beyond the query provided. The

goal of context information is to determine what a user is

trying to accomplish. We propose a solution to this problem

to quantify the context of retrieved information. The

technique aims to avoid the drawback of manually scanning

through and selecting from a long list of documents. We

also apply context-awareness to reformulate queries in

2 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System

order to improve the predicted relevance of retrieved

documents.

The rest of the paper is organised as follows: Section 2

presents the background and related work. Section 3

describes the context-adaptive IRS model. Sections 4

describes the DROPT technique while Section 5 describes

the experimental design. Sections 6 and 7 present the results

of the experiments. Section 8 presents the statistical

analysis results and discussions. Section 9 concludes the

paper.

2. Background and Related Work

One of the key drivers and developments towards creating

personalized solutions that support context-adaptive systems

has been the results from research work in personalization

systems. The main indication derived from these results

showed that it was very difficult to create generic

personalization solutions, without in general having a large

knowledge about the particular problem being solved. These

seemed to result in either a very specialized or a rather

generic solution that provided very limited personalization

capabilities. In order to address some of the limitations of

classic personalization systems, researchers have looked to

the new emerging area defined by the so-called

context-aware applications and systems (Abowd et. al., 1997

and Brown et. al., 2007).

The term context and context-awareness, denotes a

general class of systems that can sense a continuously

changing physical environment and provide relevant

services to users on this basis Dey, (20011). The definitions

of context are varied, from the surrounding objects within an

image, to the physical location of the system's user. The

definition and treatment of context varies significantly

depending on the application of study (Edmonds, 1999).

Context in information retrieval has also a wide meaning,

going from surrounding elements in an XML retrieval

application (Arvola et. al., 2005), recent selected items or

purchases on proactive information systems (Billsus et. al.,

2005), broadcast news text for query-less systems (Hezinger

et al., 2003), recently accessed documents (Bauer and Leake,

2001), visited Web pages (Sugiyama et al., 2004), past

queries and clickthrough data (Bharat 2003; Dou et. al., 2007;

Sugiyama et. al., 2004; Shen et. al., 2005), text surrounding a

query (Finkelstein et. al., 2001), text highlighted by a user

(Finkelstein et. al., 2001), recently accessed documents

(Bauer and Leake, 2001)etc.

Context-aware systems can be classified by 1) the concept

the system has for context, 2) how the context is acquired, 3)

how the context information is represented and 4) how the

context representation is used to adapt the system. One of the

most important parts of any context-aware system is the

context acquisition. Note that this is conceptually different to

profile learning techniques, context acquisition aims to

discover the short-term interests (or local interests) of the

user (Dou et. al., 2007; Sugiyama et. al., 2004; Shen et al;

2005), where the short-term profile information is usually

disposed once the user's session is ended. On the other hand,

user profile learning techniques do cause a much great

impact on the overall performance of the retrieval system, as

the mined preferences are intended to be part of the user

profile during multiple sessions.

One simple solution for context acquisition is the

application of explicit feedback techniques, like relevance

feedback (Rocchio and Salton, 1971 and Salton and

Buckley, 1988). Relevance feedback builds up a context

representation through an explicit interaction with the user.

In a relevance feedback session: 1) The user makes a query.

2) The IR system launches the query and shows the result set

of documents. 3) The user selects the results that considers

relevant from the top n documents of the result set. 4) The IR

system obtains information from the relevant documents,

operates with the query and returns to 2). Relevance

feedback has been proven to improve the retrieval

performance. However, the effectiveness of relevance

feedback is considered to be limited in real systems,

basically because users are often reluctant to provide such

information [Sugiyama et al., 2004], this information is

needed by the system in every search session, asking for a

greater effort from the user than explicit feedback techniques

in personalization. For this reason, implicit feedback is

widely chosen among context-aware retrieval systems (Kelly

and Teevan, 2002; Shen et al., 2005; White and Kelly, 2006).

Based on this fundamental definition, various authors

(Emmanouilidis et. al, 2013; Jara et. al, 2013; Noh et. al,

2012 and Xu and Deng 2012) focus on different aspects of

context-awareness, including modelling interactions

between users and IR systems nature, and how to modelling

context. The research reported in Nyongesa and

Maleki-Dizaji (2006) showed that based on preferences of

users, genetic algorithms (GA) could be applied to improve

the search rresults. Similarly, the work reported in Koorangi

and Zamanifar (2007) proposed improvement of internet

engines using multi-agent systems. In this work, a

meta-search engine gives a user documents based on an

initial query while a feedback mechanism returns to the

meta-search engine the user’s suggestions about retrieved

documents.

In Allan (2002), contextual information retrieval (CIR) is

defined as: "combine search technologies and knowledge

about query and user context into a single framework in

order to provide the most appropriate answer for user's

information needs". CIR intends to optimize the retrieval

accuracy by involving two related steps: appropriately

defining the context of user information needs, commonly

called search context, and then adapting the search by taking

it into account in the information selection process.

Several studies have addressed context specification

within and across application domains (Jara et. al, 2013;

Dinh and Tamine 2012; Kebler et. al, 2009; Goker and

Myrhaug, 2008; Vieira et. al, 2007). Device, user, task,

document and spatio-temporal are the five context specific

dimensions that have been explored in context-based

information retrieval literature (Emmanouilidis et. al, 2013;

International Journal of Information Science 2018, 8(1): 1-12 3

Dinh and Tamine 2012; Li et. al, 2011; Asfari et. el, 2009;

Mylonas et. al, 2008; Anand and Mobasher, 2007; Maeco et.

al, 2013; Lukowic et. al, 2011; Zhou et. al, 2012).

In Shen et. al., (2005) proposed a ranking technique for

multi-search projections on the Web for results aggregation

model based on query words, search results, and search

history to achieve user’s intention. To this end the Web can

offer a rich context of information which can be expressed

through the relevancy of document contents. In Shivaswamy

and Joachims (2011) proposed a model for online learning

that is specifically adequate for user feedback. The

experiment conducted shown retrieval effectiveness for web

search ranking. In the context of web search ranking, these

techniques aim at finding the best ordering function over the

returned documents is important. The authors argue that,

regression on labels may be adequate and, indeed,

competitive in the case of large numbers of retrievals. To

make the web more interesting, there is need to develop a

good and efficient ranking algorithm to deliver more suitable

results for users.

Agbele [2014] developed and coined the acronym DROPT

(Document Ranking OPTimization) to name a new adaptive

algorithm that provides a limited number of ranked

documents in response to a given query. The author argue

that, it can improve the ranking mechanism for the search

results in an attempt to adapt the retrieval environment of the

users and amount of relevant context-aware information

according to each user’s request. The DROPT measure must

be self-learning that can automatically adjust its search

structure to a user’s query behaviour. The DROPT technique

is employed in this paper to improve the retrieval

effectiveness based on the user interaction behaviour as

depicted in Figure 1.

3. Context-Adaptive for IR System

Context-adaptive IR requires an adaptation of the

processed information with respect to the individual users. It

depends on the user’s personal context-adaptive whether a

user blog article is worth reading with respect to the user’s

expectations and abilities. We are thus looking for a

workflow to enable how users can judge context changes for

adaptive retrieval based on the user profile. One major

problem of most current IR system is that they provide

uniform access and retrieval of IR results to all users

specially based on the query terms users entered to the

system.

To address these issues we propose a context-adaptive IR

model based on document preferences as search context to

rank individual users results effectively and the behaviour

that individual user has engaged in during the matching tasks.

The idea of context-adaptive is to predict relevant ranked

documents according to relevance weights. This

demonstrates a search context from search engine by

observing and analysing user behaviour (i.e. keyword

matching based querying frequency). The workflow of the

design and evaluation of this proposed context-adaptive IR

model is shown in Figure 1(see Appendix A). We generate

two user predictive models about document ranking: 1) a

predictive user model of the relevance of document content;

2) a predictive user model of ranking for currently retrieved

documents. We believe this model (Table 1) can enhance

individual user’s system retrieval performance greatly.

Table 1. Predictive document ranking model (PDRM) for user model

preference

Description of document

ranking model

Document

content

context

Can model predict

documents

relevance?

Predicted to adapt current

retrieved documents for

ranking tasks.

Relevant

Yes

Predicted to perform initial

queries reformulation but

ignored if found to be

irrelevant later.

Irrelevant

Not yet

The predictive user model generated data analysis by

individual users knowledge domain, while interacting with

the search engine in which ranking of retrieved document

has been controlled independently. By analyzing the

statistical associations between measures of user behaviour

and their judgments of document relevance, we create a

predictive user model of document relevance by assigning a

numerical weight to each retrieved document and ranking of

retrieved document, we can get a predictive user model of

current search context (relevant or irrelevant). Ranking of

retrieved documents could influence user’s context because a

user indicates documents that are relevant and otherwise

according to relevance weights. The problem at hand is thus

to find IR mechanism that allows for adaptive context-aware

IR. Agbele (2014), developed a Document Ranking

OPTimization (DROPT) technique and is employed in this

present paper to enable context-adaptive IR as illustrated in

Figure 1.

The purpose of predicting document ranking for IR system

in this paper is to adapt retrieved documents to individual

users during their search context, rather than after they finish

the entire document ranking tasks. So, the measures of user

behaviour context, which can be immediately noticed is

based on calculating the weight of keywords in the document

index vectors, calculated as a function of the frequency of a

keyword across a document should be the main sources to

predict ranking of retrieved documents according to

relevance weights. The work reported in Li and Belkin (2008)

identified task type in human information behaviour as

contextual factors to influence the way users search for

information. We apply context-awareness in this paper as a

technique to reformulate original user’s queries in order to

improve the predicted relevance of retrieved documents.

Also by reformulating a query we could not only increase the

number of relevant documents but also rank the candidate

documents. Therefore, user context is any relevant

information that can be used to characterize the situation of a

4 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System

user, such as where the user is, whom the user is with and

what resources are available to the user.

Before the current retrieved document is predicted from

individual users’ behaviours context, the predictive user

model of document relevance is calculated as measures of

individual user search in their domain of knowledge; once

the retrieved document is predicted from the model, and then

the system can activate predictive model of document

relevance for ranking task. This demonstrates how the

predicted relevance documents can be used to assist users

reformulate their initial queries to better understand users’

current information needs by user preferences. To adapt

search results means to explicitly make use of the user

preferences to tailor search results in their knowledge

domain. The next section describes the DROPT technique.

4. DROPT Technique

This section describes the document ranking technique for

context-aware IR known as a document ranking optimization

(DROPT) according to information relevance. A document

ranking technique is an algorithm that tries to match

documents in the corpus to the user, and then ranks the

retrieved documents by listing the most relevant documents

to the user at the top of the ranking. Unfortunately, despite

the exposure of individual users to domain of Web retrieval

and online documentation systems with document ranking

features; it rarely addresses the information relevance of

ranked output as core issue.

4.1. Parameters Used for Ranking Principles

In this sub-section we study the problem of ranking of

retrieved documents. For example, we desire to rank a set of

scientific articles such that those related to the

query ’information retrieval’ are retrieved first. The basic

assumption we make is that such a ranking can be obtained

by a weighting function

)( idftfw

which conveys to us

how relevant document d is for query q. The document

ranking will be done by taking a weighted average of all

determined parameters. Table 2 depicts the summary of

notations.

Table 2. Summary of ranking notations

Parameters Name

Description

indexed document

i-th query vector

( , )qd

document-query pair

()w D Q

convolution matrix

()w tf idf

weighting function

term frequency

idf

index term frequency

max( )

i i j

Val t

maximum relevance weight value added

to matrix G

 

D d if val

documents sorted in ascending order of

relevance value

 

0,1V

relevance numerical weight values

normalization interval

ij n l

Gg





query vector defined as a matrix G









weighted root mean square (RMS) to

determine the overall relevance fitness of

all documents with respect to a given

query

number of queries for self-learning

size of the corpus

Weights of terms in the document vectors

4.2. Formalization of Mathematical Model Definitions

This optimization of IR is obtained by ranking the

documents according to a relevance numerical weight value

()w tf idf

which is obtained from the weighting function

w in descending order. Then we wish to return a relevance

numerical weight subset

such that for each

dD

, we optimize the following weighting function:

()w tf idf

(1)

According to equation (1), a DROPT measure for

documents retrieved from a corpus is developed with respect

to document index keywords and the query vectors. This

mathematical model definition is based on calculating the

weight (wij) of keywords in the document index vector,

calculated as a function of the frequency of a keyword

across a document

The DROPT technique is based on IR result rankings,

where a ranking R consists of an ordered set of ranks. Each

rank consists of a relevance numerical weight value

 

1,0



where v represents the relevance numerical weights of the

retrieved documents. Each rank is assigned an ascending

rank number n, such that:

   

 

1, , 2, ,..., , n

R v v n v





(2)

Where

vvv  ...

Our technique, DROPT is composed of six steps.

Step 1: Initialization of Parameters

(a) Let a query vector, Q, be defined as:

 

1 2 3

, , ,... l

Q q q q q

(3)

where,

  ,  being a term string with a weight of 1.

(b) Let the indexed document corpus be represented by

the matrix:

International Journal of Information Science 2018, 8(1): 1-12 5

(4)

where,     being an index string, with

weight  .

simple multiplication of the document vectors and the

query vectors representing:

W = DQ = (5)

    is equal string ignore case   , where

 are query vectors,  are document vectors,  are

weights of terms in the document vectors, and  are

weights of terms in the query vectors, while n is the number

of retrieved documents that are indexed by at least one

keyword in the query vector. The matrix W gives a numeric

measure with no context information.

Step 2: Search String Processing

The comparison of the issued query term against the

document representation is called the query process. The

matching process results are a list of potentially relevant

context information. Individual users will scrutinize this

document list in search of the information they needs.

Step 3: Calculate Relevance Weight

Retrieved documents that are more relevant are ranked

ahead of other documents that are less relevant. It is

important to find relevance numerical weights of the

retrieved documents and provide a ranked list to the user

according to their information requests.

(a) Based on equation (1), the relevance weight is

obtained according to document content.

(b) Subsequently we calculate the average mean weight

using the weighted root mean squares (RMS) to

determine the overall fitness value of retrieved

documents with respect to a given query calculated

as:

nlij

l







(6)

where,

is the average relevance mean weight of each

retrieved document, n is the number of keywords terms

occurrences in each retrieved document, l is the total size of

the keywords in the corpus, and wij are the sum weights of

terms of the document vectors.

Step 4: User Feedback about Retrieved Documents

User feedback about retrieved documents is based on

overall relevance weights  to construct a personalized

user profiling of interests. We can achieve this when a user

indicates the documents that are relevant or otherwise, from

the designated databases context.

(a) The overall relevance judgment is given by:

ij nl

Gg





(7)

where,    and 1 ≤ i ≤ n, 1 ≤ j ≤ l and G is

a query vector with a small-operator defined as a matrix,

 are weights of terms of the document vectors, and 

are queries vectors. Any numerical weight component of

matrix G greater than the average mean weight,   (6)

will be retained to add to a matrix T given by:

(8)

where,

(b) Based on matrix T (8) we calculate relevance

numerical weight values, for all set of documents D,

which are the largest weighting values for each

corresponding vector given by:

max{ },1

i ij

Val t i n

i j l

  



(9)

val

was higher

than the overall average relevance weight would be

predicted as a relevant document; any document with

a lower value would be predicted as irrelevant

document (9). Thus average relevance mean value

within the normalization interval    is

computed for each document given by:

        (10)

Step 5: Relevance Judgment

The individual user is asked to judge contextual factor (e.g.

information relevance) influence on ranking given a certain

contextual dimension (numerical weight is relevant or not).

(a) If the ranked document is relevant to user information

needs, the user finishes his/her query search context,

then GO to Step 4 according to the user’s document

preference.

(b) Otherwise, the user continues to search the document

databases by reformulating the query or stop querying

the designated database until relevant documents are

ranked. GO to Step 6.

Step 6: Update Term Weight and Keywords Set

The keyword term set n provided by the ranked documents















NjNNN

dddd







321

2232221

1131211













wwww







321

2232221

1131211

 

lnij

tT 



ljni

gift

gifgt

ijij

ijijij 













 1,1



6 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System

and the relevance numerical weight values will be updated

by user feedback.

(a) Any new query term not belonging to n will be added

and a new column of relevance weight value will be

computed and expanded for ranked documents

routinely.

(b) If any ranked document di is retrieved by the users,

the corresponding relevance weight values with

respect to the query keywords will be increased

by (11). The default of β is set to increase the

corresponding relevance numerical weight values.

 

ij ij





(11)

where,

         and   

We coined the acronym DROPT to name our adaptive

algorithm that provides a limited number of ranked

documents in response to a given query. Also it can improve

the ranking mechanism for the search results in an attempt to

adapt the retrieval environment of the users and amount of

relevant context information according to each user’s request.

Finally, the DROPT measure must be self-learning that can

automatically adjust its search structure to a user’s query

behaviour.

5. Experimental Design

The experiment was designed to study a new user’s

behaviour source i.e. ranking of retrieved documents that can

influence the information retrieval process. Though

considering user searching actions (i.e. clicking on a

document in a search result, printing a document, moving a

document into a folder, etc.) as sources for implicit relevance

of documents, the techniques presented in this paper is

different because it considers document ranking. From that

view, the techniques is interesting and innovative as it

emphasizes that the IR process is not just about matching

between documents and queries but relationships among

matching, user actions and user preferences in ranked

documents of retrieved results. The experiment was designed

and piloted using systems that allows interactive information

retrieval (IIR) experiments that log users ‘in different

browsers interactive search behavior. The system has a

search engine where tables are created for experimental

generated data from searching tasks. The systems were used

to determine the frequency of keyword matching-based

querying results to monitor the progress of the experiment.

They performs several information related tasks activities

such as searching, filtering, matching, displaying, and

learning information needs over time. This is concerned with

the reuse of the existing standards, approaches, and how to

incorporate them into the design of the IR system. During the

search, the participant interactions with the search engine

were logged via the system log in menu. In each search task,

the participants were asked to obtain the frequency of

keyword matching based querying across a document; that

were relevant to meet their information requests. The

behavioral measures we examine are the frequencies of the

user issued query (i.e. frequency of keyword matching based

querying) while interacting with the IR system.

We involved three system users (Master students) in the

area of Computer Science in the Department of Computer

Science to collect data through the WampServer search

engine back end prototype. The three study system user

participants were given 10 search tasks each in their domain

of knowledge. During the search context, the students’

interactions with the search engine back end prototype were

logged via the system log in menu with their "student

identification number". In each task, the students were asked

to obtain the frequency of keyword matching based querying

across a document that were relevant to meet their

information requests to achieve document ranking task based

on individual users’ preference, or ignore documents that

were found to be irrelevant. The user behavioural measures

we examine are the frequencies of the issued query. The

function of the frequency of the keyword across a document

from the document database collected is stored in the

WampServer site localhost database. WampServer is a

Windows Internet environment that allows user to create

Internet applications with Apache 2, PHP and a MySQL

database. PHP Myadmin allows user to manage easily our

databases. This measure was used to predict the ‘relevant”

documents marked ‘X” for document ranking model. To

evaluate the performance of the proposed technique, we

performed an experiment on small scale search of different

30 queries from the system users to validate the effectiveness

of the technique. Table 3 gives the statistics of the queries

considered in the experiment. The personalized predictive

ranking model identifies retrieved documents to individual

user from the domains according to his/her preferences.

6. Ranking Performance Results

With the intention of measure ranking performance, the

DROPT technique, according to Agbele (2014) for ranking

search results list was tuned by experimenting with the

prototype system for relevance judgment. In this paper, each

query produced a document based on the matching

conditions and the retrieval was repeated for 10 query

reformulations from the domain of system user experts. The

underlying philosophy of the relevance judgment rules for

user model judgment using the DROPT technique is to rank

those documents, which exceeded the overall weighted

fitness score that the system user judges to be relevant to

his/her information needs, and ignore those documents the

system users judge to be irrelevant (less preferred).

International Journal of Information Science 2018, 8(1): 1-12 7

Figure 2. Ranking performance graph results at the known relevant documents

7. Comparison of DROPT Technique

with TF-IDF Method

In this section, we present the results that show the

performance of our DROPT technique against a traditional

tf-idf method. We compared our ranking algorithms with

selected well-known baseline algorithms such as TF-IDF to

evaluate the performance of our ranking technique in

standard "Precision at position n" (P@n) measure. For the

information needs and document collection of the

experiment, relevance was assessed by different system

users in their domain of experts. They are knowledgeable in

their domain and were asked to judge the relevance of the

retrieved documents on a six level scales: (0=Harmful,

1=Bad, 2=Fair, 3=Good, 4=Excellent and 5=Perfect) with

respect to a given query. For comparison of results, we have

used P@n metrics Jarvelin and Kekalainen (2010). Precision

at n measures the relevancy of the top n results of the ranking

list with respect to a given query according to equation (12).

P@n=No. of relevant document in top n results / n… (12)

P@n can only handle cases with binary judgment

“relevant” or “irrelevant” with respect to a given query at

rank n. To compute P@n, 30 queries were judged in these six

levels by users.

The test process involved using the 30 queries provided by

the system users. The measure (P@n) is used for the

evaluation. Naturally, this is computed for each query, and

then takes the average dimension (n) for all queries. Fig. 3

shows the comparison of the DROPT algorithm with other

algorithms in the P@n measure. As the figure shows, our

adaptive algorithm outperforms TF-IDF model. The DROPT

algorithm achieves a 28% in P@n compared to TF-IDF. The

empirical results have been compared with the traditional

relevance feedback model. It shows that the precision value

of the DROPT ranking technique is comparatively higher for

all the query sets. This achievement resides in the

combination of context-based algorithms using user

preferences for query reformulations. In this regard, the

number of top n results showed to users depicts the relevancy

degree of the retrieved documents with respect to a given

query with rank n (judged by the system users).

Table 3. Precision Results from the 3 Domains of Expert for Ranking at

Known Relevant Documents

Document#

Queries

Relevant

Precision

Fitness Score

0.000

0.37

0.500

0.90

0.000

0.73

0.500

0.93

0.000

0.73

0.500

0.90

0.571

0.93

0.625

0.83

0.667

0.83

Q10

0.700

0.83

Q11

0.727

0.87

Q12

0.000

0.57

Q13

0.692

0.90

Q14

0.714

0.87

Q15

0.000

0.67

Q16

0.688

0.90

Q17

0.706

0.80

Q18

0.000

0.70

Q19

0.000

0.47

Q20

0.000

0.40

Q21

0.000

0.57

Q22

0.591

0.93

Q23

0.609

0.87

Q24

0.625

0.93

Q25

0.640

0.87

Q26

0.000

0.53

Q27

0.630

0.93

Q28

0.643

0.93

Q29

0.655

0.93

Q30

0.000

0.73

Average

0.631

0.75

8 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System

The corpora were manually built with minimal number of

documents for evaluation purposes. For easy evaluation and

scalability issues, we use our manually built corpus to

evaluate the effectiveness of our DROPT technique. The

empirical results have been compared with the traditional

relevance feedback model. In future, we intend to perform

100 queries reformulation and compared with other

well-known standards in TREC.

Figure 3. Ranking performance graph results at the known relevant

documents

8. Statistical Analysis and Discussion

Agbele et. al (2016) presented the DROPT algorithm

results and extended in this present paper by performing

statistical analysis using ANOVA on 30 queries.

Significance test interpretation was carried out in this

research study with the purpose of measuring the

effectiveness of IR system using interactive reinforcement

learning (user’s feedback and context-awareness) in

comparison to relevance feedback. The test was established

to reject the null hypothesis, H0 that there is difference

between the group means of Domain of system user

participants 1, 2, and 3. Rejecting H0 infers accepting the

alternative hypothesis; H1 with at least one of the means is

different from others in retrieval efficacy in order to improve

the system performance.

Since F-statistical table falls to the left of F-distribution

(5.19 > 4.74) under the acceptance region. Therefore we may

conclude at a 5% level of significance test that there is a

significant difference in the means of at least one group of

Domains 1, 2, and 3. This is because the values of ad-hoc

keywords matched against documents that were searched

independently across each of the domains of system user’s

participants and the corresponding values of occurrences of

issued query were obtained. The interpretation of this

statistical result demonstrates the improvement of

information retrieval efficacy through the attributes from the

user behaviour actions while interacting with the IR system.

Our results on the indexed ad-hoc keywords represent

domain of the system user’s three participants in an in-lab

experimental setting. The results demonstrate that combining

individual system user’s behavioral measures can improve

ranking prediction accuracy (according to relevance

weights), for documents ranking tasks, and however that

individual users ranking performed much better than

combining document rankings of the systems. This

accomplishes personalization of retrieved documents for

individual users as the focus of this paper. The retrieval

effectiveness is measured using well known metrics

Precision and Recall, at known relevant documents.

Definitions:

Let MSB depicts variance between the three domains

considered in this study.

Let MSW depicts variance within the three domains

considered in this study.

In order to evaluate both the means and standard

deviations of the keyword matching based querying

experiments, we construct hypothesis test based on the

values obtained across all issued queries after 30 generations

(10 search tasks from each participant domain) using

Analysis of Variance (ANOVA).

H0:



1 =



2 =



3 where 1, 2, and 3 are domains

considered in this study.

H1: At least one of the means is different from the others.

Figure 4. Showing values of 4.74 at F 0.05, 2, 4.74

It is noted that there are presently the value of K = 3

domains, that is, Domains 1, 2, and 3. Therefore, DOFN =

K-1 = 3-1 = 2. The sum total of data for all the three domains

depicted as 10 + 10 + 10 = 30.

Using the DOFD = N-K = 10-3 = 7 and α = 0.05 (the least

significant value). The critical value if F0.05, 2, 7 = 4.74

(determined using F-Distribution table).

We need to find: = mean of mean = ∑

MSB = ∑ and MSW = ∑

The mean of mean was determined as follows:

∑

= 268+177+202 = 647/30 = 21.6

The mean for each domain are evaluated as follows:

Domain 1 = ∑ = 268/10 = 26.8

Domain 2 = ∑ = 177/10 = 17.7

Domain 3 = ∑ = 202/10 = 20.2

The variance for each domain is evaluated as follows:

F0.05, 2, 7 = 4.74

0.9

Rejection region

α = 0.05

F-Distribution

International Journal of Information Science 2018, 8(1): 1-12 9

Domain 1 = 228.9/10 = 22.89

Domain 2 = 154.5/10 = 15.45

Domain 3 = 200.01/10 = 20.01

Mean of mean ∑ = (268+177+202)/30 = 21.6

Also MSB = ∑ could be determined as follows:



ni (x

Domain1







ni (x

Domain2







ni (x

Domain 3



/ K



MSB = 10(26.8-21.6)2 + 10(17.7-21.6)2 + 10(20.2-21.6)2

/3-1 = 442.1/2 = 221.05

MSB = 221.05

Also, MSW = ∑

MSW = (10-1) Domain1 + (10-1) Domain 2 + (10-1)

Domain 3 (10-1) / N-K

Domain 3/30-3= 9(9.94) +9(12.73) +9(10.45)/7=298.08/7

MSW = 42.58

Therefore, the test statistics is F = MSB/MSW

= 221.05/42.58 = 5.19

9. Conclusions

Using adaptive IR system, situations can be detected and

classified as contexts. Once the proposed system has

recognized in which context an interaction takes place, this

information can be used to change and adapt the behaviour of

IR applications and systems. One has to keep in mind that

users learn how to interact with the system, and that they

adapt their behaviour. So, it is crucial to develop

understandable context-aware IR system that adapts to the

users’ expectations. In line with this, well-designed

context-awareness is a great and powerful way to make

user-friendly and enjoyable IR applications.

User interactive behavior measures on relationships

among matching help understand how users interact on the

clicked documents in response to a given query, and they are

indicative of document relevance. Also, user interactive

behaviours measures during user actions help describe what

the user does between issuing one query and the next. User

interactive behaviours about user preferences help

understand how to acquire search results. This in turn could

improve the information retrieval effectiveness. The adapted

search results means to explicitly make use of the user

context to tailor search results.

Our results demonstrate a significant effect of document

ranking on predictive ranking model according to document

relevance. Document ranking not only affected the user

interactive behaviour as predictors of document relevance, it

also affected the relevance weights for each of the user

interactive behaviours to improve IR effectiveness. In

addition, when document information is available, the

ranking model gives better prediction of document relevance.

Therefore, we can conclude that it is important for adapted

IR systems to detect the context in which a search is

conducted, especially the document ranking, and then to

apply the user model to adapt search results to individual

users. Also document ranking influenced how users

interacted with search systems during search sessions. The

interpretation of the statistical results using ANOVA

demonstrates the improvement of information retrieval

effectiveness through the attributes.

A DROPT technique has been evaluated to reflect how

individual user judges the context changes in IR from the

user behaviour actions while interacting with the IR system

results ranking. Predictive user model of document ranking

were presented to adapt retrieved documents to individual

users during their search context, rather than after they finish

the entire ranking tasks.

ACKNOWLEDGEMENTS

The authors would like to thank Elizade University

Management for funding this research project.

10 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System

Appendix A

REFERENCES

[1] A. K. Dey, (2001): Understanding and Using Context.

Personal Ubiquitous Computing, vol. 5, no. 1, pp: 4-7.

[2] C. Emmanouilidis, R-A. Koutsiamanis, A. Tasidou, (2013):

Taxonomy of architecture, context-awareness, technologies

and applications. Journal of Network and Computer

Applications, Vol. 36, pp. 103-125.

[3] A. J. Jara, P. Lopez, D. Fernandez,. F. Cashlo, M. A. Zamora,

and F. Skarmeta, (2013): Mobile discovery: discovering and

interacting with the world through the Internet of things.

Journal of Personal and Ubiquitous Computing, Published

Springer-Verlag London.

[4] H. Y. Noh, J. H. Lee, K. S. Oh, and S. B. Cho, (2012):

Exploiting indoor location and mobile information for

context-awareness service. Journal of Information Processing

and Management,. Vol. 14, Issue 1, pp. 1-12.

[5] W. Xue, and H. Deng, (2012): Unstructured queries based on

mobile user context. International Journal of Pervasive

Computing and Communications, Vol. 8, Issue 4, pp.

368-394.

[6] H. O. Nyongesa, and S. Maleki-dizaji, (2006): User

modelling using evolutionary interactive reinforcement

learning. Inf Retrieval, vol. 9, no. 3, pp. 343-355.

DOI: 10.1007/s10791-006- 4536-3.

[7] M. Koorangi, K. Zamanifar, (2007): a Distributed Agent

Based Web Search using a Genetic Algorithm. International

Journal of Computer Science and Network Security, vol. 7, no.

1, pp. 65-76.

Estimate measures of

individual user’s behaviours

context in user matching tasks

Generate predictive

user models of

document ranking

Implement personalized adaptive IR algorithm in the

system

Evaluation of the IR

system

Observe individual user behaviour context

(keyword matching based querying)

Predictive user model of document ranking

Can it predict the

document ranking?

Perform initial queries

reformulation

Personalize retrieved

documents to individual

users

Rank retrieved documents based on individual user

preferences

Context-based

personalized IRS

Not yet

Yes

Predictive

user model

of the

relevance of

adaptive

document

documents

content

International Journal of Information Science 2018, 8(1): 1-12 11

[8] J. Allan, (2002): Challenges in information retrieval and

language modelling. Report of a workshop held at the

Centre for Intelligent Information Retrieval, University of

Massachusetts, Amherst, September 2002.

[9] D. Dinh, and L. Tamine, (2012): Towards a Context Sensitive

Approach to Searching Information based on Domain

Specific Knowledge Source. Web Semantics: Science

Services and Agents on the WWW, Vol. 12, pp. 41-52.

[10] C. Kebler, M. Raubal, and C. Wosniok, (2009): Semantic

Rules for Context-aware Geographical Information Retrieval.

In P. Barnaghi, editor, 4th European Conference on Smart

Sensing d Context, EuroSSC 2009, University of Surrey. Vol.

5741 of LNCS Springer, pp. 77-92.

[11] A. Goker, and H. Myrhaug, (2008): Evaluation of a mobile

information system in context. Information Process

Management, Vol. 44, no. 1, pp. 39-65.

[12] V. Vieira, P. Tedesco, A. C. Salgado, and P. Brézillon,

(2007): Investigating the specifics of contextual elements

management: the cemantika approach. Context, pp. 493-506.

[13] W. Li, D. Ganguly, G. J. F. Jones, (2011): Enhanced

Information Retrieval Using Domain-Specific Recommender

Models. In: Amati, G., Crestani, F. (eds.) ICTIR 2011. LNCS,

vol. 6931, pp. 201–212.

[14] O. Asfari, B. L. Doan, Y. Bourda, J. P. Sansonnet, (2009):

Personalized access to information by query reformulation

based on the state of the current task and user profile, In: The

Third International Conference on Advances in Semantic

Processing, Malta.

[15] P. Mylonas, D. Vallet, P. Castells, M. Fernandez, Y. Avrithis,

(2008): Personalized information retrieval based on context

and ontological knowledge. Knowledge Engineering Review,

Vol. 23, No. 1, pp. 73-100.

[16] S. S. Anand, and B. Mobasher, (2007): Introduction to

intelligent techniques for web personalization. ACM

Transactions on Internet Technology, Vol.7, no 4, pp. 18.

[17] D. S. Marco, N. Vidhya, E. Churchill, E. (2013): In

Proceedings of the SIGHI Conference on Human Factors in

Computing Systems, pp. 2487-2496.

[18] P. Lukowicz, A. S. Pentland, and A. Ferscha, (2011): From

Context Awareness to Socially Aware Computing, IEEE

Pervasive Computing, Vol. 11, No. 1, pp. 32-41.

[19] D. Zhou, S. Lawless, V. Wade, (2012): Improving search via

personalized query expansion using social media. Inf. Retr.

pp. 1–25. Doi: 10.1007/s10791-012-9191-2.

[20] X. Shen, B. Tan, C. Zhai, (2005): Implicit user modelling for

personalized search. In: 14th ACM International Conference

on Information and Knowledge Management (CIKM 2005),

pp. 824–831. ACM, Bremen.

[21] P. K. Shivaswamy, and T. Joachims, (2011): Online Learning

with Preference Feedback. In NIPS workshop on Choice

Models and Preference Learning.

[22] K. Agbele. (2014). “Context-Awareness for Adaptive

Information Retrieval Systems”, Unpublished PhD Thesis,

University of the Western Cape.

[23] Y. Li, and N. J. Belkin, (2008): A faceted approach to

conceptualizing tasks in information seeking. Information

Processing and Management, Vol. 44, No. 6, pp. 1822-1837.

[24] K. Jarvelin, and J. Kekalainen, (2010): IR evaluation methods

for retrieving highly relevant documents. Published in: Belkin,

N. J., Ingwersen, P. and Leong, M. K. (eds). In: Proceedings

of the 23rd Annual International ACM SIGIR Conference on

Research and Development in Information Retrieval. New

York, NY: ACM, pp. 41-48.

[25] K. K. Agbele, E. F. Ayetiran, K. D. Aruleba and D. O. Ekong,

"Algorithm for Information Retrieval optimization," 2016

IEEE 7th Annual Information Technology, Electronics and

Mobile Communication Conference (IEMCON), Vancouver,

BC, 2016, pp. 1-8. doi: 10.1109/IEMCON.2016.7746242.

[26] F. Liu, C. Yu, and W. Meng, (2014):”Personalized Web

Search for Improving Retrieval Effectiveness,” IEEE

Transactions on Knowledge and Data Engineering, vol. 16,

No.1, pp.28-40.

[27] K. Sugiyama, K. Hatano, and M. Yoshikawa, (2004):

Adaptive web search based on user profile constructed

without any effort from users. in Proceedings of the 13th

international conference on World Wide Web, (New York,

NY, USA, 2004), ACM, 675-684.

[28] P.J. Brown, J.D. Bovey, and X. Chen, (2007): Context-aware

applications: from the laboratory to the marketplace. Personal

Communications, IEEE], 4 (5). 58-64.

[29] G. D. Abowd, A.K. Dey, R. Orr, and J. Brotherton, (1997):

Context-awareness in wearable and ubiquitous computing. in

First International Symposium on Wearable Computers.

(ISWC 97), (Cambridge, MA, 1997), 179-180.

[30] T. Haveliwala, (2002): Topic-sensitive PageRank. in

Proceedings of the Eleventh International World Wide Web

Conference, (Honolulu, Hawaii, USA, 2002).

[31] K. Bharat, (2000): SearchPad: explicit capture of search

context to support Web search. in Proceedings of the 9th

international World Wide Web conference on Computer

networks: the international journal of computer and

telecommunications netowrking, (Amsterdam, The

Netherlands, 2000), North-Holland, 493-501.

[32] Z. Dou, R. Song, and J. Wen, (2007): A Large-scale

Evaluation and Analysis of Personalized Search

Strategies.pdf. in Proceedings of the 16th international World

Wide Web conference (WWW2007), (Banff, Alberta, Canada,

2007), 572-581.

[33] D. Billsus, D. Hilbert, and D. Maynes-Aminzade, (2005):

Improving proactive information systems. in IUI '05:

Proceedings of the 10th international conference on

Intelligent user interfaces, (San Diego, California, USA,

2005), ACM, 159-166.

[34] T. Bauer, and D. Leake, (2001): Real time user context

modeling for information retrieval agents. in CIKM '01:

Proceedings of the tenth international conference on

Information and knowledge management, (Atlante, Georgia,

USA, 2001), ACM, 568-570.

[35] M. Henzinger, B. Chang, B. Milch, and S. Brin, (2003):

Query-free news search. In WWW '03: Proceedings of the

12th international conference on World Wide Web, (Budapest,

Hungary, 2003), ACM, 1-10.

[36] White, R. and Kelly, D., A study on the effects of

personalization and task information on implicit feedback

12 Kehinde Agbele et al.: A Context-Adaptive Ranking Model for Effective Information Retrieval System

performance. in CIKM '06: Proceedings of the 15th ACM

international conference on Information and knowledge

management, (Arlington, Virginia, USA, 2006), ACM,

297-306.

[37] H. Kim, and P. Chan, (2003): Learning implicit user interest

hierarchy for context in personalization. in IUI '03:

Proceedings of the 8th international conference on Intelligent

user interfaces, (Miami, USA, 2003), ACM, 101-108.

[38] Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E., Solan,

Z., Wolfman, G. and Ruppin, E., Placing search in context:

the concept revisited. in World WideWeb, (2001), 406-414.

[39] J. Rocchio, and G. Salton: Relevance feedback in information

retrieval. Prentice-Hall, 1971.

[40] G. Salton, and C. Buckley, (1988): Improving retrieval

performance by relevance feedback. Journal of the American

Society for Information Science, 41 (4). 288-297.

[41] R. White, and D. Kelly, (2006): A study on the effects of

personalization and task information on implicit feedback

performance. in CIKM '06: Proceedings of the 15th ACM

international conference on Information and knowledge

management, (Arlington, Virginia, USA, 2006), ACM,

297-306.

[42] X. Shen, B. Tan, and C.Zhai, (2005): Implicit user modeling

for personalized search. in CIKM '05: Proceedings of the 14th

ACM international conference on Information and knowledge

management, (Bremen, Germany, 2005), ACM,824-831.

Ranking of Search Requests in the Digital Information Retrieval System Based on Dynamic Neural Networks

Article

Full-text available

Apr 2022
COMPLEXITY

The article is devoted to the problem of optimization of search request ranking algorithms in the digital information retrieval system. The algorithm of functioning of the neural network ranking unit based on Hopfield neural network is built. The ability to generate a ranked list of pages found as a result of the request in the digital information retrieval system can be provided by solving two problems of integer optimization: the problem of assignment of combinatorial sets of criteria for assessing the relevance of web page search and the problem of sorting of numbers—relevance values. The architecture of the neural network model based on the dynamic Hopfield neural network with binary output function designed for combinatorial optimization of the final list of documents found in the digital information retrieval system was synthesized. Promising variants of neural network models with binary output function of neurons for synthesis of the optimal evaluation plan with a combinatorial set of criteria by solving the problem of assignment were built. It has been proven that the built models differ in the rules for determining the coefficients of synaptic connections and external shifts; each of the created rules can be used independently or in different combinations with one another. In the course of analytical research, it was found that the optimization formulation of the problem of sorting of relevance values of search pages is identical to the problem of assignment of combinatorial groups of evaluation criteria provided that the elements of the performance matrix of the latter are defined as linear combinations of relevance values.

A Comparative Analysis of Sentence Embedding Techniques for Document Ranking

Article

Full-text available

Dec 2022
J WEB ENG

Due to the exponential increase in the information on the web, extracting relevant documents for users in a reasonable time becomes a cumbersome task. Also, when user feedback is scarce or unavailable, content-based approaches to extract and rank relevant documents are critical as they suffer from the problem of determining semantic similarity between texts of user queries and documents. Various sentence embedding models exist today that acquire deep semantic representations through training on a large corpus, with the goal of providing transfer learning to a broad range of natural language processing tasks such as document similarity, text summarization, text classification, sentiment analysis, etc. So, in this paper, a comparative analysis of six pre-trained sentence embedding techniques has been done to identify the best model suited for document ranking in IR systems. These are SentenceBERT, Universal Sentence Encoder, InferSent, ELMo, XLNet, and Doc2Vec. Four standard datasets CACM, CISI, ADI, and Medline are used to perform all the experiments. It is found that Universal Sentence Encoder and SentenceBERT outperform other techniques on all four datasets in terms of MAP, recall, F-measure, and NDCG. This comparative analysis offers a synthesis of existing work as a single point of entry for practitioners who seek to use pre-trained sentence embedding models for document ranking and for scholars who wish to undertake work in a similar domain. The work can be expanded in many directions in the future as various researchers can combine these strategies to build a hybrid document ranking system or query reformulation system in IR.

A Distributed $k$-Winners-Take-All Model With Binary Consensus Protocols

Article

Dec 2023

This article concentrates on solving the $k$ -winners-take-all $(k$ WTA) problem with large-scale inputs in a distributed setting. We propose a multiagent system with a relatively simple structure, in which each agent is equipped with a 1-D system and interacts with others via binary consensus protocols. That is, only the signs of the relative state information between neighbors are required. By virtue of differential inclusion theory, we prove that the system converges from arbitrary initial states. In addition, we derive the convergence rate as $\mathcal{O}(1/t)$ . Furthermore, in comparison to the existing models, we introduce a novel comparison filter to eliminate the resolution ratio requirement on the input signal, that is, the difference between the $k$ th and $(k+1)$ th largest inputs must be larger than a positive threshold. As a result, the proposed distributed $k$ WTA model is capable of solving the $k$ WTA problem, even when more than two elements of the input signal share the same value. Finally, we validate the effectiveness of the theoretical results through two simulation examples.

Use Of Information Retrieval System for Library Services By Undergraduate Students In Professor Aghagbo Nwako Library, Nnamdi Azikiwe University Awka, Anambra State.

Article

Full-text available

Mar 2023

The study investigated the use of information retrieval system for library services by undergraduate students in Prof. Aghagbo Nwako he study investigated the use of information retrieval system for library services by undergraduate students in Prof. Aghagbo Nwako Library Awka. The purpose of the study are to determine: The library information retrieval system available for library services to undergraduate students, the use of information retrieval system for library services to undergraduate students, the constraints to the use of information retrieval system in provision of library services to undergraduate students and the ways to improve the use of information retrieval system in provision of library services to undergraduate students. Four research questions guided the study. The study employed descriptive survey research design. The population of the study comprised of 308 regular undergraduate students (100 to 400 Levels) of Library and Information Science, Nnamdi Azikiwe University, Awka. Simple random sampling technique was used for the study; therefore, 120 undergraduate students were sampled. The sample size of the study was 30 regular undergraduate students selected from each of the levels. A self-structured questionnaire was the instruments used for data collection. The data collected was analyzed using descriptive statistics. The findings of the study revealed among others that the undergraduates were aware of information retrieval system available in and use it for their academic purposes and the study also revealed that the students faced unreliable power supply, poor maintenance culture; low bandwidth of internet access, lack of awareness among others factors that hinder the utilization of library information retrieval system. The study concludes among others that information retrieval services are available for use by undergraduate students. But the undergraduate students do not utilize library information retrieval system to a fuller extent due to unreliable power supply, poor maintenance culture, low bandwidth of internet access are among others factors that hinder the utilization of library information retrieval system. The study recommends that high Internet connectivity should be provided in the Library, so as to encourage library users to patronize the library resources and services when they are made available. Also it was recommended that university management should provide adequate fund for maintenance and alternative source of power.

A Dimensional Representation of Depressive Text

Conference Paper

Full-text available

Jan 2021

Hybrid Model with Word2vector in Information Retrieval Ranking

Chapter

Jan 2021

People have realized the importance of finding and archiving information with the computer advents for thousands of years, and storing of large amount of information became possible. It is actually not related to the fetching of the documents, it informs the user on the whereabouts and existence of the documents. In this paper, hybrid model has been used in which the document is classified using the support vector machine (SVM) classifier, and after the condition is applied, if it is satisfied, the extraction of the matched paragraph and the sentence is responsible for the generation of relevant answer. The knowledge base gets updated if condition does not match, and new updated answer will be generated. Finally, the best answer is displayed after ranking by using the PSO optimization. Word2vector is applied for feature extraction. In this paper, comparison of RankSVM, RankPSO and RankHSVM + PSO for the implementation of IR ranking is considered. Here, first SVM is used as a classifier for dividing most relevant and non-relevant results, and afterward PSO is used for the optimization of the result means extraction of the best answer or document. Selection of appropriate parameters is difficult in case of simple SVM, but for the ranking of the answers it gives potential solutions. PSO is used for optimization which has global search capability and is easy to implement and thus to optimize the ranking of document retrieval. We propose the RankHSVM + PSO model to find the fitness function. This technique improves the performance of the system as comparative to other techniques. The result shows that the algorithm applied here improves the value of performance evaluation by 4–5%. TREC 2004 QA DATA dataset is used which contains my datasets. It has a question answering track since 1999. The task was defined in each track. Retrieval of true equivalent test collection for standard retrieval is an open problem. In a retrieval test collection, the unit that is judged the document has a unique identifier.

DDoS attacks impact on data transfer in IOT-MANET-based e-healthcare for tackling COVID-19

Chapter

Full-text available

Jan 2021

The Covid-19, a pandemic situation, effects the economy of the whole world severely and is gaining much huge attention in the field of research currently across the globe. The Internet of things (IoT) technology is playing a great role for taking care of the patients by monitoring and controlling the symptoms and is very much essential for the developing countries, where monitoring of health of huge population has its own challenges. So the IoT and its amalgamation with mobile ad hoc network (MANET) acts as base of networks where devices send information among each other wirelessly thus also named as wireless mesh networks (WMN), in which various nodes are either stationary or allied with static position. Sensors and different other devices involved in e-healthcare sector used in WMN converse wirelessly and hence become the main gate to a numerous susceptibilities. The main aim of this research study is to evaluate the performance of reactive, secured and hybrid routing protocols for throughput as one of the important quality of service (QoS) parameters in absence as well as in presence of distributed denial of service (DDoS). The NS-2(network simulator) is used to simulate AODV (ad hoc on-demand vector, SAODV (secured AODV) and hybrid wireless mesh protocol) in scenario of changing nodes. The comparative analysis concludes the HWMP as most suitable protocol among the other two routing protocols with impact on throughput for handling DDoS attacks. This research study aids in providing implications to enhance existing protocols and alleviate the consequence of DDoS instigated by such attacks.

A Dimensional Representation of Depressive Text

Chapter

Full-text available

Jan 2021

Depression is presently one of society's main psychological disorders. An intensified public mental health concern has been prompted by recent experiences with the emergence of corona virus disease 2019 (COVID-19). At present, the emphasis of research on human emotional state representation has changed from basic emotions to a large number of emotions in continuous three-dimensional space owing to the complexity of describing and evaluating a vast number of emotions within a single framework. Significant considerations of 3D continuous valence, arousal and dominance space while overseeing mental health issues are important as they relate to the expression of emotion and behavioural reactions. The goal of this research is to design a machine learning regressor modal to estimate the continuous valence, arousal and dominance score which results from the process of emotional intelligence via text interpretation. In the pursuit of goal, EmoBank dataset, which contains text information as well as valence–arousal–dominance values and for validation ISEAR, a labelled corpus of categorical emotions datasets is used. We learn an embedding using three pre-trained word embeddings: word2vec, Doc2vec and BERT, and find that BERT significantly outperforms the result. In a future study, the regressor model will be adopted in depression detection by distributing the categorical negative emotions in terms of VAD.

FUDMA Journal of Arts, A publication of the faculty of arts

Article

Full-text available

Jan 2018

Large amount of unstructured designed information is difficult to deal with. Obtaining specific information is a hard mission and takes a lot of time. Information Retrieval System (IR) is a way to solve this kind of problem. The main reason for this work is to help internet and library database users to find the required information with high performance. Finding information on a particular word, subject, topic or article is useful when specific keywords such as title, author's name, ISBN/ISSN, year of publication for the object are known. The study is guided by two (2) objectives and adopted documentary research method in social research and the core aim is to provide a general understanding on information retrieval system in a digital era as well as specific process for its utilization.

Impact of ICT on Information Retrieval System in Academic Libraries: The Experience of Federal University Gashua Library, Yobe State, Nigeria

Article

Feb 2019

Bukola Agboola

Algorithm for Information Retrieval optimization

Conference Paper

Full-text available

Oct 2016

When using Information Retrieval (IR) systems, users often present search queries made of ad-hoc keywords. It is then up to the information retrieval systems (IRS) to obtain a precise representation of the user's information need and the context (preferences) of the information. To address this problem, we investigate optimization of IRS to individual information needs in order of relevance. The goal of this article is to develop algorithms that optimize the ranking of documents retrieved from IRS according to user search context. In particular, the ranking task that led the user to engage in information-seeking behaviour during search tasks. This article discusses and describes a Document Ranking Optimization (DROPT) algorithm for IR in an Internet-based or designated databases environment. Conversely, as the volume of information available online and in designated databases is growing continuously, ranking algorithms can play a major role in the context of search results. In this article, a DROPT technique for documents retrieved from a corpus is developed with respect to document index keywords and the query vectors. This is based on calculating the weight (w ij ) of keywords in the document index vector, calculated as a function of the frequency of a keyword k j across a document. The purpose of the DROPT technique is to reflect how human users can judge the context changes in IR result rankings according to information relevance. This article shows that it is possible for the DROPT technique to overcome some of the limitations of existing traditional (tƒ × idƒ) algorithms via adaptation. The empirical evaluation using metrics measures on the DROPT technique carried out through human user interaction shows improvement over the traditional relevance feedback technique to demonstrate improving IR effectiveness.

A Distributed Agent Based Web Search using a Genetic Algorithm

Article

Full-text available

Jan 2007

In this paper, the problems of current web search engines are analyzed, and the need for a new design is justified. Some ideas on how to improve current web search engines are presented, and then an adaptive method for web meta-search engines with a multi-agent specially the mobile agents is presented to make search engines work more efficiently. In the method, the cooperation between stationary and mobile agents is used to make more efficiency. The meta-search engine gives the user needed documents based on the multi-stage mechanism. The merge of the results obtained from the search engines in the network is done in parallel. Using a reduction parallel algorithm, the efficiency of this method is increased. Furthermore, a feedback mechanism gives the meta-search engine the user's suggestions about the found documents, which leads to a new query using a genetic algorithm. In the new search stage, more relevant documents are given to the user. The practical experiments were performed in Aglets programming environment. The results achieved from these experiments confirm the efficiency and adaptability of the method.

Understanding and Using Context Personal and Ubiquitous Computing Journal

Article

Full-text available

Anind K. Dey

Towards a Context Sensitive Approach to Searching Information Based on Domain Specific Knowledge Sources

Article

Jan 2011

Real time user context modeling for information retrieval agents

Conference Paper

Jan 2001

Unstructured queries based on mobile user context

Article

Nov 2012

Purpose – Many mobile devices today are equipped with diversified sensors that enable the acquisition of rich user context (e.g. GPS location, phone activity) for application utilization. With the growing usage of mobile devices in daily life, the problem of conveniently and promptly searching a piece of content that a user has viewed on his/her device before becomes more and more crucial. This paper aims to propose a context‐based query processing framework called UCQP that supports unstructured queries for content search in a user's access history. Design/methodology/approach – Beyond the keywords related to the content properties, a context query in the framework is specified with freeform phrases that describe high‐level mobile contexts of the user at a previous time when the user viewed the searched content. Findings – Experimental results on a prototype system of the framework illustrate its good accuracy and small response time. Originality/value – To tolerate the incompleteness and inaccuracy in user query texts caused by fading human memory, the authors develop several semantic query parsers that are tailored for different types of contexts using natural language processing and information retrieval techniques. The authors further propose a similarity model to rank the multiple result contents of a query by comparing context entities specified in the query and historical context values associated with each result.

Improving search via personalized query expansion using social media

Article

Jun 2012

Social tagging systems have gained increasing popularity as a method of annotating and categorizing a wide range of different web resources. Web search that utilizes social tagging data suffers from an extreme example of the vocabulary mismatch problem encountered in traditional information retrieval (IR). This is due to the personalized, unrestricted vocabulary that users choose to describe and tag each resource. Previous research has proposed the utilization of query expansion to deal with search in this rather complicated space. However, non-personalized approaches based on relevance feedback and personalized approaches based on co-occurrence statistics only showed limited improvements. This paper proposes a novel query expansion framework based on individual user profiles mined from the annotations and resources the user has marked. The underlying theory is to regularize the smoothness of word associations over a connected graph using a regularizer function on terms extracted from top-ranked documents. The intuition behind the model is the prior assumption of term consistency: the most appropriate expansion terms for a query are likely to be associated with, and influenced by terms extracted from the documents ranked highly for the initial query. The framework also simultaneously incorporates annotations and web documents through a Tag-Topic model in a latent graph. The experimental results suggest that the proposed personalized query expansion method can produce better results than both the classical non-personalized search approach and other personalized query expansion methods. Hence, the proposed approach significantly benefits personalized web search by leveraging users’ social media data.

Mobile guides: Taxonomy of architectures, context awareness, technologies and applications

Article

Jan 2013

Portable devices are increasingly employed in a wide range of mobile guidance applications. Typical examples are guides in urban areas, museum guides, and exhibition space aids. The demand is for the delivery of context-specific services, wherein the context is typically identified by a combination of data related to location, time, user profile, device profile, network conditions and usage scenario. A context-aware mobile guide is intended to provide guidance services adjusted to the context of the received request. The adjustment may refer to tailoring the user interface to the perceived context, as well as delivering the right type of information to the right person at the right time and the right location. It may also refer to intermediary adaptation, as in the case of mobile multimedia transmission. This paper offers a taxonomy of mobile guides considering multiple criteria. The taxonomy considers several aspects of the mobile applications space, including context awareness, client architectures, mobile user interfaces, as well as offered functionalities, highlighting functional, architectural, technological, and implementation issues. Existing implementations are classified accordingly and a discussion of research issues and emerging trends is offered.

Relevance Feedback in Information Retrieval

Chapter

Jan 1971

J J Rocchio

User modelling using evolutionary interactive reinforcement learning

Article

Jun 2006

As the volume and variety of information sources continues to grow, there is increasing difficulty with respect to obtaining information that accurately matches user information needs. A number of factors affect information retrieval effectiveness (the accuracy of matching user information needs against the retrieved information). First, users often do not present search queries in the form that optimally represents their information need. Second, the measure of a document’s relevance is often highly subjective between different users. Third, information sources might contain heterogeneous documents, in multiple formats and the representation of documents is not unified. This paper discusses an approach for improvement of information retrieval effectiveness from document databases. It is proposed that retrieval effectiveness can be improved by applying computational intelligence techniques for modelling information needs, through interactive reinforcement learning. The method combines qualitative (subjective) user relevance feedback with quantitative (algorithmic) measures of the relevance of retrieved documents. An information retrieval is developed whose retrieval effectiveness is evaluated using traditional precision and recall.

A Context-Adaptive Ranking Model for Effective Information Retrieval System

Abstract and Figures

Recommended publications

Research on Information Retrieval System that Supports Keyword Selection based on Generalized Concep...

FUDMA Journal of Arts, A publication of the faculty of arts

Impact of ICT on Information Retrieval System in Academic Libraries: The Experience of Federal Unive...

Algorithm for Information Retrieval optimization

Applying a Novel Query Reformulation Keywords Algorithm in A Mobile Healthcare Retrieval Context