Conference PaperPDF Available

An Integrative Semantic Framework for Image Annotation and Retrieval

December 2007

December 2007

DOI:10.1109/WI.2007.69

Source
IEEE Xplore

Conference: Web Intelligence, IEEE/WIC/ACM International Conference on

Authors:

Taha Osman

Nottingham Trent University

Dhavalkumar Thakker

University of Hull

Phil Lakin

Falmouth University

Most public image retrieval engines utilise free-text search mechanisms, which often return inaccurate matches as they in principle rely on statistical analysis of query keyword recurrence in the image annotation or surrounding text. In this paper we present a semantically-enabled image annotation and retrieval engine that relies on methodically structured ontologies for image annotation, thus allowing for more intelligent reasoning about the image content and subsequently obtaining a more accurate set of results and a richer set of alternatives matchmaking the original query. Our semantic retrieval technology is designed to satisfy the requirements of the commercial image collections market in terms of both accuracy and efficiency of the retrieval process. We also present our efforts in further improving the recall of our retrieval technology by deploying an efficient query expansion technique.

Subset of the ontology tree

…

Traditional part-of relationships

…

below:

…

Comparing relationship

…

Traversing the Ontology Tree

…

Figures - uploaded by Dhavalkumar Thakker

Content may be subject to copyright.

Content uploaded by Dhavalkumar Thakker

Content may be subject to copyright.

An Integrative Semantic Framework for Image Annotation and Retrieval

Taha Osman1, Dhavalkumar Thakker1, Gerald Schaefer2, Phil Lakin3

1 School of Computing & Informatics, Nottingham Trent University, Nottingham, NG11 8NS, UK

{taha.osman, dhavalkumar.thakker}@ntu.ac.uk

2 School of Engineering & Applied Science, Aston University, Aston Triangle, Birmingham B4 7ET, UK

g.schaefer@aston.ac.uk

3PA Photos, Pavilion House, 16 Castle Boulevard, Nottingham, NG7 1FL, UK

phil.lakin@paphotos.com

Abstract

Most public image retrieval engines utilise free-text

search mechanisms, which often return inaccurate

matches as they in principle rely on statistical analysis

of query keyword recurrence in the image annotation

or surrounding text. In this paper we present a

semantically-enabled image annotation and retrieval

engine that relies on methodically structured

ontologies for image annotation, thus allowing for

more intelligent reasoning about the image content and

subsequently obtaining a more accurate set of results

and a richer set of alternatives matchmaking the

original query. Our semantic retrieval technology is

designed to satisfy the requirements of the commercial

image collections market in terms of both accuracy

and efficiency of the retrieval process. We also present

our efforts in further improving the recall of our

retrieval technology by deploying an efficient query

expansion technique.

1. Introduction

Affordable access to digital technology and

advances in Internet communications have contributed

to the unprecedented growth of digital media

repositories (audio, images, and video). Retrieving

relevant media from these ever-increasing repositories

is an impossible task for the user without the aid of

search tools. Most public image retrieval engines rely

on analysing the text accompanying the image to

matchmake it with the user query. Various

optimisations were developed including the use of

weighting systems where for instance higher regard

can be given to th e proximity of th e keyword to the

image location, or advanced text analysis techniques

that use term weighting method, which relies on the

proximity between the anchor to an image and each

word in an HTML file [1]. Despite the optimisation

efforts, these search techniques remain hampered by

the fact that they rely on free-text search that, while

cost-effective to perform, can return irrelevant results

as it primarily relies on the recurrence of exact words

in the text accompanying the image. The inaccuracy of

the results increases with the complexity of the query.

For instance, while performing this research we used

the Yahoo™ search engine to look for images of the

football player Zico returns some good pictures of the

player, mixed with photos of cute dogs (as apparently

Zico is also a popular name for pet dogs), but if we add

the action of scoring to the search text, this seems to

completely confuse the Yahoo search engine and only

one picture of Zico is returned, in which he is standing

still!

Any significant contribution to the accuracy of

matchmaking results can be achieved only if the search

engine can “comprehend” the meaning of the data that

describes the stored images, for instance, if the search

engine can understand that scoring is an act associated

with sport activities performed by humans. Semantic

annotation techniques have gained wide popularity in

associating plain data with “structured” concepts that

software programs can reason about [2]. This effort

presents a comprehensive semantic-based solution to

image annotation and retrieval as well as deploying

query expansion techniques for improving the recall

rate. It specifically targets the commercial image

collections market and acknowledges their

requirements for high quality recall without sacrificing

the performance of th e retrieval process.

The paper begins with an overview of the Semantic

web technologies. In section 3 we review the case

study that was the motivation for this work. Sections 4,

5, 6, and 7 detail the implementation roadmap of our

semantic-based retrieval system, i.e. ontology

engineering, annotation, retrieval, and query

expansion. We present our conclusions and plans for

further work in section 8.