ArticlePDF Available

A fuzzy-ontology oriented case-based reasoning framework for semantic diabetes diagnosis

August 2015
Artificial Intelligence in Medicine

August 2015

DOI:10.1016/j.artmed.2015.08.003

Authors:

Shaker El-Sappagh

Sungkyunkwan University

Mohammed Elmogy

Mansoura University

Alaa el-din mohamed Riad

Mansoura University

Objective: Case-based reasoning (CBR) is a problem-solving paradigm that uses past knowledge to interpret or solve new problems. It is suitable for experience-based and theory-less problems. Building a semantically intelligent CBR that mimic the expert thinking can solve many problems especially medical ones. Methods: Knowledge-intensive CBR using formal ontologies is an evolvement of this paradigm. Ontologies can be used for case representation and storage, and it can be used as a background knowledge. Using standard medical ontologies, such as SNOMED CT, enhances the interoperability and integration with the health care systems. Moreover, utilizing vague or imprecise knowledge further improves the CBR semantic effectiveness. This paper proposes a fuzzy ontology-based CBR framework. It proposes a fuzzy case-base OWL2 ontology, and a fuzzy semantic retrieval algorithm that handles many feature types. Material: This framework is implemented and tested on the diabetes diagnosis problem. The fuzzy ontology is populated with 60 real diabetic cases. The effectiveness of the proposed approach is illustrated with a set of experiments and case studies. Results: The resulting system can answer complex medical queries related to semantic understanding of medical concepts and handling of vague terms. The resulting fuzzy case-base ontology has 63 concepts, 54 (fuzzy) object properties, 138 (fuzzy) datatype properties, 105 fuzzy datatypes, and 2640 instances. The system achieves an accuracy of 97.67%. We compare our framework with existing CBR systems and a set of five machine-learning classifiers; our system outperforms all of these systems. Conclusion: Building an integrated CBR system can improve its performance. Representing CBR knowledge using the fuzzy ontology and building a case retrieval algorithm that treats different features differently improves the accuracy of the resulting systems.

The fuzzy sets similarity matrix for age feature.

…

Ontology evaluation measures for three ontologies.

Figures - uploaded by Mohammed Elmogy

Content may be subject to copyright.

Content uploaded by Mohammed Elmogy

Content may be subject to copyright.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

Contents

lists

available

ScienceDirect

Artiﬁcial

Intelligence

Medicine

mepage:

www.elsevier.com/locate/aiim

fuzzy-ontology-oriented

case-based

reasoning

framework

for

semantic

diabetes

diagnosis

Shaker

El-Sappagha,

Mohammed

Elmogyb,∗,

A.M.

Riadc

aDepartment

Mathematics,

College

Science,

King

Saud

University,

2455,

Riyadh,

Saudi

Arabia

bInformation

Technology

Department,

Faculty

Computers

Information,

Mansoura

University,

35516,

Mansoura,

Egypt

cInformation

Systems

Department,

Faculty

Computers

Information,

Mansoura

University,

35516,

Mansoura,

Egypt

Article

history:

Received

October

2014

Received

revised

form

June

2015

Accepted

August

2015

Keywords:

Case-based

reasoning

Knowledge

based

system

Fuzzy

ontology

Semantic

retrieval

Diabetes

diagnosis

Standard

SNOMED

terminology

Objective:

Case-based

reasoning

(CBR)

problem-solving

paradigm

that

uses

past

knowledge

inter-

pret

solve

new

problems.

suitable

for

experience-based

and

theory-less

problems.

Building

semantically

intelligent

CBR

that

mimic

the

expert

thinking

can

solve

many

problems

especially

medical

ones.

Methods:

Knowledge-intensive

CBR

using

formal

ontologies

evolvement

this

paradigm.

Ontologies

can

used

for

case

representation

and

storage,

and

can

used

background

knowledge.

Using

standard

medical

ontologies,

such

SNOMED

CT,

enhances

the

interoperability

and

integration

with

the

health

care

systems.

Moreover,

utilizing

vague

imprecise

knowledge

further

improves

the

CBR

semantic

effectiveness.

This

paper

proposes

fuzzy

ontology-based

CBR

framework.

proposes

fuzzy

case-base

OWL2

ontology,

and

fuzzy

semantic

retrieval

algorithm

that

handles

many

feature

types.

Material:

This

framework

implemented

and

tested

the

diabetes

diagnosis

problem.

The

fuzzy

ontol-

ogy

populated

with

real

diabetic

cases.

The

effectiveness

the

proposed

approach

illustrated

with

set

experiments

and

case

studies.

Results:

The

resulting

system

can

answer

complex

medical

queries

semantic

understanding

medical

concepts

and

handling

vague

terms.

The

resulting

fuzzy

case-base

ontology

has

concepts,

(fuzzy)

object

properties,

138

(fuzzy)

datatype

properties,

105

fuzzy

datatypes,

and

2640

instances.

The

system

achieves

accuracy

97.67%.

compare

our

framework

with

existing

CBR

systems

and

set

ﬁve

machine-learning

classiﬁers;

our

system

outperforms

all

these

systems.

Conclusion:

Building

integrated

CBR

system

can

improve

its

performance.

Representing

CBR

knowledge

using

the

fuzzy

ontology

and

building

case

retrieval

algorithm

that

treats

different

features

differently

improves

the

accuracy

the

resulting

systems.

2015

Elsevier

B.V.

All

rights

reserved.

Introduction

Diabetes

complex,

chronic

illness

requiring

continuous

medical

care

with

multifactorial

risk-reduction

strategies

beyond

glycemic

control.

According

World

Health

Organization

(WHO),

diabetes

will

the

seventh

leading

cause

death

2030

[1].

Globally,

about

336

million

people

are

living

with

type

diabetes

mellitus,

and

this

ﬁgure

set

rise

over

552

million

2030

[2].

2014,

adults

years

and

older

had

diabetes

[1].

There

are

three

main

types

diabetes.

The

ﬁrst

type

diabetes

mel-

litus

insulin

dependent

diabetes

mellitus;

this

type

occurs

when

the

pancreas

cannot

produce

sufﬁcient

insulin.

The

second

type

∗Corresponding

author.

Tel.:

+0020

1098889791;

fax:

+0020

502223754.

E-mail

address:

melmogy@mans.edu.eg

(M.

Elmogy).

type

diabetes

mellitus

insulin-independent

diabetes

mellitus;

this

type

occurs

when

the

body

cannot

effectively

use

the

produced

insulin.

The

third

type

gestational

diabetes,

which

occurs

preg-

nant

women.

patient

diabetes

symptoms

but

not

really

diabetic

called

pre-diabetes

patient.

The

early

diagnosis

diabetes

critical

its

care

process

because

the

early

care

can

prevent

long-term

microvascular

com-

plications

such

retinopathy,

nephropathy

and

neuropathy,

and

cardiovascular

disease.

Moreover,

the

early

diagnosis

can

prevent

the

pre-diabetes

patient

become

diabetic.

present,

the

results

for

early

detection

diabetes

are

not

highly

accurate.

There-

fore,

there

need

develop

diagnosis

system

for

diabetes

that

has

better

accuracy.

Clinical

decision

support

systems

(CDSS)

can

help

this

regard.

Existing

rule-based

diagnose

diabetes

systems

are

mainly

based

the

A1C

criteria

plasma

glucose

criteria,

either

the

fasting

plasma

glucose

(FPG)

the

2-h

plasma

glucose

http://dx.doi.org/10.1016/j.artmed.2015.08.003

0933-3657/©

2015

Elsevier

B.V.

All

rights

reserved.

180

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

(2-h

PG)

value

after

75-g

oral

glucose

tolerance

test

(OGTT).

For

example,

they

take

decisions

using

rules

such

(A1C

≥

6.5%

FPG

≥

126

mg/dL

2-h

≥

200

mg/dL)

then

the

patient

dia-

betic

[3].

However,

diabetes

diagnosis

complicated

than

these

direct

decisions.

Diabetes

other

diseases

includ-

ing

renal

diseases,

heart

diseases,

foot

diseases,

etc.

Moreover,

has

symptoms

hyperglycemia

hypoglycemia.

The

true

false

decisions

about

these

symptoms,

e.g.

thirst

true,

not

enough.

Diabetes

diagnosis

theory-less

and

unstructured

problem,

and

depends

the

physician’s

experience.

For

experience-based

problem

solving,

case

based

reasoning

(CBR)

one

the

most

suit-

able

techniques

for

decision

support

[4].

CBR

imitates

human

reasoning,

and

suitable

when

cannot

formulate

problem

set

generalized

rules.

appropriate

medical

con-

text

where

symptoms

represent

the

problem,

and

diagnosis

and

treatment

represent

the

solution.

The

CBR

paradigm

has

been

suc-

cessfully

used

various

medical

ﬁelds

from

lung

disease

and

eating

disorders

diabetes

and

Alzheimer’s

disease

[5].

Many

pieces

research

utilized

CBR

for

diabetes

diagnosis

[6–9].

Although

any

CBR

system

relies

set

speciﬁc

experiences,

its

rea-

soning

power

can

improved

general

knowledge

about

the

domain

[10].

Ontologies

can

enhance

the

capabilities

CBR

cre-

ating

knowledge

intensive-CBR

(KI-CBR)

systems

[11].

can

play

many

roles

CBR

such

background

domain

ontology,

case-base

ontology,

semantic

similarity

measurement,

and

others

[12].

Ontol-

ogy

can

enhance

CBR

systems

many

dimensions,

shown

Fig.

this

ﬁgure,

suggest

three

types

KI-CBRs

paradigms.

part

(a)

Fig.

the

case-base

stored

traditional

database,

and

the

domain

knowledge

stored

ontology.

part

(b),

the

case-base

stored

crisp

ontology,

and

the

domain

knowledge

stored

ontology.

part

(c),

the

case-base

stored

fuzzy

ontology,

and

the

domain

knowledge

stored

ontol-

ogy.

have

selected

the

most

complicated

and

recent

approach

(part

c).

For

diabetes

diagnosis,

researchers

made

efforts

toward

diabetes

ontology

development

[13].

Nevertheless,

the

literature

ontology-based

CBR

for

diabetes

not

rich

with

studies

[7,8].

The

most

critical

steps

CBR

paradigm

are

the

case

repre-

sentation

and

case

retrieval.

concentrate

these

two

main

steps

improve

the

performance

medical

CBR.

The

case

base

building

process

reduces

the

efforts

and

time

build

the

system’s

knowledge

base

compared

rule-based

systems.

generalized

knowledge

required

build

successful

CBR

system.

However,

the

collection

cases

for

patients

requires

the

integration

between

the

CDSS

system

and

the

distributed

electronic

health

record

(EHR)

environment.

result,

the

standardization

CBR

knowledge

and

data

critical

achieving

interoperability.

Interoperability

between

EHR

systems

and

CDSS

facilitates

the

automatic

collection

knowledge

from

patients’

EHRs,

supports

the

integration

CDSS

the

healthcare

environment,

and

eases

the

physician’s

querying

process.

EHR

uses

standards

Health

Level

reference

information

model

(HL7

RIM)

[14]

and

systematized

nomenclature

medicine-clinical

terms

(SNOMED-CT)

[15],

SCT

for

short,

ontology

for

data

storage

and

exchange,

which

can

utilized

CBR.

RIM

can

used

standard

case-base

structure,

and

SCT

can

used

background

knowledge

enhance

semantic

retrieval

[16,17].

El-Sappagh

al.

[9]

proposed

standard

data

model

for

diabetes

case-base.

SCT

huge

ontology,

which

affects

the

performance

the

CBR

retrieval

algorithm.

Creating

reference

set

from

SCT

for

diabetes

required.

El-Sappagh

al.

[18]

proposed

diabetes

diagnosis

OWL2

standard

ontology

from

SCT

reference

set.

far

know,

there

are

studies

utilize

SCT

reference

sets

CBR

systems

for

diabetes

diagnosis,

which

considered

required

issue

for

semantic

retrieval

and

integration

CDSS

EHR

environment.

Using

the

created

SCT-based

OWL2

for

semantic

retrieval

requires

the

encoding

the

case-base

unstructured

knowledge

with

the

same

code.

The

encoding

process

not

straightforward

process,

and

requires

methodology.

El-Sappagh

al.

[19]

proposed

encoding

methodology

and

utilized

encode

the

case-base

contents.

Physicians

often

describe

patients

using

imperfect

and

linguis-

tic

data,

and

their

knowledge

and

natural

language

have

great

deal

imprecision

and

vagueness.

Zadeh

[20]

argued

much

the

knowledge

that

humans

acquire

through

experience

perception-based

and

thus

subject

imprecision

and

inaccuracy.

Such

knowledge,

when

not

treated

some

suitable

way

that

can

consider

and

convey

its

inherent

imprecision,

usually

leads

the

poor

effectiveness

the

knowledge-based

systems

that

use

it.

result,

KI-CBR

paradigm

must

handle

the

imprecise

knowledge

representation

and

reasoning

[21].

The

existing

fuzzy

CBR

systems

utilize

imprecise

knowledge

through

the

use

fuzzy

logic

for

case

representation

and

relevant

fuzzy

pattern

matching

techniques

for

similarity

assessment

[22].

survey

existing

systems

fuzzy

CBR

diabetes

diagnosis

indicates

that

there

are

few

works

this

ﬁeld.

However,

the

lack

representation

this

knowledge

onto-

logical

restricts

the

effectiveness

these

systems

because

they

did

not

take

advantage

the

reasoning

capabilities

that

ontolo-

gies

provide.

The

fuzzy

ontology

focuses

assigning

meaning

the

fuzziness

the

ontology’s

components.

important

characteristic

makes

the

fuzzy

ontology’s

imprecision

explicit,

thus

facilitating

efﬁcient

knowledge

acquisition

and

ontology

reuse.

Moreover,

enables

the

deﬁnition

effective

seman-

tic

similarity

measures,

which

facilitate

case

retrieval.

For

diabetes,

the

existing

fuzzy

CBR

systems

have

not

used

fuzzy

ontology

even

crisp

ontology

background

domain

knowledge

case-

base

ontologies

[8].

the

other

hand,

ontologies

and

fuzzy

logic

have

been

utilized

diabetes

other

reasoning

methods

such

rule-based

expert

systems

[23].

this

paper,

present

fuzzy

KI-CBR

framework

that

handles

and

exploits

imprecise

knowledge

through

the

effective

integration

fuzzy

logic

the

ontology-based

CBR

paradigm.

Fuzzy

case-base

ontology

and

fuzzy

semantic

retrieval

algorithm

are

proposed

and

integrated

build

intelligent

CBR

for

diabetes

diagnosis.

This

approach

introduces

fuzzy

semantics

CBR

two

places.

The

ﬁrst

the

representation

imprecise

knowledge

itself,

and

the

second

case

retrieval.

particular,

our

proposed

framework

Fig.

KI–CBR

frameworks.

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

181

built

using

fuzzy

ontology

that

supports

the

representation

imprecise

case-speciﬁc

knowledge

while

the

retrieval

cases

enabled

proposing

highly

customizable

fuzzy

semantic

simi-

larity

framework.

most

the

CBR

studies

did

not

implement

the

entire

cycle

[12,24],

concentrate

the

most

critical

and

most

steps

(i.e.,

case

representation

and

retrieval).

Case

adapta-

tion,

reuse,

retention,

and

case-base

maintenance

will

handled

other

works.

Importantly,

our

system

implemented

six

modules:

Case

source

preparation,

case

base

ontology

engineering,

terminology

server,

fuzzy

case-base

ontology

population,

case

retrieval

engine,

and

case

query

parser.

implement

and

test

the

proposed

frame-

work

real

case-base.

The

system

has

user-friendly

interface;

supports

the

selection

standard

medical

concepts

from

SCT

dialog,

and

implements

the

clinical

distance

the

case

retrieval

process.

result,

the

system

achieves

high-level

performance

com-

pared

the

traditional

CBR

systems,

other

CBR

systems

the

literature,

and

machine

learning

algorithms.

The

system’s

accuracy

97.67%.

Therefore,

highly

accurate

and

can

applied

real

medical

environment.

this

end,

the

remainder

the

paper

organized

follows:

Section

provides

studies

KI-CBR,

especially

for

diabetes,

and

show

its

limitations.

Section

set

preliminaries

including

our

dataset

description.

Section

illustrates

the

research

method-

ology

used

the

study.

Section

the

proposed

CBR

framework.

Implementation

and

evaluation

are

discussed

Section

Finally,

Section

concludes

the

paper

and

highlights

future

work

direc-

tions.

work

The

physician

can

depend

clinical

practice

guidelines

(CPG)

diagnose

diabetes.

However,

CPGs

are

long

plaintext

documents.

Some

languages

such

Arden

syntax

can

used

for

represent-

ing

and

sharing

this

medical

knowledge.

can

convert

CPGs

into

actionable

rules

implement

rule-based

CDSS

systems.

Samwald

al.

[25]

proposed

development

environment

including

com-

piler

and

rule

engine

for

Arden

Syntax

rules.

However,

diabetes

diagnosis

ill-formed,

theory-less,

and

experience

based

prob-

lems.

Depending

rules,

not

suitable

because

there

will

many

exceptional

cases.

Rules

results

often

require

adaptations

physician.

Rules

cannot

customized

for

speciﬁc

patient

condi-

tions.

time-consuming

build

and

maintain

large

rule-base.

CBR

one

the

most

suitable

technique

for

the

experience

based

problems

because

easier

for

expert

physician

formulate

speciﬁc

cases

that

formulate

generalized

rules.

Tra-

ditional

CBR

has

been

used

for

diabetes

diagnosis

many

studies

[4–7].

evolution

this

paradigm

the

ontology-based-CBR

[21].

This

approach

generally

effective

retrieving

sim-

ilar

cases

than

traditional

ones

[10].

Ontology

plays

many

roles

enhance

CBR

semantics

ranging

from

case

storage

and

representa-

tion

case

adaptation

and

reuse

[11].

Moreover,

case

semantic

retrieval

algorithms

can

improved

using

case-base

and

domain

background

knowledge

the

form

ontologies

[26,27].

2.1.

Regarding

the

role

ontology

diabetes

management

the

diabetes

domain,

ontology

has

been

used

many

CDSSs

[13,23,28–30].

For

example,

Chen

al.

[13]

introduced

ontology

for

diabetes

drugs

and

ontology

for

patients’

symptoms.

These

ontologies

utilized

semantic

web

rule

language

(SWRL)

and

Java

expert

system

shell

(JESS)

determine

potential

prescriptions

for

the

patients.

Rahimi

al.

[28]

developed

type

diabetes

mel-

litus

(T2DM)

ontology

(DMO)

diagnose

and

manage

patients

with

diabetes,

and

they

proposed

algorithm

query

the

ePBRN

data

repository

diagnose

T2DM.

Sherimon

al.

[29]

proposed

dynamic

adaptive

questionnaire

ontology

for

gathering

the

dia-

betic

patient’s

medical

history.

Hayuhardhika

al.

[30]

developed

ontology

for

diabetes

disease

and

used

weighted

tree

similar-

ity

algorithm

for

diagnosis.

However,

regarding

diabetes

diagnosis,

none

these

ontologies

designed

for

CBR,

and

few

studies

have

used

ontology

CBR

[6,8].

diabetes

diagnosis

systems,

ontolo-

gies

have

not

been

utilized

neither

case-base

nor

background

knowledge

nor

case

retrieval.

Jaya

and

Uma

[7]

have

listed

the

roles

ontology

diabetes

diagnosis

CBR.

El-Sappagh

al.

[31]

proposed

case-base

ontology

engineering

methodology,

and

they

proposed

diabetes

case-base

ontology.

However,

there

are

decision

support

capabilities

provided

the

study.

The

result-

ing

OWL2

ontology

can

utilized

the

current

study

store

and

retrieve

cases

semantically.

addition,

this

ontology

crisp

and

cannot

handle

the

existed

vagueness

diabetes

diagnosis

environ-

ment

[20].

2.2.

Regarding

the

encoding

medical

data

Some

medical

knowledge

stored

the

unstructured

form.

This

knowledge

not

suitable

for

CBR.

enhance

the

semantic

intelligence

CBR

system,

the

case-base

textual

contents

have

encoded

formal

way.

Samwald

al.

[32]

asserted

that

the

building

CDSS

system

requires

the

encoding

clinical

data

using

ontologies.

They

developed

CDSS

for

pharmacogenomic

knowl-

edge

representation

and

reasoning

based

OWL2

ontology

[33].

However,

using

standard

medical

ontologies,

such

SCT,

sup-

ports

the

implementation

semantically

intelligent

case

retrieval

algorithms

[34],

enhances

the

interoperability

and

seamlessly

inte-

gration

between

CDSS

and

EHR

environment

[16],

and

supports

the

creation

standard

encoded

case-base

knowledge

[35].

result,

the

unstructured

medical

data

EHR

are

standardized

into

uniﬁed

form,

which

facilitate

the

automatic

collection

cases

knowledge

the

distributed

EHR

environments.

Moreover,

the

CBR

system

becomes

intelligent

interpreting

the

meaning

medical

concepts.

addition,

case

retrieval

algorithm

can

calcu-

late

the

clinical

distance

between

patients

rather

than

geometric

semantic

distances.

the

best

our

knowledge,

standard

medi-

cal

ontologies

such

SCT

have

not

been

used

diabetes

diagnosis

CBR

systems.

El-Sappagh

al.

[18]

proposed

OWL2

ontology

for

SCT

used

background

domain

knowledge

with

CBR.

addition,

this

ontology

can

used

encode

the

diabetes

case-base

unstructured

knowledge

into

formal

and

standard

form

[19].

2.3.

Regarding

the

fuzziﬁcation

medical

data

Diagnosis

diabetes

depends

the

physician’s

experience

and

the

patient’s

description

his

case.

Most

medical

data

are

described

using

vague

terms

(i.e.,

partially

known)

[36].

Vague-

ness

can

handled

using

fuzzy

logic

(FL)

[20].

useful

for

CBR

because

CBR

fundamentally

analogical

reasoning,

which

can

operate

with

linguistic

expressions.

facilitates

the

knowledge

elicitation

from

domain

expert,

eases

the

transfer

knowl-

edge

between

domains,

and

enhances

the

similarity

measurement.

Fuzzy

logic

has

been

integrated

with

CBR

hybrid

systems

[37,38]

and

used

for

calculating

the

fuzzy

similarity

between

cases

[22].

However,

there

are

real

studies

the

literature

for

fuzzy-CBR

systems

for

diabetes

diagnosis.

Thirugnanam

al.

[39]

built

hybrid

CDSS

system

for

diabetes

diagnosis

using

neural

network,

fuzzy,

and

CBR.

This

study

used

the

fuzzy

and

CBR

reasoning

mech-

anisms

separately,

and

fuzziness

has

been

added

enhance

the

CBR

functionality.

Most

CBR

systems

the

literature

utilized

case

retrieval

step

only.

Building

fuzzy

case-base

knowledge

required

support

fuzziness

CBR

systems.

However,

these

182

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

hybrid

systems

have

not

beneﬁted

from

fuzzy

ontology

reasoning

capabilities

CBR

system.

2.4.

Regarding

the

role

fuzzy

ontology

CDSS

crisp

ontology

has

proved

its

roles

CBR,

fuzzy

ontologies

can

extend

the

capabilities

crisp

ontologies

[40].

Crisp

ontolo-

gies

are

not

suitable

address

imprecise

and

vague

knowledge,

which

inherent

real

world

domains

[41].

Fuzzy

ontology

can

come

from

two

sources:

mapping

fuzzy

database

[42]

extension

crisp

ontology

[40].

Fuzzy

ontology

has

been

used

medical

and

non-medical

systems

[41,43–47].

Ali

al.

[47]

pro-

posed

T2FOBOMIE;

this

system

opinion

mining

system

based

type-2

fuzzy

rough

ontology.

Rodríguez

al.

[41]

proposed

fuzzy

ontology-based

system

for

modeling

human

behavior.

Tor-

shizi

al.

[43]

utilized

fuzzy

ontology

build

intelligent

rule-based

system

determine

the

severity

Benign

Prostatic

Hyperplasia

and

recommend

the

appropriate

therapies.

Carlsson

al.

[44]

discussed

the

capabilities

fuzzy

ontology

over

crisp

one

and

utilized

knowledge

mobilization

application.

Mezei

al.

[45]

asserted

that

fuzzy

ontology

critical

building

actionable

knowledge

aid

complex

decisions,

and

they

proposed

fuzzy

wine

ontology.

Molinera

al.

[46]

proposed

decision

support

system

for

recommending

smartphones

using

fuzzy

ontologies.

Lee

and

Wang

[23]

used

fuzzy

ontology

build

diabetes

diag-

nosis

CDSS

system.

This

system

based

rule-based

reasoning

paradigm.

used

the

freely

available

Pima

Indians

dataset,

which

not

diabetes

representative

data.

achieved

the

accuracy

91.2%.

2.5.

Regarding

the

role

fuzzy

ontology

CBR

For

CBR

systems,

many

studies

have

utilized

fuzzy

ontology

for

case

base

representation

and

fuzzy

retrieval

processes

[21,48].

Alexopoulos

al.

[21]

proposed

fuzzy

ontology-based

CBR

system

using

fuzzy

algebra.

Ali

al.

[48]

proposed

type-2

fuzzy

ontology-based

CBR

system

for

collision

avoidance

autonomous

underwater

vehicles.

Fuzzy

ontology

can

enhance

CBR

different

ways

such

physician

can

easily

deﬁne

experience

cases

using

natural-like

language,

cases

can

indexed

efﬁciently,

and

fuzzy-semantic

retrieval

algorithms

can

implemented.

Diabetes

has

utilized

fuzzy

ontologies

many

ﬁelds

[49];

however,

there

fuzzy

ontology-based

CBR

for

diabetes

management.

CBR

effectiveness

further

improved

ontology-

based

CBR

systems

can

utilize

vague

imprecise

knowledge

[21].

argue

that

there

difference

between

ontology-based

fuzzy

CBR

[50]

and

fuzzy-ontology

based

CBR

[21].

The

former

builds

fuzzy

CBR

system

and

uses

crisp

ontology

enhance

its

functionality,

but

the

later

builds

fuzzy

ontology

for

its

case-base.

Alexopoulos

al.

[21]

have

concentrated

only

fuzzy

properties

using

fuzzy

algebra.

Fuzzy

Ontologies

can

extend

query

cases.

Fuzzy-ontology-based

KI-CBR

yet

unstudied

topic,

especially

the

medical

domains

such

diabetes

diagnosis.

Moreover,

there

are

studies

diabetes

diagnosis,

which

incorporate

subsets

standard

ontologies

such

SCT,

uniﬁed

medical

language

system

(UMLS),

gene

ontology

(GO),

international

classiﬁcation

diseases

(ICD),

disease

ontology,

logical

observation

identiﬁers

names

and

codes

(LOINC)

the

background

domain

knowledge

[51].

addition,

our

study,

are

the

ﬁrst

separate

case-base

ontology

from

background

knowledge

ontology.

This

separation

has

great

role

the

medical

domain

because

the

case

base

and

domain

ontologies

are

usually

huge;

moreover,

many

standard

ontologies

can

simultaneously

utilized

the

same

CBR

system.

shown

Fig.

the

purpose

this

paper

propose,

imple-

ment,

and

test

fuzzy

KI-CBR

framework

using

characteristics

ontology,

fuzzy

logic,

and

standard

medical

terminology

(i.e.,

SCT).

accomplish

this

purpose,

the

major

contributions

performing

this

research

can

summarized

follows:

•We

propose

integrated

fuzzy

knowledge-intensive

CBR

frame-

work.

This

system

(shown

Fig.

distinctive

its

novel

architecture

and

can

applied

the

development

variety

CDSS

systems.

•We

introduce

efﬁcient

way

develop

the

case-base

fuzzy

ontology,

which

the

backbone

the

proposed

system.

This

ontology

built

based

our

previously

published

crisp

ontol-

ogy

[31]

and

the

top-level

CBR

crisp

ontology

namely

CBROnto

proposed

[52].

The

step-by-step

tutorial

the

fuzzy

ontol-

ogy

development

process

can

helpful

for

interested

readers

conduct

experiments.

The

proposed

fuzzy

ontology

the

ﬁrst

the

medical

domain.

•We

propose

fuzzy

semantic

retrieval

algorithm

for

retrieving

cases

from

the

fuzzy

ontology

according

the

physician

new

coming

problems.

This

hybrid

algorithm

accurate

and

takes

into

account

the

types

patient’s

features

including

numerical,

fuzzy,

ordinal,

lexical,

and

semantic

types.

Moreover,

the

fuzzy

types

are

represented

fuzzy

ontology,

and

the

semantic

types

are

based

standard

diabetes

diagnosis

SCT

ontology.

•To

perform

the

case

study,

develop

JAVA-based

prototype

based

the

most

popular

CBR

APIs

(i.e.

JCOLIBRI).

The

internal

intelligent

processes

the

prototype

control

the

query

processes.

The

physician

enters

the

patient

description

data

new

case

QV.

The

system

converts

the

query

case

crisp

vector

into

fuzzy

semantic

vector

QFSV.

The

QFSV is

passed

the

retrieval

engine,

which

retrieves

the

most

similar

cases

the

QFSV case

based

the

clinical

distances

between

patients.

The

experimental

results

that

are

generated

utilizing

this

prototype

advocate

the

efﬁ-

ciency

the

proposed

architecture.

The

proposed

framework

utilizes

our

previously

proposed

ontologies

such

the

crisp

case-base

ontology

[31]

and

the

Fuzz

y CBR Ontology

Fuzzy-CBR KI-CBR

Fuzzy-ontology based CBR

Domain

ontolog

Case-base

ontolog

Fuzzy ca

se-base

ontolog

Utilizes

Subclass of

Fig.

Our

research

focus.

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

183

Case C1Case C

Case Ci

…

fi1

. . .

fin

Problem Pi

Solution Si

Case bas

e CB

Query cas

e C

fq1

. . .

fqn

Problem Pq

Solution Sq

Retrieve

Similar

ity

Fig.

The

correspondence

between

stored

cases

and

query

case.

diabetes

standard

ontology

from

SCT

taxonomy

[18].

addition,

prepare

our

case-base

contents,

utilize

the

case-base

stan-

dard

data

model

[9],

use

the

pre-processing

step

handle

noisy

data,

select

relevant

features,

and

calculate

the

weight

vector

[35].

utilized

our

encoding

methodology

encode

the

case-base

unstructured

knowledge

into

standard

form

[19].

Moreover,

utilize

our

fuzziﬁcation

methodology

fuzzify

the

case-base

vague

concepts.

Preliminaries

make

the

article

self-contained,

this

section

deﬁne

some

concepts,

deﬁnitions,

and

terminologies

before

discussing

the

proposed

framework.

3.1.

Case

base

reasoning

Generally,

CBR

technique

for

solving

problem

remembering

similar

past

experiences.

For

example,

physicians

look

for

groups

known

symptoms

and

engineers

take

many

their

ideas

from

previously

successful

solutions.

The

main

concept

CBR

“similar

problems

have

similar

solutions.”

CBR

knowledge

formed

case-base

experiences

(either

success

failure).

does

not

depend

the

explicit

model

the

problem

rule

base

reasoning

for

the

inference

process,

but

simply

uti-

lizes

the

experience

captured,

the

same

way,

the

expert

usually

inputs

and

processes

it.

The

newly

solved

problems

can

added

new

experience

the

CBR

system’s

experience-base

(case-base),

which

supports

the

auto-learning

process.

The

CBR

can

deﬁned

cyclic

process

named

“the

four

Rs”

[54]:

(i)

Retrieve

the

most

similar

cases,

(ii)

Reuse

the

cases

that

might

solve

the

problem,

(iii)

Revise

the

proposed

solution

necessary,

and

(iv)

Retain

the

new

solution

part

new

case.

The

most

important

aspects

CBR

system

are

the

case-base

knowledge

representation

and

the

case

retrieval

algorithm,

and

these

are

our

contributions

the

current

paper.

Deﬁnition

case-base

ﬁnite

set

cases{C1,C2,.

.Cm},

where

the

number

cases

the

CB.

Deﬁnition

case

contextualized

piece

knowl-

edge

representing

experience.

The

ith

experience

case

Ci∈

formally

deﬁned

Ci=

Pi,

Si,

where

Piand

Sirespectively

repre-

sent

the

case

problem

description

and

the

case

solution

features.

Deﬁnition

case

retrieval

algorithm

that

takes

input

(query

case

Cq,

case

base

CB,

and

features

weighting

vec-

tor 

W);

calculates

the

level

similarity

between

Cqand

every

case

CB;

and

ﬁnally

returns

the

solution

the

most

simi-

lar

cases.

The

k-nearest

neighbour

(k-NN)

the

most

applicable

retrieval

algorithm.

formally,

let

Cq=

Pq,

X

query

case,

where

Pqis

the

query

case’s

problem

and

denote

its

solution.

should

mentioned

that

the

main

objective

the

CBR

system

determine

the

value

which

unknown

before

the

execution

the

case

retrieval

process.

general,

multiple

features

describe

the

problem

situations

both

the

case-base

historical

cases

and

the

target

case.

Let

{1,

.n},

the

total

number

attributes.

Let

{f1,

f2.

.fn}

ﬁnite

set

features

concerning

the

prob-

lem

situations

both

the

historical

cases

and

the

target

case,

where

fjdenotes

the

jth

attribute,

∈

Let 

(w1,

w2.

.wn)T

weights

vector

case

features

which

determine

the

features

importance,

where

wjdenotes

the

weight

the

importance

degree

attribute

fj,

such

that n

j=1wj=

and

≤

wj≤

∈

Let

→

C1=

(fi1,

fi2.

.fin)Tbe

vector

feature

values

for

the

problem

situation

historical

case

Ci,

where

ﬁjdenotes

the

consequence

historical

problem

situation

Ciconcerning

attribute

fj,

∈

Let

Cq=

(fq1,

fq2.

.fqn)Tbe

vector

feature

values

for

the

prob-

lem

situation

target

case

Cq,

where

fqjdenotes

the

consequence

current

problem

situation

Cqconcerning

attribute

fj,

∈

shown

Fig.

the

correspondence

between

query

case’

and

the

historical

cases’

features

can

easily

deﬁned.

The

case

retrieval

algorithm

depends

the

level

similar-

ity

between

two

cases

SIM Ci,

Cq,

i.e.

the

global

similarity,

where

SIM Ci,

Cq∈[0,

1].

The

similarity

function

SIM Ci,

Cqis

the

col-

lection

feature-level

similarities

sim fij,

fqj,

the

local

similarity,

where

sim fij,

fqj∈[0,

1].

Many

studies

existing

CBR

assume

that

all

features

are

the

same

datatype

(e.g.

numerical)

and

pro-

vide

single

local

similarity

function

sim fij,

fqjto

measure

the

similarity

between

fij and

fqj.

This

not

the

normal

case

[55].

our

study,

propose

one

the

most

complete

similarity

measure,

which

takes

into

account

the

numerical,

nominal,

ordinal,

fuzzy,

and

semantic

feature

types,

shown

Fig.

The

global

similarity

between

the

two

cases

SIM Ci,

Cqcan

deﬁned

distance

method.

The

most

widely

used

measures

are

the

Euclidean

distance

Hamming

distance,

shown

the

following

equation:

sim (Ci,

Cq)=⎧

⎪

⎨

⎪

⎩

−

j×

dist2fij ,

fqjif

the

Euclidean

distance

used,

−

wj×

dist fij,

fqjif

the

Hamming

distance

used.

(1)

where

sim fij,

fqjfunction

deﬁned

terms

the

function

dist fij,

fqj.

3.2.

Case

representation

The

contents

case-base

must

deﬁned

the

ﬁrst

beginning

CBR

system.

These

contents

determine

all

the

sub-

sequent

steps

such

case-base

ontology,

case

base

fuzzy

ontology,

and

case

retrieval.

After

checking

with

the

domain

experts,

CPGs,

and

handbooks

case

histories

diabetes

diagnosis

domain,

our

case

will

contain

the

features

described

Table

The

data

have

been

obtained

and

managed

the

hospitals

Mansoura

Uni-

versity,

Mansoura,

Egypt.

All

the

features

that

affect

the

diabetes

diagnosis

have

been

collected

our

domain

experts.

Some

data

are

collected

from

diagnostic

biochemical

lab

(AutoLab,

Mansoura,

Egypt).

The

used

data

set

was

collected

from

January

2010

through

184

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

Data-pr

operty range-types

Numeric Nominal SCT instance Ordinal

Fuzzy

A=?

B=?

Integer or

decimal

A= “XXX”

B= “YYY”

a>b>c>d

A= a

B= c A=?

B=?

Fuzzy

number

Fig.

The

case

feature

types.

1 Patient

case

Kidn

fun

ction

test

Ser

um urea

Serum uric

acid

Serum creatinin

Serum sod

ium

Serum potassium

CaseID

Liver

disease

Nephropathy

Disease

Diabetes

CaseID

Cancer

type

Kidne

sease

erc

hol

estremia

Diagnos

CaseID

Total bil

irubin

Direct bili

rub

SGOT

AST

CaseID

SGPT ALT

Alk

phosphatase

Total

rotein

Alb

umin

Liver fun

ction

test

Triglycer

ides

HDL cholester

CaseID

Lipi

d profile

LDL cholesterol

Total cholesterol

Global symptoms

CA12

Thi

rst

Vision CaseID

Hun

ger

Urinati

on frequency

Fati

Birth

AFP

Serum

FERRITIN

Ameno

rrhea

Dysmenorrhea

Urin

ation

sympto

CaseID

Protein

Bloo

Bili

rubin

Glucose Keton

Urolibingen

PusRB

Crystals

Hematological pro file

Lymphocyte

CaseID

Redcel

count

Haematocrit

MCV

MCH

MCHC

Platelet cou nt

White cell

count

Mono

Eosi

nophi

Basophils

CaseID Diabetes lab test

HbA1C FPG

2hPG

BMI

CaseID

Age

Residence

Occ

upation

Fig.

Diabetes

diagnosis

and

other

complaints

case

base

data

model.

August

2013.

There

are

eligible

patients,

who

enrolled

this

study.

However,

seven

control

subjects

were

excluded

due

lim-

ited

blood

samples

for

testing

AFP.

Our

data

set

contains

features

for

describing

diabetic

patients

and

for

linking

diabetes

with

other

disorders

such

cancer,

kidney

diseases,

and

liver

diseases.

The

data

set

distributed

33.3%

pre-diabetic

patients,

53%

diabetic

patients,

and

13.7%

normal

patients.

Table

shows

descriptions

considered

features

this

study.

3.3.

The

structure

diabetes

diagnosis

case

Fig.

shows

Entity

Relationship

(ER)

model

for

all

entities

and

attributes

used

our

data

set.

This

data

model

compatible

with

HL7

RIM

[56].

This

compatibility

facilitates

the

integration

with

EHR

and

supports

the

auto

collection

cases.

Moreover,

this

data

model

has

been

fuzziﬁed

with

our

proposed

fuzziﬁcation

method-

ology

into

fuzzy

model,

then

converted

fuzzy

case-base

database,

which

was

the

source

instances

for

our

proposed

fuzzy

case-base

ontology.

These

entities

and

attributes

were

enriched

entities

and

attributes

diabetes

diagnosis

CPGs

the

National

Guidelines

Clearing

House1.

Entities

and

features

dia-

betes

treatment,

medications,

and

drugs

are

out

scope.

Deﬁnition

Diabetes

diagnosis

cases

are

deﬁned

according

our

data

model.

case

P,

S

deﬁned

follows:

{LFT,

1http://www.guideline.gov/.

LP,

GS,

KFT,

LT,

US,

HP,

DI}

where

LFT

liver

func-

tion

tests,

lipid

proﬁle,

global

symptoms,

age,

BMI,

residence,

gender,

occupation,

KFT

kidney

function

tests,

lab

tests,

urination

symptoms,

haematological

proﬁle,

and

where

probable

liver

problem,

probable

nephropathy

problem,

probable

cancer

type,

and

probable

hypercholesterolemia

problem.

S(P)

the

solution

part

describes

the

diagnosis

diabetes

including

diabetic,

predia-

betic,

gestational–diabetic,

and

prediabetic–gestational.

=DD

where

diabetes

diagnosis.

Our

diagnostic

features

can

numerical

features

(e.g.,

age,

lab

tests,

BMI

and

on),

ordinal

fea-

tures

(e.g.,

features

Global

symptoms

table

Fig.

5),

and

text

features

(e.g.,

sex,

occupation,

etc.).

All

these

features

have

not

been

encoded

SCT

concepts

because

their

coding

will

not

enhance

the

semantic

retrieval

algorithm

CBR.

the

other

hand,

patient

disorders

are

instance

features,

and

have

mapped

standard

SCT

concepts

another

work

[18].

concentrated

the

CBR

semantic

retrieval

aspect,

not

sharing

and

interoperability

issues.

For

example,

feature

HbA1c

6.4

encoded

SCT

|43396009:

Hemoglobin

A1c

measurement|

6.4,

this

code

enhances

semantic

interoperability

but

does

not

enhance

semantic

retrieval

process

CBR.

the

other

hand,

the

patient

has

disorder

such

nephropathy,

this

concept

has

long

sub-tree

disorders

(e.g.,

caliectasis,

amyloid

nephropathy,

calyceal

ﬁstula,

and

on),

which

can

described

different

physicians.

The

semantic

similarity

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

185

Table

The

patient

attributes

used

describe

cases.

Data

type

primitive,

instance

SCT

concept,

numerical,

categorical,

fuzzy,

ordinal}.

Feature

type Feature

name Data

type Normal

range

UoM

Min-mean-max

No.

Demographics Residence

{Urban,

Rural}

–

Occupation

{Farmer,

Police}

–

Gender

{Male,

Female}

–

Age

–

Year

29–48.117–74

BMI

18.5–25

kg/m220–33.117–45

Diabetes

lab

tests HbA1C

≤5

mmol/L

5–6.373–7.4

PG P,

F <139

mg/dL

165–202.733–235 7

FPG

<99

mg/dL

96–129.633–156

Haematological

proﬁle Prothrombin

INR

0–1

1–1.16–1.4

Red

cell

count

4.2–5.4

106/cmm

3.8–5.194–5.88

Hbg

12–16

g/dL

9.8–12.332–13.4

Haematocrit

(PCV)

37–47

vol%

31.1–35.215–36.8

MCV

F 80–90 ﬂ

26.8–71.908–76.4 13

MCH

F 27–32 pg

3.3–25.47–29.4

MCHC

30–37

1.8–35.465–41.7

Platelet

count

150–400

103/cmm

135–316.183–2000

White

cell

count P,

F 4–11 103/cmm 6–8.055–9.2

Basophils

0–1

0–1.013–5

Lymphocytes

20–45

21.2–25.768–29

Monocytes

2–10

1.7–2.942–4

Eosinophils

1–4

1–1.897–3.4

Symptoms Urination

frequency

–

Vision

–

Thirst

–

Hunger

–

Fatigue

–

Kidney

Function

Lab

tests

Serum

potassium

3.5–5.3

mEq/L

2.4–3.767–4.3

Serum

urea

5–50

mg/dL

17–31.56–67

Serum

Uric

acid P,

F 3.0–7.0 mg/dL

3–4.237–7.9

Serum

creatinine

0.7–1.4

mg/dL

0.9–1.35–3.6

Serum

sodium

135–150

mEq/L

134–137.833–158

Lipid

proﬁle LDL

cholesterol

0–130

mg/dL

50–94.917–170

Total

cholesterol

0–200

mg/dL

158–209.367–275

Triglycerides

60–160

mg/dL

78–144.767–189

HDL

cholesterol

45–65

mg/dL

30–55.533–65

Tumor

markers FERRITIN

28–397

ng/mL

–

AFP

Serum

0.5–5.5

IU/ml

–

CA-125

1.9–16.3

U/mL

–

Urine

analysis Chemical

examination Protein

–

Blood

–

Bilirubin

–

Glucose

–

Ketones

–

Urobilinogen

–

Microscopic

examination

Pus

–

RBcs

–

Crystals

–

Liver

function

tests S.

albumin

3.5–5.0

g/dL

1.9–4.082–5.4

Total

bilirubin

0.0–1.0

mg/dL

0.8–1.317–3

Direct

bilirubin

0.0–0.3

mg/dL

0.3–0.533–1.6

SGOT

(AST)

0–40

U/L

35–54.567–165

SGPT

(ALT)

0–45

U/L

35–57.317–183

Alk.

phosphatase

64–306

U/L

170–214.2–360

␥

7–32

U/L

18–35.833–98

Total

protein

6.0–8.7

g/dL

3.1–4.858–8.7

Females

history Amenorrhea

–

Birth

–

Dysmenorrhea

–

Diagnosis

Diabetes

type

–

Nephropathy

check

–

Lipid

disease

Hypercholesteremia’s

check

–

Cancer

type

Tumor

markers

–

Liver

disease

Liver

problem

–

Radiological

examination

Radiological

examination

–

these

concepts

critical

KI-CBR

retrieval

engine.

Moreover,

the

case

solution

features

are

not

encoded

because

these

features

not

participate

measuring

similarity

between

cases.

3.4.

Ontology

formal,

explicit

speciﬁcation

shared

concep-

tualization.

uniﬁed

view

domain,

which

describes

its

instances,

concepts,

and

relationships

between

them

[57].

The

main

advantage

ontology

usage

that

support

the

sharing

and

reusing

formally

represented

knowledge

explicitly

stating

the

concepts,

relationships

and

axioms

domain.

Ontology

deﬁned

particular

language.

OWL2

the

most

recent

ontology

repre-

sentation

language

deﬁned

W3C2.

addition,

ontology

mainly

2http://www.w3.org/TR/owl2-overview/.

186

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

depends

speciﬁc

description

logic

(DL).

For

example,

OWL2

based

the

SROIQ

(D)

DL.

formal

logic

that

can

enhance

the

reasoning

capabilities

CBR

systems.

Ontology

can

formally

deﬁned

Deﬁnition

Ontology

deﬁned

=I,

A

where

ontology

for

domain

interest;

set

concepts

domain;

set

individuals

instances

domain;

set

relations

among

concepts

including

object

and

data

rela-

tionships;

set

axioms

holding

among

concepts,

relations,

individual.

Axioms

provide

explicit

logical

assertions

about

these

three

elements.

3.5.

Fuzzy

sets

The

backbone

the

proposed

framework

the

case-base

fuzzy

ontology.

This

ontology

deﬁned

the

combination

fuzzy

sets

theory

with

crisp

ontology.

Fuzzy

set

theory

was

introduced

Zadeh

[58]

address

vague

and

imprecise

concepts.

Classical

sets

are

deﬁned

characteristic

functions:

Deﬁnition

Let



set

and

subset

(A

⊆

).

Then

the

function

the

following

equation:

A(x)=1

∈

/∈

A(2)

called

the

characteristic

function

the

set

.

Fuzzy

sets

introduce

the

concept

partial

membership

where

element

can

member

set

with

certain

degree

[0,1]

other

than

{0,

crisp

sets.

result,

allows

the

reasoning

linguistic

terms.

fuzzy

set

can

deﬁned

follows:

Deﬁnition

fuzzy

set

over

universe

discourse

deﬁned

membership

function

A(or

simply

which

maps

each

ele-

ment

value

between

[0,1],

shown

the

following

equation:

A(x):

→[0,

1](3)

where

the

fuzzy

set,

Ais

the

degree

membership,

∈

and

A(x)

∈

[0,

1].

fuzzy

set

can

deﬁned

set

ordered

pairs:

=x/A(x)|x

∈

X.

3.6.

Fuzzy

ontology

Vagueness

the

vital

part

any

suitable

medical

diagnosis

sys-

tem.

Fuzzy

logic-based

systems

employ

the

classical

fuzzy

logic

theory,

which

can

handle

vagueness

certain

level.

After

the

successfulness

crisp

ontology

and

the

applicability

fuzzy

logic

case

representation

and

retrieval

CBR,

the

integration

these

two

technologies

(in

fuzzy

ontology)

will

surely

enhance

the

per-

formance

CBR

systems.

formal

deﬁnitions

can

found

fuzzy

ontology

[59].

One

deﬁnition

ontology

that

uses

fuzzy

logic

provide

natural

representation

imprecise

and

vague

knowl-

edge

and

eases

reasoning

over

it.

Formally

speaking,

fuzzy

ontology

can

deﬁned

follows:

Deﬁnition

Fuzzy

OWL

ontology

consists

fuzzy

ontol-

ogy

structure

FOSand

fuzzy

ontology

instances

FOI,

(FOS,

FOI)

[42]:

FOS=

FID0∪

FAxiom0,

where

FID0=

FCID0∪

FDRID0∪

FOPID0∪

FDPID0is

set

fuzzy

class

descriptions,

and

FAxiom0is

set

fuzzy

class

and

property

axioms

deﬁned

over

FID0:

•FCID0is

set

fuzzy

classes

concepts.

Each

fuzzy

class

may

user-deﬁned

fuzzy

class,

one

two

predeﬁned

fuzzy

classes

owl:

Thing

and

owl:

Nothing.

•FDRID0is

set

fuzzy

datatypes.

Each

fuzzy

data

type

may

predeﬁned

XML

Schema

fuzzy

datatype.

•FOPID0is

set

fuzzy

object

properties.

•FDPID0is

set

fuzzy

data

properties.

•FAxiom0is

set

fuzzy

class

and

property

axioms

deﬁned

over

FID0.

FOI=

FIID0∪

FAxiom0,

where

FIID0is

set

individuals,

and

FAxiom0is

set

fuzzy

individual

axioms.

fuzzy

ABOX

fuzzy

TBOX

Fuzzy

TBOX

ﬁnite

set

fuzzy

concept

inclusion

axioms

the

form

˛



n,

and

fuzzy

role

inclusion

axioms

the

form

˛



n,

where

∈

(0,

and

␣

can

concept

inclusion

axiom

role

inclusion

axiom.

Fuzzy

ABOX

ﬁnite

set

fuzzy

concept

and

fuzzy

role

assertions

axioms

the

form

˛



n,

where

∈

(0,

and

role

concept

assertion

the

form

a:C



˛,(a,b):R



˛,(a,b):¬R



˛,a

and

The

main

idea

fuzzy

DLs

that

concepts

and

roles

are

inter-

preted

fuzzy

subsets

interpretation’s

domain.

fuzzy

DLs,

axioms

can

occur

with

certain

degree

truth.

The

notion

satis-

faction

fuzzy

axiom

fuzzy

interpretation

denoted



deﬁned

[60]

follows:

•I





≥

˛

iff

1≥

•I

(trans

R)iff

∀x,y∈I,

RI(x,

y)≥

supz∈IRI(x,

z)⊗

RI(z,

•I



R1⊆

R2iff

∀x,

∈

I·

1(x,

y)≤

2(x,

•I



(inv

R1,

R2)

iff

∀

∈

I·

1(x,

y)−

2(x,

Concept

satisﬁable

iff

there

interpretation

and

individual

∈

Isuch

that:

CI(x)

For

set

axioms

␧,

say

that

satisﬁes

iff

satisﬁes

each

element

ε.

model

iff



satisﬁes

(is

model

of)

fuzzy

A,

T,

denoted



iff

model

each

component

respectively.

axiom

logical

consequence

knowledge

base

denoted



iff

every

model

satisﬁes

3.7.

The

fuzzy-semantic

case

representation

Given

case

base

crisp

ontology,

elements

that

can

fuzziﬁed

include

datatypes,

object

properties

(through

fuzzy

modiﬁers),

and

data

properties.

Moreover,

fuzzy

case

base

ontology

can

include

crisp

assertions

side-by-side

with

fuzzy

assertions.

Cases

are

stored

fuzzy

ontology

concept

instances.

result,

case-base

deﬁned

as:

{01,02.

0m},

where

the

number

cases

and

0kis

the

k’s

case.

Each

case

the

case-base

ontology

deﬁned

follows:

Deﬁnition

case

0kis

vector

conjunctive

set

predicates

the

form:

0k−

P1∩

P2,

where

Piis

the

i’s

predicate

four

forms:

(fuzzy)

concept

assertion

a:Ci

˛,

(fuzzy)

object

property

assertion

(a,b):Ri

˛,

(fuzzy)

data

property

assertion

(a,v):Ti

˛,

for

a,b

abstract

individuals

and

literal

value.

(fuzzy)

data

property

asser-

tion

(a,v):Ti,

for

fuzzy

linguistic

term

deﬁned

using

fuzzy

datatype.

converting

the

physician

query

into

semantic

query

the

form

iQ≡

∼

PQ1∩

PQ2,

PQn,

the

similarity

calculation

between

these

predicates

becomes

straightforward.

This

similar-

ity

depends

the

inference

capabilities

the

utilized

ontology

reasoners.

The

querying

process

will

detailed

subsequent

sections.

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

187

assert,

(fuzzy)

ontology

has

many

roles

CBR.

can

play

many

roles

every

phase

CBR

including

case

representation,

indexing,

retrieval,

adaptation,

and

maintenance.

For

the

case

rep-

resentation

and

retrieval

steps,

these

roles

include

the

following:

considerably

reduces

the

knowledge

acquisition

bottleneck

[11].

allows

knowledge

engineers

use

knowledge

already

acquired,

conceptualized,

and

implemented

formal

language,

such

DLs

based

languages.

supports

persistence

cases

and

indexes

using

individuals

concepts

that

are

embedded

the

ontology

[31].

can

used

vocabulary

deﬁne

the

case

structure,

either

the

cases

are

embedded

individuals

the

ontology

itself,

the

cases

are

stored

different

persistent

media

such

database

[10].

can

play

the

role

terminology

deﬁne

the

query

vocabulary

[31].

The

user

can

better

express

his

requirements

can

use

richer

vocabulary

deﬁne

the

query.

During

the

similarity

computation,

the

ontology

allows

the

user

bridge

the

semantic

gap

between

the

query

terminology

and

the

case

base

terminology

[18].

supports

dynamic

case

storage

where

features

can

added,

updated,

deleted

from

the

case

base.

preserves

storage

space

which

many

cases

can

point

the

same

feature

values.

can

deﬁne

semantic

index

cases

for

in-memory

case

base

[61].

Ontology’s

description

logic

reasoners

such

Pellet

and

FaCT++

[62]

can

check

the

case

base

consistency,

redundancy,

and

ade-

quacy,

which

not

possible

regular

database

environment

[42].

Moreover,

reasoners

signiﬁcantly

enhance

the

effectiveness

the

case

retrieval

process

Domain’s

background

knowledge

such

SCT

medical

terminol-

ogy

can

integrated

with

case

base

ontology

create

KI-CBR

[11].

For

active

CDSS,

ontology

supports

interoperability

between

CBR-based

CDSS

and

EHR

system

database.

Ontology

provides

common

understanding

domain.

result,

supports

the

implementation

distributed

CBR

systems

[52].

using

ontology,

complex

relations

between

case

features

can

created.

For

example,

the

relationship

between

diabetes

symptoms

and

disorders

can

used

for

inference

values

missing

features.

Heterogeneous

cases,

which

have

ﬁxed

structure,

can

designed.

They

may

have

different

structures

with

different

types

and

numbers

features.

Cases

can

have

relationships

with

each

other

such

Cause,

ISA,

Part

Of,

Result

From.etc.

These

relationships

can

handle

incom-

plete

cases

and

allow

default

values

(by

inheritance)

[11].

Compound

features,

which

contain

many

other

simple

and

com-

pound

features,

can

deﬁned.

Utilizing

(fuzzy)

ontology

engineering

methodologies

can

help

making

the

CBR

knowledge

acquisition

process

efﬁcient.

the

medical

domain,

where

there

are

many

standard

ontolo-

gies

SCT,

GO.etc.,

ontologies

reuse

has

many

beneﬁts

such

standardization

the

CDSS

knowledge,

interoperability,

distribu-

tion

knowledge,

and

on.

Moreover,

many

ontologies

can

integrated

with

the

CBR,

where

each

case

feature

can

semanti-

cally

connected

with

ontology.

For

example,

the

patient

disease

feature

can

connected

Disease

ontology;

patient

gene

feature

can

connected

ontology;

patient

lab

test

features

can

connected

LOINC

ontology.etc.

Ontology-based

representation

cases

enables

reusing

and

adaptation

variety

application

scenarios

[21].

Creating

fuzzy

case-base

ontology

from

fuzzy

case-base

database

supported

methodologies

[40],

languages

[63],

tools

[48],

and

reasoners

[60].

These

fuzzy

ontologies

add

vagueness

the

KI-CBR

systems.

Research

methodology

shown

Fig.

speciﬁc

methodology

ﬁnish

this

study.

accomplish

the

purpose

this

study,

have

utilized

some

existing

technologies

and

studies.

Moreover,

have

utilized

our

research

studies

complete

some

speciﬁc

steps.

the

ﬁgure,

make

clear

cut

between

the

current

study

goals

and

the

other

utilized

works.

the

ﬁrst

step,

the

detailed

understanding

the

nature

diabetes

mellitus

disease

and

its

diagnosis

process

requires

deep

interviews

with

the

domain

experts.

The

step

involves

the

col-

lection

patients

EHR

records

implement

the

case-base

fuzzy

ontology.

This

dataset

will

determine

the

structure

the

case-base

ontology,

and

will

used

populate

the

ontology.

However,

the

collected

medical

data

needed

preparation

processes

includ-

ing

(pre-processing

enhance

the

quality

data

and

calculate

the

weight

vector,

coding

formalize

the

unstructured

contents

medical

data

using

standard

medical-ontology,

and

fuzziﬁ-

cation

fuzzify

some

numerical

features).

Moreover,

standard

ontology

needs

created

from

the

huge

SCT

ontology

used

the

domain

background

knowledge

similarity

calcula-

tion

process.

utilize

our

studies

accomplish

this

Existing utilized works

SNOMED C

HL7 RIM

Our SCT ont

olog

Our stand

ard

diabetes

data mod

Proté

gé

OWL2

Fuzzy OWL2 plugin

Crisp ontology

reasoners

(Pellet

)

Fuzzy ontolo

rea

son

ers (

uzzD

Our EHR

raw ca

ses

Our diabetes case

ase cris

ontol

Our ontology

inee

rin

method

Our encoding methodology

Our pr

e-process

ing method

olog

yOur fuzzification methodology

er’s s

ecific work

CBROnto

In depth

interview

s with

domain

exper

System testing (each module and as a whole)

System implemen

tatio

Design a fuzzy semantic case retrieval algorithm

Design of the fuzzy KI-CBR framework

Fuz

zy ontolog

y con

stru

ction

and

popul

ation

Case bas

e pr

e-process

ing

, encoding,

and

fuzzific

ation

IKARUS-O

nto

Fig.

Research

structure.

188

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

step

[9,18,19,35].

The

step

involves

the

construction

and

pop-

ulation

the

case-base

fuzzy

ontology.

this

step

complex,

extend

the

previously

proposed

crisp

ontology

[40]

using

high-

level

methodology

[31]

and

create

fuzzy

OWL2

ontology

using

protégé

tool.

far

know,

there

are

fuzzy

case-base

ontolo-

gies

for

medical

CBR

systems.

the

step,

propose

fuzzy

KI–CBR

framework.

This

framework

integrated

set

modules.

Each

module

for

speciﬁc

purpose,

and

each

one

has

inputs

and

outputs.

The

framework

will

detailed

the

section.

Next,

for

the

fuzzy

case

base

ontology,

and

for

handling

the

supported

feature

types,

design

hybrid

semantic

retrieval

algorithm.

The

step

the

implementation

our

framework

using

JAVA

pro-

gramming

language.

Finally,

test

the

implemented

system

using

case

base

real

diabetics.

The

proposed

fuzzy

KI-CBR

framework

for

diabetes

diagnosis

This

section

provides

description

our

proposed

fuzzy-

ontology

based

CBR

system

for

diabetes

diagnosis.

The

architecture

this

system

shown

Fig.

has

six

modules:

Case

source

preparation,

case

base

ontology

engineering,

terminology

server,

fuzzy

case-base

ontology

population,

case

retrieval

engine,

and

case

query

parser.

The

main

steps

the

framework

are

case-base

preparation

and

case

retrieval.

The

case-base

preparation

step

achieved

the

case

source

preparation,

case-base

ontology

engineering,

terminology

server,

and

fuzzy

case-base

ontology

population

modules

fol-

lows:

The

case-source

preparation

module

takes

the

EHR

raw

data

and

converts

into

pre-processed,

encoded,

and

fuzziﬁed

relational

database.

The

encoding

process

based

SCT

codes

from

the

terminology

server

module.

The

case-base

ontology-engineering

module

builds

the

case-base

crisp

ontology

and

extends

fuzzy

ontology.

The

fuzzy

case-base

ontology

population

module

populates

the

resulting

fuzzy

ontology

step

with

the

fuzzy

relational

database

step

The

case

retrieval

step

achieved

the

terminology

server,

case

retrieval

engine,

and

case

query

parser

modules.

The

case

query-parser

module

takes

the

user

query

vector

and

converts

semantic

query

vector

according

the

case

base

fuzzy

ontology

terminologies.

The

case

retrieval-engine

module

takes

the

created

semantic

query

vector

generated

step

and

searches

for

the

most

similar

cases

the

fuzzy

case-base

ontology.

The

clinical

similarity

between

medical

concepts

semantic

features

based

the

SCT

ontology

the

terminology

server

module.

5.1.

Case

source

preparation

module

This

module

prepared

the

EHR

raw

data

case-base

structure

and

content.

collected

the

patient’s

features

diabetes

diagnosis

from

distributed

EHR

systems

and

stored

opera-

tional

data

store

(ODS).

have

collected

cases,

which

describe

Fig.

The

proposed

CBR

framework.

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

189

Fig.

The

proposed

SCT

reference

set

for

diabetes

diagnosis.

diabetic

patients,

shown

Table

These

cases

are

descriptive

all

types

cases

[64],

which

used

cases

only.

Next,

these

data

were

anonymized,

cleaned,

and

normalized.

Features’

weights

were

calculated

using

machine

learning

algorithms

includ-

ing

genetic

algorithm,

decision

tree,

and

others.

El-Sappagh

al.

[35]

have

proposed

case-base

preparation

process

and

applied

the

used

case-base

data.

Moreover,

the

data

were

converted

case

base

structure

using

our

proposed

standard

data

model

[9].

addition,

the

prepared

case-base

was

coded

according

SCT

reference

set

that

was

created,

which

specialized

for

dia-

betes

diagnosis

[18].

Finally,

the

encoded

case-base

was

fuzziﬁed

fuzzy

relational

database

using

our

proposed

methodology

another

work.

The

works

El-Sappagh

al.

[18,35,64]

are

uti-

lized

this

module

prepare

the

used

EHR

medical

data.

The

resulting

database

the

source

instances

(ABOX)

for

our

pro-

posed

fuzzy

case-base

ontology.

5.2.

Terminology

server

module

This

module

creates

the

domain

background

ontology.

This

knowledge

critical

two

places:

(1)

semantic

similarity

mea-

surement,

and

(2)

query

formulation.

The

domain

knowledge

ontology

can

built

locally,

can

depend

standard

medical

ontology

such

SCT

[56].

Unfortunately,

ontologies

are

typically

created

ad-hoc

manner,

which

may

inﬂuence

the

accuracy

the

similarity

calculations

[64].

The

second

choice

better

because

clinical

ontologies

are

mature,

and

they

include

all

required

medical

concepts

and

relationships.

Moreover,

this

standardiza-

tion

enhances

the

interoperability,

reuse,

sharing,

and

integration

with

the

EHR

environment.

SCT

was

the

terminology

used

this

study.

Building

complete

ontology

not

realistic

and

using

the

whole

SCT

CBR

affects

the

retrieval

algorithm

because

very

large

ontology

(i.e.,

contains

361,800

concepts).

have

collected

all

SCT

concepts

diabetes

according

our

pro-

posed

methodology

[18],

and

built

its

OWL

ontology

(TBOX),

shown

Fig.

This

ontology

only

contains

550

concepts.

When

measuring

semantic

similarity

with

JCOLIBRI

API,

between

concept

instances;

however,

SCT

contains

only

concepts.

have

solved

this

problem

creating

instance

for

each

selected

con-

cept

with

the

same

name

(ABOX).

Moreover,

have

represented

the

selected

concepts

using

its

conceptIDs.

Fully

speciﬁed

names,

symptoms,

and

preferred

names

can

added

annotations

with

their

corresponding

names.

shown

Fig.

this

ontology

not

user

readable.

resolve

this

issue

our

future

work.

Each

con-

cept

name

begins

with

the

pattern

“C

”

readable

JCOLIBRI

API3as

concept

and

differentiate

from

instances.

The

resulting

ontology

directed

acyclic

graph

(DAG),

which

supports

single

inheritance

only,

but

the

whole

SCT

supports

multiple

inheritances.

ontology

has

structured

format

with

relationships

between

concepts.

The

“IS

A”

relationship

between

parent

and

child

the

core

relationship,

whereas

other

semantic

relationships

pro-

vide

additional

associations

between

terms

(such

“part-of”

“active-ingredient-of”).

Our

ontology

concentrates

the

“IS

A”

relationship

only

form

taxonomy

concepts.

Enriching

the

ontology

with

other

relationships

and

axioms

will

considered

future

work.

5.3.

Case-base

ontology

engineering

module

This

module

converts

our

crisp

case

base

ontology

created

our

work

[31]

into

fuzzy

case-base

ontology.

apply

the

procedural

steps

IKARUS-Onto

[40]

methodology

for

con-

verting

crisp

ontology

fuzzy

ontology.

The

IKARUS-Onto

high-level

and

abstract

methodology

add

fuzziﬁcation

aspects

crisp

ontology.

customize

this

methodology

according

our

requirements.

the

most

accurate

and

complete

method-

ology.

Moreover,

the

resulting

ontology

represented

Bobillo

and

Straccia

syntax

OWL

ontology

using

Fuzzy

OWL2

2.1.1

plug-in

Protégé

4.1

[63].

This

syntax

adds

the

fuzzy

compo-

nents

annotations

for

concepts

and

relationships

(i.e.,

datatype

and

object

properties).

Moreover,

allows

the

creation

hedges

and

fuzzy

data

types.

The

default

reasoners

such

Pellet

[62]

and

default

modeling

tools

such

protégé

can

used

with

the

result-

ing

ontology

because

all

fuzzy

aspects

are

coded

annotations

(i.e.,

FuzzyLabel

annotation).

Every

annotation

delimited

start

tag

and

end

tag

</fuzzyOwl2>,

with

attribute

fuzzyType

specifying

the

fuzzy

element

being

tagged.

3http://gaia.fdi.ucm.es/research/colibri/jcolibri

190

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

Fig.

The

crisp

case

base

ontology.

5.3.1.

Crisp

ontology

customization

Before

starting

the

fuzziﬁcation

process,

our

previously

created

crisp

case-base

ontology

[31]

customized

according

our

case-

base’s

fuzzy

database

contents

and

the

CBROnto

standard

ontology

JCOLIBRI

API

[52].

Fig.

shows

our

crisp

case

base

ontology

after

customization.

This

customization

includes:

outcome

concept

our

new

ontology,

have

removed

the

temporal

aspect

because

not

provide

the

treatment

plan

for

the

diabetic

patient,

and

our

data

set

does

not

have

multiple

values

over

time

for

case

features,

The

context

has

been

removed,

and

will

propose

indexing

methodology

another

work,

The

diagnoses

are

Normal,

Prediabetic,

Prediabetic

Gestational,

Diabetic,

and

Diabetic

Gestational

only.

Our

data

set

cannot

determine

the

type

diabetes

(i.e.,

Type

and

the

type

pre-diabetes

(i.e.,

IFG,

IGT),

our

data

set,

many

the

problem

description

features

are

new

and

not

modeled

the

ontology

[31],

The

hierarchy

the

ontology

simpliﬁed

much

possible

compatible

with

CBROnto,

Dealing

with

rules

the

form

SWRL

will

future

work

enhance

the

semantic

our

case

base.

shown

Fig.

CASE

INDEX

subsumes

all

the

case

fea-

tures,

CBRCASE

subsumes

case

instances,

and

HAS-COMPONENT

subsumes

the

two

parts

the

case.

This

way,

utilize

OntoBridge

API

JCOLIBRI2

address

ontology

storage,

retrieval,

and

manip-

ulation

straightforward

way

[52].

ontology-based

CBR,

cases

are

represented

concept

instances

and

their

attributes

are

repre-

sented

ontology

relations

properties.

The

values

that

relation

attributes

may

take

are

instances

deﬁned

within

some

domain

ontology.

For

example,

consider

small

fragment

our

case

base

containing

only

age,

gender,

cancer,

and

labTest.

Fig.

10,

all

the

case

base

data

and

structure

are

inside

the

case-base

ontology.

may

implement

this

ontology

two

sepa-

rate

components:

case

base

structure

stored

OWL2

ontology,

and

instances

cases

and

features

stored

database,

shown

Fig.

11.

Each

choice

has

its

advantages

and

limitations,

and

chose

the

ﬁrst

one.

5.3.2.

Case-base

ontology

fuzziﬁcation

process

Our

proposed

fuzzy

KI-CBR

(FKI-CBR)

framework

operates

two

axes,

namely

the

ontology-based

representation

imprecise

knowledge

and

the

utilization

this

knowledge

for

effective

case

retrieval.

For

the

ﬁrst

axis,

fuzzy

ontology

proposed.

For

case

retrieval,

algorithm

that

utilizes

ontology

and

fuzzy

proposed.

Fig.

10.

small

fragment

crisp

case

base

instantiation

structure.

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

191

Fig.

11.

Case

instances

stored

database.

ontology

may

deﬁned

set

concepts,

instances,

prop-

erties

(data

type

properties)

and

relations

(object

properties).

concept

represents

set

class

entities

within

domain

while

the

entities

that

belong

concept

are

called

instances

this

con-

cept.

relation

turn

links

concept

instance

another

instance

while

property

links

instance

standard

data

type

such

string,

integer,

ﬂoat,

Boolean,

etc.

5.3.2.1.

The

case-base

fuzzy

ontology

design.

this

paper,

handle

only

vagueness

(i.e.,

imprecision),

but

uncertainty

not

handled

(i.e.,

probability,

ambiguity,

inexactness).

fuzzy

ontology

may

informally

deﬁned

ontology

that

expresses

vague

knowledge

using

fuzzy

set

(fuzzy

concept)

namely

degree-vagueness

and

fuzzy

relation

and

properties

namely

combinatory-vagueness

[40].

Because

crisp

ontology

special

case

fuzzy

ontology,

which

all

relation

and

property

degrees

are

equal

fuzzy

ontology-based

CBR

retains

the

characteris-

tics

the

traditional

ontology-based

CBR

paradigm.

Crisp

elements

that

can

fuzziﬁed

include

data

types,

object

properties

(through

fuzzy

modiﬁers),

and

data

properties

(through

fuzzy

modiﬁed

data

types).

other

words,

the

fuzziness

ontology

includes

modeling

[40]:

Fuzzy

concepts:

concepts

whose

instances

may

belong

certain

degrees,

such

youngPatient

are

fuzzy

concepts.

Because

young

vague

predicate,

the

concept

also

vague

and,

there-

fore,

can

represented

fuzzy

one;

allows

the

fuzzy

concept

assertions

such

“patient

instance

youngPatient

degree

0.7.”

Fuzzy

relations:

there

are

two

main

types,

(2.1)

Fuzzy

object

rela-

tions,

which

link

concept

instances

certain

degree,

and

allows

fuzzy

role

assertions

“patient

has-Disease

degree

0.8.”

(2.2)

Fuzzy

data

type

relations,

which

either

assign

literal

value

concept

instances

certain

degrees

(e.g.,

patient

has-

Residence

“Rural”

degree

0.4),

fuzzy

datatype

assigned

concept

instance

(e.g.,

patient

has-Fuzzy-Age

young),

which

includes

the

age

fuzzy

predicate.

There

are

many

fuzzy

ontology

construction

methodologies

IKARUS-Onto

[40],

UFOC

[65],

UPFON

[66]

and

OntoMethodology

[67].

Moreover,

fuzzy

ontology

representation

languages

have

been

proposed

[63,68].

Fuzzy

ontology

reasoners

include

FuzzyDL,

Fire,

and

DeLorean.

Fuzzy

reasoners

use

fuzzy

description

logics

fuzzy

SROIQ

(D),

F-ALC,

fuzzy

SHIN,

and

fuzzy

SHOID

(D).

shown

Table

our

case,

the

fuzzy

case-base

ontology

con-

struction

process,

store

fuzzy

cases

about

diabetic

patient,

used

this

IKARUS-Onto

methodology,

OWL

fuzzy

extension

[63],

the

FuzzyDL4reasoner

using

fuzzy

SROIQ

(D)

[60],

and

protégé

tool

with

the

fuzzy

OWL

plugin

[69].

The

plug-in

does

not

translate

fuzzy

representations

into

OWL

but

rather

eases

their

represen-

tation

allowing

speciﬁcation

the

type

fuzzy

logic

used,

the

deﬁnition

fuzzy

data

types,

fuzzy

modiﬁed

concepts,

weighted

concepts,

weighted

sum

concepts,

fuzzy

nominals,

fuzzy

modiﬁers,

fuzzy

modiﬁed

roles

and

data

types,

and

fuzzy

axioms.

Table

shows

the

execution

steps

the

IKARUS-Onto

methodology

our

case

study.

5.3.2.2.

The

case-base

fuzzy

ontology

implementation.

For

the

fuzzi-

ﬁcation

our

crisp

case-base

ontology,

use

the

Fuzzy

OWL2

2.1.1

plug-in5in

Protégé

4.16.

the

following,

detail

fuzzy

concepts,

data

types,

relations,

and

data

types.

fuzzy

data

type

pair

D,

Dwhere

Dis

concrete

interpreta-

tion

domain,

and

Dis

set

fuzzy

concrete

predicates

with

arity

and

interpretation

d1:

n

D→

[0,

1],

which

n-ary

fuzzy

relation

over

D[63].

For

fuzzy

data

types,

the

functions

allowed

Fuzzy

OWL

deﬁned

over

inter-

val

[k1,

k2]

⊆

are

→

{left(k1,k2,a,b)(ﬁg.

13c),

right(k1,k2,a,b)(ﬁg.

13d),

Triangle(k1,k2,a,b,c)

(ﬁg.

13b),

Trapizoidal(k1,k2,a,b,c,d)

(ﬁg.

13a),

linear(k1,k2,c)

ﬁg.

13e,

mod(d)}

The

formalization

each

ele-

ment

the

ontology

conducted

follows:

5.3.2.2.1.

Fuzzy

data

types

and

fuzzy

concrete

roles

(data

prop-

erties).

For

each

the

numerical

features

our

case

base,

our

domain

experts

have

deﬁned

their

ranges,

and

fuzzy

member-

ship

functions,

their

shapes,

and

parameters.

For

fuzziﬁcation

these

values,

deﬁne

two

things:

(1)

fuzzy

data

type,

(2)

fuzzy

concrete

role.

Because

have

∼70

features,

and

most

them

are

numerical,

only

give

examples

here.

cooperation

with

our

domain

experts,

have

used

MATLAB

deﬁne

the

fuzzy

membership

functions

and

their

ranges,

shapes,

and

equa-

tions,

shown

Fig.

12.

Experience

suggests

that

the

overlap

triangle-to-triangle

and

trapezoid-to-triangle

fuzzy

regions

aver-

ages

somewhere

between

25%

and

50%

the

fuzzy

set

base

[70].

our

case,

our

domain

expert

has

recommended

ﬁxing

the

normal

ranges

and

overlapping

low

and

high

ranges

50%

the

normal

range,

see

Fig.

12b.

Considering

HbA1c

lab

test

values,

let

assume

its

range

[71,71]

and

its

linguistic

terms

are

lowA1c

(left

shoulder

5.7,

6.05),

normalA1c

(triangle

(5.7,

6.05,

6.4)),

and

highA1c

(right-

shoulder

(6.05,

6.4)).

Firstly,

create

fuzzy

data

type

for

each

these

vague

terms.

shown

Fig.

14,

have

used

the

protégé

plugin

[65]

create

datatype

lowA1c

and

then

annotate

fuzzy

datatype.

This

action

repeated

for

every

linguistic

term

each

4FuzzyDL

Reasoner:

http://gaia.isti.cnr.it/straccia/software/fuzzyDL/fuzzyDL.

html.

5Fuzzy

OWL2

2.1.1

plug-in:

http://www.straccia.info/software/FuzzyOWL/.

6Protégé

4.1:

http://protege.stanford.edu/.

192

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

Table

The

fuzzy

case-base

ontology

construction

process.

fuzzy

variable

our

case

base

ontology.

Next,

for

each

numerical

feature,

have

deﬁned

concrete

role

for

each

its

linguistic

val-

ues.

The

previously

deﬁned

fuzzy

datatypes

are

used

ranges

for

these

roles.

Continuing

with

HbA1c,

deﬁne

three

fuzzy

concrete

roles

hasLowA1c,

hasNormalA1c,

and

hasHighA1c.

For

example,

the

hasLowA1c

modeled

hasLowA1c

(HbA1c,

lowA1c)

where

HbA1c

crisp

concept

and

lowA1c

fuzzy

data

type.

5.3.2.2.2.

Fuzzy

modiﬁers,

fuzzy

modiﬁed

data

types,

and

fuzzy

modiﬁed

roles.

Modiﬁers

can

improve

the

expressiveness

the

ontology

and

semantic

queries.

The

degree

membership

fuzzy

Fig.

12.

example

fuzzy

datatypes.

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

193

Fig.

13.

Membership

functions

[16]

for

fuzzy

data

type

deﬁnition

(i.e.,

fuzzy

concrete

domains)

and

fuzzy

modiﬁer

functions:

(a)

trapezoidal

function;

(b)

triangular

function;

(c)

left-shoulder

function;

(d)

right-shoulder

function;

and

(e)

linear.

data

types

may

changed

using

fuzzy

modiﬁers.

fuzzy

modi-

ﬁer

function

fmod :

[0,

→

[0,

1],

which

applies

fuzzy

set

change

its

membership

function,

which

can

linear

(c)

(Fig.

12e)

triangular

(a,

and

(Fig.

12b).

the

help

domain

expert,

deﬁned

modiﬁes

values

very,

slightly,

somewhat

etc.

with

the

help

domain

expert.

For

example,

have

deﬁned

fuzzy

modiﬁers

very

linear

(0.85).

our

work,

these

modiﬁers

have

two

purposes

(Fig.

13):

modify

data

type,

such

the

new

data

type

veryLowA1c,

which

modiﬁed

version

lowA1c

shown

•<fuzzyOwl2

fuzzyType

“datatype”>

•<Datatype

type

“modiﬁed”

modiﬁer

“very”

base

“lowA1c”/>

•</fuzzyOwl2>

modify

fuzzy

concrete

role

shown

next.

5.3.2.2.3.

Fuzzy

modiﬁed

data

type

properties.

The

other

type

fuzzy

data

type

properties

are

Degree-vagueness

has-Disease

and

lived-In

attributes

shown

Table

they

are

modeled

fuzzy

modiﬁed

roles.

For

example,

has-Disease

role

can

modiﬁed

very

modiﬁer

new

role

very-has-Disease

•<fuzzyOwl2

fuzzyType

“role”>

•<Role

type

“modiﬁed”

modiﬁer

“very”

base

“has-Disease”/>

•</fuzzyOwl2>

5.3.2.2.4.

Fuzzy

logic

the

ontology.

have

selected

Zadeh

fuzzy

logic

for

our

ontology

where:

t-Norm

⊗

min

{˛,

ˇ},

Conorm

⊗ˇ

max

{˛,

ˇ},

Negation



−

˛,

and

Implication

⇒

max

−

˛,

ˇ}.

This

annotation

the

ontology

level

•<fuzzyOwl2

fuzzyType

“ontology”>

•<FuzzyLogic

logic

“zadeh”/>

•</fuzzyOwl2>

The

resulting

fuzzy

ontology

structure

(TBOX)

contains

classes,

object

properties,

138

(fuzzy)

datatype

properties,

105

fuzzy

datatypes.

After

creating

the

fuzzy

ontology

structure,

the

step

create

the

ontology

instances.

The

instances

the

cases

and

the

instances

its

describing

features

are

populated

from

our

fuzzy

case

base

relational

database.

populate

the

ontology

with

real

world

diabetes

diagnosis

individual

cases.

5.4.

Fuzzy

case-base

ontology

population

module

Fuzzy

ontology

population

from

the

fuzzy

relational

database

has

been

studied

[42].

Moreover,

there

are

protégé

plugins

auto-

mate

the

process

such

FRDB2FOnto

[42],

which

convert

the

fuzzy

database

schema

and

content

fuzzy

ontology

structure

and

instance.

the

other

hand,

for

storage

large

ontologies,

fuzzy

ontologies

can

stored

semantic

preserved

databases

[73].

selected

the

ﬁrst

choice

compatible

with

the

JCOLIBRI2

frame-

work.

Inspired

ontology

population

approaches,

developed

our

procedure

ﬁll

the

resulting

case-base

fuzzy

ontology

with

cases

(i.e.,

instances)

from

our

previously

modeled

case-base

fuzzy

relational

database.

show

the

process

single

fuzzy

table

Age,

which

includes

the

fuzzy

components

feature

age.

Case

base

crisp

model

Fig.

has

been

previously

fuzziﬁed

and

implemented

into

fuzzy

relational

database.

shown

Fig.

15,

the

Age

feature

table

Patient

Case

(Fig.

15a)

has

been

fuzziﬁed

into

Age

table

(Fig.

15b).

According

our

resulting

fuzzy

ontology,

can

map

between

fuzzy

concrete

properties

has-Young-Age

and

the

attributes

Age

relation

youngAge.

Moreover,

the

has-Age

object

property

connects

the

instances

from

classes

the

Case

and

Age

as:

ClassAsser-

tion

(Case

C1);

ClassAssertion

(Age

A1);

ObjectPropertyAssertion

(has-Age

A1).

The

same

process

done

Fig.

for

the

whole

case-base

crisp

ontology

was

performed

for

the

fuzzy

ontology.

Our

mapping

rules

database

instances

ontology

instances

are

Fig.

14.

example

fuzzy

data

type

deﬁnition.

194

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

Fig.

15.

Correspondence

between

fuzzy

relational

database

and

fuzzy

ontology.

guided

W3C

rules7and

the

rules

Zhang

al.

[74]

adapt-

ing

their

rules

work

with

the

fuzzy

relational

database

(e.g.,

table,

tuple,

attribute,

primary

key,

and

foreign

key

are

mapped

concept,

instance,

data

type

property,

axiom,

and

object

property,

respectively).

The

resulting

ontology

now

ready

with

the

sets

TBOX

and

ABOX.

After

populating

the

ontology

with

cases,

contains

2640

concept

instances.

The

resulting

case-base

collection

case

instances

the

ontology.

Case

attributes

are

represented

fuzzy

data

properties

and

fuzzy

object

properties,

follows:

1,nCBRase,

where

the

number

cases,

CBRCasei≡

(∃Has

ID.string)

∩

(∃CASE

COMPONENT.CBR

DESCRIPTION)

∩

(∃has

Solution.CBR

Solution)

∩(∃Has

Residence.string)

∩

(∃Has

Age.Age)

∩

(∃Has

BMI.BMI)

∩

(∃Has

Occupation.string)

∩(∃Has

Disease.Disease)

∩

(∃Has

FemaleHistory.FemaleHistory)

∩(∃Has GlobalSymptom.GlobalSymptom)

∩

(∃Has

Hemato

log

icalProfile.Hemato

log

icalProfile)

∩(∃Has

KidneyFunctionTest.KidneyFunctionTest)

∩

(∃Has

LabTest.LabTest)

∩(∃Has

LipidProfile.LipidProfile)

∩

(∃Has

LiverFunctionTest.LiverFunctionTest)

∩(∃Has

Radio

log

icalExa

min

ation.Radio

log

icalExa

min

ation)

∩(∃Has

UyinationSymptom.UyinationSymptom)

∩

Has

Gender.{female, male}

∩

Moreover,

case

diagnosis

part

nominal

concept

the

form

DIAGNOSIS

{diabetic,

preDiabetic,

normal,

diabeticGestational,

prediabeticGestational}.

result,

the

module

(case

query

parser

module),

the

query

case

will

modeled

the

same

format

CBRCasei,

and

the

case

retrieval

module

(Section

5.6),

systematic

comparison

between

the

cases’

predicates

can

calculate

the

similarity

levels

between

cases.

5.5.

Case

query

parser

module

For

new

patient

diagnosis

problem,

the

physician

enters

the

new

patient

description

the

query

form;

this

forms

the

new

case

without

solution.

have

asserted

before

that

our

cases

have

homogeneous

structure.

Implementing

heterogeneous

case

structure

will

discussed

another

study.

result,

all

the

necessary

patient

features

are

known

advance,

but

the

physician

may

not

know

all

the

values

these

features

when

describing

the

patient,

and

their

entry

may

time-consuming.

Ontologies

especially

standard

medical

ontologies

support

the

integration

CBR

system

and

EHR

[18].

The

query

module

can

the

patient

record

for

the

necessary

ﬁelds.

Moreover,

can

implement

rule

base

link

features

and

infer

the

missing

ones.

Next,

the

query

fuzziﬁed

and

coded

with

the

same

methods

used

for

the

case-base

ontology

facilitate

similarity

and

mapping.

The

new

problem

structure

transformed

into

the

fuzzy

case-base

7http://www.w3.org/2001/sw/rdb2rdf/wiki/Database-Instance-Only

and

Database-Instances-and-Schema

Mapping.

ontology

vocabulary

some

strategy;

then,

the

semantic

query

sent

the

Case

Retrieval

Engine

compute

the

similarity

between

the

query

concepts

and

the

concepts

the

new

semantic-query

problem.

The

semantic

query

conjunctive

query

the

logic

form

ˆi(Øi)



˛,

where

Øiis

conjunction

terms

the

form

A(x),R(x,y),

for

atomic

concept

and

atomic

role

are

either

individuals

variables

names

∈

(0,1],

and



∈

{>,≥,≤,<}.

this

end,

let

take

semantic

query

example.

After

acquir-

ing

the

query

case

from

physician,

represented

vector

<attributei=valuei>,

for

the

number

features.

Our

cases

are

represented

with

features,

writing

seman-

tic

queries

using

all

these

features

will

create

long

and

complicated

query.

very

small

fragment

these

features

<Age

38,

Residence

“Rural”,

Fatigue

“++”,

Gender

“Male”,

disease

“Malignant

tumor

involving

left

ovary

direct

extension

from

endometrium”.

.>.

This

vector

enters

two

main

prepa-

ration

steps:

fuzziﬁcation

numerical

data,

and

coding

unstructured

data.

After

the

fuzziﬁcation

process,

the

vector

<(young

0.2,

middleAged

0.8,

old

fuzzyLabel

middleAged,

Age

38),

Residence

“Rural”,

Fatigue

“++”,

Gender

“Male”,

dis-

ease

“Malignant

tumor

involving

left

ovary

direct

extension

from

endometrium”.

.>.

After

the

encoding

the

query

our

SNOMED

domain

OWL2

ontology;

this

step

encodes

unstructured

data

into

standard

codes.

The

resulting

vector

<(young

0.2,

middleAged

0.8,

old

fuzzyLabel

middleAged,

Age

38),

Residence

“Rural”,

Fatigue

“++”,

Gender

“Male”,

dis-

ease

“369524001”.

.>.

The

other

ordinal

and

categorical

features

remain

the

same.

The

vector

needs

transformed

into

semantic

query.

This

query

conjunction

set

predi-

cates

P1∩

P2∩

Pnwhere

Piis

predicate

four

forms:

(fuzzy)

concept

assertion

a:Ci

˛,

(fuzzy)

object

property

asser-

tion

(a,b):Ri

(fuzzy)

data

property

assertion

(a,v):Ti

˛,

for

a,b

abstract

individuals

and

literal

value,

(fuzzy)

data

prop-

erty

assertion

(a,v):Ti

˛,

for

fuzzy

linguistic

term

deﬁned

using

fuzzy

datatype.

According

the

vocabulary

our

fuzzy

case-base

ontology,

the

vector

transformed

into

semantic

query

containing

OWL

individuals

and

property

(i.e.,

data

and

object)

instances

the

form

<concept

instance,

object

property,

concept

instance;

˛>,

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

195

Fig.

16.

part

the

semantic

representation

fuzzy

query

case.

Fig.

17.

part

semantic

query

over

the

case-base.

<concept

instance,

data

property,

fuzzy

value;

˛>,

<concept

instance,

data

property,

literal;

˛>,

shown

Fig.

15c.

Fig.

shows

example

using

OWL2

Functional-Style

Syntax8for

the

query

vector

concentrate

the

fuzzy

values

representation

and

assume

that

this

point,

have

two

options.

ﬁrstly

for

exact

match

between

this

query

case

and

case

the

fuzzy

ontol-

ogy.

this

case,

SPARQL-DL

query

can

used

query

the

case-base

ontology

and

retrieve

the

diagnosis

the

matched

case,

shown

Fig.

17.

The

other

general

option

the

existence

partial

similar-

ity

between

the

query

case

and

all

cases

the

case-base.

this

case,

use

set

APIs

including

JCOLIBRI,

OntBridge,

Pel-

let,

and

fuzzyDL

APIs

implement

java

project

implement

the

retrieval

algorithm.

The

algorithm

uses

the

proposed

simi-

larity

function

the

section

retrieve

the

most

suitable

cases

the

fuzzy

case-base

ontology.

The

algorithm

calculates

the

clinical

similarity

between

the

query

case

and

all

cases

the

case-base

according

the

inference

capabilities

the

Pellet

and

fuzzyDL

reasoners.

The

solutions

the

most

similar

cases

are

selected

and

retrieved

the

physician

guide

his

decision

pro-

cess.

5.6.

Case

retrieval

engine

module

can

state

that

case

equivalent

another

case

both

cases

have

exactly

the

same

structure

and

attribute

values.

crisp

ontology-based

CBR,

the

retrieval

cases

involves

the

exploitation

the

structure

and

the

content

the

ontology

for

computing

the

semanticsimilarity

between

the

attribute

values

8http://www.w3.org/TR/owl2-syntax/.

and

consequently,

for

the

cases.

There

some

ontology-speciﬁc

similarity

functions

that

utilize

ontological

knowledge

dif-

ferent

manner

[72].

None

these

measures

utilizes

imprecise

knowledge

any

way.

Case

retrieval

can

implemented

with

neural

network

(NN),

rule-based

(RB),

case

indexing

(CI),

and

decision

tree

(DT).

However,

hard

determine

the

cor-

responding

structure

and

parameters

and

DT,

addition,

extraction

and

the

choice

rules

and

indexes

are

largely

depen-

dent

the

experience

the

knowledge

engineers

well

[75].

this

paper,

propose

case

retrieval

algorithm

that

involves

combining

the

reasoning

capabilities

classical

ontologies

(i.e.,

semantic

similarity)

with

fuzzy

similarity

for

numerical

features

order

create

powerful

hybrid

reasoning

mechanism.

assume

that

all

case

classes

have

unique

structure

(i.e.,

the

same

set

attributes).

The

performance

similarity

measure

totally

depends

the

type

and

the

importance

features.

have

used

set

machine

learning

algorithms

calculate

feature

weights,

another

study

[35].

First,

calculated

the

local

similarity

each

feature

accord-

ing

its

type

[12];

next,

used

global

similarity

function

based

distance

function

Euclidian

Minkowski.

Accord-

ing

feature

types,

our

proposed

similarity

algorithm

had

two

stages

similarity.

The

ﬁrst

stage

depends

syntactic

features

only

retrieve

set

potentially

similar

cases,

and

the

second

depends

the

remaining

semantic

features

select

the

most

similar

case.

5.6.1.

Similarity

calculation’s

ﬁrst

stage

Consider

query

case

Cq,

stored

cases

Cifor

and

the

number

cases

the

case-base,

and

feature

weights

wi.

All

instance

features

have

weight

wi=

The

ﬁrst

layer

calcu-

lates

SIMsyntactic Cq,

Ci.

This

global

similarity

function

SIMsyntactic

returns

the

most

similar

cases

according

the

similarity

between

Cqand

Ciusing

syntactic

similarity

syntactic

features

(fuzzy

and

196

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

not

fuzzy),

see

the

following

equation:

SIMsyntactic Cq,

Ci

=n

j=1wi×

sim fqj,

fij

n

j=1wi=m

j=1wi×

simLAj,

Zj

n

j=1wi

+k

j=m+1wi×

simOBj,

Zj

n

j=1wi+r

j=k+1wi×

simFCj,

Zj

n

j=1wi

+n

j=r+1wi×

simNDj,

Zj

n

j=1wi

(4)

The

A,B,C,

and

are

the

sets

nominal,

ordinal,

fuzzy,

and

numerical

features

the

query

case,

and

contains

the

corre-

sponding

features.

Moreover,

can

add

importance

for

each

type

features

using

weights

w1,

w2,

w3,

w4as

shown

Eq.

(5),

where

w1,

w2,

w3,

and

w4∈

(0,1]

and

w1+

w2+

w3,

w4=

SIMsyntactic =

w1×m

j=1wi×

simLAj,

Zj

n

j=1wi+

×k

j=m+1wi×

simOBj,

Zj

n

j=1wi+

×r

j=k+1wi×

simFCj,

Zj

n

j=1wi+

×n

j=r+1wi×

simNDj,

Zj

n

j=1wi

(5)

Depending

the

type

feature,

the

local

similarity

sim

selected

follows:

the

feature

nominal,

the

exact

match

used

the

fol-

lowing

equation:

simLAj,

Zj=1

Aj=

Aj/=

(6)

the

feature

ordinal,

our

domain

experts

proposed

similarity

matrix

for

each

ordinal

feature,

and

the

similarity

simO(Bj,

Zj)

calculated

based

this

matrix.

Due

space

restrictions,

not

show

matrices.

the

feature

fuzzy,

have

two

options:

(1)

The

feature

value

numerical.

Our

proposed

fuzzy

similarity

measure

utilizes

all

the

fuzzy

sets

compared

features

cal-

culating

similarity.

the

case-base

fuzzy

ontology

store

case

with

fuzziﬁed

features,

the

input

query

numerical

features

fuzziﬁed

using

the

same

fuzzy

sets,

and

comparison

conducted

between

stored

and

query

fuzzy

values.

The

normalized

Euclidean

distances

between

fuzzy

sets

feature

are

used

calculate

similarity

the

following

equation:

DistFCj,

Zj=n

k=1cjk −

zjk2

√n(7)

where

Cj=

crisp

value

feature

query,

Zj=

crisp

value

fea-

ture

case,

number

fuzzy

sets

for

feature

cjk and

zjk

are

k’s

fuzzy

values

for

query

and

stored

cases’

feature,

respectively.

The

similarity

calculated

using

the

following

equation:

simFCj,

Zj=

−

Dist Cj,

Zj(8)

After

testing

this

function,

found

insensitive

for

extreme

values

because

the

membership

functions

are

equal

zero

except

one

function,

for

example,

for

ages

and

70,

DistF(60,74)

and

simF(60,74)

solve

this

problem,

calculate

fuzzy

similarity,

take

the

average

crisp

similarity

(Eq.

(9))

and

fuzzy

one.

(2)

The

feature

value

vague

term.

patient

can

described

using

vague

terms

for

numerical

features

(e.g.,

Age

young,

BMI

obese,

FPG

low).

Our

case-base

fuzzy

ontology

supports

all

types

similarities.

shown

Fig.

15,

the

has-Fuzzy-Age

data

type

property

stores

the

linguistic

term

young

for

the

numerical

age

36.

When

patient

described

linguistic

term,

proposed

similarity

matrices

our

domain

expert

are

used

(see

Table

3).

addition,

fuzzy

hedges

such

“very”,

“quite”,

“somewhat”,

“not”,

“extremely”

are

possible

query

case

description.

shown

Section

4.3.1,

possible

deﬁne

hedges

the

case

base

ontology.

The

stored

and

entered

hedges

can

compared

using

similarity

matrices

proposed

our

domain

expert.

the

feature

simple

numerical,

then

the

similarity

calcu-

lated

using

the

following

equation:

simNCj,

Zj=

−|Dj−

Zj|

Max

−

Min (9)

5.6.2.

Similarity

calculation’s

second

stage

Medical

concepts

similarity

can

conducted

non-semantically

lexically,

can

done

semantically

using

standard

ontologies

SCT

[16,64].

selected

the

second

choice

measure

the

sim-

ilarity

meaning

between

concepts.

The

retrieved

cases

from

the

ﬁrst

layer

(i.e.,

SIMsyntactic(Cq,

Ci))

enter

another

evaluation

based

the

semantic

similarity

between

the

instance

features.

Lexical

exact

similarity

cannot

used

compare

ontology

concepts.

All

syntactic

features

have

wi=

The

SIMsyntactic(Cq,

Ci))

utilizes

our

proposed

SCT

domain

ontology

calculate

the

semantic

similarity

between

compared

SCT

concepts

[18].

Instance

features

have

the

data

type

Table

Not

relatedness,

semantic

similarity

measures

how

similar

the

meaning

con-

cepts

are

based

the

IS–A

relationship

only

[55].

These

measures

include

edge-based,

node-based

(i.e.,

information

content

and

features-based),

and

hybrid

measures.

Garla

and

Brandt

[27]

have

provided

recent

survey

all

existing

measures.

Most

these

measures

are

suitable

for

WordNet

nouns

only.

not

uti-

lize

Information

Content

(IC),

neither

corpus

nor

intrinsic,

because

none

its

calculation

methods

applicable

SCT;

its

calcu-

lation

time

consuming

[55];

inaccurate

due

shallow

annotations

[27].

The

most

popular

methods

for

intrinsic

are

Seco

al.

[76]

using

Eq.

(10)

and

Sánchez

al.

[77]

using

Eq.

(11).

ICSeco(u)

−log (D(u))

log

|(C)|(10)

ICSanchez =

−

log Leaves(u)/A(u)+1

Max

leaves

1(11)

with

D(u)

−

{v|v

u},

the

set

all

concepts

the

ontology,

leaves

(u)

the

number

leaves

subsumed

the

concept

and

Max

Leaves

the

number

terminal

concepts

the

ontol-

ogy.

propose

new

hybrid

measure

based

path

length

and

concept

features.

First,

for

path

length,

our

similarity

based

Table

The

fuzzy

sets

similarity

matrix

for

age

feature.

Query

Stored

case Young

Middle-aged

Old

Young

0.5

0.1

Middle-aged

0.5

0.6

Old

0.1

0.6

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

197

Fig.

18.

The

structure

the

system

implementation.

the

depth

the

Least

Common

Ancestor

(LCA)

the

two

con-

cepts

and

the

closeness

level

concepts

their

LCA

our

SCT

sub-ontology

based

IS-A

relationship

only.

other

words,

(1)

the

deeper

the

LCA,

the

speciﬁc

considered

and,

thus,

the

similar

the

compared

concepts

are

assumed;

(2)

the

closer

the

two

concepts

are

their

LCA,

the

similar

they

are.

Second,

quantify

similarity

for

concept

features,

the

commonalities

and

differences

between

concepts

must

consid-

ered

[55].

JCOLIBRI

API9uses

four

semantic

similarity

measures:

path-based

such

fdeep

basic

and

fdeep,

and

feature-based

such

cosine,

and

detail

[52].

These

measures

have

been

tested

the

API’s

tutorial;

however,

path-based

measures

not

take

account

the

depth

concepts

from

their

LCA,

and

feature-based

measures

depend

only

the

commonalities

between

compared

concepts.

Our

proposed

measure

overcomes

these

limitations

and

integrates

path

based

and

feature

based

approaches.

The

proposed

composite

similarity

measure

uses

the

equation

the

following

equation:

SIMSemantic (u,

v)=

w1simpath (u,

v)+

w2simfeature (u,

v)(12)

where

w1,

w2∈

(0,

are

weights

for

w1+

w2=

and

simpath(u,

(Eq.

(13))

adapted

version

and

Palmer

[69]

(Eq.

(14))

because

simwu

and

palmer(u,

which

violates

the

Identity

the

Indiscernibles

property

(IOI)

[55].

simpath(u,

=1

simwu

and

palmer otherwise (13)

simwu

and

palmer (u,

depth (lca (u,

v))

shortest

path (u,

lca (u,

v)) +

shortest

path (v,

lca (u,

v)) +

depth (lca(u,

v))

(14)

addition,

simFeature(u,v)

based

Batet

al.

[26],

Eqs.

(15)

and

(16):

simFeature (u,

v)=

−

DistBatet (u,

v)(15)

DistBatet (u,

v)−

log21

+|A(u)\A(v)|

|A(v)\A(u)|

|A(u)\A(v)|

|A(v)\A(u)|

|A(u)

∩

A(v)|

(16)

where

A(u)

the

set

ancestors

i.e.,

A(u)

{v|uv},

A(u)/A(v)

speciﬁcity

and

A(u)

∩

A(v)

the

commonality

between

and

tried

calculate

the

clinical

similarity

between

two

concepts

rather

than

the

semantic

distance.

Clinical

similarity

9http://gaia.fdi.ucm.es/research/colibri/jcolibri.

inﬂuenced

the

clinical

granularity

concepts.

For

example,

consider

the

hierarchy

“megacalycosis

is-a

caliectasis

is-a

kidney-disease”

from

SCT,

semantic

similarity

(kidney

disease,

kidney

disease)

but

clinical

similarity

(kidney

disease,

kidney

disease)

because

kidney

disease

general

and

abstract

concept,

which

means

other

diseases

well.

Moreover,

clinical

similarity

(caliecta-

sis,

caliectasis)

clinical

similarity

(kidney

disease,

kidney

disease)

decided.

The

main

rule

that

the

deeper

the

concept,

the

spe-

ciﬁc

is.

Finally,

the

solution

for

the

most

similar

case

suggested

for

the

new

problem.

Implementation

and

evaluation

6.1.

System

implementation

CBR

system

was

developed

Java

extending

the

APIs

the

JCOLIBRI2

CBR

framework

[52].

shown

Fig.

18,

the

pro-

posed

customization

has

three

layers,

and

each

layer

has

speciﬁc

tasks.

Due

space

restrictions,

not

discuss

this

framework

detail.

The

persistence

layer

prepares

the

fuzzy

case-base

ontology.

The

CBR

application

layer

the

core

the

framework

contains

the

whole

CBR

cycle.

The

interface

layer

accepts

query

from

the

physician

and

returns

the

most

similar

case.

have

implemented

the

case

representation

and

retrieval

steps

only;

case

adaptation

and

retention

are

out

scope.

Due

space

restrictions,

select

only

seven

from

our

∼70

fea-

tures

implement

our

system.

These

features

are

representative

the

dataset

because

includes

fuzzy

features

Age,

HbA1c,

and

BMI;

instance

features

lipid

disease,

liver

disease,

and

nephropa-

thy;

and

nominal

features

gender.

The

fuzziﬁcation

numerical

features

has

been

done

Matlab,

and

for

space

limitation,

will

not

discuss

this

process.

Moreover,

instance

features

have

been

encoded

using

our

SCT

reference

set.

Fig.

shows

the

query

screen

used

collect

patient

attributes.

For

instance

features,

the

user

selects

instance

from

shown

ontology.

Fig.

shows

the

sim-

ilarity

conﬁguration

window;

allows

the

dynamic

selection

similarity

functions

and

weights

for

each

feature;

the

selection

the

number

cases

retrieve

(i.e.,

k).

have

implemented

all

the

proposed

similarity

function

Section

5.5

including

fuzzy

and

semantic.

Spinner

used

let

the

user

choose

from

range

values

control

the

number

retrieved

cases.

The

slider

used

set

the

weight.

Fig.

shows

the

retrieved

cases

with

their

level

similarity.

6.2.

Evaluation

the

proposed

CBR

system

Each

component

the

proposed

system

evaluated

upon

completion.

These

evaluations

have

provided

proof

concept,

illuminated

system

strengths,

and

weaknesses

and

guided

system

198

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

Fig.

19.

new

case

description.

development.

The

proposed

framework

the

ﬁrst

integrate

the

capabilities

standard

medical

ontologies

(i.e.,

SCT),

fuzzy

logic,

ontology,

and

CBR

the

hybrid

system.

This

combination

has

powerful

beneﬁts

CBR

functionality.

the

sub-sections,

evaluate

the

proposed

fuzzy

case-base

ontology

(Section

6.2.1).

Moreover,

evaluate

the

proposed

semantic

retrieval

functions

small

fragment

the

SCT

medical

ontology

(Section

6.2.2).

addition,

the

overall

performance

the

system

evaluated

using

the

case-base

ontology,

the

overall

retrieval

algorithm,

and

the

domain

standard

ontology

(Section

6.2.3).

6.2.1.

Case-base

fuzzy

ontology

evaluation

6.2.1.1.

Our

fuzzy

ontology

evaluation

includes

three

dimensions.

First,

the

ontology

consistency

has

been

checked

using

set

Fig.

20.

Similarity

measures

setting.

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

199

Fig.

21.

Retrieved

similar

cases.

reasoners

including

HermiT

1.3.8,

Fact++,

and

Pellet

2.3.0.

More-

over,

its

fuzziness

consistency

has

been

checked

fuzzyDL

1.1.

Pitfalls

found

the

ontology

modeling

process

were

detected

using

the

OOPS!

Pitfall

Scanner

[78]

and

carefully

corrected.

Second,

checking

correctness,

accuracy,

and

completeness

typically

manual

[48].

Our

Domain

experts

have

validated

the

correctness,

accuracy,

and

completeness

the

built

case-base

fuzzy

ontology.

Regarding

the

correctness,

our

two

domain

experts

reviewed

each

fuzzy

element

(i.e.

fuzzy

datatype,

fuzzy

object

prop-

erty,

and

fuzzy

data

property)

and

asserted

that

these

elements

convey

meaning,

which

indeed

vague

diabetes

diagnosis

domain.

Regarding

the

accuracy,

our

domain

experts

have

uniﬁed

the

fuzziﬁcation

process.

They

asserted

that

for

each

fuzzy

variable

such

HbA1c

lab

test

its

normal

range

modeled

using

triangu-

lar

fuzzy

set

and

the

other

fuzzy

values

such

low

and

high

will

modeled

using

left

and

right

shoulder

functions;

these

shoul-

der

functions

are

overlapped

with

the

normal

range

50%.

All

these

aspects

have

been

reviewed

the

domain

experts,

and

they

asserted

that

the

vagueness

has

been

done

intuitively

accurate

way.

Regarding

the

completeness,

this

fuzzy

ontology

exten-

sion

our

crisp

ontology

[31].

First,

the

crisp

ontology

complete

because

has

been

tested

using

set

competency

questions

and

using

all

medical

concepts

diabetes

diagnosis

domain.

other

words,

our

domain

experts

have

collected

328

medical

terms

from

some

diabetes

diagnosis

CPGs

such

Canadian

Diabetes

Guideline,

and

they

have

tested

the

coverage

the

ontology

for

all

these

terms.

The

ontology

has

100%

concept

coverage

for

all

medical

con-

cepts

required

describe

diabetic

patient

cases.

Second,

domain

experts

have

checked

the

completeness

vagueness.

used

set

SPARQL

and

protégé

queries

verify

the

ability

the

ontology

answer

any

fuzzy

queries

deﬁned

domain

experts;

experts

have

veriﬁed

that

all

the

vagueness

needed

for

diabetes

domain

has

been

represented

the

ontology.

Third,

have

evaluated

our

ontology

using

criteria-based

and

data-driven

approaches.

Brewster

al.

[79]

argued

that

precision

and

recall

are

not

appropriate

for

ontology

evaluation

because

they

depend

comparison

between

concepts

evaluated

ontology

and

standard

one.

There

are

standard

ontology

evaluation

mechanisms

[80].

measure

the

quality

our

ontology,

can

use

criteria-based

data-driven

evaluation

mechanisms.

Regard-

ing

criteria-based

evaluation

mechanisms,

need

compare

with

other

ontologies

the

same

domain.

There

are

other

(fuzzy)

case-base

ontologies

the

medical

domain

compare

our

ontology

with

it.

Alexopoulos

al.

[21]

have

proposed

fuzzy

case-

base

ontology

for

electricity

market

CBR

system.

the

other

hand,

there

are

some

crisp

case-base

ontologies

such

ArgCBROnto

[81]

for

argumentation,

[82]

for

mould

design,

and

[83]

for

resource

management.

There

are

many

proposed

criteria

quantify

the

quality

ontologies

[84].

Some

these

criteria

such

consis-

tency

can

successfully

determined

using

semantic

reasoners.

Some

criteria,

such

clarity

difﬁcult

evaluate

there

are

means

place

determine

them.

Most

the

proposed

criteria

are

overlapped.

From

the

set

criteria

proposed

the

literature,

depend

criteria

proposed

Djedidi

al.

[84],

where

each

criterion

can

measured

metrics.

These

criteria

include:

Complexity

criterion,

which

assesses

structural

and

semantic

links

between

ontology

entities

and

the

navigability

ontology

structure,

Cohesion

criterion,

which

takes

into

account

the

connected

ontol-

ogy

components

(i.e.

classes),

Conceptualization

criterion,

which

corresponds

design

richness

the

ontology

content,

Abstraction

criterion,

which

indicates

class

abstraction

level

(generalization/specialization)

measuring

the

depth

sub-

sumption

hierarchies,

200

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

Table

Ontology

evaluation

quality

metrics.

Measure

Ontology

The

proposed

ontology

ArgCBROnto

Criteria

Metrics

Complexity The

average

number

paths

reach

class

from

the

root 5

Average

number

semantic

relations

(object

properties)

per

class

1.4

1.3

Abstraction

Average

depth

the

ontology

Cohesion

Average

number

connected

components

(classes)

Conceptualization Semantic

richness:

Ratio

the

total

number

semantic

relations

assigned

classes,

divided

the

total

number

ontology

relations

(object

properties

and

subsumption

relations)

58/(58

59)

0.495

38/(38

23)

0.62

Attribute

richness:

Ratio

the

ontology’s

total

number

attributes

(i.e.,

the

data

properties),

divided

the

total

number

classes

138/62

2.26 24/26

0.92

Inheritance

richness:

Average

number

subclasses

per

class

5.0

2.875

Comprehension Documentation

the

properties

2.04%

0.0%

Documentation

the

classes

88.71%

0.0%

Table

Ontology

evaluation

measures

for

three

ontologies.

Knowledge

coverage

measures

The

measure

Number

classes

Number

properties

Number

axioms

Maximum

number

Parents

Documentation

the

classes

Documentation

the

properties

Properties

with

domain

Properties

with

range

Number

individuals

The

proposed

ontology

196

1316

88.71%

2.04%

98.47%

98.98%

2640

ArgCBROnto

[81] 26

446

85.48%

77.41%

Alexopoulos

al.

[21]

N/A

Sclerosi

Microcystic

Withou

t nephrocalcinosi

Cortical cystic

Kidne

disea

Glomerular disease

Glome

rulon

ephrit

Glomer

uloscl

erosis

Acute

Chronic

Acute prolifer

ati

Idiopathic

crescentic

Type I

Type II

Chronic focal Membranous

Stage II

Stage

Stage II

Focal segmental

Hyperfiltration

Classical

Autosom

al recess

ive

Structural

and

nal abnormalities

Cystic dis

ease of kidn

Congenital

Medullary sponge kidney

With nephrocalcinosis

hrocalcino

sis

Macrosco

eonatal

Cortical Medullary

oma

tosis

reni

sUrem

Uremic acidosisUrem

ic neuro

ath

Fig.

22.

sub-graph

SCT

for

kidney

disease.

Completeness

criterion,

which

evaluates

the

ontology

covers

domain

relevant

properties;

This

criterion

has

been

evaluated

previously

the

ontology

concept

coverage,

Comprehension

criterion,

which

assesses

the

facility

under-

standing

ontology.

ArgCBROnto

the

most

complete

ontology,

and

the

other

stud-

ies

have

OWL2

ontologies.

Table

represents

comparison

between

ArgCBROnto

and

our

ontology

regarding

these

metrics.

For

calculating

metrics,

have

used

the

equations

proposed

Zhang

al.

[85].

The

ontology

parameters

used

for

metrics

are

calculated

using

protégé

4.3

ontology

editor’s

evaluation

plugin

(i.e.,

Ontology

Evaluation10).

Protégé

has

other

automatic

evalua-

tion

plugins

such

OntoClean

and

AEON

(Automatic

Evaluation

ONtologies)11.

shown

the

table,

regarding

data-driven

evaluation

mech-

anisms,

Fernández

al.

[86]

have

proposed

another

measure

for

data-driven

evaluation.

They

measure

the

structure

the

ontology

including

number

classes,

properties,

axioms,

and

individuals.

Protégé

Ontology

Evaluation

plugin

calculates

these

metrics

and

others

including

naming

conventions,

class

hierarchy,

object

prop-

erties

hierarchy,

data

type

properties

hierarchy,

documentation,

10 http://protegewiki.stanford.edu/wiki/Ontology

Evaluation.

11 http://code.google.com/p/aeon-project/.

properties

domain

and

range,

disjointness

restrictions,

and

lex-

ically

similar

concepts

and

properties.

The

application

these

measures

summarized

Table

where

our

ontology

does

over-

weight

the

compared

ontologies.

6.2.2.

Evaluation

the

proposed

semantic

retrieval

algorithm

The

proposed

retrieval

algorithm

supports

ﬁve

types

features

including

numerical,

nominal,

fuzzy,

ordinal,

and

semantic.

The

last

type

(i.e.

semantic

type)

measures

the

clinical

distance

between

the

compared

SCT

standard

medical

concepts.

this

section,

the

proposed

semantic

similarity

algorithm

evaluated

comparing

with

the

most

popular

semantic

similarity

algorithms

CBR

(i.e.

with

JCOLIBRI2

[52]).

shown

Fig.

22,

this

done

doing

experiments

using

sub-ontology

from

our

SCT

ontology

for

kidney

diseases,

assuming

that

and

are

0.5

Eq.

(9).

argue

that

there

difference

between

the

lexical,

seman-

tic,

and

clinical

similarity.

Lexical

similarity

depends

the

level

textual

similarity

between

the

two

concepts.

Therefore,

the

lexi-

cal

similarity

SIMlexical (Chronic

focal,

Membranous)

equal

and

this

not

accurate

because

both

197618004|chronic

focal

glomeru-

lonephritis

and

77182004|membranous

glomerulonephritis

are

both

20917003|chronic

glomerulonephritis.

The

semantic

similarity

adds

some

intelligence

this

process.

compare

two

patients

and

with

diseases

“kidney

disease”

and

“renal

disorder”,

then

the

semantic

distance

SimSemantic (D1,

D2)

Another

example,

“autosomal

dominant

focal

segmental

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

201

Table

The

comparison

between

JCOLIBRI

semantic

similarity

methods

and

our

proposed

one.

Method

Similarity

Fdeep

basic

Fdeep

Cosine

Detail

Proposed

method

Case

no.

Sim(Type

Type

7/7

13/14

Sim(“lipomatosis

renis”,

uremia)

1/2

0.11

Sim(Cortical,

Classical) 0

1/5

1/2

0.04

Sim(Chronic,

Stage

II)

4/7

4/6

2/√6

0.66

Sim(Glomerulosclerosis,

Type

2/7

2/√21

0.47

Sim(Glomerulosclerosis,

acute)

2/7

1/2

2/√12

0.383

Sim(Acute,

chronic)

3/7

3/4

5/6

0.889

Sim(Type

chronic)

3/7

3/√21

0.423

glomerulosclerosis”

and

“hyperﬁltration

focal

segmental

glomerulosclerosis”

then

SimSemantic (D1,

D2)

Semantic

similar-

ity

depends

the

ontology

structure

infer

the

level

similarity

between

two

concepts.

However,

the

two

patients

the

second

example

are

similar

than

the

ﬁrst

example.

This

because

while

both

patients

with

“kidney

disease”

from

semantic

perspective

have

the

same

concept

and

therefore

semantic

distance

zero,

when

applying

these

concepts

the

patient

case,

“kidney

dis-

ease”

could

mean

many

other

disease

entities

including

“Medullary

sponge

kidney”,

“medullary

cystic

disease

OS”,

“caliectasis”,

“amy-

loid

nephropathy”,

“hypertensive

renal

disease”,

and

other.

the

other

hand,

the

second

example,

444977005|autosomal

dominant

focal

segmental

glomerulosclerosis

and

236405006|hyperﬁltration

focal

segmental

glomerulosclerosis

refer

speciﬁc

disease

entity.

propose

handle

this

issue

using

the

clinical

simi-

larity

measurement.

clinical

similarity,

the

SimClinical (“kidney

disease”,

“kidney

disease”)

SimClinical (“autosomal

dominant

focal

segmental

glomerulosclerosis”,

“hyperﬁltration

focal

segmental

glomerulosclerosis”).

result,

the

three

similarities

are

not

equal

regarding

accuracy,

i.e.

SimLexical /=

SimClinical /=

SimSemantic.

Our

proposed

similarity

measure

takes

into

account

the

level

speci-

ﬁcity

concept

that

subsumes

the

two

compared

concepts

and

the

level

commonality

between

the

compared

concepts.

result,

shown

Table

the

similarity

Sim

(Type

Type

because

Type

and

Type

are

very

speciﬁc

the

ontology.

The

sim-

ilarity

Sim

(Acute,

Chronic)

0.889

because

these

concepts

are

not

speciﬁc;

they

contains

many

sub-concepts.

Our

algorithm

very

sensitive

the

level

similarity

between

the

compared

concepts.

can

see

Table

Fdeep

basic

and

Fdeep

not

take

account

the

depth

concepts

from

their

LCA

(i.e.,

the

closeness

between

concepts)

cases

Moreover,

Cosine

and

Detail

not

account

for

the

differences

between

concepts

such

cases

What

more,

there

are

distributed

inefﬁciencies

Detail

(Type

Type

Cosine

(“lipomatosis

renis”,

Uremia)

etc.

the

other

hand,

the

proposed

similarity

measure

provides

logically

con-

sistent

results

for

all

types

problems

because

accounts

for

into

account

the

depth

the

compared

concepts

from

their

LCA,

and

takes

the

differences

between

compared

concepts

well

com-

monalities.

Eqs.

(17)–(19)

are

the

implementation

the

semantic

equations

the

JCOLIBRI

OntoBridge

API

environment,

where

(−)

the

difference.

SimWU+Palmer (u,

v)=2

max

ProfLCS (u,

(profConcept (u)−

maxProfLCS (u,

v)) +(profConcept (v)−

maxProfLCS (u,

v)) +(2

maxProfLCS (u,

v)) (17)

Simfeature (u,

v)=

−Math.

log (1

Math.

log (2)(18)

=(super (u,

CN)) −(super (v,

CN))+(super (v,

CN)) −(super (u,

CN))

(super (u,

CN)) −(super (v,

CN))+(super (v,

CN)) −(super (u,

CN))+(super (u,

CN)) −(super (v,

CN))+(super (v,

CN)) ∀(super (u,

CN))(19)

6.2.3.

Performance

the

proposed

system

domain

experts

knowledge

are

known

the

most

rel-

evant

for

evaluating

the

CDSS

performance,

one

measure

the

performance

our

system

the

extent

which

the

proposed

system

decisions

are

matched

with

domain

experts

decisions

[87].

After

system

development,

our

domain

experts

have

conducted

realistic

experiments

test

the

accuracy,

correctness,

ﬂexibility,

applicability,

and

ease

use

the

proposed

diabetes

diagnosis

CDSS

framework.

The

testing

environment

the

Mansura

Univer-

sity

Hospitals,

and

they

have

reported

the

results.

The

results

show

that

our

implemented

CDSS

realistic

model

the

real

world

diabetes

diagnosis.

Patient

symptoms

and

tests

are

collected

real-time

using

crisp,

fuzzy,

text,

and

semantic

values.

the

crisp

value

attribute

not

available,

the

domain

expert

can

set

descriptive

vague

value

according

the

patient’s

description

these

condi-

tions.

The

determination

patient’s

current

diseases

(e.g.,

kidney,

liver,

cancer,

etc.)

selected

from

the

SCT

ontology

form.

SCT

pro-

vides

the

most

comprehensive

and

standard

interface

for

selecting

concepts

that

describe

patient

diseases.

These

values

form

new

query

case,

and

this

case

formatted

the

form

semantic

query

according

the

case

base

fuzzy

ontology.

The

CBR

retrieval

engine

retrieves

the

most

similar

cases.

The

value

selected

the

domain

expert.

The

system’s

produced

decisions

are

compared

with

the

experts’

diagnoses

the

case.

After

execution

the

sys-

tem

for

many

times,

domain

experts

have

evaluated

our

system

100%

regarding

ﬂexibility,

adequacy,

and

ease

use.

Figs.

and

illustrate

the

screen

shots

our

prototype

application

test-

ing

scenario.

have

applied

this

study

case-base

containing

cases

from

Mansura

University

Hospitals.

Out

method

shows

promising

results.

These

results

can

considered

ﬁrst

step

for

real

world

testing

our

proposed

system.

did

the

evaluation

our

system

using

set

measures.

First,

used

the

leave-one-in

evaluation

technique

check

the

accuracy

our

system

retrieve

existing

cases.

Our

system

was

100%

accurate

when

retrieving

existing

cases.

Second,

used

the

leave-one-out

technique

measure

the

performance

for

non-existing

cases.

Namely,

cases

are

taken

out

from

the

case-base

one

one,

and

have

computed

the

simi-

larity

this

case

with

all

the

remaining

cases

the

case-base.

particular

case

cross-validation.

has

been

used

evaluate

many

CBR

systems

including

radiotherapy

planning

system

[88],

diabetes

management

system

[89],

and

Fuzzy

CBR

systems

[90].

The

domain

experts

evaluate

the

performance

the

implemented

framework

organizing

set

experiments.

The

test

cases

are

selected

manner

that

allowed

them

span

the

majority

topics

and

content

represented

the

case

base.

Each

test

query

fed

into

the

system,

and

the

corresponding

response

was

recorded.

202

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

Table

The

system

performance

evaluation.

Test

case

FKI-CBR

decision

Expert

decision

Retrieved

cases

decision

Conﬁdence

(%)

Case

1Diabetic

90.1 Diabetic

Diabetic

88.2

Diabetic

Case

2Pre-diabetic

92 Diabetic

Diabetic

Pre-diabetic

gestational

Case

3Normal

98.1 Normal

Normal

97.7

Normal

95.7

Case

4Normal

98.2 Normal

Normal

94.1

Case

5Pre-diabetic

99.4 Pre-diabetic

Diabetic

gestational

Diabetic

Case

6Pre-diabetic

99 Pre-diabetic

Diabetic

gestational

Diabetic

Case

7Diabetic

gestational

100 Diabetic

gestational

Diabetic

Pre-diabetic

Case

8Diabetic

97 Diabetic

Diabetic

Case

9Diabetic

95 Diabetic

Diabetic

Case

10 Diabetic

94 Diabetic

Diabetic

gestational

Pre-diabetic

Case

11 Diabetic

97 Diabetic

Diabetic

Case

12 Diabetic

98 Diabetic

Diabetic

Case

13 Diabetic

98 Diabetic

Pre-diabetic

Diabetic

91.6

Case

14 Diabetic

94 Diabetic

Diabetic

Case

15 Diabetic

87 Diabetic

Diabetic

Case

16 Diabetic

86 Diabetic

Diabetic

Case

17 Diabetic

93 Diabetic

Diabetic

Normal

Case

18 Diabetic

93 Diabetic

Diabetic

92.5

Diabetic

Case

19 Pre-diabetic

84 Pre-diabetic

Diabetic

Pre-diabetic

Case

20 Pre-diabetic

92 Pre-diabetic

Diabetic

Case

21 Normal

91.5 Normal

Normal

Pre-diabetic

Case

22 Diabetic

94 Diabetic

Diabetic

gestational

Pre-diabetic

Case

23 Diabetic

98.2 Diabetic

Diabetic

93.5

Diabetic

92.6

Case

24 Diabetic

96.5 Diabetic

Pre-diabetic

Diabetic

gestational

Table

(Continued)

Test

case

FKI-CBR

decision

Expert

decision

Retrieved

cases

decision

Conﬁdence

(%)

Case

25 Diabetic

92 Diabetic

Diabetic

Case

26 Diabetic

98 Diabetic

Pre-diabetic

Diabetic

Case

27 Diabetic

93 Diabetic

Diabetic

91.9

Case

28 Diabetic

95.43 Diabetic

Diabetic

95.2

Diabetic

Case

29 Diabetic

95 Diabetic

Pre-diabetic

93.7

Diabetic

92.6

Case

30 Normal

97.74 Normal

Normal

97.6

Normal

94.6

Case

31 Pre-diabetic

91.97 Pre-diabetic

Diabetic

90.8

Diabetic

89.01

Case

32 Diabetic

92.1 Diabetic

Diabetic

89.9

Diabetic

87.7

Case

33 Normal

91.5 Normal

Normal

90.3

Diabetic

Case

34 Diabetic

95.5 Diabetic

Diabetic

87.5

Diabetic

87.2

Case

35 Normal

93.05 Normal

Pre-diabetic

92.2

Pre-diabetic

Case

36 Diabetic

87 Diabetic

Pre-diabetic

Diabetic

85.9

Case

37 Pre-diabetic

92.06 Pre-diabetic

Diabetic

90.8

Diabetic

90.3

Case

38 Diabetic

90.9 Diabetic

Diabetic

Pre-diabetic

Case

39 Pre-diabetic

90.4 Pre-diabetic

Diabetic

87.84

Case

40 Normal

95.79 Normal

Normal

94.63

Normal

94.1

Case

41 Diabetic

97.52 Diabetic

Diabetic

97.03

Diabetic

95.1

Case

42 Normal

92.27 Normal

Pre-diabetic

90.5

Normal

90.3

Case

43 Diabetic

97.5 Diabetic

Diabetic

94.5

Diabetic

93.8

The

proposed

system’s

decisions

are

compared

with

the

domain

expert

ones

[21,71],

and

the

“system’s

effectiveness”

referred

the

amount

right

answers,

that

say,

the

answers

that

verify

what

the

expert

had

said.

other

words,

the

accuracy

inversely

proportional

the

amount

the

system’s

failures.

shown

Table

our

CDSS

takes

decisions

similar

those

domain

expert

for

all

cases

the

test

set.

The

table

contains

three

main

columns

the

proposed

system

decision,

the

conﬁdence

these

decisions,

and

the

corresponding

domain

expert

decisions.

This

study

testiﬁed

the

performance

the

proposed

CBR

approach

through

experiment.

The

system

results

are

contrasted

with

the

domain

expert

decisions

determine

the

results

matched

the

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

203

Table

The

ROC

confusion

matrix.

System

decision

Domain

expert

decision

Positive

Negative

Positive

Negative

diagnosis

expected

the

expert

not.

With

these

data,

the

accu-

racy,

precision,

recall,

accuracy,

and

f-measure

the

system

could

measured.

have

selected

assert

the

system

behavior.

For

example,

case

the

system

has

decided

that

this

case

has

Diabetic

diagnose

for

the

three

choices

with

similarities

90.1,

88.2,

and

88.

The

system

performs

right

for

most

types

diagno-

sis,

e.g.

Pre-diabetes,

Diabetic

(cases

10),

and

Normal.

can

see

Table

there

only

one

false

decision

case

where

the

patient

diabetic,

but

the

system

diagnose

per-diabetic.

The

semantic

performance

the

system

97.67%,

compared

66%

using

Node

Distance

(ND)

metrics

only,

79%

using

similarity

metric

only,

and

82%

using

combination

both

and

[91].

Based

results

Table

use

ROC

confusion

matrix

calculate

the

evaluation

metrics

our

system.

For

Diabetic

deci-

sions

only,

the

values

TP,

FP,

FN,

Table

can

interpreted

as:

•TP

the

CBR

system

decides

the

diabetic

case,

and

domain

expert

decides

diabetic

case.

•FP

the

CBR

system

decides

diabetic

case,

but

the

domain

expert

not.

•FN

the

CBR

system

decides

not

diabetic

case,

but

the

domain

expert

decides

diabetic.

•TN

the

CBR

system

decides

not

diabetic

case,

and

the

expert

decides

not

diabetic

case.

The

above

parameters

can

evaluated

for

Pre-diabetic

and

Normal

well.

For

space

restrictions,

calculate

Precision

(P),

Recall

(R),

Accuracy

(A),

Sensitivity

(S),

Effectiveness

(E),

and

Neg-

ative

Prediction

Value

(NPV)

for

Diabetic

decisions

only

follows.

The

metrics

and

NPV

are

calculated

using

the

following

equations:

Effectiveness (E)=

−

Measure (Score)=1

1/2P+1/2R(20)

Table

Diabetic

decision

confusion

matrix.

System

decision

Domain

expert

decision

Positive

Negative

Positive

Negative

Prediction

Value (NPV)=TN

FN (21)

From

Table

have

calculated

the

values

Table

for

the

proposed

systems.

The

regarding

diabetic

diagnosis

are

=27

27+0=

100%,

=27

27+1=

96.43%,

=27+15

27+15+0+1=

97.67%,

15+0=

100%,

(1/2×(1))+(1/2×(0.9643)) =

98.18%,

and

NPV

15+1=

93.75%

Although,

the

pre-diabetic

and

normal

patients

form

less

than

half

the

case-base,

the

proposed

system

accuracy

for

predicting

them

100%.

The

performance

our

proposed

system

enhanced

because

its

similarity

measures

take

into

account

the

nature

all

features.

6.2.4.

comparison

between

the

proposed

and

other

CBR

systems

Most

the

existing

diabetes

diagnosis

CBR

systems

are

tradi-

tional,

and

they

did

not

provide

adequate

evaluations

[6,8].

Fig.

shows

comparison

with

two

diabetes

diagnosis

systems

[92],

and

asserts

that

our

system

has

better

performance

than

these

sys-

tems.

Montani

al.

[93]

proposed

traditional

CBR

system

for

dia-

betes

care

with

the

accuracy

83%.

The

4DSS

hybrid

CBR–RBR

(Rule

Based

Reasoning)

system

proposed

Marling

[89]

has

retrieval

accuracy

80%.

Fuzzy

case-based

reasoning

has

not

been

used

for

diagnosis

diabetes

before;

however,

has

been

used

for

develop-

ing

other

medical

systems

the

diagnosis

stress

[94].

The

results

this

system

are

Precision

79.16%

and

Recall

79.96%.

Utilizing

fuzzy

ontology

with

rule-based

system

for

diabetes

management

[23]

has

enhanced

the

accuracy

91.2%.

However,

Lee

and

Wang

[23]

used

the

rule-based

reasoning

technique,

which

not

suitable

for

experience-based

problems

such

diabetes

diagnosis.

Moreover,

Lee

and

Wang’s

study

used

the

Pima

Indians

Dataset,

but

use

real

cases

from

Mansura

University

Hospitals

Egypt.

Fig.

23.

comparison

between

the

proposed

system

and

traditional

ones.

204

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

Table

comparison

between

the

accuracy

proposed

system

and

other

studies.

Reasoning

type

Domain

System

name

Purpose

Accuracy

(%)

Fuzzy

CBR Medical

and

semantic

The

proposed

system

Diabetes

diagnosis

97.67

Medical ConFuCiuS

[95]

Diabetes

diagnosis

75.53

CBFDT

[96]

Diagnosis

liver

disorder

Begum

al.

[97]

Diagnosis

stress

Petrovic

al.

[88]

Radiotherapy

planning

84.72

Non-medical Li

al.

[98]

Financial

application

92.36

Arias-Aranda

al.

[99] Knowing

the

relationship

between

ﬂexibility

and

operations

strategy

89.23

Khanum

al.

[22]

Facial

expression

recognition

Han

al.

[100]

Endpoint

prediction

basic

oxygen

furnace

(BOF)

91.98

Sushmita

al.

[101]

Financial

application

Xiong

al.

[102]

Hybrid

rule-CBR

93.25

Martins-Bede

al.

[103]

Classifying

the

prevalence

Schistosomiasis

the

state

Minas

Gerais

Brazil

Jin

al.

[104]

Customer-driven

design

Traditional

CBR Medical System

implemented

our

dataset

Diabetes

diagnosis 57.14

T-IDDM

[93] Diabetes

treatment

and

monitoring

using

conventional

insulin

therapy

Marling

al.

[105]

Type

diabetes

management

insulin

pump

therapy

77.5

Balakrishnan

al.

[106]

Predictive

system

for

diabetic

retinopathy

Bellazzi

al.

[107]

Diabetes

therapy

Marling

al.

[89] 4DSS

system

for

diabetes

diagnosis 80

Based

our

case-base

knowledge,

have

implemented

CBR

system

without

any

semantic

capabilities

(i.e.

neither

case

base

fuzzy

ontology

nor

domain

standard

ontology).

The

resulting

system

has

achieved

precision

85.7%,

recall

42.85%,

accuracy

57.14%,

speciﬁcity

85.7%,

effectiveness

57.13%,

and

NPV

42.9%.

Our

system

has

achieved

better

performance,

which

explains

the

effects

case

base

knowledge

preparation

and

seman-

tic

case

retrieval

algorithms.

Moreover,

our

system

has

only

one

false

case

(Case

Table

7).

One

the

most

important

features

CBR

the

ability

retrieve

similar

cases

the

current

problem.

our

systems,

just

consider

then

our

system

will

have

false

negative

cases,

and

the

accuracy

will

100%.

compare

our

system

with

studies,

Table

compares

our

system’s

performance

with

set

existing

medical

and

non-medical

CBR

studies.

6.2.5.

comparison

the

proposed

system

and

machine

learning

classiﬁers

Shankaracharya

al.

[108]

presented

review

diabetes

diag-

nosis

techniques.

Techniques

such

artiﬁcial

neural

networks

(ANN),

support

vector

machines

(SVMs),

neuro-fuzzy

systems

and

expert

systems

that

developed

different

authors

have

been

dis-

cussed.

Firstly,

all

these

studies

have

lower

performance

than

ours.

However,

these

systems

mostly

depend

Pima

Indians

Dataset12.

compare

our

system

with

these

techniques,

better

run

these

algorithms

our

dataset.

This

dataset

has

been

prepared

before,

and

all

noise

and

missing

data

have

been

handled

[35].

For

the

comparing

purpose,

apply

some

machine

learning

classi-

ﬁers

including

C4.5,

k-NN,

SVM,

Bayesian

classiﬁer,

and

ANN

our

dataset

and

measure

their

performance.

use

the

2-fold,

3-fold,

4-fold.10-fold

The

cross-validation

technique

the

eval-

uation

process.

Cross-validation

statistical

technique

useful

determining

the

robustness

model.

The

n-fold

cross

validation

divides

the

whole

data

set

into

folds.

The

−

folds

are

used

for

training,

and

one

fold

used

for

testing.

This

process

continued

until

each

fold

from

used

for

testing.

12 https://archive.ics.uci.edu/ml/datasets/Pima+Indians+Diabetes.

The

overall

performance

these

algorithms

presented

Table

11.

For

the

k-NN

algorithm,

select

done

our

system;

however,

its

performance

low.

C4.5

achieves

the

best

performance

(about

89.19%)

among

machine-learning

techniques;

however,

our

system

outperforms

it.

After

testing

the

machine

learning

algorithms

using

from

2-fold

10-fold

cross-validation

techniques,

calculate

the

average

performance

each

fold,

and

make

comparison

different

folds’

results.

Fig.

shows

that

the

best

performance

achieved

with

5-fold

cross

validation.

calculate

the

average

precision,

recall,

accuracy,

f-measure,

and

speciﬁcity

for

all

folds.

These

averages

are

compared

with

the

proposed

system,

the

5-fold

cross

validation,

and

the

traditional

(i.e.

not

fuzzy

and

not

semantic)

system,

shown

Fig.

25.

Our

ﬁnd-

ings

show

that

the

fuzzy

KI-CBR

can

classify

data

accurately

than

the

other

machine

learning

techniques

and

conventional

CBR.

can

seen

Fig.

that

the

machine

learning

classiﬁers

have

better

performances

than

conventional

CBR

systems.

This

means

that

our

study

makes

high

improvement

the

CBR

performance.

The

average

accuracies

C4.5,

conventional

CBR,

and

proposed

system

are

88.88%,

57.14%,

and

98.18%,

respectively.

The

proposed

approach

demonstrates

major

improvement

than

machine

learn-

ing

techniques

and

conventional

CBR

system.

The

results

this

study

clearly

indicate

that

the

hybridization

CBR

with

fuzzy

ontology

and

medical

ontologies

the

most

suitable

technique

for

solving

medical

diagnosis

problems.

The

enhanced

performance

our

system

result

couple

reasons.

Firstly,

the

proposed

CBR

framework

integrated

and

complete.

All

com-

ponents

have

been

fully

implemented

and

tested.

The

knowledge

representation

formalism

using

fuzzy

ontology

integrates

the

rea-

soning

capabilities

fuzzy

logic,

description

logic,

and

CBR.

There

are

many

studies,

which

use

each

these

reasoning

mechanisms

individually,

but

they

have

not

achieved

high

accuracy.

The

second

reason

the

preparation

case-base

data.

These

data

have

been

pre-processed,

fuzziﬁed,

and

encoded

before

populated

into

the

case-base

knowledge.

result,

accurate

data

will

produce

accu-

rate

decisions.

The

third

reason

the

usage

suitable

weight

vector

for

the

used

case

features;

the

global

similarity

function

has

produced

suitable

similarities.

The

fourth

reason

the

proposed

semantic

retrieval

algorithm.

have

handled

most

the

possi-

ble

datatypes,

which

appear

the

medical

domain.

The

fuzzy

types

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

205

Table

Performance

machine

learning

algorithms

our

dataset.

Machine

learning

algorithms

Fold

Algorithm

Precision

(%)

Recall

(%)

Accuracy

(%)

F-measure

(%)

Speciﬁcity

(%)

2-Fold C4.5

93.1

93.33

93.1

93.54

k-NN

3) 63.3 63.3 63.33

63.3

64.5

SVM

6.3

58.6

63.33

60.7

67.74

Naive

Bayes

81.8

62.1

70.6

87.09

ANN

65.5

66.66

65.5

67.74

3-Fold C4.5

93.1

91.66

91.5

90.32

k-NN

59.9

64.51

SVM

75.9

73.33

73.3

70.96

Naive

Bayes 65.4 58.6 65

61.8 70.96

ANN

72.4

73.33

72.4

74.19

4-Fold C4.5

89.7

90.32

k-NN

68.7

68.3

68.33

77.41

SVM

70.96

Naive

Bayes

77.3

58.6

71.66

66.7

83.87

ANN

75.9

76.66

75.9

77.41

5-Fold C4.5

92.9

89.7

91.66

91.2

93.54

k-NN

68.3

68.33

68.3

70.96

SVM

78.6 75.9 78.33 77.2

80.64

Naive

Bayes

77.3

58.6

71.66

66.7

83.87

ANN

78.6

75.9

78.33

77.2

80.64

6-Fold C4.5

89.3

86.2

88.33

87.7

90.32

k-NN

61.7

61.66

61.5

67.74

SVM

67.7

72.4

67.74

Naive

Bayes

61.5

55.2

61.66

58.2

67.74

ANN

73.3 75.9 75

74.6

74.19

7-Fold C4.5

89.7

90.32

k-NN

73.6

73.3

73.33

73.2

80.64

SVM

69.7

79.3

73.33

74.2

67.74

Naive

Bayes

70.4

65.5

67.9

74.19

ANN

71.9 79.3 75

75.4

70.96

8-Fold C4.5

89.7

k-NN

3) 68.7

68.3

68.33

77.41

SVM

74.2

79.3

76.66

76.7

74.19

Naive

Bayes

82.6

65.5

76.66

73.1

87.09

ANN

72.4

71.66

71.2

70.96

9-Fold C4.5

89.3

86.2

88.33

87.7

90.32

k-NN

3) 66.8

66.7

66.66

66.4

74.19

SVM

82.8

78.33

78.7

74.19

Naive

Bayes

79.2

65.5

71.7

83.87

ANN

77.4

82.8

77.41

10-Fold C4.5

74.2

79.3

76.66

76.7

90.32

k-NN

73.1

65.5

71.66

69.1

70.96

SVM

77.4

82.8

77.41

Naive

Bayes 79.2 65.5 75

71.7

83.87

ANN

74.2

79.3

76.66

76.7

74.19

Average

(%)

73.88

73.39

75.1

74.04

78.04

Conventional

CBR

system

85.7

42.85

57.14

57.13

85.7

Proposed

fuzzy

KI-CBR

system

100

96.43

97.67

98.18

100

Fig.

24.

comparison

between

the

n-folds

cross

validation

results.

206

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

Fig.

25.

Classiﬁcation

results

comparison.

support

the

reasoning

using

linguistic

terms

and

enhance

the

simi-

larity

calculation.

Ordinal

features’

similarity

based

the

expert

domain

knowledge

the

form

similarity

matrixes.

Semantic

fea-

tures

support

the

calculation

clinical

similarities

between

SCT

concepts.

addition

its

enhanced

performance,

the

proposed

sys-

tem

tested

for

problems

that

are

complex

and

cannot

solved

traditional

systems.

For

example,

the

case

base

contains

case

(age

20,

disease

“Acute

proliferative”,

uri-

nation

frequency

“++”.)

and

the

query

case

(age

young,

disease

“Idiopathic

crescentic”,

urination

frequency

“Nil”.);

the

traditional

CBR

systems,

these

cases

are

not

similar

and

will

not

returned.

For

fuzzy

systems,

the

age

matched

right

age

the

same

age

young

(i.e.,

Young(20)

1).

How-

ever,

the

comparison

semantic

and

ordinal

features

fails

get

the

similarity.

semantic

CBR

systems,

they

fail

get

the

similar-

ity

fuzzy

and

ordinal

features.

Due

these

conditions,

the

results

these

systems

might

prove

not

accurate.

our

proposed

system,

have

proposed

algorithms

handle

all

these

types.

Conclusion

This

paper

proposes

fuzzy

ontology-based

semantic

CBR

system

and

its

implementation

for

decision

support

system

for

diabetes

diagnosis.

This

system

enhances

the

decision

maker

efﬁciency

the

diagnosing

process.

The

proposed

approach

has

many

contributions

and

novelties:

(1)

builds

case-base

fuzzy

ontology

compatible

with

the

most

famous

CBR

framework,

i.e.

JCOLIBRI,

(2)

builds

and

uses

standard

medical

terminology

subset

for

diabetes

diagnosis

from

SCT,

which

the

most

complete

medical

ontology,

and

(3)

proposes

fuzzy-semantic

similarity

algorithm

for

case

retrieval.

Our

implemented

fuzzy

ontology

has

followed

formal

methodology,

and

has

represented

using

fuzzy

OWL2

language.

The

proposed

fuzzy-semantic

retrieval

algorithm

outweighs

all

the

JCOLIBRI

algorithms,

and

covers

their

limitations.

The

integration

path-based

similarity

measures

and

feature-based

measures

enhances

the

accuracy

calculating

clinical

distances

between

concepts.

Our

system

has

achieved

performance

97.67%.

These

results

show

that

the

proposed

system

has

high

accuracy,

and

physicians

can

consult

when

diagnosing

patients.

the

future,

will

implement

the

rest

the

CBR

steps

especially

the

case

adaptation

process.

will

utilize

fuzzy

ontology

the

other

steps

CBR

case

adaptation,

retention,

and

case-base

maintenance.

Moreover,

will

try

integrate

multiple

medical

ontologies

our

system

because

SCT

has

limitation

many

aspects

lab

tests

and

genes

represen-

tation.

Fortunately,

there

are

many

standard

medical

ontologies

for

theses

domains

such

LOINC

for

lab

tests

and

for

genes

representation.

The

integration

CBR

with

EHR

environment

will

enhance

the

automation

the

decision

support

process.

Finally,

will

beneﬁt

from

the

relational

database

for

storing

and

query-

ing

the

case

base

fuzzy

ontology.

The

relational

database

supports

storage

large

case-base

using

semantic

preserving

method.

Acknowledgments

This

project

was

supported

King

Saud

University,

Deanship

Scientiﬁc

Research,

College

Sciences,

Research

Centre.

The

authors

would

thank

Dr.

Farid

Badria,

Prof.

Pharma-

cognosy,

Department

and

head

Liver

Research

Lab,

Mansoura

University,

Egypt;

and

Dr.

Hosam

Zaghloul,

Prof.

Clinical

Pathol-

ogy

Department,

Faculty

Medicine,

Mansoura

University,

Egypt,

for

their

efforts

this

work.

References

[1]

World

Health

Organization

(WHO).

Diabetes;

2015.

http://www.who.int/

mediacentre/factsheets/fs312/en

(accessed:

May

2015).

[2]

Ofori

Unachukwu

Holistic

approach

prevention

and

management

type

diabetes

mellitus

family

setting.

Diabetes

Metab

Syndr

Obes

2014;7:159–68.

[3]

AlJarullah

Decision

tree

discovery

for

the

diagnosis

type

diabetes.

In:

International

conference

innovations

information

technology.

Abu

Dhabi,

UAE:

IEEE;

2011.

303–7.

[4]

Begum

Ahmed

Funk

Xiong

Folke

Case-based

reasoning

systems

the

health

sciences:

survey

recent

trends

and

developments.

IEEE

Trans

Syst

Man

Cybernet,

2010;7(1):39–59.

[5]

Marlinga

Montanib

Bichindaritzc

Funkd

Synergistic

case-based

rea-

soning

medical

domains.

Expert

Syst

Appl

2014;41(2):249–59.

[6]

Jha

Pakhira

Chakraborty

Diabetes

detection

and

care

applying

CBR

techniques.

Int

Soft

Comput

Eng

(IJSCE)

2013;2(6):132–7.

[7]

Jaya

Uma

Role

ontology

case-based

reasoning

(CBR)

for

diagnosing

diabetes.

Inf

Technol

2009;5(3):17–23.

[8]

Chen

Chang

Diabetes

care

decision

support

system.

In:

The

2nd

international

conference

industrial

and

information

systems

(IIS).

2010.

323–6,

[9]

El-Sappagh

Elmogy

Riad

CBR

system

for

diabetes

mellitus

diagnosis:

case-base

standard

data

model.

Int

Med

Eng

Inf

2015;7(3).

[10]

Dendani

Khadir

Guessoum

Use

domain

ontology

develop

knowl-

edge

intensive

CBR

systems

for

fault

diagnosis.

In:

International

conference

information

technology

and

e-Services

(ICITeS).

Sousse,

Tunisia:

IEEE;

2012.

1–6.

[11]

Diaz-Agudo

Gonzalez-Calero

architecture

for

knowledge

intensive

CBR

systems,

advances

case-based

reasoning,

Enrico

Blanzieri

and

Luigi

Portinale,

vol.

1898.

Berlin,

Heidelberg,

Germany:

Springer;

2000.

37–48.

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

207

[12]

Amailef

Ontology-supported

case-based

reasoning

approach

for

intelligent

m-Government

emergency

response

services.

Decis

Support

Syst

2013;55(1):79–97.

[13]

Chen

Huang

Bau

Chen

recommendation

system

based

domain

ontology

and

SWRL

for

anti-diabetic

drugs

selection.

Expert

Syst

Appl

2012;39(4):3995–4006.

[14]

Health

Level

Seven

International

(HL7),

http://www.hl7.org/

(accessed:

August

2015).

[15]

The

International

Health

Terminology

Standards

Development

Organi-

zation

(IHTSDO),

SNOMED

CT:

The

Global

Language

Healthcare,

http://www.ihtsdo.org/snomed-ct

(accessed:

August

2015).

[16]

Melton

Parsons

Inter-patient

distance

metrics

using

SNOMED

deﬁning

relationships.

Biomed

Inf

2006;39:697–705.

[17]

Jirathitikul

Nithisansawadikul

Tongphu

Suntisrivaraporn

sim-

ilarity

measuring

service

for

SNOMED-CT

structural

analysis

concepts

ontology.

In:

The

11th

international

conference

electrical

engineer-

ing/electronics,

computer,

telecommunications

and

information

technology

(ECTI-CON).

2014.

1–6.

[18]

El-Sappagh

Elmogy

El-Masri

Riad

diabetes

diagnostic

domain

ontology

for

CBR

system

from

the

conceptual

model

SNOMED

CT.

In:

The

second

international

conference

engineering

and

technology

(ICET

2014).

Cairo,

Egypt:

IEEE;

2014.

1–7.

[19]

El-Sappagh

Elmogy

Riad

Zaghloul

Badria

proposed

SNOMED

ontology-based

encoding

methodology

for

diabetes

diagnosis

case-base.

In:

The

ninth

international

conference

computer

engineering

and

systems

(ICCES

2014).

Cairo,

Egypt:

IEEE;

2014.

184–91.

[20]

Zadeh

From

engines

question-answering

systems

the

need

for

new

tools,

advances

web

intelligence,

Ernestina

Menasalvas,

Javier

Segovia,

and

Piotr

Szczepaniak,

vol.

2663.

Berlin,

Heidelberg,

Germany:

Springer;

2003.

15–7.

[21]

Alexopoulos

Wallace

Kafentzis

Askounis

Utilizing

imprecise

knowl-

edge

ontology-based

CBR

systems

means

fuzzy

algebra.

Int

Fuzzy

Syst

2010;12(1):1–14.

[22]

Khanum

Mufti

Javed

Shaﬁq

Fuzzy

case-based

reasoning

for

facial

expression

recognition.

Fuzzy

Sets

Syst

2009;160:231–50.

[23]

Lee

Wang

fuzzy

expert

system

for

diabetes

decision

support

applica-

tion.

IEEE

Trans

Syst

Man

Cybernet,

Cybernet

2011;41(1):139–53.

[24]

Zhaoa

Cui

Zhao

Qiu

Chen

Learning

HAZOP

expert

system

case-

based

reasoning

and

ontology.

Comput

Chem

Eng

2009;33:371–8.

[25]

Samwald

Fehre

Bruin

Adlassnig

The

Arden

syntax

standard

for

clinical

decision

support:

experiences

and

directions.

Biomed

Inf

2012;45:

711–8.

[26]

Sánchez

Batet

Isern

Valls

Ontology-based

semantic

similarity:

new

feature-based

approach.

Expert

Syst

Appl

2012;39:7718–28.

[27]

Gan

Dou

Jiang

From

ontology

semantic

similarity:

calculation

ontology-based

semantic

similarity.

Sci

World

2013;2013:1–11.

[28]

Rahimi

Liaw

Taggart

Ray

Validating

ontology-based

algo-

rithm

identify

patients

with

type

diabetes

mellitus

electronic

health

records.

Int

Med

Inf

2014;83(10):768–78.

[29]

Sherimon

Vinu

Krishnan

Takroni

AlKaabi

AlFars

Adaptive

ques-

tionnaire

ontology

gathering

patient

medical

history

diabetes

domain.

Proceedings

the

ﬁrst

international

conference

advanced

data

and

infor-

mation

engineering,

Tutut

Herawan,

Mustafa

Mat

Deris,

and

Jemal

Abawajy,

vol.

285.

Tannery

Lane,

Singapore:

Springer;

2014.

453–60.

[30]

Sugiyanto

Hayuhardhika

Sarno

Sidiq

Weighted

ontology

and

weighted

tree

similarity

algorithm

for

diagnosing

diabetes

mellitus.

In:

Inter-

national

conference

computer,

control,

informatics

and

its

applications

(IC3INA).

IEEE;

2013.

267–72.

[31]

El-Sappagh

El-Masri

Elmogy

Riad

AM,

Saddik

ontological

case

base

engineering

methodology

for

diabetes

management.

Med

Syst

2014;38(8):67–81.

[32]

Samwald

Stenzhorn

Dumontier

Marshall

Luciano

Adlassnig

KP.

Towards

interoperable

information

infrastructure

providing

deci-

sion

support

genomic

medicine.

Stud

Health

Technol

Inform

2011;169:

165–9.

[33]

Samwald

Antonio

Giménez

Boyce

Freimuth

Adlassnig

KP,

al.

Pharmacogenomic

knowledge

representation,

reasoning

and

genome-based

clinical

decision

support

based

OWL

ontologies.

BMC

Med

Inf

Decis

Making

2015;15:pp.12.

[34]

Liaw

Taggart

Khorzoughi

Using

ontologies

identify

patients

with

diabetes

electronic

health

records.

In:

International

semantic

web

conference.

Berlin,

Heidelberg,

Germany:

Springer-Verlag;

2013.

77–80.

[35]

El-Sappagh

Elmogy

Riad

Zaghlol

Badria

EHR

data

prepara-

tion

for

case

based

reasoning

construction.

The

proceedings

the

second

international

conference

advanced

machine

learning

technologies

and

applications

(AMLTA14),

communications

computer

and

information

sci-

ence

(CCIS),

Aboul

Ella

Hassanien,

Mohamed

Tolba,

and

Ahmad

Azar,

vol.

488.

Cham

(ZG),

Switzerland:

Springer

International

Publishing;

2014.

483–97.

[36]

Adlassnig

Fuzzy

set

theory

medical

diagnosis.

IEEE

Trans

Syst

Man

Cyber-

net

1986;16(2):260–5.

[37]

Abdul

Muhammad

Mustapha

Muhammad

Ahmad

Database

workload

management

through

CBR

and

fuzzy

based

characterization.

Appl

Soft

Comput

2014;22:605–21.

[38]

Ekong

Inyang

Onibere

Intelligent

decision

support

system

for

depression

diagnosis

based

neuro-fuzzy-CBR

hybrid.

Modern

Appl

Sci

2012;6(7):79–88.

[39]

Thirugnanam

Kumar

Srivatsan

Nerlesh

Improving

the

prediction

rate

diabetes

diagnosis

using

fuzzy,

neural

network,

case

based

(FNC)

approach.

Procedia

Eng

2012;38:1709–18.

[40]

Alexopoulos

Wallace

Kafentzis

Askounis

IKARUS-Onto:

methodology

develop

fuzzy

ontologies

from

crisp

ones.

Knowl

Inf

Syst

2012;32(3):667–95.

[41]

Rodríguez

Cuéllar

Lilius

Calvo-Flores

fuzzy

ontology

for

semantic

modelling

and

recognition

human

behaviour.

Knowl

Based

Syst

2014;66:46–60.

[42]

Zhang

Yan

Reasoning

fuzzy

relational

databases

with

fuzzy

ontologies.

Int

Intel

Syst

2012;27:613–34.

[43]

Torshizi

Zarandi

Torshizi

Eghbali

hybrid

fuzzy-ontology

based

intelligent

systemto

determine

level

severity

and

treatment

recommen-

dation

for

Benign

Prostatic

Hyperplasia.

Comput

Methods

Programs

Biomed

2014;113(1):301–13.

[44]

Carlsson

Brunelli

Mezei

Decision

making

with

fuzzy

ontology.

Soft

Comput

2012;16:1143–52.

[45]

Mezei

Wikström

Carlsson

Aggregating

linguistic

expert

knowledge

type-2

fuzzy

ontologies.

Appl

Soft

Comput

2015;(March),

http://dx.doi.

org/10.1016/j.asoc.2015.03.023.

ISSN:1568-4946;

pii:S1568494615001799.

[46]

Molinera

Galvez

Wikstrom

Viedma

Carlsson

Designing

decision

support

system

for

recommending

smartphones

using

fuzzy

ontologies.

Intel

Syst

2015;323:323–34.

[47]

Ali

Kim

Type-2

fuzzy

ontology-based

opinion

mining

and

infor-

mation

extraction:

proposal

automate

the

hotel

reservation

system.

Appl

Intel

2015;42(3):481–500.

[48]

Ali

Kim

Type-2

fuzzy

ontology-based

semantic

knowledge

for

colli-

sion

avoidance

autonomous

underwater

vehicles.

Inf

Sci

2015;295:441–64.

[49]

Lee

Wang

Hsu

Chen

Type-2

fuzzy

set

and

fuzzy

ontology

for

diet

application.

Adv

Type-2

Fuzzy

Sets

Syst

2013;301:237–56.

[50]

Park

Benedictos

Lee

Wang

Ontology-based

fuzzy-CBR

support

system

for

ship’s

collision

avoidance.

International

conference

machine

learning

and

cybernetics,

vol.

IEEE;

2007.

1845–50.

[51]

The

Open

Biological

and

Biomedical

Ontologies,

http://www.obofoundry.

org/

(accessed:

August

2015).

[52]

Recio-García

Díaz-Agudo

González-Calero

In:

Montani

Stefania,

Jain

Lakhmi,

editors.

The

COLIBRI

platform:

tools,

features

and

working

examples,

successful

case-based

reasoning

applications-2,

vol.

494.

Berlin,

Heidelberg,

Germany:

Springer;

2014.

55–85.

[54]

Aamodt

Plaza

Case-based

reasoning

foundational

issues,

methodological

variations,

and

system

approaches.

Commun

1994;7(1):39–59.

[55]

Harispe

Sanchez

Ranwez

Janaqi

Montmain

framework

for

uni-

fying

ontology-based

semantic

similarity

measures

study

the

biomedical

domain.

Biomed

Inf

2014;48:38–53.

[56]

The

SNOMED

technical

implementation

guide,

July

2014

Interna-

tional

Release,

International

Health

Terminology

Standards

Development

Organization,

http://ihtsdo.org/ﬁleadmin/user

upload/doc/download/

doc TechnicalImplementationGuide

Current-en-US

INT

20140731.pdf

(accessed:

August

2015).

[57]

Gruber

Towards

principles

for

the

design

ontologies

used

for

knowledge

sharing.

Int

Hum

Comput

Stud

1995;43(5–6):907–28.

[58]

Zadeh

Fuzzy

sets.

Inf

Control

1965;8:338–53.

[59]

Bobillo

Managing

vagueness

ontologies.

Spain:

University

Granada;

2008

(Ph.

Thesis).

[60]

Bobillo

Straccia

fuzzyDL:

expressive

fuzzy

description

logic

reasoner.

In:

International

conference

fuzzy

systems

(FUZZ-08).

IEEE

Computer

Soci-

ety;

2008.

923–30.

[61]

Recio-Garía

Díaz-Agudo

In:

Ellis

Richard,

Allen

Tony,

Andrew

Tuson,

editors.

Ontology

based

CBR

with

jCOLIBRI

applications

and

innovations

intelligent

systems

XIV.

London,

WC1X

8HB,

United

Kingdom:

Springer-

Verlag;

2007.

149–62.

[62]

Description

logics,

http://dl.kr.org/

(accessed:

August

2015).

[63]

Bobillo

Straccia

Fuzzy

ontology

representation

using

OWL

Int

Approx

Reason

2011;52(7):1073–94.

[64]

Akmal

Shih

Batres

Ontology-based

similarity

for

product

information

retrieval.

Comput

Ind

2014;65(1):91–107.

[65]

Wang

Ling

Shi

Building

fuzzy

ontology

edutainment

using

OWL.

Comput

Sci—ICCS

2007;4489:591–4.

[66]

Baldarrago

Santos

Prado

UPFON:

uniﬁed

process

for

building

fuzzy

ontology.

In:

The

ninth

international

conference

fuzzy

systems

and

knowl-

edge

discovery

(FSKD).

Chongqing,

Sichuan,

China:

IEEE;

2012.

617–22.

[67]

Ghorbel

Bahri

Bouaziz

Fuzzy

ontologies

building

method:

fuzzy

ontomethodology.

In:

Annual

meeting

the

North

American

Fuzzy

Infor-

mation

Processing

Society

(NAFIPS).

Toronto,

Canada:

IEEE;

2010.

1–8.

[68]

Yaguinuma

Santos

Camargo

Nicoletti

Fuzz-onto:

meta-ontology

for

representing

fuzzy

elements

and

supporting

fuzzy

classiﬁcation

rules.

In:

The

12th

international

conference

intelligent

systems

design

and

applica-

tions

(ISDA).

Cochin,

India:

IEEE;

2012.

166–71.

[69]

Palmer

Verb

semantics

and

lexical

selection.

In:

Proceedings

the

32nd

annual

meeting

association

for

computational

linguistics.

San

Francisco,

CA:

Morgan

Kaufmann

Publishers;

1994.

133–8.

[70]

Ghanea-Hercock

Applied

evolutionary

algorithms

Java.

2003

ed.

New

York,

NY:

Springer-Verlag;

2003.

[71]

Lin

Shih

Lin

Strategy

selection

for

product

service

systems

using

case-based

reasoning.

Afr

Bus

Manage

2010;4(6):987–94.

208

El-Sappagh

al.

Artiﬁcial

Intelligence

Medicine

(2015)

179–208

[72]

Garla

Brandt

Semantic

similarity

the

biomedical

domain:

evaluation

across

knowledge

sources.

BMC

Bioinf

2012;13(1):pp.

261.

[73]

Zhang

Yan

Cheng

Fuzzy

ontology

knowledge

bases

storage

fuzzy

databases,

fuzzy

knowledge

management

for

the

semantic

web.

Stud

Fuzz

Soft

Comput

2014;306:233–42.

[74]

Zhang

Fan

Wang

automatic

fuzzy

semantic

web

ontology

learn-

ing

from

fuzzy

object-oriented

database

model.

Database

Expert

Syst

Appl

2010;6261:16–30.

[75]

Peng

Wang

Zhang

case

retrieval

method

combined

with

similarity

measurement

and

multi-criteria

decision

making

for

concurrent

design.

Expert

Syst

Appl

2009;36:10357–66.

[76]

Seco

Veale

Hayes

intrinsic

information

content

metric

for

semantic

similarity

WordNet.

The

16th

Eur

Conf

Artif

Intell

(ECAI),

vol.

16.

Amster-

dam,

Netherlands:

IOS

Press;

2004.

1089–90.

[77]

Sánchez

Batet

Isern

Ontology-based

information

content

computa-

tion.

Knowledge-Based

Syst

2011;24:297–303.

[78]

Poveda-Villalón

Suárez-Figueroa

Gómez-Pérez

Validating

ontologies

with

OOPS!

Knowl

Eng

Knowl

Manage

2012;7603:267–81.

[79]

Brewster

Alani

Dasmahapatra

Wilks

Data

driven

ontology

evalua-

tion.

In:

Proceedings

the

international

conference

language

resources

and

evaluation.

2004.

164–8.

[80]

Bright

Furuya

Kuperman

Cimino

Bakken

Development

and

evalu-

ation

ontology

for

guiding

appropriate

antibiotic

prescribing.

Biomed

Inf

2012;45:120–8.

[81]

Heras

Botti

Julian

ArgCBROnto:

knowledge

representation

formalism

for

case-based

argumentation.

Agreement

Technol

2013;8068:105–19.

[82]

Guo

Peng

CBR

system

for

injection

mould

design

based

ontol-

ogy:

case

study.

Comput

-Aided

Des

2012;44(6):496–508.

[83]

Zhukova

Kultsova

Navrotsky

Dvoryankin

Intelligent

support

decision

making

human

resource

management

using

case-based

reasoning

and

ontology.

Knowledge-Based

Softw

Eng

2014;466:172–84.

[84]

Djedidi

Aufaure

ONTO-EVOAL

ontology

evolution

approach

guided

pattern

modeling

and

quality

evaluation.

In:

Foundations

information

and

knowledge

systems,

Sebastian

Link

and

Henri

Prade.

Berlin,

Heidelberg,

Germany:

Springer;

2010.

286–305.

[85]

Zhang

Yang

evaluation

method

for

ontology

complexity

analysis

ontology

evolution,

managing

knowledge

world

networks,

Steffen

Staab

and

Vojtˇ

ech

Svátek,

4248.

Berlin,

Heidelberg,

Germany:

Springer-

Verlag;

2006.

214–21.

[86]

Fernández

Overbeeke

Sabou

Motta

What

makes

good

ontology?

case

study

Fine-Grained

knowledge

reuse.

Semant

Web

2009;5926:61–75.

[87]

Satter

Cohen

Ortiz

Kahol

Mackenzie

Olson

al.

Avatar-based

simulation

the

evaluation

diagnosis

and

management

mental

health

disorders

primary

care.

Biomed

Inform

2012;45:1137–50.

[88]

Petrovic

Mishra

Sundar

novel

case

based

reasoning

approach

radiotherapy

planning.

Expert

Syst

Appl

2011;38:10759–69.

[89]

Marling

Wiley

Cooper

Bunescu

Shubrook

Schwartz

The

dia-

betes

support

system:

case

study

CBR

research

and

development.

In:

Ram

Ashwin,

Wiratunga

Nirmalie,

editors.

The

proceeding

the

19th

inter-

national

conference

case-based

Reasoning

(ICCBR).

Berlin,

Heidelberg,

Germany:

Springer;

2011.

137–50.

[90]

Armengol

Esteva

Godo

Torra

learning

similarity

relations

fuzzy

case-based

reasoning.

Trans

Rough

Sets

2005;3135:14–32.

[91]

Fernandes

Grosse

Krishnamurty

Witherell

Wileden

Semantic

meth-

ods

supporting

engineering

design

innovation.

Adv

Eng

Inf

2011;25:185–92.

[92]

Anouncia

Madonna

Jeevitha

Nandhini

Design

diabetic

diagnosis

system

using

rough

sets.

Cybern

Inf

Technol

2013;13(3):124–39.

[93]

Montani

Bellazzi

Portinale

d’Annunzio

Fiocchi

Stefanelli

Diabetic

patients

management

exploiting

case-based

reasoning

techniques.

Comput

Methods

Programs

Biomed

2000;62:205–18.

[94]

Mobyen

Begum

Funk

Xiong

Schéele

Case-based

reasoning

for

diagnosis

stress

using

enhanced

cosine

and

fuzzy

similarity.

Trans

Case-

Based

Reasoning

Multimedia

Data

2008;1(1):3–19.

[95]

Rodriguez

Garcia

Baets

Morell

Bello

connectionist

fuzzy

case-

based

reasoning

model,

MICAI:

advances

artiﬁcial

intelligence,

Alexander

Gelbukh

and

Carlos

Reyes-Garcia.

Berlin,

Heidelberg,

Germany:

Springer;

2006.

176–85.

[96]

Fan

Chang

Lin

Hsieh

hybrid

model

combining

case-based

reason-

ing

and

fuzzy

decision

tree

for

medical

data

classiﬁcation.

Appl

Soft

Comput

2011;11:632–44.

[97]

Begum

Ahmed

Funk

Xiong

Schéele

case-based

decision

support

system

for

individual

stress

diagnosis

using

fuzzy

similarity

matching.

Int

Comput

Intell

Appl

2009;25(3):180–95.

[98]

Predicting

ﬁnancial

activity

with

evolutionary

fuzzy

case-based

reasoning.

Expert

Syst

Appl

2009;36:411–22.

[99]

Arias-Aranda

Castro

Navarro

Zurita

CBR

system

for

knowing

the

relationship

between

ﬂexibility

and

operations

strategy.

Found

Intell

Syst

2009;5722:463–72.

[100]

Han

Cao

improved

case-based

reasoning

method

and

its

appli-

cation

end

point

prediction

basic

oxygen

furnace.

Neurocomputing

2015;149(Part

C):1245–52.

[101]

Sushmita

Chaudhury

Hierarchical

fuzzy

case

based

reasoning

with

multi-

criteria

decision

making

for

ﬁnancial

applications.

Pattern

Recognit

Mach

Intell

2007;4815:226–34.

[102]

Xiong

Learning

fuzzy

rules

for

similarity

assessment

case-based

reason-

ing.

Expert

Syst

Appl

2011;38:10780–6.

[103]

Godo

Sandri

Dutra

Freitas

Carvalho

Guimarães

al.

Clas-

siﬁcation

schistosomiasis

prevalence

using

fuzzy

case-based

reasoning.

Bio-Inspired

Syst:

Comput

Ambient

Intell

2009;5517:1053–60.

[104]

Jin

Jie

Ying-hong

Wei-ming

Zhen-fei

New

weighted

fuzzy

case

retrieval

method

for

customer-driven

product

design.

Shanghai

Jiaotong

Univ

(Sci)

2010;15(6):641–50.

[105]

Marling

Shubrook

Schwartz

Case-based

decision

support

for

patients

with

type

diabetes

insulin

pump

therapy.

Advances

case-based

reasoning:

ninth

European

conference

(ECCBR),

Klaus-Dieter

Althoff,

Ralph

Bergmann,

Mirjam

Minor,

and

Alexandre

Hanft,

vol.

5239.

Berlin,

Heidelberg,

Germany:

Springer;

2008.

325–39.

[106]

Balakrishnan

Shakouri

Hoodeh

Loo

Predictions

using

data

mining

and

case-based

reasoning:

case

study

for

retinopathy.

World

Acad

Sci

Eng

Technol

2012;63:573–6.

[107]

Bellazzi

Montani

Portinale

Retrieval

prototype-based

case

library:

case

study

diabetes

therapy

revision.

In:

Smyth

Barry,

Cunningham

Pádraig,

editors.

Advances

Case-Based

Reasoning,

vol.

1488.

Berlin,

Heidelberg,

Germany:

Springer;

1998.

64–75.

[108]

Shankaracharya

Odedra

Vidyarthi

Samanta

Computational

intelli-

gence

early

diabetes

diagnosis:

review.

Rev

Diabetes

Stud

2010;7:252–62.

A fuzzy description logic based IoT framework: Formal verification and end user programming

Article

Full-text available

Mar 2024
PLOS ONE

The Internet of Things (IoT) has become one of the most popular technologies in recent years. Advances in computing capabilities, hardware accessibility, and wireless connectivity make possible communication between people, processes, and devices for all kinds of applications and industries. However, the deployment of this technology is confined almost entirely to tech companies, leaving end users with only access to specific functionalities. This paper presents a framework that allows users with no technical knowledge to build their own IoT applications according to their needs. To this end, a framework consisting of two building blocks is presented. A friendly interface block lets users tell the system what to do using simple operating rules such as “if the temperature is cold, turn on the heater.” On the other hand, a fuzzy logic reasoner block built by experts translates the ambiguity of human language to specific actions to the actuators, such as “call the police.” The proposed system can also detect and inform the user if the inserted rules have inconsistencies in real time. Moreover, a formal model is introduced, based on fuzzy description logic, for the consistency of IoT systems. Finally, this paper presents various experiments using a fuzzy logic reasoner to show the viability of the proposed framework using a smart-home IoT security system as an example.

A Comparative Analysis of Various Machine Learning Methods to Predict Diabetes Mellitus

Article

Full-text available

Apr 2022

The recent advancements in the field of health sciences have produced substantial amount of data such as clinical information that is generated by patient records which is used in AI applications for better diagnosis and predictions. Diabetes belongs to a group of metabolic disorders that affects 422 million people worldwide. This is primarily due to lack of predictive and forecasting measures. Research on several aspects of diabetes has generated huge amounts of data which makes it suitable for application of AI based methods. Presently, several methods have been used for predicting diabetes on the basis of certain factors. However, results of this study show that Support Vector Machine (SVM) and Linear regression when combined with statistical methods, provide much better results compared to AI methods.

Utilization of Fuzzy Ontology for the Meaning of Homonymous and Homophones Ambiguous Sentences

Article

Full-text available

Nov 2023

The ambiguous sentences Homonyms and Homophones become a big problem when processed by computers. From these problems, a Novelty was found; the Novelty created a system that was able to recognize ambiguous sentences of Homonyms and Homophones. The process that the system runs for the first time is to test the proximity of the ambiguous sentences entered with the data set; from this process, the ambiguous sentences entered can already be recognized as the meaning of the sentence. The resulting result is how many per cent the level of similarity. Then the results are processed with the fuzzy ontology method. The results of the Fuzzy Ontology are low similarity level, moderate similarity level, and high similarity level. The method used to analyze this research is the confusion matrix, the precision results obtained were 92%, recall was 100%, and accuracy was 96%. In the future, this research can be used to refine translation results in a translation system.

Type-2 diabetes identification from toe-photoplethysmography using Fourier decomposition method

Article

Full-text available

Nov 2023
NEURAL COMPUT APPL

Type-2 diabetes mellitus (DM-2) is a complicated endocrine and metabolism condition recognized as the most major non-communicable disease in the world. The complications associated with DM-2 involve cardiovascular disease, diabetic retinopathy and neuropathy. This article proposes the Fourier decomposition method for non-invasive automated type-2 diabetes detection using photoplethysmography (PPG) signals. The proposed research work comprises three major phases. In the first phase, the 5-min duration of the toe PPG signal is split into 10-s segments and decomposed into frequency subbands known as Fourier intrinsic band functions (FIBFs). Two features from each FIBF are extracted in the second phase, including kurtosis and log energy entropy. The last stage involves passing the features on to various machine learning techniques. The least-square support vector machine (radial basis function) algorithm yielded better classification results with an accuracy of 98.61%, a sensitivity of 98.96%, and a selectivity of 98.26%.

Data Mining Algorithms for Pharmacovigilance

Article

Full-text available

Dec 2019

In this paper, various data mining algorithms for pharmacovigilance is analyzed and a decision support system for hospital is proposed.. Overall analysis of adverse events of a specific drug helps in finding the potential danger of using the specific drug. Decision support system with good classification accuracy to improve its use in hospital for computer aided diagnosis by doctors is also analyzed

Indeterminacy Handling of Adaptive Neuro-fuzzy Inference System Using Neutrosophic Set Theory: A Case Study for the Classification of Diabetes Mellitus

Article

Full-text available

Jun 2023
IJISA

Early diabetes diagnosis allows patients to begin treatment on time, reducing or eliminating the risk of serious consequences. In this paper, we propose the Neutrosophic-Adaptive Neuro-Fuzzy Inference System (N-ANFIS) for the classification of diabetes. It is an extension of the generic ANFIS model. Neutrosophic logic is capable of handling the uncertain and imprecise information of the traditional fuzzy set. The suggested method begins with the conversion of crisp values to neutrosophic sets using a trapezoidal and triangular neutrosophic membership function. These values are fed into an inferential system, which compares the most impacted value to a diagnosis. The result demonstrates that the suggested model has successfully dealt with vague information. For practical implementation, a single-value neutrosophic number has been used; it is a special case of the neutrosophic set. To highlight the promising potential of the suggested technique, an experimental investigation of the well-known Pima Indian diabetes dataset is presented. The results of our trials show that the proposed technique attained a high degree of accuracy and produced a generic model capable of effectively classifying previously unknown data. It can also surpass some of the most advanced classification algorithms based on machine learning and fuzzy systems.

iCBR Techniques Enabled Cervical Intraepithelial Neoplasia Detection System (CINDS)

Conference Paper

Dec 2023

A deep neural network with modified random forest incremental interpretation approach for diagnosing diabetes in smart healthcare

Article

Dec 2023
APPL SOFT COMPUT

Wearable sensor platform in real time monitoring and early warning of metabolic disorders in human health

Article

Aug 2023

Nowadays, the prevalence of metabolic syndromes (MSs) has attracted increasing concerns as it is closely related to overweight and obesity, physical inactivity and overconsumption of energy, making the diagnosis and real-time monitoring of the physiological range essential and necessary for avoiding illness due to defects in the human body such as higher risk of cardiovascular disease, diabetes, stroke and diseases related to artery walls. However, the current sensing techniques are inconvenient and do not continuously monitor the health status of humans. Alternatively, the use of recent wearable device technology is a preferable method for the prevention of these diseases. This can enable the monitoring of the health status of humans in different health domains, including environment and structure. The use wearable devices with the purpose of facilitating rapid treatment and real-time monitoring can decrease the prevalence of MS and long-time monitor the health status of patients. This review highlights the recent advances in wearable sensors toward continuous monitoring of blood pressure and blood glucose, and further details the monitoring of abnormal obesity, triglycerides and HDL. We also discuss the challenges and future prospective of monitoring MS in humans.

Fuzzy ontology-based Approach for Liver Fibrosis Diagnosis

Article

Aug 2023

Fuzzy Ontologies Building Method: Fuzzy OntoMethodology

Conference Paper

Full-text available

Apr 2010

Building ontologies is very important for diverse domains and especially for semantic Web. We find in the literature many methods and tools for this building. However, the fuzzy aspect is not enough studied in these methods and tools, whereas information systems can include uncertainties and imperfections. The goal of the definition of fuzzy ontologies is to integrate these characteristics. So, we must be able to modulate uncertainties, on the one hand, and to product representations accessible and understandable by machines, on the other hand. If we find actually many building methods and editors for classic ontologies (i.e., crisp or exact), we do not find such methods for fuzzy ontologies. Then, this paper defines our work for fuzzy ontologies building. It presents our fuzzy ontologies building method "Fuzzy OntoMethodology".

A fuzzy ontology for semantic modelling and recognition of human behaviour. Knowl

Article

Full-text available

Jan 2014

Designing a Decision Support System for Recommending Smartphones Using Fuzzy Ontologies

Chapter

Jan 2015

Nowadays, smartphones have become indispensable items for everybody. Thanks to them, people can communicate and access Internet at any time regardless of where they are located. New smartphones belonging to a high amount of labels and with different features and prices keep appearing constantly in the market. This way, there is a need of tools that help buyers to select and buy the smartphone that better fits their necessities. In this article, a decision support system build over a fuzzy ontology has been designed in order to help people to select the perfect smartphone for them. Linguistic labels are used in order to provide the buyer with a comfortable way of expressing himself/herself.

Data Driven Ontology Evaluation

Conference Paper

Jan 2004

The evaluation of ontologies is vital for the growth of the Semantic Web. We consider a number of problems in evaluating a knowledge artifact like an ontology. We propose in this paper that one approach to ontology evaluation should be corpus or data driven. A corpus is the most accessible form of knowledge and its use allows a measure to be derived of the ‘fit’ between an ontology and a domain of knowledge. We consider a number of methods for measuring this ‘fit’ and propose a measure to evaluate structural fit, and a probabilistic approach to identifying the best ontology.

Adaptive Questionnaire Ontology in Gathering Patient Medical History in Diabetes Domain

Article

Jan 2014

Clinical Decision Support System (CDSS) can be used to prepare diagnosis from different patient's details and hence physicians or nurses can review this diagnosis for improving the final decision. Due to the lack of CDSS in diabetes and related diseases in Sultanate of Oman, an Ontology based CDSS is proposed here. The deployed key components of the system are Adaptive Questionnaire Ontology, patient's semantic profile, guideline ontology and risk assessment reasoner. We here propose a model for gathering the patient medical history based on dynamic questionnaire ontology. Ontology is among the most powerful tools to encode medical knowledge semantically. It is an abstract model which represents a common and shared understanding of a domain. The model is explained and implemented for diabetes domain.

Fuzzy Ontology Knowledge Bases Storage in Fuzzy Databases

Article

Jan 2014

In the context of the Semantic Web, fuzzy extensions to OWL (the W3C standard ontology language) and Description Logics (DLs, the logical foundation of OWL) have been extensively investigated as introduced in Chap. 4, and many real knowledge bases based on fuzzy DLs and fuzzy OWL tend to become very large to huge. Therefore, how to store fuzzy knowledge bases has become an important issue. Based on the widespread investigation of fuzzy relational databases, in this chapter, we briefly introduce how to store fuzzy knowledge bases in fuzzy relational databases. Until now, there are a few papers discussing fuzzy DL or ontology knowledge base storage, which is still an open problem. Much work about fuzzy DL and ontology knowledge base storage may be needed for supporting the fuzzy knowledge management in the Semantic Web.

The COLIBRI platform: Tools, features and working examples

Article

Sep 2014

COLIBRI is an open source platform for the development of Case-based reasoning (CBR) systems. It supports the development of different families of specialized CBR systems: from Textual CBR to Knowledge Intensive applications. This chapter provides a functional description of the platform, its capabilities and tools. These features are illustrated with real examples of working systems that have been developed using COLIBRI. This overview should serve to motivate and guide those readers that plan to develop CBR systems and are looking for a tool that eases this task.

An improved case-based reasoning method and its application in endpoint prediction of basic oxygen furnace

Article

Feb 2015
NEUROCOMPUTING

Case retrieval and case revise (reuse) are core parts of case-based reasoning (CBR). According to the problems that weights of condition attributes are difficult to evaluate in case retrieval, and there are few effective strategies for case revise, this paper introduces an improved case-based reasoning method based on fuzzy c-means clustering (FCM), mutual information and support vector machine (SVM). Fuzzy c-means clustering is used to divide case base to improve efficiency of the algorithm. In the case retrieval process, mutual information is introduced to calculate weights of each condition attribute and evaluate their contributions to reasoning results accurately. Considering the good ability of the support vector machine for dealing with limited samples, it is adopted to build an optical regression model for case revise. The proposed method is applied in endpoint prediction of Basic Oxygen Furnace (BOF), and simulation experiments based on a set of actual production data from a 180 t steelmaking furnace show that the model based on improved CBR achieves high prediction accuracy and good robustness.

Type-2 Fuzzy Set and Fuzzy Ontology for Diet Application

Chapter

Jun 2013

Nowadays, most people can get enough energy to maintain one-day activity, while few people know whether they eat healthily or not. It is quite important to analyze nutritional facts of foods eaten for those who are losing weight or suffering chronic diseases such as diabetes. However, diet is a problem with a high uncertainty, and it is widely pointed out that classical ontology is not sufficient to deal with imprecise and vague knowledge for some real-world applications like diet. On the other hand, a fuzzy ontology can effectively help handle and process uncertain data and knowledge. This chapter proposes a type-2 fuzzy set and fuzzy ontology for diet application and uses the type-2 fuzzy markup language (T2FML) to describe the knowledge base and rule base of the diet, including ingredients and the contained servings of six food categories of some common foods in Taiwan. The experimental results show that type-2 fuzzy logic system (FLS) performs better than type-1 FLS, proving that type-2 FLS can provide a powerful paradigm to handle the high level of uncertainties present in diet.

Aggregating linguistic expert knowledge in type-2 fuzzy ontologies

Article

Mar 2015
APPL SOFT COMPUT

In many industrial contexts, knowledge and data provided by experts are imprecise as there seems to be an understanding that “experts do not need precise details as they understand anyway what is meant”. The imprecision inherent in the knowledge that experts acquire in their practice require decision support tools that can be tailored to the specific application contexts to aid complex decisions. As a specific example, expert knowledge expressed in linguistic terms is not precisely structured and concepts are not defined specifically enough in order to be easy to use and process. If we want to represent and use expert knowledge for knowledge-based systems on a general level, that is easily adaptable, we need to find ways to represent and process knowledge elements; our approach is to use interval-valued fuzzy sets, fuzzy ontology and aggregation operators. We show that these instruments will offer us a novel approach for aggregation of imprecise data to obtain actionable knowledge to aid complex decisions. The framework is described and the approach is shown through the context of a fuzzy wine ontology; the problem formulation resembles many features of important and complex decision making problems found in different industries. We describe the potential application of the framework in the case of paper machine maintenance. A web-based application is introduced to better demonstrate the benefits decision-makers can receive from the proposed framework. Additionally, we present an approach to utilize the framework in finding consensual solutions in situations involving several experts.

A fuzzy-ontology oriented case-based reasoning framework for semantic diabetes diagnosis

Abstract and Figures

Recommended publications

Integrating Medical Ontologies into Radiology Reporting Templates

Analog and mixed signal circuit and system ontology

Are SNOMED CT browsers ready for institutions? Introducing MySNOM

Automated Mapping of Observation Archetypes to SNOMED CT Concepts