ArticlePDF Available

Review of heart disease prediction system using data mining and hybrid intelligent techniques

July 2013
International Journal of Soft Computing 3(4):2229-6956

July 2013
3(4):2229-6956

DOI:10.21917/ijsc.2013.0087

License
CC BY-NC-SA 4.0

Authors:

Chitra Jegan

Karunya University

The Healthcare industry generally clinical diagnosis is done mostly by doctor’s expertise and experience. Computer Aided Decision Support System plays a major role in medical field. With the growing research on heart disease predicting system, it has become important to categories the research outcomes and provides readers with an overview of the existing heart disease prediction techniques in each category. Neural Networks are one of many data mining analytical tools that can be utilized to make predictions for medical data. From the study it is observed that Hybrid Intelligent Algorithm improves the accuracy of the heart disease prediction system. The commonly used techniques for Heart Disease Prediction and their complexities are summarized in this paper.

Content uploaded by Chitra Jegan

Content may be subject to copyright.

ISSN: 2229-6956(ONLINE) ICTACT JOURNAL ON SOFT COMPUTING, JULY 2013, VOLUME: 03, ISSUE: 04

605

REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND

HYBRID INTELLIGENT TECHNIQUES

R. Chitra1 and V. Seenivasagam2

1Department of Computer Science and Engineering, Noorul Islam Centre for Higher Education, India

E-mail: jesi_chit@yahoo.co.in

2Department of Information Technology, National Engineering College, India

E-mail: yespee1094@yahoo.com

Abstract

The Healthcare industry generally clinical diagnosis is done mostly by

doctor’s expertise and experience. Computer Aided Decision Support

System plays a major role in medical field. With the growing research

on heart disease predicting system, it has become important to

categories the research outcomes and provides readers with an

overview of the existing heart disease prediction techniques in each

category. Neural Networks are one of many data mining analytical

tools that can be utilized to make predictions for medical data. From

the study it is observed that Hybrid Intelligent Algorithm improves the

accuracy of the heart disease prediction system. The commonly used

techniques for Heart Disease Prediction and their complexities are

summarized in this paper.

Keyword:

Neural Network, Hybrid Intelligent Algorithm, Heart Disease

Prediction, Computer Aided Decision Support System

1. INTRODUCTION

Heart Diseases remain the biggest cause of deaths for the last

two decades. Recently computer technology and machine

learning techniques to develop software to assist doctors in

making decision of heart disease in the early stage. The

diagnosis of heart disease depends on clinical and pathological

data. Heart disease prediction system can assist medical

professionals in predicting heart disease status based on the

clinical data of patients. In biomedical field data mining plays an

essential role for prediction of diseases In biomedical diagnosis,

the information provided by the patients may include redundant

and interrelated symptoms and signs especially when the

patients suffer from more than one type of disease of the same

category. The physicians may not able to diagnose it correctly.

Data mining with intelligent algorithms can be used to tackle

the said problem of prediction in medical dataset involving

multiple inputs. Now a day’s Artificial neural network has been

used for complex and difficult tasks. The neural network is

trained from the historical data with the hope that it will discover

hidden dependencies and that it will be able to use them for

predicting. Feed forward neural networks trained by back-

propagation have become a standard technique for classification

and prediction tasks.

The healthcare industry collects huge amounts of healthcare

data and that need to be mined to discover hidden information

for effective decision making. Discover of hidden patterns and

relationships often go unexploited [6].

Clinicians and patients need reliable information about an

individual’s risk of developing Heart Disease. Ideally, they

would have entirely accurate data and would be able to use a

perfect model to estimate risk. Such a model would be able to

categorize people with heart disease and others. Indeed, the

perfect model would even be able to predict the timing of the

disease’s onset. The risk factors for heart disease can be divided

into modifiable and non modifiable. Modifiable risk factors

include obesity, smoking, lack of physical activity and so on.

The non modifiable risk factors for heart disease are like age,

gender, and family history. Many people have at least one heart

disease risk factor.

Some kinds of heart disease are cardiovascular diseases,

heart attack, coronary heart disease and Stroke. Stroke is a type

of heart disease it is caused by narrowing, blocking, or

hardening of the blood vessels that go to the brain or by high

blood pressure [12, 13]. The rest of the paper is organized as

follows. Section 2 describes the heart disease prediction system

using data mining techniques and the intelligent and hybrid

technique with feature subset selection are discussed in section 3

and 4 respectively.

2. HEART DISEASE PREDICTION USING

DATAMINING

In this section the demining systems used for the

classification of heart disease is analyzed.

N. Deepika et al. proposed Association Rule for

classification of Heart-attack patients [1]. The extraction of

significant patterns from the heart disease data warehouse was

presented. The heart disease data warehouse contains the

screening clinical data of heart patients. Initially, the data

warehouse preprocessed to make the mining process more

efficient. The first stage of Association Rule used preprocessing

in order to handle missing values. Later applied equal interval

binning with approximate values based on medical expert advice

on Pima Indian heart attack data. The significant items were

calculated for all frequent patterns with the aid of the proposed

approach. The frequent patterns with confidence greater than a

predefined threshold were chosen and it was used in the design

and development of the heart attack prediction system. The,

Pima Indian Heart attack dataset used was obtained from the

UCI machine learning repository. Characteristics of the patients

like number of times of chest pain and age in years were

recorded. The actions comprised in the preprocessing of a data

set are the removal of duplicate records, normalizing the values

used to represent information in the database, accounting for

missing data points and removing unneeded data fields.

Moreover it might be essential to combine the data so as to

reduce the number of data sets besides minimizing the memory

and processing resources required by the data mining algorithm

[15]. In the real world, data is not always complete and in the

R CHITRA AND V SEENIVASAGAM: REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES

606

case of the medical data, it is always true. To remove the number

of inconsistencies which are associated with data we use Data

preprocessing.

K. Srinivas et al. presented Application of Data Mining

Technique in Healthcare and Prediction of Heart Attacks [2].

The potential use of classification based data mining techniques

such as Rule based, Decision tree, Naïve Bayes and Artificial

Neural Network to the massive Volume of healthcare data.

Tanagra data mining tool was used for exploratory data analysis,

machine learning and statistical learning algorithms. The

training data set consists of 3000 instances with 14 different

attributes. The instances in the dataset are representing the

results of different types of testing to predict the accuracy of

heart disease. The performance of the classifiers is evaluated and

their results are analyzed. The results of comparison are based

on 10 tenfold cross-validations. According to the attributes the

dataset is divided into two parts that is 70% of the data are used

for training and 30% are used for testing. The comparison made

among these classification algorithms out of which the naive

Bayes algorithm considered as the best performance algorithm.

The performance of various algorithms is listed below [2].

Table.1. Performance Study of Data mining Algorithms

The algorithm used

Accuracy

Time taken

Naïve Bayes

52.33%

609ms

Decision list

52%

719ms

K-NN

45.67%

1000ms

Diagnosis of heart disease was used Naïve Bayes, K-NN,

Decision List in this Naïve Bayes has taken a time to run the

data for accurate result when compared to other algorithms.

Sudha et al. [11] to propose the classification algorithm like

Naïve Bayes, Decision tree and Neural Network for predicting

the stroke diseases. The classification algorithm like decision

trees, Bayesian classifier and back propagation neural network

were adopted in this study. The records with irrelevant data were

removed from data warehouse before mining process occurs.

Data mining classification technology consists of classification

model and evaluation model. The classification model makes use

of training data set in order to build classification predictive

model. The testing data set was used for testing the classification

efficiency. Then the classification algorithm like decision tree,

naive Bayes and neural network was used for stroke disease

prediction. The performance evaluation was carried out based on

three algorithms and compared with various models used and

accuracy was measured. While comparing these classification

algorithms, the observation shows the neural network

performance was more than the other two algorithms.

M A. Jabbar et al. proposed Association Rule mining based

on the sequence number and clustering for heart attack

prediction [16]. The entire database is divided into partitions of

equal size. The dataset with 14 attributes was used in that work

and also each cluster is considered one at a time for calculating

frequent item sets. This approach reduces main memory

requirement. To predict the heart attack in an efficient way the

patterns are extracted from the database with significant weight

calculation. The frequent patterns having a value greater than a

predefined threshold were chosen for the valuable prediction of

heart attack. Three mining goals were defined based on data

exploration and all those models could answer complex queries

in predicting heart attack.18].

Mai Shouman, et al. [21] proposed k-means clustering with

the decision tree method to predict the heart disease. In their

work they suggested several centroid selection methods for k-

means clustering to increase efficiency. The 13 input attributes

were collected from Cleveland Clinic Foundation Heart disease

data set. The sensitivity, specificity, and accuracy are calculated

with different initial centroids selection methods and different

numbers of clusters. For the random attribute and random row

methods, ten runs were executed and the average and best for

each method were calculated. When comparing integrating k-

means clustering and decision tree with traditional decision tree

applied previously on the same data set, integrating k-means

clustering with decision tree could enhance the accuracy of

decision tree in diagnosing heart disease patients. In Addition,

integrating k-means clustering and decision tree could achieve

higher accuracy than the paging algorithm in the diagnosis of

heart disease patients. The accuracy achieved was 83.9% by the

enabler method with two clusters.

3. HEART ATTACK PREDICTION USING

INTELLIGENT TECHNIQUES

In this section the role of intelligent technique such as neural

network for heart disease prediction is explained

Latha Parthiban and R. Subramanian presented Intelligent

Heart Disease Prediction System using CANFIS and Genetic

Algorithm [4]. Adaptable based fuzzy inputs are adapted with a

modular neural network to rapidly and accurately approximate

complex functions. The CANFIS model combined the neural

network adaptive capabilities and the fuzzy logic quantitative

approach then integrated with genetic algorithm to diagnosis the

presence of the disease. Coactive neuro-fuzzy inference system

model has good training performance and classification

accuracies. Dataset of heart disease was obtained from UCI

Machine Learning Repository .Coactive Neuro-fuzzy modeling

was proposed as a dependable and robust method developed to

identify a nonlinear relationship and mapping between the

different attributes.

Dangare et al. proposed Improved Study of Heart Disease

Prediction System using Data Mining Classification Techniques

[6]. Prediction System for heart disease used system contains

huge amount of data, used to extract hidden information for

making intelligent medical diagnosis. The main objective of this

research was to build Intelligent Heart Disease Prediction

System that gives diagnosis of heart disease using historical

heart database. To develop the system, medical terms such as

sex, blood pressure, and cholesterol like 13 input attributes are

used. To get more appropriate results, two more attributes i.e.

obesity and smoking, as attributes were considered as important

attributes for heart disease. A Multi-layer Perceptron Neural

Networks (MLPNN) that maps a set of input data onto a set of

appropriate. It consists of 3 layers input layer, hidden layer &

output layer. There is connection between each layer & weights

are assigned to each connection. The primary function of

neurons of input layer is to divide input into neurons in hidden

layer. The dataset consists of total 573 records in heart disease

ISSN: 2229-6956(ONLINE) ICTACT JOURNAL ON SOFT COMPUTING, JULY 2013, VOLUME: 03, ISSUE: 04

607

database. The total records are divided into two data sets one is

used for training consists of 303 records & another for testing

consists of 270 records. Initially dataset contained some fields,

in which some value in the records was missing. These were

identified and replaced with most appropriate values using

Replace Missing Values filter. The Replace Missing Values

filter scans all records & replaces missing values with mean

mode method known as Data Preprocessing. After pre-

processing the data, data mining classification techniques such

as Neural Networks, is used for classification. Many problems in

business, science, industry, and medicine can be treated as

classification problems. Owing to the wide range of applicability

of ANN and their ability to learn complex and nonlinear

relationships including noisy or less precise information, neural

networks are well suited to solve problems in biomedical

engineering. So here use for the neural network technique is

classification of medical dataset 14 attributes by considering the

single and multilayer neural network models [19].

Olatubosun Olabode et al. [22] to classify the

Cerebrovascular disease by using artificial neural network with

back propagation error method. The Multi-layer perceptrons

artificial neural networks with back-propagation error method

were feed-forward nets with one or more layers of nodes

between the input and output nodes. These additional layers

contain hidden units or nodes that were not directly connected to

both the input and output nodes. The neural network was trained

using back propagation algorithm with sigmoid function on one

hidden layer with the 16 input attributes. Predictive models were

used in variety of domains for the diagnosis. Dataset for this

work were collected 100 records (60 males and 40 females) from

federal medical fields. The input values obtained from the

records of the forms the input variables in the input layer with 16

nodes. The neural network weights were initialized randomly.

This work range of the weights was between [-0.5 and 0.5] and

the learning rate was set between 0.1 and 0.9. The training,

validation, generalization accuracy was measured.

4. HEART DISEASE WITH FEATURE SUBSET

SELECTION

Feature subset selection is one of the technique used for

dimensionality reduction and thus to reduce the complexity of

the algorithm. But the selection of appropriate feature is

challenging one. In this section the heart disease prediction

system with evolutionary feature selection is explained.

M. Anbarasi et al. proposed Enhanced Prediction of Heart

Disease with Feature Subset Selection using Genetic Algorithm

[5]. Originally 13 attributes involved in prediction of heart

disease, proposed enhanced prediction of heart disease with

feature subset selection using genetic algorithm using 10

attributes for predicting and data mining techniques after

incorporating feature subset selection with high model

construction time. Classification techniques are Naïve Bayes,

Decision Tree and Classification by clustering. The genetic

search starts with zero attributes, and an initial population with

randomly generated rules. Based on the idea of survival of the

fittest, new population is constructed to comply with fittest rules

in the current population, as well as offspring of these rules.

Feature Extraction is the process of detecting and eliminating

irrelevant, weakly relevant or redundant attributes or dimensions

in a given data set. The goal of feature selection is to find the

minimal subset of attributes such that the resulting probability

distribution of data classes is close to original distribution

obtained using all attributes. The reduced data set fed to three

classification models. K fold cross validation method is used as

the test mode. Genetic algorithm is used to determine the

attributes which contribute more towards the diagnosis of heart

ailments which indirectly reduces the number of tests which are

needed to be taken by a patient. The 13 attributes are reduced to

6 attributes using genetic search. Subsequently, three classifiers

like Naive Bayes.

D. Shanthi, et al. [7] proposed to functional model of ANN

to aid existing diagnosis methods. The Back propagation

algorithm was used to train the ANN architecture and the same

had been tested for the various categories of stroke disease. The

data for this study had been collected from 50 patients who had

symptoms of stroke disease. The data had standardized so as to

be error free in nature. All the fifty cases were analyzed after

careful scrutiny with the help of the Physicians. Data were

analyzed in the dataset to define column parameters and data

anomalies. Data analysis information needed for correct data

preprocessing. After data analysis, the values had been identified

as missing, wrong type values or outliers and which columns

were rejected as unconvertible for use with the neural network.

In this study, backward stepwise method was used for input

feature selection. The removal of insignificant inputs had

improved the generalization performance of a neural network.

This method begins with all inputs and it works by removing

one input at each step. At each step, the algorithm finds an input

that least deteriorates the network performance and becomes the

candidate for removal from the input set. The architecture of the

neural network used in this study was the multilayered feed-

forward network architecture with 20 input nodes, 10 hidden

nodes, and 10 output nodes. The number of input nodes was

determined by the finalized data; the number of hidden nodes

was determined through trial and error; and the numbers of

output nodes were represented as a range showing the disease

classification. The most widely used neural-network learning

method is the BP algorithm. Learning in a neural network

involves modifying the weights and biases of the network in

order to minimize a cost function.

In this prediction of combinations of several targets attributes

for intelligent and effective heart attack prediction using data

mining. For predicting heart attack, significantly 15 attributes

are listed and with basic data mining technique other approaches

e.g. ANN, Time Series, Clustering and Association Rules, soft

computing approaches etc. can also be incorporated and

performance were analyzed. In compare to the performance of

predictive data mining technique on the same dataset and the

outcome reveals that Decision Tree outperforms and some time

Bayesian classification is having similar accuracy as of decision

tree but other predictive methods like KNN, Neural Networks,

Classification based on clustering are not performing well. The

second conclusion is that the accuracy of the Decision Tree and

Bayesian Classification further improves after applying genetic

algorithm to reduce the actual data size to get the optimal subset

of attribute sufficient for heart disease prediction [17].

R CHITRA AND V SEENIVASAGAM: REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES

608

The Application of Artificial Neural Network (ANN) can be

time-consuming due to the selection of input features for the

Multi Layer Perceptron. The number of layers, number of

neurons in each layer was also determined by the input

attributes. Reducing the dimensionality, or selecting a good

subset of features, without sacrificing accuracy, was of great

importance for neural networks to be successfully applied to the

area. D. Shanthi [5] propose a neuro-genetic approach to feature

selection in disease classification.

Fig.1. Architecture of MLP

The Fig.1 shows schematically a typical representation of a

MLP with input neurons, four hidden neurons, and one output

neuron. Each of the input neurons connects to each of the hidden

neurons, and each of the hidden neurons connects to the output

neuron. The Back propagation algorithm, in particular,

adaptively changes the internal network free parameters based

on external after trained, a neural network can make predictions

about the membership of every test example. MLP is trained

with the Back propagation algorithm suffers from the high

number of parameters that need to be tuned, like learning rate,

number of neurons, momentum rate, etc. However, the

motivations to select this algorithm arise after observing that

they had been used to solve problems in different domains,

moreover, the output can be directly used for ranking purposes.

The Back propagation algorithm, in particular, adaptively

changes the internal network free parameters based on external

stimulus. After trained, a neural network can make predictions

about the membership of every test example. MLP was trained

with the Back propagation algorithm suffers from the high

number of parameters that need to be tuned, like learning rate,

number of neurons, momentum rate, etc. However, the

motivations to select this algorithm arise after observing that

they have been used to solve problems in different domains,

moreover, the output can be directly used for ranking purposes.

The fuzzy neuro expert system with genetic feature reduction

is used for diagnosis of heart disease [20].This system will help

the doctors to arrive at a decision about the presence or absence

of heart disease in patients.

5. DISCUSSION

The numerous heart attack predicting system techniques

presented in this paper. In this paper Heart attack prediction

system methodology is categorized in three types. At first type

data mining technique (mainly classification technique) are

analyzed. The second type intelligent techniques used for heart

disease prediction are analyzed. The final type the role of feature

subset in the heart disease prediction is discussed.

In data mining approach the heart disease data warehouse

contains the screening clinical data of heart patients used for

heart disease diagnosis. The classification technique is used in

all proposed work.

In intelligent technique neural network is used for disease

prediction. The MLFFNN with back propagation algorithm is

proposed in many papers discussed in this section. The main

drawback of this system is training time and complexity .The

offline training of the neural network can be advised to reduce

the time complexity. In the final model feature subset selection

only the more significant attributed are extracted to predict

accurate result. Data mining techniques combined with

intelligent and evolutionary computation are discussed in the

reviewed paper. From the result it is observed that the accuracy

of the system improved with the Feature subset selection. But in

this technique also time complexity is high. Selection of the

algorithm for feature reduction is still challenging.

In many papers author used the dataset of Heart disease was

obtained from UCI Machine Learning Repository, University of

California. Hence it might be a good choice for training the

network.

6. CONCLUSION

Heart disease is one of the leading causes of death worldwide

and the early prediction of heart disease is important. The

computer aided heart disease prediction system helps the

physician as a tool for heart disease diagnosis. Some Heart

Disease classification system is reviewed in this paper. From the

analysis it is concluded that, data mining plays a major role in

heart disease classification. Neural Network with offline training

is a good for disease prediction in early stage and the good

performance of the system can be obtained by preprocessed and

normalized dataset. The classification accuracy can be improved

by reduction in features.

REFERENCE

[1] N. Deepika and K. Chandra shekar, “Association rule for

classification of Heart Attack Patients”, International

Journal of Advanced Engineering Science and

Technologies, Vol. 11, No. 2, pp. 253 – 257, 2011.

[2] K. Srinivas, B. Kavitha Rani and Dr. A. Govrdhan,

“Application of Data Mining Techniques in Healthcare and

Prediction of Heart Attacks”, International Journal on

Computer Science and Engineering, Vol. 02, No. 02, pp.

250 - 255, 2011.

[3] Asha Rajkumar and B. Sophia Reena, “Diagnosis Of Heart

Disease Using Data mining Algorithm” , Global Journal

of Computer Science and Technology, Vol. 10, No. 10, pp.

38 - 43, 2010

[4] Latha Parthiban and R. Subramanian, “Intelligent Heart

Disease Prediction System using CANFIS and Genetic

ISSN: 2229-6956(ONLINE) ICTACT JOURNAL ON SOFT COMPUTING, JULY 2013, VOLUME: 03, ISSUE: 04

609

Algorithm”, International Journal of Biological and Life

Science, Vol. 15, pp. 157 - 160, 2007.

[5] M. Anbarasi, E. Anupriya and N.CH.S.N. Iyenga,

“Enhanced Prediction of Heart Disease with Feature Subset

Selection using Genetic Algorithm”, International Journal

of Engineering Science and Technology, Vol. 2, No. 10, pp.

5370 - 5376, 2010

[6] Chaitrali S. Dangare and Sulabha S. Apte, “Improved

Study of Heart Disease Prediction System using Data

Mining Classification Techniques”, International Journal

of Computer Applications, Vol. 47, No. 10, pp. 0975 – 888,

2012

[7] D. Shanthi, G. Sahoo and Dr. N. Saravanan, “Designing an

Artificial Neural Network Model for the Prediction of

Thrombo-embolic Stroke”, International Journal of

Biometric and Bioinformatics, Vol. 3, No. 1, pp. 250 - 255,

2008.

[8] P. K. Anooj, “Clinical decision support system: Risk level

prediction of heart disease using weighted fuzzy rules”,

Journal of King Saud University Computer and

Information Sciences, Vol. 11, pp. 309 - 314, 2011.

[9] J. C. Obi and A. A. Imainvan, “Decision Support System

for the Intelligent Identification of Alzheimer using

Neuro Fuzzy logic”, International Journal on Soft

Computing , Vol. 2, No. 2, pp. 25 - 38, 2011.

[10] D. Shanthi, G. Sahoo and Dr. N. Saravanan, “Evolving

Connection Weights of Artificial Neural Network Using

Genetic Algorithm With Application to the Prediction

Stroke Diseases”, International Journal of Soft Computing,

Vol. 2, pp. 95 - 101, 2009.

[11] A. Sudha, P. Gayathiri and N. Jaisankar, “Effective

Analysis and Predictive Model of Stroke Disease using

Classification Methods”, International Journal of

Computer Applications, Vol. 43, No. 14, pp. 0975 – 8887,

2012.

[12] Tom Dent, “Predicting the risk of coronary heart disease”,

PHG foundation publisher, 2010.

[13] World Health Organization, “Global status report on no

communicable diseases”, 2010.

[14] World Health Organization, “Pocket Guidelines for

Assessment and Management of Cardiovascular Risk”,

2007.

[15] D. Shanthi, G. Sahoo and N. Saravanan “Input Feature

Selection using Hybrid Neuro-Genetic Approach in the

Diagnosis of Stroke Disease”, International Journal of

Computer Science and Network Security, Vol. 8, No.12,

pp. 99 - 106, 2008.

[16] M A. Jabbar, Priti Chandra and B. L. Deekshatulu, “Cluster

based association rule mining for heart attack prediction”,

Journal of Theoretical and Applied Information

Technology, Vol. 32, No.2, pp. 197 - 201, 2011.

[17] Jyoti Soni, Ujma Ansari and Dipesh Sharma, “Predictive

Data Mining for Medical Diagnosis: An Overview of Heart

Disease Prediction”, International Journal of Computer

Applications, Vol. 17, No. 8, pp. 43 – 48, 2011.

[18] K. Srinivas, G. Raghavendra Rao and A. Govardhan,

“Survey on prediction of heart morbidity using data mining

techniques”, International Journal of Data Mining &

Knowledge Management Process, Vol. 1, No. 3, pp. 14 -

34, 2011.

[19] K. Usha Rani, “Analysis of heart diseases dataset using

neural network approach”, International Journal of Data

Mining and Knowledge Management Process ,Vol. 1, No.

5, pp. 1 - 8, 2011.

[20] E. P. Ephzibah and V. Sundarapandian, “A neuro fuzzy

expert system for heart disease diagnosis”, An International

Journal Computer Science & Engineering, Vol. 2, No. 1,

pp. 17 - 23, 2012.

[21] Mai Shouman, Tim Turner and Rob Stocker, “Integrating

Decision Tree and K-Means Clustering with Different

Initial Centroid Selection Methods in the Diagnosis of

Heart Disease Patients”, Proceedings of the International

Conference on Data Mining, 2012.

[22] Olatubosun Olabode and Bola Titilayo Olabode,

“Cerebrovascular Accident Attack Classification Using

Multilayer Feed Forward Artificial Neural Network with

Back Propagation Error”, Journal of Computer Science,

Vol. 8, No. 1, pp.18 - 25, 2012.

A Comprehensive Study on Early Prevention and Detection of Cardiac Health Issues Using Machine Learning and Deep Learning Algorithms

Research

Full-text available

Dec 2023

Heart disease is a major global health concern and a primary cause of death and morbidity, affecting millions of people worldwide. Machine learning techniques, such as data mining tools, have been applied to improve the performance of medical diagnosis for heart disease. The complexity of heart disease calls for a thorough investigation of its various facets, including coronary artery disease, heart failure, arrhythmias, valvular heart disorders, and congenital heart defects. Risk factors for heart disease include genetics, environmental influences, nutrition, exercise, smoking, excessive alcohol use, obesity, diabetes, and hypertension. Early detection and risk assessment are made possible by advanced imaging techniques and diagnostic technologies, leading to timely interventions and improved cardiac health. In this work, we proposed both Machine learning algorithms including Support Vector Machine (SVM), Decision Tree, KNN, XGB classifier, and Logistic Regression and Deep learning algorithm based on the CNN model to show some useful results to improve the diagnostics related to cardiac issues. The study aims to explore the prevalence, risk factors, diagnostic techniques, therapeutic approaches, and efforts to lessen the negative effects of heart disease on public health.

A Comprehensive Study on Early Prevention and Detection of Cardiac Health Issues Using Machine Learning and Deep Learning Algorithms

Article

Full-text available

Dec 2023

Analysis of Cardiovascular Disease Prediction using different techniques of Machine learning

Conference Paper

Full-text available

Jul 2023

Cardiovascular illnesses have been the leading cause of mortality in both advanced and emerging economies during the last several decades. Early diagnosis of heart disorders, as well as continuous clinical monitoring, can help to prevent mortality. However, reliable identification of cardiac illnesses in all situations and 24-hour consultation by a surgeon is also not an option because it requires more intellect, time, and ability. In this study, a basic notion of a computerized heart disease diagnostic was established to identify impending heart illness using Machine learning techniques.. An effective machine learning approach generated from a different examination of many machine learning techniques in an Open Source Public Access Data Analysis Platform, WEKA, should be employed for the accurate identification of heart disease.

Predicting Presence Of Heart Diseases Using Machine Learning Section A-Research Paper Eur

Article

Full-text available

Jun 2023

cardiac illness covers several cardiac issues. According to WHO statistics, cardiovascular illnesses kill 17.9 million people worldwide. This worrying rise in heart disease cases shows how important it is to find it early. The health of the patient is closely linked to early and accurate diagnosis, so this is a very important issue. To deal with this, programmes that use machine learning have been made to find and identify people with heart disease. In this project, we made a heart disease prediction system that uses a patient's medical background to figure out how likely it is that they will be diagnosed with heart disease. The method looks at things like chest pain, high blood pressure, heart attack, high cholesterol, and high blood pressure. Anaconda Python software has been used for machine learning algorithm implementation, experimental data processing, and visualisation. This project demonstrates how Python and machine learning can detect cardiac problems. This project used UCI data. It has 3000 cases, and each one has 14 different characteristics. These examples show the results of different tests done to predict how well heart disease can be diagnosed. The dataset has been split into two parts so that the success of the algorithms can be judged. Seventy percent of the data will be used for training, and the other thirty percent will be used for tests. The study uses SVM, KNN, LR, RF, and DT algorithms to enhance heart disease detection. The goal of these programmes is to be able to tell if a person has heart disease or not. With this study, we want to compare how well different models work and look at the results to improve how well heart disease can be found. Using machine learning techniques and Python code together could be a good way to deal with this world health problem. The accuracy is calculated and analysed using these values. It has 14 classification attributes. We found that random forest is performing well out of all five algorithms giving an accuracy of 98.05%, Decision tree is performing next to random forest giving an accuracy of 97.08%, Support vector machine is giving accuracy of 90.25%, K-nearest neighbour and logistics regression are giving accuracy of 81.82%.

Heart Disease Prediction Using Machine Learning

Article

Full-text available

Mar 2024

Seema S Awathare

Deep Neural Network based Algorithm for Stroke Prediction

Conference Paper

Dec 2022

Predictive Modeling of Cardiovascular Disease using Machine Learning Techniques

Conference Paper

May 2023

Software process management in heart disease prediction using KDD

Conference Paper

Jan 2023

Decentralized Blockchain Application using ReactJs

Article

May 2023

It is a decentralised blockchain application created with the help of ReactJS. It is made up of SOLANA digital currency. Because we can update smart contracts in Solana far more readily than we can in Solidity, Solana is more trustworthy overall. Solana is one of the world's fastest blockchains, processing more than 65,000 transactions per second (TPS). Solana has maintained a spot in the top 10 cryptocurrencies by market cap for more than a year thanks to its speed, cheap transaction cost, and developer-friendly development strategy. This application is basically a clone of Instagram platform in this application we can create the post which is stored on cloud storage and when we add the post it will cost some SOL and it will be stored on the blockchain. Once the post is stored on the blockchain we cannot remove it. To do this process we need a cryptocurrency wallet so for this project we are using the Phantom cryptocurrency wallet. Phantom wallet is a non-custodial Web3.0 wallet, and it is the most popular cryptocurrency wallet for the Solana blockchain..

Agricultural Crop Recommendation Based on Productivity and Season

Conference Paper

Mar 2023

Applications of data mining techniques in healthcare and prediction of heart attacks

Article

Full-text available

Jan 2010

Decision Support System for the Intelligient Identification of Alzheimer using Neuro Fuzzy logic

Article

Full-text available

May 2011

Alzheimer Disease (AD) is a form of dementia; it is a progressive, degenerative disease. Alzheimer is abrain disease that causes problems with memory, thinking and behavior. It is severe enough to interferewith daily activities. Alzheimer symptoms are characterized by memory loss that affects day-to-dayfunction, difficulty performing familiar tasks, problems with language, disorientation of time and place,poor or decreased judgment, problems with abstract thinking, misplacing things, changes in mood andbehavior, changes in personality and loss of initiative. Neuro-Fuzzy Logic explores approximationtechniques from neural networks to find the parameter of a fuzzy system. In this paper, the traditionalprocedure for the medical diagnosis of Alzheimer employed by physician is analyzed using neuro-fuzzyinference procedure. The proposed system is a useful decision support approach for the diagnosis ofAlzheimer.

Clinical decision support system: Risk level prediction of heart disease using weighted fuzzy rules

Article

Full-text available

Jan 2012

Anooj P K

As people have interests in their health recently, development of medical domain application has been one of the most active research areas. One example of the medical domain application is the detection system for heart disease based on computer-aided diagnosis methods, where the data are obtained from some other sources and are evaluated based on computer-based applications. Earlier, the use of computer was to build a knowledge based clinical decision support system which uses knowledge from medical experts and transfers this knowledge into computer algorithms manually. This process is time consuming and really depends on medical experts’ opinions which may be subjective. To handle this problem, machine learning techniques have been developed to gain knowledge automatically from examples or raw data. Here, a weighted fuzzy rule-based clinical decision support system (CDSS) is presented for the diagnosis of heart disease, automatically obtaining knowledge from the patient’s clinical data. The proposed clinical decision support system for the risk prediction of heart patients consists of two phases: (1) automated approach for the generation of weighted fuzzy rules and (2) developing a fuzzy rule-based decision support system. In the first phase, we have used the mining technique, attribute selection and attribute weightage method to obtain the weighted fuzzy rules. Then, the fuzzy system is constructed in accordance with the weighted fuzzy rules and chosen attributes. Finally, the experimentation is carried out on the proposed system using the datasets obtained from the UCI repository and the performance of the system is compared with the neural network-based system utilizing accuracy, sensitivity and specificity.

Integrating Naive Bayes and K-means clustering with different initial centroid selection methods in the diagnosis of heart disease patients

Conference Paper

Full-text available

Dec 2012

Heart disease is the leading cause of death in the world over the past 10 years. Researchers have been using several data mining techniques to help health care professionals in the diagnosis of heart disease. Naïve Bayes is one of the data mining techniques used in the diagnosis of heart disease showing considerable success. K-means clustering is one of the most popular clustering techniques; however initial centroid selection strongly affects its results. This paper demonstrates the effectiveness of an unsupervised learning technique which is kmeans clustering in improving supervised learning technique which is naïve bayes. It investigates integrating K-means clustering with Naïve Bayes in the diagnosis of heart disease patients. It also investigates different methods of initial centroid selection of the K-means clustering such as range, inlier, outlier, random attribute values, and random row methods in the diagnosis of heart disease patients. The results show that integrating k-means clustering with naïve bayes with different initial centroid selection could enhance the naïve bayes accuracy in diagnosing heart disease patients. It also showed that the two clusters random row initial centroid selection method could achieve higher accuracy than other initial centroid selection methods in the diagnosis of heart disease patients showing accuracy of 84.5%.

Predicting the risk of coronary heart disease

Conference Paper

Sep 2010

Tom Dent

Cluster based Association rule mining for Heart Attack Prediction

Article

Oct 2011

This Paper Focuses on Analyzing Medical Data using Association Rule mining for Heart Attack Prediction. Association rule mining is one of the fundamental research topics in data mining and knowledge discovery that finds interesting association or correlation relation ship among a large set of data items and predicts the associative and correlative behaviors for new data. The Association rule mining algorithms must perform efficiently. In this paper we suggest a new approach for association rule mining based on sequence number and clustering the transactional data base for heart attack prediction. Experimental results shows effectiveness of our proposed algorithm.

Cerebrovascular Accident Attack Classification Using Multilayer Feed Forward Artificial Neural Network with Back Propagation Error

Article

Jan 2012
J Comput Sci

Olabode

Problem statement: Most important problems of medical diagnosis. When there is a cerebrovascular accident attach the chances of a successful treatment depends essentially on the early diagnosis. In practice the part of medical errors while diagnosing a stroke type comes to 20-45% even for experienced doctors and the scope of methods of neurovisualization at stroke diagnosis are limited. Approach: In this research study, attempt was made to model the application of Artificial Neural Networks to the classification of patient Cerebrovascular Accident Attack. The Network for the consisted of a three-layer feed forward artificial neural network with back-propagation error method. Results: Data were collected from 100 records of patients at Federal Medical Centre Owo, Nigeria and the Artificial Neural Networks classifier was trained using gradient decent backward propagation algorithm with flexible sigmoid activation function at one hidden layer, with 16 inputs nodes representing stroke onset symptoms at the input layer, 10 nodes at the hidden layer and one node at the output layer representing the type of the attack. Conclusion: The learning Rate γ was set between 0.1 and 0.9 while the epoch set at 150. Initial weight set at Rand (-0.5 and 0.5). The simulation results showed that the model was capable of producing a reasonable forecasting accuracy in short.

Evolving connection weights of artificial neural networks using genetic algorithm with application to the prediction of stroke disease

Article

Jan 2009

In Artificial Neural Network (ANN), the selection of connection weights is a key issue. The usual weights' randomization methods are used to initialize the network weights before training. The main aim of weights randomization techniques is to avoid sigmoid saturation problems that cause slow training. There are different weights randomization methods such as Manual, Automatic, optimized for uniform distribution of networks, optimized for Gaussian distribution of network input and Random seed available. In this study, we proposed a new hybrid model of Neural Networks and Genetic Algorithm (GA) to initialize and optimize the connection weights of ANN so as to improve the perfonnance of the ANN and the same has been applied in a medical problem of predicting stroke disease for verification of the results.

Diagnosis Of Heart Disease Using Datamining Algorithm

Article

Nov 2009

The diagnosis of heart disease is a significant and tedious task in medicine. The healthcare industry gathers enormous amounts of heart disease data that regrettably, are not "mined" to determine concealed information for effective decision making by healthcare practitioners. The term Heart disease encompasses the diverse diseases that affect the heart. Cardiomyopathy and Cardiovascular disease are some categories of heart diseases. The reduction of blood and oxygen supply to the heart leads to heart disease. In this paper the data classification is based on supervised machine learning algorithms which result in accuracy, time taken to build the algorithm. Tanagra tool is used to classify the data and the data is evaluated using 10-fold cross validation and the results are compared.

A Neuro Fuzzy Expert System for Heart Disease Diagnosis

Article

Feb 2012

Heart disease in India is one of the major causes of death. This disease is common not only in old and middle aged people but also in young people. It is caused due to improper diet habits. The proposed system finds a solution to diagnose the disease using some of the evolutionary computing techniques like genetic algorithm, fuzzy rule based learning and neural networks. The neuro fuzzy classification of the disease with the help of genetic algorithms for feature selection is the frame work of the proposed system.

Review of heart disease prediction system using data mining and hybrid intelligent techniques

Abstract

Recommended publications

Analysis of myocardial infarction risk factors in heart disease data set

Coronary Artery Disease Detection Using a Fuzzy-Boosting PSO Approach

A hybrid approach for Heart Disease Prediction using Artificial Neural Network and K-means

An Enhanced Approach for the Prioritisation in Patent Ductus Arteriosus (PDA) Services Using Data Mi...