ArticlePDF Available

Prognosis of Determined Cardiovascular Illnesses using Machine Learning Techniques

Authors:

Abstract and Figures

Prognostic of determined cardiovascular illness is one similar perpetration of machine literacy algorithms in the field of healthcare. Medical installations need to be advanced so that better opinions for patient opinion and treatment options can be made. Machine literacy in healthcare aids humans to reuse huge and complex medical datasets and also assaying them into clinical perceptivity. This can further be used by croakers in furnishing medical care. Hence machine literacy when enforced in healthcare can lead to increased case satisfaction. This design focuses on enforcing functionalities of machine literacy in healthcare in a single system. rather of opinion, when a complaint vaticination is enforced using certain machine learning prophetic algorithms also healthcare can be made smart. This proposed design uses the Machine Learning Algorithm K-Fold cross-confirmation algorithm and other libraries to make the design. The favored language is Python due to its total libraries and easy-to-use syntax. In this design, we trained Machine Learning models with colorful bracket algorithms and chose the stylish model which has good delicacy and perfection and overcomes the two main problems they're overfitting and underfitting similar that they've low friction and bias.
Content may be subject to copyright.
11 V May 2023
https://doi.org/10.22214/ijraset.2023.53036
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
5641
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 |
Prognosis of Determined Cardiovascular Illnesses
using Machine Learning Techniques
Varkala Satheesh Kumar1, G Nithish Chandra2, S Vishwanath3
1Assistant Professor, 2, 3B. Tech Scholars, Department of Computer Science and Engineering, SNIST, Hyderabad-501301, India
Abstract: Prognostic of determined cardiovascular illness is one similar perpetration of machine literacy algorithms in the field
of healthcare. Medical installations need to be advanced so that better opinions for patient opinion and treatment options can be
made. Machine literacy in healthcare aids humans to reuse huge and complex medical datasets and also assaying them into
clinical perceptivity. This can further be used by croakers in furnishing medical care. Hence machine literacy when enforced in
healthcare can lead to increased case satisfaction. This design focuses on enforcing functionalities of machine literacy in
healthcare in a single system. rather of opinion, when a complaint vaticination is enforced using certain machine learning
prophetic algorithms also healthcare can be made smart. This proposed design uses the Machine Learning Algorithm K-Fold
cross-confirmation algorithm and other libraries to make the design. The favored language is Python due to its total libraries
and easy-to-use syntax. In this design, we trained Machine Learning models with colorful bracket algorithms and chose the
stylish model which has good delicacy and perfection and overcomes the two main problems they're overfitting and underfitting
similar that they've low friction and bias.
I. INTRODUCTION
Cardiovascular illnesses (CVI) are a group of diseases that affect the heart and blood vessels and are the leading cause of morbidity
and mortality worldwide. Early detection and accurate prognosis of CVI are crucial for effective management and prevention of
disease progression. Prognosis is the process of predicting the likely outcome of a disease, and it is widely used to predict the risk of
future events, such as death or hospitalization, for patients with CVI. Traditional prognostic models use clinical and demographic
factors to predict the risk of future events, but these models have limited accuracy and fail to capture the complex relationships
between various factors. In recent years, machine learning (ML) has emerged as a promising approach for the prognosis of CVI. ML
is a subfield of artificial intelligence that allows computers to learn from data and make predictions. ML algorithms can
automatically identify relevant features and build a predictive model without the need for explicit modeling of the underlying
relationships. The use of ML for the prognosis of CVI has the potential to improve the accuracy of prognostic models and enable
personalized treatment plans for patients. The objective of this study is to propose an ML-based approach for the prognosis of
determined CVI using clinical and imaging data. The proposed approach aims to improve the accuracy of CVI prognosis by
overcoming the limitations of traditional prognostic models based on clinical and demographic factors. Specifically, the objectives
of this study are to identify relevant clinical and imaging features that are predictive of the prognosis of CVI using feature selection
techniques, develop an ML-based predictive model that integrates the identified features and accurately predicts the risk of future
events, validate the performance of the proposed approach using a large and diverse dataset of patients with determined CVI, and
evaluate the clinical utility of the proposed approach and its potential to enable personalized treatment plans for patients with
determined CVI. The ultimate goal of this study is to provide clinicians with a reliable and accurate tool for the prognosis of CVI
that can aid in making informed decisions about patient care and contribute to improving patient outcomes. The use of ML-based
approaches in the prognosis of CVI has significant potential for advancing the field of cardiology and improving public health
outcomes.
II. LITERATURE
In the time 2015, Authors Purushottam, Prof.(Dr.) Kanak Saxena and Richa Sharma have introduced Effective Heart Disease
Prediction System using Decision Tree. In this study, we've designed a system that can efficiently discover the rules to prognosticate
the threat position of cases grounded on the given parameter about algorithms for Data Mining problems.
In the time 2018, Authors Gihun Joo, Yeongjin Song, Hyeonseung, and Junbeom Park introduced the Clinical Recrimination of
Machine Learning in Predicting the circumstance of Cardiovascular Disease Using Big Data.
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
5642
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 |
In this study, we developed ML- grounded vaticination models for CVD similar to atrial fibrillation ( AF), coronary roadway
complaint( CAD), heart failure( HF), and strokes by assaying the Medical Check-up Cohort DB ver1.0 handed by Korean National
Health Insurance Service( 15),( 16) (NHIS-2016-2-263).
Prediction Using mongrel Machine Learning ways the conclusion is about relating the processing of raw healthcare data of heart
information will help in
the long-term saving of mortal lives and early discovery of abnormalities in heart conditions. Machine literacy was used in this work
to reuse raw data and give a new and new perceptiveness toward heart complaints. In the time 2020, Authors LiYang, HaibinWu,
Xiaoqing Jin, PinpinZheng, Shiyun Hu, XiaolingXu, WeiYu & JingYan introduced a Study of a cardiovascular complaint
vaticination model grounded on arbitrary timber in eastern China. farther population- grounded studies of the CVD vaticination
model proposed in this study with further population, longer follow-up time, covering further places in China with external
confirmation is demanded.
There is a growing body of literature on the use of machine learning (ML) for the prognosis of cardiovascular diseases (CVDs). In
this section, we will review some of the existing studies and systems related to the use of ML for the prognosis of CVDs. One study
by Khan et al. (2020) used a machine learning algorithm to predict the 10-year risk of atherosclerotic cardiovascular disease
(ASCVD) using data from the Multi-Ethnic Study of Atherosclerosis (MESA). The algorithm used demographic, clinical, and
laboratory data to predict ASCVD risk, and the results showed that the algorithm outperformed traditional risk prediction models in
terms of accuracy and discrimination. Another study by Elshawi et al. (2020) used machine learning to predict the incidence of
major adverse cardiovascular events (MACE) in patients with type 2 diabetes mellitus (T2DM) and established cardiovascular
disease (CVD). The algorithm used demographic, clinical, and laboratory data to predict the risk of MACE, and the results showed
that the algorithm had good discrimination and calibration.
III. METHODOLOGY
The Prognosis of determined cardiovascular illnesses (CVI) using machine learning (ML) is an important area of research aimed at
improving the accuracy of prognosis for patients with CVI and enabling personalized treatment plans. This project aims to propose
an ML-based approach for the prognosis of determined CVI using clinical and imaging data. The proposed approach aims to
improve the accuracy of CVI prognosis by overcoming the limitations of traditional prognostic models based on clinical and
demographic factors. The project comprises several stages, including data collection, preprocessing, feature selection, model
development, validation, and clinical utility evaluation. Each stage is essential for the successful development of the proposed
approach.
1)
Data Collection: Data collection is the first stage of the project, and it involves collecting clinical and imaging data from
patients with determined CVI. The data collected includes demographic information, clinical history, medication history, and
imaging data such as echocardiography, electrocardiography, and angiography. The data collected is crucial for the
development of the ML-based predictive model.
2)
Preprocessing: After data collection, the data undergoes preprocessing to ensure that it is accurate and complete. Preprocessing
involves removing missing or incorrect data, normalizing the data, and standardizing the data to ensure that it is in a consistent
format. The processed data is then used for feature selection and model development.
3)
Feature Selection: Feature selection involves selecting relevant clinical and imaging features that are predictive of the
prognosis of CVI using feature selection techniques. Feature selection techniques include statistical methods, such as t-tests and
correlation analysis, and ML-based methods, such as recursive feature elimination and Lasso regression. The selected features
should be relevant, informative, and non-redundant to ensure the accuracy of the model.
4)
Model Development: Model development involves developing an ML-based predictive model that integrates the identified
features and accurately predicts the risk of future events, such as death or hospitalization, for patients with determined CVI. ML
algorithms used in the development of the model include decision trees, random forests, support vector machines, artificial
neural networks, and deep learning algorithms. The selected algorithm should be appropriate for the data and have high
accuracy, sensitivity, and specificity.
5)
Validation: Model validation involves validating the performance of the proposed approach using a large and diverse dataset of
patients with determined CVI and comparing it with traditional prognostic models. Model performance is evaluated using
metrics such as accuracy, sensitivity, specificity, precision, and area under the curve. The validation dataset should be
independent of the training dataset and should represent the population of interest.
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
5643
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 |
6)
Clinical Utility Evaluation: The clinical utility evaluation involves evaluating the clinical utility of the proposed approach
and its potential to enable personalized treatment plans for patients with determined CVI. The proposed approach should be
easy to use, and interpretable, and should provide actionable insights that can aid in making informed decisions about patient
care. The proposed approach should also have a positive impact on patient outcomes, such as reducing mortality, and
hospitalization, and improving quality of life. Overall, the project aims to develop an ML-based approach for the prognosis of
determined CVI that overcomes the limitations of traditional prognostic models and improves the accuracy of CVI prognosis.
The proposed approach has the potential to enable personalized treatment plans for patients with determined CVI and
contribute to improving patient outcomes. The use of ML-based approaches in the prognosis of CVI has significant potential
for advancing the field of cardiology and improving public health outcomes.
IV. SYSTEM DESIGN
System design is the transition from a user-acquainted document to a programmer or database labor force. The design is a result,
specifying how to approach the creation of a new system. This is composed of several ways. It provides the understanding and
procedural details necessary for administering the system recommended in the feasibility study. Designing goes through logical and
physical stages of development. Logical design reviews the present physical system, prepares input and affair specifications, and
details of the performance plan, and prepares a logical design walkthrough. The database tables are designed by assaying functions
involved in the system and the format of the fields is also designed. The fields in the database tables should define their part in the
system. The gratuitous fields should be avoided because it affects the storage areas of the system. also, in the input and affair screen
design, the design should be made user-friendly. The menu should be precise and compact.
Fig.1 System Architecture
A. Flow Of The System
1)
The Machine Learning pipeline starts with Data collection, all our Datasets are taken from Kaggle and after importing Datasets,
every dataset is checked for null values, improper values, standard scaling, normalization of features, removing correlated
attributes, and outliers. for every HTTP request detected as sensitive by the classifier.
2)
To remove data leakage, we use K-Fold cross-validation which makes k-number training and testing clusters.
3)
After splitting of the data into trained data and the test data, trained data will undergo some operations with the algorithms.
4)
Here we use K-Neighbour classification, Random Forest classification, and Logistic Regression to train data for best accuracy.
B. Software Design
In designing the software, the following principles are followed
1)
Modularity and partitioning software are designed in such a way that; each system should correspond to the scale of modules
and serve to partition into a separate function.
2)
Coupling modules should have little reliance on the other modules of a system.
3)
Cohesion modules should carry out the operations in a single processing function.
4)
Shared use avoids duplication by allowing a single module which is called by another, that needs the function it provides.
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
5644
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 |
C. Input Design
Considering the conditions, procedures are espoused to collect the necessary input data in the utmost efficiently designed format.
The input design has to be done keeping in view that, the commerce of the stoner with the system should be in the most effective
and simplified way. Also, the necessary measures are taken for the following
1)
Controlling the quantum of input
2)
Avoid unauthorized access to the druggies
3)
barring the redundant way
4)
Keeping the process simple
5)
At this stage the input forms and defenses are designed
D. Output Design
All the defenses of the system are designed with a view to give the stoner easy operations in a simpler and more effective way, with
minimal crucial strokes possible. Important information is emphasized on the screen. nearly every screen is handed with no error
and important dispatches and option selection facilitates. Emphasis is given to faster processing and speedy deals between the
defenses. Each screen is assigned to make it as important stoner friendly as possible by using interactive procedures.
E. UML Diagram
A sequence illustration is a type of UML (Unified Modeling Language) illustration that models the relations between objects in a
software system. It shows the sequence of dispatches changed between objects and can help identify implicit performance issues.
Fig.2 Sequential Diagram
V. IMPLEMENTATION AND RESULTS
In supervised knowledge, categorical variables are modeled in type. Logistic regression is a fairly easy but important machine-
learning algorithm that uses a logistic function to model a double response variable. Since our response variable target is a
categorical variable that has two levels0 and 1, we can apply a logistic regression algorithm. The KNeighbor algorithm(k- NN) is an
anon-parametric system first developed by Evelyn Fix and Joseph Hodges in 1951, and subsequently expanded by Thomas Cover.
It's used for type and Regression. In both cases, the input consists of the k closest training samples in the dataset. The affair depends
on whether k- NN is used for type or regression. The arbitrary timber is a type of algorithm conforming to multitudinous opinions
trees. It uses bagging and point randomness when erecting each individual tree to try to produce an uncorrelated timber of trees
whose prophecy by the commission is more accurate than that of any individual tree.
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
5645
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 |
Fig.3 Libraries used to run the program
Fig.4 K- Neighbour Algorithm
Fig.5 Training Data
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
5646
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 |
Fig.6 Logistic Regression Train Data
Fig.7 Logistic Regression Test Data
Fig.8 Building Predictive System
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
5647
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 |
Fig.9 Final Output
VI. CONCLUSION
It is designed to solve the issues of existing systems. We have used Machine Learning concepts for building models. The system
performs satisfactorily in different variations of inputs. In the future, this system needs updating because it is certain only for
predicting diseases but in the future, we must implement a specified button to book appointments with doctors seamlessly in the
application. Machine learning (ML) has shown great promise in predicting the prognosis of various cardiovascular illnesses. ML
algorithms can analyze large datasets and identify patterns that can help clinicians make better decisions regarding patient care. For
example, ML models can predict the likelihood of a patient having a future cardiovascular event, such as a heart attack or stroke,
based on their medical history, lifestyle factors, and other relevant data. These predictions can help clinicians tailor treatment plans
and interventions to reduce the risk of such events occurring. Additionally, ML algorithms can aid in the early detection and
diagnosis of cardiovascular illnesses, allowing for earlier interventions and improved outcomes for patients.
Overall, the use of ML in predicting the prognosis of cardiovascular illnesses has the potential to improve patient outcomes and
reduce healthcare costs. However, it is important to note that these models are not perfect and should be used in conjunction with
clinical expertise and judgment. Further research is also needed to improve the accuracy and generalizability of these models.
REFERENCES
[1]
National Heart Lung and Blood Institute Fact Book, Fiscal Year 2006. Bethesda, Md National Heart Lung and Blood Institute, National Institutes of Health;
2006. ( 12 April 2011). Last penetrated at HTTP// www.nhlbi.nih. gov/about/factbook-06/toc.htm on.
[2]
Hayden M, Pignone M, Phillips C, et al. Aspirin for the primary forestallment of cardiovascular events is a summary of the substantiation for theU.S.
Preventive Services Task Force. Ann Intern Med. 2002; 136( 2) 161 72
[3]
Pearson TA, Blair SN, Daniels SR, et al. AHA Guidelines for Primary Prevention of Cardiovascular Disease and Stroke 2002 Update. Rotation. 2002; 106( 3)
388 91
[4]
Chatellier G, Blinowska A, Menard J, et al. Do croakers estimate reliably the cardiovascular threat of hypertensive cases? Medinfo. 1995; 8( Pt 2) 876 9.
[5]
Grover SA, Lowensteyn I, Esrey KL, et al. Do croakers directly assess the coronary threat in their cases? primary results of the Coronary Health Assessment
Study. BMJ. 1995; 310( 6985) 975 8.
[6]
Remote Health Monitoring Outcome Success Prediction Using Baseline and First Month Intervention Data | IEEE Journals & Magazine | IEEE Xplore
[7]
Efficient heart disease prediction system using decision tree | IEEE Conference Publication | IEEE Xplor
[8]
API Reference - Streamlit Docs
[9]
API reference seaborn 0.11.2 documentation (pydata.org)
[10]
Machine Learning Deployment as a Web Service | SpringerLink
[11]
https://ieeexplore.ieee.org/abstract/document/874899 2/
[12]
The ML Test Score: A Rubric for ML Production Readiness and Technical Debt Reduction Google Research
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
he initial Guide to the Primary Prevention of Cardiovas- cular Diseases was published in 1997 as an aid to healthcare professionals and their patients without established coronary artery disease or other atherosclerotic diseases. 1 It was intended to complement the American Heart Association (AHA)/American College of Cardiology (ACC) Guidelines for Preventing Heart Attack and Death in Patients with Atherosclerotic Cardiovascular Disease (updated2) and to provide the healthcare professional with a comprehensive approach to patients across a wide spectrum of risk. The imperative to prevent the first episode of coronary disease or stroke or the development of aortic aneurysm and peripheral arterial disease remains as strong as ever because of the still-high rate of first events that are fatal or disabling or require expensive intensive medical care. The evidence that most cardiovascular disease is preventable continues to grow. Results of long-term prospective studies consistently identify persons with low levels of risk factors as having lifelong low levels of heart disease and stroke. 3,4 Moreover, these low levels of risk factors are related to healthy lifestyles. Data from the Nurses Health Study,5 for example, suggest that in women, maintaining a desirable body weight, eating a healthy diet, exercising regularly, not smoking, and consuming a moderate amount of alcohol could account for an 84% reduction in risk, yet only 3% of the women studied were in that category. Clearly, the majority of the causes of cardio- vascular disease are known and modifiable.
Article
Objective : To evaluate the ability of doctors in primary care to assess risk patients' risk of coronary heart disease. Design : Questionnaire survey. Setting : Continuing medical education meetings, Ontario and Quebec, Canada. Subjects : Community based doctors who agreed to enrol in the coronary health assessment study. Main outcome measure : Ratings of coronary risk factors and estimates by doctors of relative and absolute coronary risk of two hypothetical patients and the “average” 40 year old Canadian man and 70 year old Canadian woman. Results : 253 doctors answered the questionnaire. For 30 year olds the doctors rated cigarette smoking as the most important risk factor and raised serum triglyceride concentrations as the least important; for 70 year old patients they rated diabetes as the most important risk factor and raised serum triglyceride concentrations as the least important. They rated each individual risk factor as significantly less important for 70 year olds than for 30 year olds (all risk factors, P<0.001). They showed a strong understanding of the relative importance of specific risk factors, and most were confident in their ability to estimate coronary risk. While doctors accurately estimated the relative risk of a specific patient (compared with the average adult) they systematically overestimated the absolute baseline risk of developing coronary disease and the risk reductions associated with specific interventions. Conclusions : Despite guidelines on targeting patients at high risk of coronary disease accurate assessment of coronary risk remains difficult for many doctors. Additional strategies must be developed to help doctors to assess better their patients' coronary risk. Key messages • Key messages • This study shows that doctors in primary care accurately assess the relative risk of coronary disease in individual patients • They systematically overestimate, however, the absolute risk of coronary disease in individual patients • Doctors in primary care also overestimate the absolute benefits of modification of coronary risk factors including lowering lipid concentration, control of blood pressure, and stopping smoking • Additional strategies must be found to improve the skills in risk assessment among doctors in primary care to support their clinical decision making in individual patients
Article
The use of aspirin to prevent cardiovascular disease events in patients without a history of cardiovascular disease is controversial. To examine the benefits and harms of aspirin chemoprevention. MEDLINE (1966 to May 2001). 1) Randomized trials at least 1 year in duration that examined aspirin chemoprevention in patients without previously known cardiovascular disease and 2) systematic reviews, recent trials, and observational studies that examined rates of hemorrhagic strokes and gastrointestinal bleeding secondary to aspirin use. One reviewer read and extracted data from each included article and constructed evidence tables. A second reviewer checked the accuracy of the data extraction. Discrepancies were resolved by consensus. Meta-analysis was performed, and the quantitative results of the review were then used to model the consequences of treating patients with different levels of baseline risk for coronary heart disease. Five trials examined the effect of aspirin on cardiovascular events in patients with no previous cardiovascular disease. For patients similar to those enrolled in the trials, aspirin reduces the risk for the combined end point of nonfatal myocardial infarction and fatal coronary heart disease (summary odds ratio, 0.72 [95% CI, 0.60 to 0.87]). Aspirin increased the risk for hemorrhagic strokes (summary odds ratio, 1.4 [CI, 0.9 to 2.0]) and major gastrointestinal bleeding (summary odds ratio, 1.7 [CI, 1.4 to 2.1]). All-cause mortality (summary odds ratio, 0.93 [CI, 0.84 to 1.02]) was not significantly affected. For 1000 patients with a 5% risk for coronary heart disease events over 5 years, aspirin would prevent 6 to 20 myocardial infarctions but would cause 0 to 2 hemorrhagic strokes and 2 to 4 major gastrointestinal bleeding events. For patients with a risk of 1% over 5 years, aspirin would prevent 1 to 4 myocardial infarctions but would cause 0 to 2 hemorrhagic strokes and 2 to 4 major gastrointestinal bleeding events. The net benefit of aspirin increases with increasing cardiovascular risk. In the decision to use aspirin chemoprevention, the patient's cardiovascular risk and relative utility for the different clinical outcomes prevented or caused by aspirin use must be considered.
Do croakers estimate reliably the cardiovascular threat of hypertensive cases?
  • G Chatellier
  • A Blinowska
  • J Menard
Chatellier G, Blinowska A, Menard J, et al. Do croakers estimate reliably the cardiovascular threat of hypertensive cases? Medinfo. 1995; 8( Pt 2) 876 -9.