ArticlePDF Available

AI based Automated Diagnosis of COVID-19 Patients

April 2022
International Journal of Computer Applications 184(9):7-12

April 2022
184(9):7-12

DOI:10.5120/ijca2022922020

Authors:

Jalpaiguri Government Engineering College

The recent Corona Virus Disease 2019 (COVID-19) pandemic has placed severe stress on healthcare systems worldwide, which is amplified by the critical shortage of COVID-19 tests. Effective screening of SARS-CoV-2 enables quick and efficient diagnosis of COVID-19 and can mitigate the burden on healthcare systems. In this study, a satisfactory accurate automated diagnosis model of COVID-19 based on patient symptoms has been proposed by applying several Artificial Intelligence (AI) models. For training, COVID-19 data has been collected from Israeli Ministry of Health publicly released data. We have used Artificial Neural Network and Decision Tree for classification or prediction purpose. The proposed model predicted COVID-19 test results with satisfactory accuracy. Hence, this AI based diagnostic framework can be used, among other considerations, to prioritize testing for COVID-19 when testing resources are limited.

Confusion Matrix for Cross Validation Using ANN

…

Confusion Matrix for Testing New Case Using ANN

…

Confusion Matrix for Cross Validation Using Decision Tree

…

Figures - uploaded by Sudip Mandal

Content may be subject to copyright.

Content uploaded by Sudip Mandal

Content may be subject to copyright.

International Journal of Computer Applications (0975 – 8887)

Volume 184 – No.9, April 2022

AI based Automated Diagnosis of COVID-19 Patients

Sudip Mandal

ECE Department, Jalpaiguri

Government Engineering College

Jalpaiguri, India

Shankhalika Mallick

ECE Department, Jalpaiguri

Government Engineering College

Jalpaiguri, India

Arkaprava Roy

EE Department, Jalpaiguri

Government Engineering College

Jalpaiguri, India

Arghyadip Paul

EE Department,

Jalpaiguri Government Engineering College

Jalpaiguri, India

ABSTRACT

The recent Corona Virus Disease 2019 (COVID-19)

pandemic has placed severe stress on healthcare systems

worldwide, which is amplified by the critical shortage of

COVID-19 tests. Effective screening of SARS-CoV-2 enables

quick and efficient diagnosis of COVID-19 and can mitigate

the burden on healthcare systems. In this study, a satisfactory

accurate automated diagnosis model of COVID-19 based on

patient symptoms has been proposed by applying several

Artificial Intelligence (AI) models. For training, COVID-19

data has been collected from Israeli Ministry of Health

publicly released data. We have used Artificial Neural

Network and Decision Tree for classification or prediction

purpose. The proposed model predicted COVID-19 test

results with satisfactory accuracy. Hence, this AI based

diagnostic framework can be used, among other

considerations, to prioritize testing for COVID-19 when

testing resources are limited.

Keywords

COVID-19; Artificial Intelligence (AI); Artificial Neural

Network (ANN); Decision Tree; Automated Disease

Diagnosis; Classification

1. INTRODUCTION

The novel corona virus (COVID-19 or SARS-COV-2)

epidemic has been acquainted as a global pandemic. More

than 21.9 crore individuals have been infected worldwide,

leading to more than 45.5 lakhs deaths as of 7th October 2021

[1]. Rapid human-to-human transmission accompanied by

unrevealed nature of the virus had led to this tremendous

outbreak of COVID-19. It affects people in various ways. The

main symptoms are main symptoms are fever, cough, myalgia

or fatigue, sputum, and dyspnea [2]. However, 80% of

patients get well from the disease without facing any serious

complication. It has been observed that one out of every six

affected people get seriously sick and develop difficulty in

breathing due to infection and inflammation in lungs caused

by the Corona virus [3].

This pandemic continues to challenge medical systems

worldwide in many aspects, including sharp increases in

demands for hospital beds and critical shortages in medical

equipment, while many healthcare workers have themselves

been infected. Thus, the capacity for immediate clinical

decisions and effective usage of healthcare resources is

crucial. The most validated diagnosis test for COVID-19,

using reverse transcriptase polymerase chain reaction (RT-

PCR), has long been in shortage in developing countries. This

contributes to increased infection rates and delays critical

preventive measures.

Effective screening enables quick and efficient diagnosis of

COVID-19 and can mitigate the burden on healthcare

systems. Prediction models that combine several features to

estimate the risk of infection have been developed, in the hope

of assisting medical staff worldwide in triaging patients,

especially in the context of limited healthcare resources. To

defeat the COVID-19 outbreak [4], appropriate and evidence-

based actions must be taken worldwide. For this purpose,

prediction models can help not only allocating medical

resources but also raising the preparedness of healthcare

systems involved. In this regard, mathematical, computational

and statistical methods have been utilized to predict if a

person is affected by COVID-19 or not using Artificial

Intelligence (AI) techniques.

In literature, several AI or machine learning models have

already proposed for automated diagnosis of the COVID-19.

Mainly, two different approaches were being used: COVID

affected Lungs X-ray / CT scan images analysis [5], [6] and

symptoms based database [7], [8] analysis using different AI

techniques. In case of X-ray / CT scan images analysis,

different researchers have contributed in this domain using

image processing techniques. Li et al. [9] used CT scan

images to classify the images in either COVID-19 or normal

and they also calculate severity of infection. Hassanien et al.

[10] used support vector machine and multilevel thresholding

to classify the X-ray images of lungs. On the hand, Sing et al.

[11] use multi-objective differential evolution–based

convolutional neural networks to classify CT scan images of

chest affected by COVID-19. Yan et al. [12] used image

processing tools to identify the severity of infection for

COVID-19. Rajinikanth et al. [13] used Harmony Search

(HS) algorithm along with multilevel thresholding to identify

the infection severity in lungs CT scan images that created

due COVID-19. S. Mandal [14] used Elephant Swarm Water

Search Algorithm (ESWSA) for multilevel thresholding based

on Otsu’s and Kapur’s method and further calculation of

severity of infections of lungs images.

On the other hand, several authors also proposed different AI

based models for diagnosis of COVID-19 with the help of

symptoms based clinical data of COVID-19 patients. Khanday

et al. [15] have used different machine learning techniques for

the classification of COVID clinical data. Iwendi et al. [16]

have used boosted random forest algorithm for COVID-19

International Journal of Computer Applications (0975 – 8887)

Volume 184 – No.9, April 2022

patient health prediction. Chen et al. [17] used machine

learning techniques for early prediction of mortality rate for

COVID-19. Sudre et al. [18] have developed an app that helps

to predict if a person is affected by COVID-19 or not using by

analyzing the COVID cases and their symptoms. Zoabi et al.

[19] used Gradient boosting predictor for automated diagnosis

of COVID-19 patients using Israeli Ministry of Health data

[28] which is publicly available for the researchers. The

obtained results showed very satisfactory accuracy. Dutta et

al. [20] utilized three different machine learning models

namely bagging algorithm, k-nearest neighbour, and random

forest for prediction. They have used the real-time COVID-19

dataset from India Government website [29] and obtained

satisfactory accuracy. Mei et al. [20] used AI based

techniques for rapid diagnosis of COVID-19 cases to reduce

the burden of health workers.

Through this study, an automated prediction model based on

Artificial Intelligence models are proposed to predict the

COVID-19 through a given set of data comprising of eight

different symptoms. Artificial Neural Network (ANN) and

Binary Decision Tree are employed for classification purpose.

Firstly, a record of confirmed cases of a desired place is taken.

Then, the corresponding data were divided into two parts as

training and test data. The former was used to train the

models, while the latter was used for validation purposes.

Thus, the estimated accumulative confirmed cases of the test

data were compared with those of actual target values. The

rest of the manuscript is structured as follows. Preliminary

background of Artificial Neural Network and Binary Decision

Tree are discussed in next section. Propose methodology and

data collections processes have been elaborated in Section 3.

Results using ANN and Decision tree for classification of

COVID-19 data have been shown in Section 4 followed by

the Conclusion section.

2. THEORETICAL BACKGROUND

In this section, some preliminary concept on Artificial Neural

Network and Decision Tree are discussed those are required

to understand this research work.

2.1 Artificial Neural Network

An ANN [22] is an information processing paradigm that

is inspired by the way the biological nervous system (such as

the brain) that process information. An ANN is configured for

a specific application such as pattern recognition or data

classification [23] through a learning process. An ANN is

typically defined by three types of parameters:

1. The interconnection pattern between the different

layers of neurons. Three types of layers are observed: input,

hidden and output layer.

2. The activation function that converts a neuron's

weighted input to its output activation.

3. The learning process for updating the weights of the

interconnections.

The main characteristic of ANN is self-learning without

prior knowledge of the complex non-linear relationships that

exist between the input and output variables. Another

advantage is that this type of approach also makes it possible

to use several predictor variables simultaneously.

2.2 Decision Tree

Decision tree [24] is one of the most popular and efficient

technique in data mining which is established and well-

explored by many researchers. Decision trees are categorized

as a supervised method that trying to find the relationship

between input attributes and target attributes which represent

the relationship in structure as a model. The model

constructed by using input attributes to predict target

However, some decision tree algorithms may produce a large

structure of tree size such as J48 which is an implementation

of C4.5 algorithm [25]. C4.5 was a version earlier algorithm

developed by J. Ross Quinlan.

3. PROPOSED METHODOLOGY

The overall work has two parts:

1. Data collection, preprocessing and preparation of database.

2. Application of AI based model to the dataset for validation

of the used techniques to predict the COVID-19 status i.e.

either the person is affected or not.

The publicly released COVID data form Israeli Ministry of

Health has been utilized for the preparation of the required

dataset. Next, this dataset has been used for both training and

validation for the Artificial Neural Network and Decision tree

on this classification problem. All these AI based models are

implemented using MATLAB 2018 in a laptop containing i3

processor and 4 GB processor. The details of the proposed

methodology have been elaborated in following sub-sections.

3.1 Data Collection and Preparation

The Israeli Ministry of Health [26], [27], [28] publicly

released data of individuals who were tested for SARS-CoV-2

via RT-PCR assay of a nasopharyngeal swab. The initial

dataset contains daily records of all the residents who were

tested for COVID-19 nationwide. Various information that

was provided in the datasheet includes clinically tested

symptoms like cough, fever, sore throat, shortness of breath,

headache. Based on these data, Artificial Neural Network and

Decision Tree model will be trained that will help to predict

COVID19 test results using eight binary features: gender, age

60 years or above, known contact with an infected individual,

and five initial clinical symptoms. The original data collected

for the months of March and April-2020. Then it was

processed to make it feasible for further analysis. A few steps

were executed to convert the large data into a clean data set.

The steps that were followed are as follows:

1. Data for a time period was segregated

2. Rows containing missing data were eliminated

3. Erroneous and wrong data were removed

4. Non binary data was converted to 0 and 1 in the

following manner

i. Gender: male -1; female -0

ii. Age above 60: yes-1; no-0

iii. Other information: contact with

confirmed-1; others-0

iv. Corona result:positive-1; negative-0

The modified training validation data set consisted of records

from 7,968 tested individuals (of whom 3214 were confirmed

to have COVID-19), from the period March 22th, 2020

through March 31st, 2020. The test set contained data from

the subsequent week, April 1st through April 7th (5,732 tested

individuals, of whom 1952 were confirmed to have COVID-

19).

The following list describes each of the dataset’s features used

by the model for training and testing:

A. Basic information:

1. Sex (male/female).

2. Age ≥60 years (true/false)

International Journal of Computer Applications (0975 – 8887)

Volume 184 – No.9, April 2022

B. Symptoms:

3. Cough (true/false).

4. Fever (true/false).

5. Sore throat (true/false).

6. Shortness of breath (true/false).

7. Headache (true/false).

C. Other information:

8. Known contact with an individual confirmed to have

COVID-19 (true/false).

3.2 Artificial Neural Network Based Model

To classify the COVID-19 data using ANN based model,

Multilayer Feed Forward Artificial Neural Network with one

hidden layer which consist of 20 nodes has been used. On the

other hand, the input layer consists of 8 nodes which are

different symptoms and features of COVID-19 patients.

Output layer of ANN model consists of only 1 node that

indicates the predicted value (COVID 1 or 0) i.e. the status of

the patients.

Initially, the training dataset is used to train the ANN model

with the use Back Propagation Algorithm. The trained ANN

model is then cross validated using the training data itself and

the performance and accuracy was noted. Cross-validation is a

resampling procedure used to evaluate machine learning

models on a limited data sample. Next, the trained model is

tested against a new dataset that contained inputs for the

period 1st April through 7th April, 2020. The performance

and accuracy are also noted for testing new case. All the

results are given in results section.

3.3 Decision Tree Based Model

C4.5 algorithm produces decision tree classification for a

given dataset by recursive division of the data. The decision

tree is grown using Depth-first strategy. On data testing, this

algorithm will emphasized on splitting dataset and by

selecting a test that will give best result in information gain.

With no of fold 10, and confidence factor 0.25, the algorithm

is implemented for construction of decision tree. Decision

Tree provides vary fast prediction but it has least accuracy

than other approaches. Decision Tree shows that one

directional path or inference for classification of new data

based on different attribute value. Results related with cross

validation and testing new data are given in next section.

4. RESULTS AND DISCUSSION

In this section, the detailed results corresponding to two cases

of AI models i.e. using ANN and Decision Tree to detect if a

person is affected be COVID or not are shown and discussed.

Following figure shows the Neural Network model (using

MATLAB) to detect if any person is COVID Positive or not,

based on some input symptoms and features.

Fig. 1: ANN training model for COVID detection

The Table 1 summarizes the classification accuracy for two

tested cases i.e. cross validation and testing new cases for

ANN.

Table 1: Accuracy for ANN model

Process

Percentage Accuracy

Cross Validation

84.80%

New Dataset

85.80%

The Table 2 shows the classification accuracy of decision tree

the two tested cases i.e. cross validation and testing new

cases.

Table 2: Accuracy for Decision Tree model

Process

Percentage Accuracy

Cross Validation

83.75%

New Dataset

85.79%

From above two tables, it has been observed that classification

accuracy is better for ANN over the decision tree for both

Cross validation and testing new cases. Hence, it can be stated

that the ANN is more suitable over decision tree in terms of

classification accuracy of COVID-19 dataset.

Figure 2 shows the output Decision Tree model for to detect if

any person is COVID-19 Positive or not.

International Journal of Computer Applications (0975 – 8887)

Volume 184 – No.9, April 2022

Fig. 2: Decision Tree model for COVID detection

Now, a comparison with respect to runtime of the ANN and

Decision Tree for this particular problem of classification of

COVID-19 has been observed. From Table 3, it can be clearly

noted that Decision Tree is very fast classification algorithm

compare to the ANN as it required only 38.18 sec for decision

tress training whereas ANN required almost 10 times.

Table 3: Runtime for ANN and Decision Tree

Process

Runtime (Sec)

ANN

363.61

Decision Tree

38.18

However, percentage of error for decision tree is slightly

higher than the ANN for both cross validation and testing new

case. Following Fig. 3 shows the percentage of error for

different cases during classification.

Fig. 3: Percentage of Error for different cases

Next, we shall show confusion matrix for the different case of

testing using ANN and Decision Tree. Table 4 and 5 shows

the confusion matrix for cross validation and testing new case

using ANN respectively.

Table 4: Confusion Matrix for Cross Validation Using

ANN

Target Value

Predicted

Value

4321

777

433

2437

Table 5: Confusion Matrix for Testing New Case Using

ANN

Target Value

Predicted

Value

3332

368

448

1584

Table 6 and 7 shows the confusion matrix for cross validation

and testing new case using decision tree respectively.

Table 6: Confusion Matrix for Cross Validation Using

Decision Tree

Target Value

Predicted

Value

4336

418

877

2337

15.2

16.25

14.2

14.21

ANN Cross

Validation

Decion Tree

Cross

validation

ANN Test New

Case

Decion Tree

Test New Case

Percentage of Error

International Journal of Computer Applications (0975 – 8887)

Volume 184 – No.9, April 2022

Table 7: Confusion Matrix for Testing New Case Using

Decision Tree

Target Value

Predicted

Value

3331

449

365

1587

From the above confusion matrix, it is very clear to us that

both ANN and Decision tree both not able to detect COVID

cases for few cases (i.e. False Negative or FN) and also they

detect few case as COVID positive by mistake (i.e. False

Positive or FP). As an example, for cross validation using

ANN, 777 numbers of FN and 433 numbers of FP are

detected which are not desirable. For this reason,

classification accuracy has been reduced to 84.80%. The

reason behind this kind of outputs is the asymptotic case for

COVID-19. As it is already known to us that many person do

not show any symptoms but they are affected by the COVID.

Hence, the proposed models also detect them as normal case

i.e. FNs are detected which of obvious a drawback of the AI

based automated diagnosis process. Similarly, it is also

possible that showing symptoms like fever; cough etc. does

not guarantee the COVID. The person may suffer due to

others disease. In this scenario, computer can predict them as

COVID positive case i.e. FPs are detected. However, the AI

based COVID-19 diagnosis is still very promising techniques

where medical facility like RTPCR test has limited

availability.

5. CONCLUSION

Here, the aim is to develop an automated diagnosis system for

COVID-19 using two popular AI based models namely

Artificial Neural Network and Decision Tree by observing the

symptoms and features of the respective patients. This paper

focuses on how ANN and decision tree can be used to detect

if any person is COVID positive or not, based on eight input

parameters. The dataset has been collected from Israeli

Ministry of Health website where all data are available

publicly for further analysis. In this study, both ANN and

decision tree model have been trained and tested for

validation of the proposed methodology. It has observed that

performance of the ANN is superior over decision tree in

terms of classification accuracy for both cross validation and

testing new cases. However, the runtime for decision tree is

very small compare to ANN. It is an advantage of using

decision tree for classification problem. It is possible to

achieve satisfactory classification accuracy for both models

although other advanced AI model or algorithm may be

applied to improve the performance in future. If healthcare

facility (i.e. RTPCR test) is limited, this AI based diagnostic

framework can be used where automated decision can be

taken by the computer or machine instantly without waiting

for the report from the pathologist.

6. REFERENCES

[1] World Health Organization. Report of the WHO-China

Joint Mission on Corona virus Disease 2019 (COVID-

19) Geneva: World Health Organization; 2020

https://www.who.int/docs/default-

source/coronaviruse/who-china-joint-mission-on-covid-

19-finalreport.

[2] Ministry of Health & Family Welfare Government of

India.

https://www.mohfw.gov.in/pdf/PreventionandManageme

ntofCOVID19FLWEnglish.pdf [Accessed on 20th May

2020].

[3] Liu J, Liao X, Qian S et al. Community transmission of

severe acute respiratory syndrome coronavirus 2,

Shenzhen, China, 2020. Emerg Infect Dis 2020

doi.org/10.3201/eid2606.200239

[4] Novel Corona Virus Map,

https://infographics.channelnewsasia.com/covid-

19/map.html [Accessed on 24th May 2020]

[5] Chung M, Bernheim A, Mei X, et al. CT Imaging

Features of 2019 Novel Coronavirus (2019-

nCoV). Radiology. 2020;295(1):202-207.

doi:10.1148/radiol.2020200230.

[6] Pan, Feng, et al. "Time course of lung changes on chest

CT during recovery from 2019 novel coronavirus

(COVID-19) pneumonia." Radiology (2020): 200370.

[7] Huang C, Wang Y, Li X, et al. Clinical features of

patients infected with 2019 novel coronavirus in Wuhan,

China. Lancet 2020; 395: 497–506.

[8] Liu Y, Yan LM, Wan L et al. Viral dynamics in mild and

severe cases of CVOID-19. Lancet Infect Dis

doi.org/10.1016/S1473-3099(20)30232-2

[9] Li, K., Fang, Y., Li, W. et al. CT image visual

quantitative evaluation and clinical classification of

coronavirus disease (COVID-19). Eur Radiol (2020).

https://doi.org/10.1007/s00330-020-06817-6.

[10] Aboul Ella Hassanien Sr., Lamia Nabil Mahdy Jr., Kadry

Ali Ezzat Jr., Haytham H. Elmousalami Jr.,Hassan Aboul

Ella Jr. Automatic X-ray COVID-19 Lung Image

Classification System based on Multi-Level

Thresholding and Support Vector Machine,

doi: https://doi.org/10.1101/2020.03.30.20047787.

[11] Singh, D., Kumar, V., Vaishali et al. Classification of

COVID-19 patients from chest CT images using multi-

objective differential evolution–based convolutional

neural networks. Eur J Clin Microbiol Infect Dis (2020).

https://doi.org/10.1007/s10096-020-03901-z.

[12] Yan, R. et al. Chest CT Severity Score: An Imaging Tool

for Assessing Severe COVID-19. Radiology:

Cardiothoracic Imaging 2020, 2(2).

https://doi.org/10.1148/ryct.2020200047.

[13] Rajinikanth, V., Nilanjan Dey, Alex Noel Joseph Raj,

Aboul Ella Hassanien, K. C. Santosh, and N. Raja.

"Harmony-search and otsu based system for coronavirus

disease (COVID-19) detection using lung CT scan

images." arXiv preprint, arXiv:2004.03431 (2020).

[14] S. Mandal, “Identification of Severity of Infection for

COVID-19 Affected Lungs Images using Elephant

Swarm Water Search Algorithm” International Journal of

Modelling and Simulation, 2021, doi:

10.1080/02286203.2021.1934797.

[15] A.M.U.D. Khanday et al., Machine learning based

approaches for detecting COVID-19 using clinical text

data, Int. J. Inf. Technol. (2020),

https://doi.org/10.1007/s41870-020-00495-9.

[16] C. Iwendi, et al., COVID-19 patient health prediction

using boosted random forest algorithm, Front. Public

Health 8 (2020),

https://doi.org/10.3389/fpubh.2020.00357.

International Journal of Computer Applications (0975 – 8887)

Volume 184 – No.9, April 2022

[17] X. Chen, Z. Liu, Early prediction of mortality risk among

severe COVID-19 patients using machine learning,

preprint, Epidemiology (2020),

https://doi.org/10.1101/2020.04.13.20064329.

[18] Carole H. Sudre et al, Attributes and predictors of Long-

COVID: analysis of COVID cases and their symptoms

collected by the COVID Symptoms Study App, medRxiv

preprint (2020) doi:

https://doi.org/10.1101/2020.10.19.20214494.

[19] Yazeed Zoabi et al, Machine learning-based prediction of

COVID-19 diagnosis based on symptoms, npj Digital

Medicine (2021) 4:3 ; https://doi.org/10.1038/s41746-

020-00372-6.

[20] Pijush Dutta, Shobhandeb Paul, Asok Kumar,

Comparative analysis of various supervised machine

learning techniques for diagnosis of COVID-19,

Electronic Devices, Circuits, and Systems for Biomedical

Applications (2021). https://doi.org/10.1016/B978-0-

323-85172-5.00020-4.

[21] Mei, X. et al. Artificial intelligence–enabled rapid

diagnosis of patients with COVID-19. Nat. Med. 26,

1224–1228 (2020).

[22] S. Mandal, G. Saha, and R. K. Pal, “A Comparative

Study on Disease Classification Using Different Soft

Computing Techniques”, The SIJ Transactions on

Computer Science Engineering & its Applications

(CSEA), vol. 1(3), pp. 59-66, 2014

[23] S. Mandal, G. Saha, and R. K. Pal, “An Approach

towards Automated Disease Diagnosis & Drug Design

Using Hybrid Rough-Decision Tree from Microarray

Dataset”, Journal of Computer Science and System

Biology, vol. 6(6), pp. 337-343, 2013,

DOI:10.4172/jcsb.1000130.

[24] S. Mandal, and I. Banerjee, “Cancer Classification Using

Neural Network”, International Journal of Emerging

Engineering Research and Technology, vol. 3(7), pp.

172-178, 2015.

[25] S. Mandal, G. Saha, and R. K. Pal, “Neural Network

Training Using Firefly Algorithm”, Global Journal on

Advancement in Engineering and Science, vol. 1(1), pp.

07-11, 2015.

[26] COVID-19-Government Data.

https://data.gov.il/dataset/covid-19 (2020).

[27] The Novel CoronavirusIsrael Ministry of Health.

https://govextra.gov.il/ministry-ofhealth/corona/corona-

virus-en/ (2020).

[28] COVID-19-Government Data Information.

https://data.gov.il/dataset/covid-19/resource/3f5c975e-

7196-454b-8c5b-ef85881f78db/download/-readme.pdf

(2020).

[29] Covid-19 India data information. https:

www.covid19india.org (2020)

IJCATM : www.ijcaonline.org

Medical Images Based Covid-19 Detection Survey

Article

Sep 2023

In December 2019, COVID-19 appeared for the first time in Wuhan (Hubei Province, China), after which it quickly spread over the entire earth. The World Health Organization quickly designated COVID-19 a pandemic because of the high number of deaths and rapid global spread of the disease. Because of this, many facets of society have been impacted, and those effects may last for years to come. Therefore, COVID-19 detection methods were the focus of numerous research projects in the past. This has led to the development of the COVID-19 AI Detector, a specialized area of artificial intelligence-based research. In this paper, we survey all significant current efforts that have taken advantage of machine learning to COVID-19 detection and prediction. We first surveyed all datasets used in relevant research, and then summarized them in a table containing the link to that data. Then, we mention all the methodologies employed to detect the presence of COVID-19 using such datasets. Later, the challenges and difficulties that facing the concerned researches are reported, while the results of all interesting related work that lay ahead in this field before concluding this paper were discussed fairly

Machine learning-based prediction of COVID-19 diagnosis based on symptoms

Article

Full-text available

Jan 2021

Effective screening of SARS-CoV-2 enables quick and efficient diagnosis of COVID-19 and can mitigate the burden on healthcare systems. Prediction models that combine several features to estimate the risk of infection have been developed. These aim to assist medical staff worldwide in triaging patients, especially in the context of limited healthcare resources. We established a machine-learning approach that trained on records from 51,831 tested individuals (of whom 4769 were confirmed to have COVID-19). The test set contained data from the subsequent week (47,401 tested individuals of whom 3624 were confirmed to have COVID-19). Our model predicted COVID-19 test results with high accuracy using only eight binary features: sex, age ≥60 years, known contact with an infected individual, and the appearance of five initial clinical symptoms. Overall, based on the nationwide data publicly reported by the Israeli Ministry of Health, we developed a model that detects COVID-19 cases by simple features accessed by asking basic questions. Our framework can be used, among other considerations, to prioritize testing for COVID-19 when testing resources are limited.

Attributes and predictors of Long-COVID: analysis of COVID cases and their symptoms collected by the Covid Symptoms Study App

Preprint

Full-text available

Oct 2020

Reports of “Long-COVID”, are rising but little is known about prevalence, risk factors, or whether it is possible to predict a protracted course early in the disease. We analysed data from 4182 incident cases of COVID-19 who logged their symptoms prospectively in the COVID Symptom Study app. 558 (13.3%) had symptoms lasting >28 days, 189 (4.5%) for >8 weeks and 95 (2.3%) for >12 weeks. Long-COVID was characterised by symptoms of fatigue, headache, dyspnoea and anosmia and was more likely with increasing age, BMI and female sex. Experiencing more than five symptoms during the first week of illness was associated with Long-COVID, OR=3.53 [2.76;4.50]. A simple model to distinguish between short and long-COVID at 7 days, which gained a ROC-AUC of 76%, was replicated in an independent sample of 2472 antibody positive individuals. This model could be used to identify individuals for clinical trials to reduce long-term symptoms and target education and rehabilitation services.

Machine learning based approaches for detecting COVID-19 using clinical text data

Article

Full-text available

Jun 2020

Technology advancements have a rapid effect on every field of life, be it medical field or any other field. Artificial intelligence has shown the promising results in health care through its decision making by analysing the data. COVID-19 has affected more than 100 countries in a matter of no time. People all over the world are vulnerable to its consequences in future. It is imperative to develop a control system that will detect the coronavirus. One of the solution to control the current havoc can be the diagnosis of disease with the help of various AI tools. In this paper, we classified textual clinical reports into four classes by using classical and ensemble machine learning algorithms. Feature engineering was performed using techniques like Term frequency/inverse document frequency (TF/IDF), Bag of words (BOW) and report length. These features were supplied to traditional and ensemble machine learning classifiers. Logistic regression and Multinomial Naïve Bayes showed better results than other ML algorithms by having 96.2% testing accuracy. In future recurrent neural network can be used for better accuracy.

COVID-19 Patient Health Prediction Using Boosted Random Forest Algorithm

Article

Full-text available

Jul 2020

Integration of artificial intelligence (AI) techniques in wireless infrastructure, real-time collection, and processing of end-user devices is now in high demand. It is now superlative to use AI to detect and predict pandemics of a colossal nature. The Coronavirus disease 2019 (COVID-19) pandemic, which originated in Wuhan China, has had disastrous effects on the global community and has overburdened advanced healthcare systems throughout the world. Globally; over 4,063,525 confirmed cases and 282,244 deaths have been recorded as of 11th May 2020, according to the European Centre for Disease Prevention and Control agency. However, the current rapid and exponential rise in the number of patients has necessitated efficient and quick prediction of the possible outcome of an infected patient for appropriate treatment using AI techniques. This paper proposes a fine-tuned Random Forest model boosted by the AdaBoost algorithm. The model uses the COVID-19 patient's geographical, travel, health, and demographic data to predict the severity of the case and the possible outcome, recovery, or death. The model has an accuracy of 94% and a F1 Score of 0.86 on the dataset used. The data analysis reveals a positive correlation between patients' gender and deaths, and also indicates that the majority of patients are aged between 20 and 70 years.

Artificial intelligence–enabled rapid diagnosis of patients with COVID-19

Article

Full-text available

Aug 2020
NAT MED

For diagnosis of coronavirus disease 2019 (COVID-19), a SARS-CoV-2 virus-specific reverse transcriptase polymerase chain reaction (RT–PCR) test is routinely used. However, this test can take up to 2 d to complete, serial testing may be required to rule out the possibility of false negative results and there is currently a shortage of RT–PCR test kits, underscoring the urgent need for alternative methods for rapid and accurate diagnosis of patients with COVID-19. Chest computed tomography (CT) is a valuable component in the evaluation of patients with suspected SARS-CoV-2 infection. Nevertheless, CT alone may have limited negative predictive value for ruling out SARS-CoV-2 infection, as some patients may have normal radiological findings at early stages of the disease. In this study, we used artificial intelligence (AI) algorithms to integrate chest CT findings with clinical symptoms, exposure history and laboratory testing to rapidly diagnose patients who are positive for COVID-19. Among a total of 905 patients tested by real-time RT–PCR assay and next-generation sequencing RT–PCR, 419 (46.3%) tested positive for SARS-CoV-2. In a test set of 279 patients, the AI system achieved an area under the curve of 0.92 and had equal sensitivity as compared to a senior thoracic radiologist. The AI system also improved the detection of patients who were positive for COVID-19 via RT–PCR who presented with normal CT scans, correctly identifying 17 of 25 (68%) patients, whereas radiologists classified all of these patients as COVID-19 negative. When CT scans and associated clinical history are available, the proposed AI system can help to rapidly diagnose COVID-19 patients.

Classification of COVID-19 patients from chest CT images using multi-objective differential evolution-based convolutional neural networks

Article

Full-text available

Jul 2020
EUR J CLIN MICROBIOL

Early classification of 2019 novel coronavirus disease (COVID-19) is essential for disease cure and control. Compared with reverse-transcription polymerase chain reaction (RT-PCR), chest computed tomography (CT) imaging may be a significantly more trustworthy, useful, and rapid technique to classify and evaluate COVID-19, specifically in the epidemic region. Almost all hospitals have CT imaging machines; therefore, the chest CT images can be utilized for early classification of COVID-19 patients. However, the chest CT-based COVID-19 classification involves a radiology expert and considerable time, which is valuable when COVID-19 infection is growing at rapid rate. Therefore, an automated analysis of chest CT images is desirable to save the medical professionals' precious time. In this paper, a convolutional neural networks (CNN) is used to classify the COVID-19-infected patients as infected (+ve) or not (-ve). Additionally, the initial parameters of CNN are tuned using multi-objective differential evolution (MODE). Extensive experiments are performed by considering the proposed and the competitive machine learning techniques on the chest CT images. Extensive analysis shows that the proposed model can classify the chest CT images at a good accuracy rate.

Early prediction of mortality risk among severe COVID-19 patients using machine learning

Preprint

Full-text available

Apr 2020

Background Coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection has been spreading globally. The number of deaths has increased with the increase in the number of infected patients. We aimed to develop a clinical model to predict the outcome of severe COVID-19 patients early. Methods Epidemiological, clinical, and first laboratory findings after admission of 183 severe COVID-19 patients (115 survivors and 68 nonsurvivors) from the Sino-French New City Branch of Tongji Hospital were used to develop the predictive models. Five machine learning approaches (logistic regression, partial least squares regression, elastic net, random forest, and bagged flexible discriminant analysis) were used to select the features and predict the patients' outcomes. The area under the receiver operating characteristic curve (AUROC) was applied to compare the models' performance. Sixty-four severe COVID-19 patients from the Optical Valley Branch of Tongji Hospital were used to externally validate the final predictive model. Results The baseline characteristics and laboratory tests were significantly different between the survivors and nonsurvivors. Four variables (age, high-sensitivity C-reactive protein level, lymphocyte count, and d-dimer level) were selected by all five models. Given the similar performance among the models, the logistic regression model was selected as the final predictive model because of its simplicity and interpretability. The AUROCs of the derivation and external validation sets were 0.895 and 0.881, respectively. The sensitivity and specificity were 0.892 and 0.687 for the derivation set and 0.839 and 0.794 for the validation set, respectively, when using a probability of death of 50% as the cutoff. The individual risk score based on the four selected variables and the corresponding probability of death can serve as indexes to assess the mortality risk of COVID-19 patients. The predictive model is freely available at https://phenomics.fudan.edu.cn/risk_scores/. Conclusions Age, high-sensitivity C-reactive protein level, lymphocyte count, and d-dimer level of COVID-19 patients at admission are informative for the patients' outcomes.

Automatic X-ray COVID-19 Lung Image Classification System based on Multi-Level Thresholding and Support Vector Machine

Preprint

Full-text available

Apr 2020

The early detection of SARS-CoV-2, the causative agent of (COVID-19) is now a critical task for the clinical practitioners. The COVID-19 spread is announced as pandemic outbreak between people worldwide by WHO since 11/ March/ 2020. In this consequence, it is top critical priority to become aware of the infected people so that prevention procedures can be processed to minimize the COVID-19 spread and to begin early medical health care of those infected persons. In this paper, the deep studying based totally methodology is usually recommended for the detection of COVID-19 infected patients using X-ray images. The help vector gadget classifies the corona affected X-ray images from others through usage of the deep features. The technique is useful for the clinical practitioners for early detection of COVID-19 infected patients. The suggested system of multi-level thresholding plus SVM presented high accuracy in classification of the infected lung with Covid-19. All images were of the same size and stored in JPEG format with 512 * 512 pixels. The average sensitivity, specificity, and accuracy of the lung classification using the proposed model results were 95.76%, 99.7%, and 97.48%, respectively.

Comparative analysis of various supervised machine learning techniques for diagnosis of COVID-19 25

Article

Sep 2021

Coronavirus is a large family of a viruses that causes illness ranging from a normal cold to severe disease. COVID-19 is another strain that has not been distinguished in humans before. As this virus is rapidly spreading all over the globe, we need to implement a mathematical model to estimate the prediction of new cases as well as how to classify that a person is COVID-19 positive or not by considering the practical scenario in India. In this research, we proposed three different supervised machine learning techniques for diagnosis of COVID-19. We have compared classification results of different techniques, i.e., bagging algorithm, k-nearest neighbor, and random forest for classifying the datasets of COVID-19. For the classification purpose, we took symptoms from a Covid-19 tracker in India, whereas India has entered into the second stage. The performance of each technique is evaluated using various performance measures. The classification results show that the random forest gives better results, employing accuracy of 85.71% and F1 score of 0.833.

Identification of Severity of Infection for COVID-19 Affected Lungs Images using Elephant Swarm Water Search Algorithm

Article

Jul 2021

Sudip Mandal

Due to the outbreak of the pandemic COVID-19 or ‘novel corona virus disease’, the world is facing a global emergency. In case of severe infection, lungs are affected by COVID-19 significantly, whichmay lead to the death of the patient. In this paper, an automated image-assisted system based on artificial intelligence is proposed to extract infected sections from lung CT scan images that are caused due to COVID-19. Multilevel thresholding is a typical example of maximization problem of optimization to identify the threshold(s) levels for image segmentation. In this paper, Elephant Swarm Water Search Algorithm (ESWSA) has been used for multilevel thresholding based on Otsu’s and Kapur’s method and further analysis of lungs images. It has been observed from the obtained simulated results that ESWSA performs better than other state-of-the-art optimization techniques for multilevel thresholding. Moreover, location of infection and severity of infection has also been extracted from the pixel ratio between the infection and lung sections from the lung CT scan images. It is expected that the proposed methodology will support the doctors as it will reduce diagnostic burden and respective treatment process can be planned faster as the severity of infection by COVID-19 is found by this automated methodology.

AI based Automated Diagnosis of COVID-19 Patients

Abstract and Figures

Recommended publications

The Use of Artificial Intelligence in Automation of Planning and Operational Management of Organizat...

Detection of Novel Coronavirus From Chest X-Ray Radiograph Images Via Automated Machine Learning and...

Invention is the need of the hour: A unique Data Accumulation and Analysis Platform (DAAP) for covid...

“An Automated Social Distance Monitoring & Alarm System based on Human Structure Using Video Surveil...