Conference PaperPDF Available

An Automated System for the Classification of Bronchiolitis and Bronchiectasis Diseases using Lung Sound Analysis

March 2023

March 2023

DOI:10.1109/ICRAI57502.2023.10089608

Conference: 2023 International Conference on Robotics and Automation in Industry (ICRAI)
At: Pakistan

Authors:

Syed Ali Faraz Jaffery

University of Engineering and Technology, Taxila

Sumair Aziz

University of Canberra

Muhammad Umar Khan

University of Engineering and Technology, Taxila

Syed Zohaib Hassan Naqvi

University of Engineering and Technology, Taxila

Show all 6 authorsHide

The main goal of this paper is to develop a classification model and a technique to identify bronchiolitis and bronchiectasis using lung sound analysis. In this paper, we develop a methodology to automatically identify lung disease through an intelligent system. ICBHI lungs sound database was used for this study. A total of 64 lung recordings, selected from three pulmonary classes namely normal, bronchiectasis and bronchiolitis were used for this purpose. To accomplish the task, we first split all the recorded signals into four parts to increase the number of input data. Discrete wavelet transform was used to denoise and segment the pulmonological data. Mel frequency cepstral coefficients were then computed from the cleaned signal. After extensive experimentation with various classifiers, the highest recognition rate of 99.6% was found by using K-Nearest Neighbors.

Content uploaded by Sumair Aziz

Content may be subject to copyright.

An Automated System for the Classification of

Bronchiolitis and Bronchiectasis Diseases using

Lung Sound Analysis

Syed Ali Faraz Jaffery, Sumair Aziz, Muhammad Umar Khan*, Syed Zohaib Hussain Naqvi, Muhammad Faraz

and Adil Usman

Department of Electronics Engineering,University of Engineering and Technology Taxila,Pakistan

*email: sa.umarkhan@gmail.com

Abstract— The main goal of this paper is to develop a

classification model and a technique to identify bronchiolitis and

bronchiectasis using lung sound analysis. In this paper, we

develop a methodology to automatically identify lung disease

through an intelligent system. ICBHI lungs sound database was

used for this study. A total of 64 lung recordings, selected from

three pulmonary classes namely normal, bronchiectasis and

bronchiolitis were used for this purpose. To accomplish the task,

we first split all the recorded signals into four parts to increase

the number of input data. Discrete wavelet transform was used

to denoise and segment the pulmonological data. Mel frequency

cepstral coefficients were then computed from the cleaned

signal. After extensive experimentation with various classifiers,

the highest recognition rate of 99.6% was found by using K-

Nearest Neighbors.

Keywords; Bronchiolitis, Bronchiectasis, lung sounds,

Discrete Wavelet Transform, MFCC, K-Nearest Neighbors,

Feature extraction

I. INTRODUCTION

Over the past few years, pulmonary abnormalities have

become a common issue all over the world. Chronic

obstructive pulmonary disease (COPD), asthma,

bronchiectasis, bronchiolitis, upper respiratory tract infection

(URTI), pneumonia, lower respiratory tract infection (LRTI)

and other diseases are some major respiratory diseases.

Among them, the mortality rates for bronchiolitis and

bronchiectasis are frequently underestimated due to

differences in diagnostic criteria. In the U.S., 110,000 infants

are hospitalized every year due to bronchiolitis [1]. Typically,

bronchiolitis is caused by a viral infection, which, in most

cases, is caused by the respiratory syncytial virus. RSV virus

is quite prevalent and is easily spread by coughs and sneezes.

Almost all infants have it by the age of two [2]. Infants with

bronchiolitis have damage in the small airways that can cause

coughing, wheezing, and breathing difficulties.

In another study by the Australian Institute of Health and

Welfare (AIHW), there were 983 deaths reported in 2018 that

had bronchiectasis described either as the underlying cause of

death (387) or as an associated cause (596) of death [3].

Bronchiectasis is a chronic disease that causes the abnormal

widening of the lungs' airways. Because of these harmed

airways, mucus and stones gather in the lungs. It causes lung

damage and prevents healthy airway function by blocking

them with mucus [4]. Bronchiectasis can also result in

wheezing cough pain, joint pain, and shortness of breath, and

all these symptoms can also be caused by bronchiolitis. But,

bronchiectasis is induced by pneumonia, pertussis,

tuberculosis, and nontuberculous mycobacterial infections

that lead to abnormal windings in the lungs of older

individuals. In contrast, bronchiolitis is an acute lower

respiratory infection that is commonly induced by viral

infections in infants [5]. The major difference between these

two diseases is that bronchiectasis is a chronic condition that

mostly affects older people. However, bronchiolitis typically

affects children and can be treated at home.

The clinical options for identifying respiratory diseases are

many. Imaging techniques including magnetic resonance

imaging (MRI), computed tomography (CT) scans, and chest

X-rays are used to diagnose respiratory illnesses. Contrarily,

adopting such imaging modalities presents several difficulties,

particularly for many patients in third-world countries. These

difficulties include the risk of receiving repeated dosages of

hazardous rays, the expense of the equipment, and the

inconvenience of deploying it in remote areas [6]. Despite

faster and continuous technological advances in the diagnosis

of chest diseases, auscultation keeps the most extensively used

and effective lung disease diagnostic tool [7]. Respiratory

sounds captured using a stethoscope are a direct indicator of

lung health and abnormalities. They yield essential details and

significant information regarding the condition of the lungs.

The use of a stethoscope provides a restricted and biased

perspective of breathing sounds. Subjectivity causes

differences in the perception of lung sounds (LS) by different

medical specialists. Subjectivity and discrepancy are made by

the pathologist's ability to hear, experience, discriminate and

define various sound patterns. Moreover, the stethoscope

output is sensitive to noise. Such noisy signals obscure critical

aspects of LS signals and result in inaccurate lung illness

diagnoses [8].

Today's era of computing technology has achieved

significant strides in the early and quick identification of a

variety of respiratory disorders. To overcome the limitations

of physical diagnosis, various machine/deep learning

algorithms for automated lung disease recognition based on

lung sound data have been presented in the literature.

Moreover, most of these pulmonary disease classification

approaches are developed for a particular and limited lung

disease classification that is less precise, and ineffective for

the evaluation of non-stationary LS signals. However, this

study describes the development of an automated and

intelligent system for the accurate detection and classification

of two lung disorders, bronchiolitis, and bronchiectasis, using

signal processing and machine learning algorithms.

Authorized licensed use limited to: University of Canberra. Downloaded on June 10,2023 at 14:01:12 UTC from IEEE Xplore. Restrictions apply.

II. RELATED WORK

Currently, a machine learning model has been developed

to identify a single pulmonary pathology from the analysis of

lung sounds. [9]. A timely and precise diagnosis can reduce

the risk of mortality. However, the subjective nature of

abnormal noises like coughs has made the identification of

pneumonia, bronchiolitis, and other lung conditions extremely

challenging[10]. In the study conducted by Z. Tariq et al. [11],

the ICBHI dataset was used and six pulmonary diseases are

classified by using a convolution neural network (CNN), and

the feature vector is computed from the Spectrogram of

annotated lung sound samples. The classification accuracy

was found to be 97%. Garcia-Ordas et. al [12] presented a

convolution neural network (CNN) with variational

autoencoders (VAE) to classify lung disorders. The audio

signals were represented by Mel spectrograms. By using the

CNN model, they identify six distinct pathological diseases.

This study achieved 99.1% sensitivity and 99.4% specificity.

V. Base et al. in [13] purposed the deep neural network model

for the identification of the LS. These researchers have used

the MFCC features with the RNN classifier and achieved

95.67% accuracy on the six pulmonary diseases classification.

Another study conducted by [14] S. I. Khan et al. computed

tomparal and frequency domain features from the first four

IMFs of EMD and these IMFs were then further processed for

the two dimensions and higher dimensions space

representation. These researchers achieved the highest

performance of the ensembled bagged tree classifier with an

accuracy of 96.29% on 10-fold cross-validation. In [15] M.

Orders et al. implement the convolution neural network

(CNN) to identify pulmonary abnormalities. They used the

Mel Spectrogram for a visual representation of lung sounds.

The recognition rate for the six different classes was achieved

by 99%. In another research [16] L. Brumes et al. purposed a

neural network-based model for the identification of

pulmonary diseases. The feature vector consists of the

combination of time, cepstral and spectral for the input of the

classification learner. The model achieved an F-score of

98.3%. In [17] L. Fragipan et al. derived a novel method to

recognize the multi-class classification of pulmonary illness

from LS analysis. They used a boosted decision trees classifier

with Shannon entropy, log entropy, and spectral entropy

features to classify the pulmonary illness and achieved

98.27% accuracy. In [18] J. Acharya et. al presented a hybrid

CNN-RNN learning model consisting of three stages namely

deep CNN, Bi-LSTM, and SoftMax layers for the

classification of pulmonary diseases. These researchers also

implement a subject-specific classification model in which the

last 3rd stage of the proposed hybrid CNN-RNN is updated

according to the patient-specific data. Moreover, for the

reduction of memory cost, a weight quantization technique is

also discussed in the same study in which the amplitude of

every layer's weight is quantified in the log domain. Such

models are difficult to train due to the challenges of collecting

a vast amount of subject-specific data. While a subject-

specific model demands more time and effort from medical

experts for gathering and grouping the data. Previous studies

for computerized-based respiratory LS detection have been

conducted by using the application of signal processing and

machine learning ML algorithms to automate the diagnosing

mechanism however a great deal of work is still required in

this research area. A complete comparison of previous studies

is given in Table I. CNN requires a large dataset in order to

attain better results. The dataset in the case of lung sound is

limited. CNN also needs more computing resources, so can

not be deployed on low-cost embedding devices which are

battery-powered. There is a substantial need for an accurate

TABLE I. COMPARISON WITH PREVIOUS STUDIES

Study Database

Feature

extraction

Classification

Results

[11]

ICBHI

2017

Spectrogram

CNN

Sensitivity (97%)

[12]

ICBHI

2017

Mel

Spectrogram

CNN

99.1

% (

for

muti

class classification)

98.8% (for binary classification)

[13]

ICBHI

2017

MFCC Features

RNN

Sensitivity (95.67%)

F1-Score (95.66%)

Kappa (94.74%)

[14]

ICBHI

2017

our IMF features

calculated from EMD

Ensemble of

agged tree

Classifier

Sensitivity (9

6.29

[15]

ICBHI

2017

Mel

Spectrogram

CNN

Sensitivity (99.00%)

F1-Score (90.00%)

[16]

ICBHI

2017

Time, cepstral and spectral

feature from

recorded sound

eural

etwork

Algorithm

score

(

98.3%

)

[17]

ICBHI

2017 +

Self-

Recorded

Entropy features

Boosted DT

Sensitivity

(98.27%)

Specificity (95.28%)

F1-score (98.9%)

Kappa (92.28%)

[18]

ICBHI

2017

Mel

Spectrogram

Hybrid CNN

RNN

Patient Specific Model

(

66.31%

) for hybrid model

(71.81%) for patient specific Model

Authorized licensed use limited to: University of Canberra. Downloaded on June 10,2023 at 14:01:12 UTC from IEEE Xplore. Restrictions apply.

lung sound analysis based on pattern recognition and machine

learning.

III. PROPOSED METHODOLOGY

Figure 1 illustrates the proposed methodology of our

intelligent system for the detection of respiratory lung sound

LS diseases.

Firstly, bronchiolitis, bronchiectasis, and normal sounds

are taken from the Lung sound LS database and split each

recording into four parts to increase the total number of data

samples. In the second stage, pre-processing of these signals

is performed by using DWT. In the next stage, MFCCs

features are extracted and then classification is performed on

selected features by using the KNN classifier.

A. Lungs sound database

The publicly available dataset used in this research is

ICBHI 2017 which contains lung sound files as shown in

Figure 2. The human lung sounds in the ICBHI database were

recorded over several years by two different research teams.

The dataset contains a 5.5-hour audio recording of 126

subjects' lung sounds which are recorded with four types of

recording equipment. The lung sound recording contains the

6898 respiratory cycles that include the crackles, wheezes, a

combination of both, and annotated audio samples. The

dataset includes normal, bronchiectasis, bronchiolitis,

pneumonia, asthma, COPD, URTI, and LRTI, which describe

different lung conditions. Among these sounds, we used 64

audios in this research, which are normal, bronchiectasis, and

bronchiolitis. To increase the number of audio

samples, we divided the selected samples into four parts.

Table II provides details about the dataset used in this work.

B. Preprocessing

Preprocssing is very important step in signal processing

pipeline. For feature extraction purposes, it is preffered to

have clear audio samples of lungs. There are certain sounds

like heart sounds, background sounds, and vocalization that

make complexity in the identification of certain lung diseases

because inappropriate features are measured due to these

strange sounds. These noise elements hides little differences

that exists between sounds of different categories, and thus

make a classification task very complicated. To avoid

complication, it is always preffered to remove the unwanted

components and nosie from signal before further processing.

In this study, we applied discrete wavelet transform on all the

signals for preprocessing puspose [19-22]. Figures 3 and 4

represent the time domain of respiratory signals with and

without removing unwanted sounds. It was observed that after

preprocessing the lung sound signals, high freuqney noise was

significantly reduced. Frequency analysis was also carried out

to make sure that the preprocessed signal contains the relevant

frequency bands of lung sounds.

C. Feature Extraction

Features are the most important and essential components

that are fed to the classifier to differentiate between different

classes [23, 24]. Features represent the input signal data into

numeric components, thus reducing the data dimensionality,

and removing redundant information. An appropriate feature

vector must contain the relevant data and remove unnecessary

information. There are two main properties of strong features:

1) the Mean of features within the same class must be having

a minimum variance, and 2) Centroids of features of different

classes must be having significant distance. We use MFCC

information of denoised audio signals for classification in this

research. Mel frequency cepstral coefficients [25-27] is

computed from the discrete cosine transform of a log power

spectrum on a Mel frequency scale as described in equation 1

below.

   

 (1)

where f is equal to the frequency in hertz.

Fig. 2. ICBHI respiratory database of lungs sound

TABLE II. DATASET DETAILS

ID. Name of

Class

Audio

Files

Segmented Audio

Files

1. Healthy 35 140

2. Bronchiolitis 13 52

3. Bronchiectasis 16 64

Total 64 256

Fig.1. Proposed Method for Classification of bronchiolitis and bronchiectasis

Authorized licensed use limited to: University of Canberra. Downloaded on June 10,2023 at 14:01:12 UTC from IEEE Xplore. Restrictions apply.

To determine the MFCC coefficient from the audio input,

there are five steps to follow:

1. Frame the input wave with fixed-size windows.

2. Calculate the FFT of each frame

3. Use equation 1 to obtain the Mel spectrogram of each

segment.

4. Apply the logarithm to each Mel spectrogram to get the

log power spectrum.

5. Calculate the discrete cosine transform (DCT) of each

logarithmic power spectrum to get MFCC coefficients.

As a result, we obtain thirteen MFCC features/coefficients.

These features were further analyzed to identify best

performing parameters using back elimination method.

D. Classification

The final stage of this research is classification. After

extracting the MFCC features our audio signals pattern is

ready for classification. For the KNN classifier to recognize

the pattern of the different sounds, ten-fold cross-validations

were carried out to make sure random factors don't enter into

the equation [28-30]. The K-Nearest Neighbor algorithm [31]

is an instance-based learning model that classifies the objects

based on their nearest training instances in the feature. It used

a supervised learning algorithm to solve both regression and

classification problems. To classify a problem a KNN

algorithm requires only two parameters a distance feature and

the value of the K. The distance function for continuous

variables is calculated as follows

   



 (2)

!"#$ % &



 (3)

#'()'#  * # +



 ,+

- (4)

And for the categorical variables, the distance function is

calculated as follows

./  



  (5)

The most effective and strong technique for categorization

and prediction is the decision tree. The decision tree

classification algorithm is constructed using nodes, branches,

and leaf nodes. Fine Tree is a version of the decision tree, with

a huge number of branches and decision nodes. SVM is a

widely used classification method based. SVM differentiates

the input features using a hyperplane based on support vectors.

SVM classifies complex data patterns using nonlinear kernels

such as the Cubic operator. The kernel trick is used to increase

the dimension of input data. The extended dimensions are

employed to make a better separation between the two classes.

For multiclass problems, SVM adopts one-vs-one or one-vs-

rest approaches. Multiple models are trained and results are

predicted based on voting strategy. Boosted Tree is a kind of

ensemble method, which is a combination of many weak

learners. The prediction performance is enhanced in the

ensemble method due to the combination of different

classifiers. The performance of the proposed scheme was also

tested using kernel naive Bayes classifier, which is easy to

develop but required the input features to be independent.

Quadratic discriminant analysis (QD) is another type of

classifier that requires the input features to follow Gaussian

distribution. QD is also widely employed classifier in

biomedical domain.

IV. RESULTS AND DISCUSSION

The discrete wavelet transform algorithm is applied after

augmenting the three types of lung sound data: healthy,

bronchiolitis, and bronchiectasis. It split the signals into

detailed and approximate coefficients and removes the

approximate coefficient containing lower frequencies. We

analyze that our region of interest (ROI) lies in the

approximated coefficient of DWT due to the presence of the

desired frequency of LS in it. MFCCs, which are calculated

from the detail coefficient, are mostly used to classify LS

illnesses using different classifier models. The back-

elimination method is performed for the MFCCs features

fusion in this research and after all, we get the maximum

accuracy at 8 selected MFCC features out of 13. The extracted

features were tested with a several classification methods such

as Fine Tree (FT), Fine KNN, Cubic Support Vector Machine

(SVM), Boosted Trees (BT), Kernel Naïve Bayes (KNB), and

Quadratic Discriminant Classifier (QDC). The classification

Fig. 3. Raw LS for healthy, bronchiolitis, and bronchiectasis

Fig. 4. Preprocessed LS for healthy, bronchiolitis, and

bronchiectasis

Authorized licensed use limited to: University of Canberra. Downloaded on June 10,2023 at 14:01:12 UTC from IEEE Xplore. Restrictions apply.

performance was evaluated using 10-fold cross-validation. In

10-fold cross-validation, the dataset is divided into ten parts.

In each iteration, only one part is used as a "test set", while the

remaining nine parts are combined to create a "training set".

In the next iteration, the second part is used as a test set, and

all other parts are merged to construct a training set. This

process is repeated ten times, and ten models are evaluated.

The results of all iterations are averaged to get the final

performance measures. 10-fold cross-validation is a

significant scheme for smaller datasets.

The performance in terms of accuracy for differentiating

different LS features is illustrated in Figure 5. All

classification algorithms show comparable performance,

which confirms that processing and extracted features have

strong discrimination ability. Accuracies of 85.2%, 88.7%,

82%, and 87.9% were obtained with FT, BT, KNB, and QDC,

respectively. The best results of 99.3% were obtained using

Fine KNN, followed by the second best (94.1%) obtained

through Cubic SVM.

The experimental process was further extended by

exploring different versions of KNN. Figure 6 illustrated the

results using Fine KNN, Medium KNN, Coarse KNN, Cubic

KNN, Cosine KNN, and Weighted KNN. The lowest

performance of 62.2% was provided by the Coarse KNN.

Medium KNN predicts the class of input features by

comparing the results of the 10 nearest neighbors. Similar

results of 84.4% accuracy were obtained via Medium KNN,

Cosine KNN, and Cubic KNN. The best performance of

99.3% was yielded by Fine KNN for differentiating different

lung sounds.

The confusion matrix presented in Figure 7 shows

classwise results of classification using Fine KNN. Out of 140

Healthy sounds, only 1 was mispredicted as Bronchiolitis.

Similarly, only one sound out of 64 was mispredicted in the

case of Bronchiectasis. All 52 signals of Bronchiolitis were

correctly classified.

V. C

ONCLUSION AND FEATURE WORK

In this paper we propose a novel method to detect

bronchiolitis and bronchiectasis pathologies in LS. In the

studies of LS data, it is very usual to have a less amount of

data. One of the limitations that we are challenging during this

work is the lack of data, so we decided to split every audio

sample into four parts. For denoising the audio samples from

irrelevant frequencies, discrete wavelet transformations are

applied. The DWT yields approximate and detailed

coefficients that contain un-noised information. The

approximate coefficient contains the low frequencies part of

the input signal while the detailed coefficient contains the

higher frequencies part of the input signal. We extracted

MFCCs feature from the approximate coefficient of DWT for

the input of training models. We tested these features with

classification models and found that the fine KNN classifier

achieved the best results with 99.6% accuracy on selected

features. With this promising result, we conclude that the Fine

KNN machine learning model with the input of selected

MFCC features that extract from the approximate coefficient

of DWT has demonstrated excellent performance in the

classification of bronchiolitis and bronchiectasis illnesses.

Future works could include new feature exploration,

change in classification methods, and then integration of the

system on hardware. This research work can also be enhanced

for the diagnosing of other respiratory lung diseases by using

the same methodology.

EFERENCES

[1] T. Ooka et al., "Proteomics endotyping of infants with severe

bronchiolitis and risk of childhood asthma," Allergy, 10/23/2022.

[2] (19 october 2022). Bronchiolitis. Available:

https://www.nhs.uk/conditions/bronchiolitis/#:~:text=Bronchiolit

is%20is%20caused%20by%20a,children%20it%20can%20cause

%20bronchiolitis.

[3] Australian Institute of Health and Welfare 2020. Bronchiectasis.

Cat. no. ACM 36. Canberra: AIHW. Viewed 07 December 2020,

https://www.aihw.gov.au/reports/chronic-respiratory-

conditions/bronchiectasis

[4] A. E. O’Donnell, "Bronchiectasis—A Clinical Review," New

England Journal of Medicine, vol. 387, no. 6, pp. 533-545, 2022.

[5] S. R. Dalziel et al., "Bronchiolitis," The Lancet, 2022.

[6] B. A. Tessema, H. D. Nemomssa, and G. L. Simegn, "Acquisition

and Classification of Lung Sounds for Improving the Efficacy of

Auscultation Diagnosis of Pulmonary Diseases," Medical Devices

(Auckland, NZ), vol. 15, p. 89, 2022.

Fig. 5. Accuracies of various classifiers

85.20% 99.30% 94.10% 88.70% 82.00%

87.90%

Fine Tree

Fine KNN

Cubic SVM

Boosted Trees

Kernel Naive Bayes

Quadratic

Discriminant

Fig. 7. Confusion Matrix of the proposed system

Fig. 6. Accuracies of different KNN Kernels

99.30% 84.40%

60.20%

84.40% 84.40% 84.40%

Fine KNN

Medium KNN

Coarse KNN

Cosine KNN

Cubic KNN

Weighted

Authorized licensed use limited to: University of Canberra. Downloaded on June 10,2023 at 14:01:12 UTC from IEEE Xplore. Restrictions apply.

[7] G. Altan, Y. Kutlu, Y. Garbi, A. Ö. Pekmezci, and S. Nural,

"Multimedia Respiratory Database (RespiratoryDatabase@TR):

Auscultation Sounds and Chest X-rays," ArXiv, vol.

abs/2101.10946, 2021.

[8] M. Aykanat, Ö. Kılıç, B. Kurt, and S. Saryal, "Classification of

lung sounds using convolutional neural networks," EURASIP

Journal on Image and Video Processing, vol. 2017, no. 1, pp. 1-9,

2017.

[9] H. I. Hussein, H. I. Dino, R. J. Mstafa, and M. M. Hassan, "Person-

independent facial expression recognition based on the fusion of

HOG descriptor and cuttlefish algorithm," Multimedia Tools and

Applications, vol. 81, no. 8, pp. 11563-11586, 2022.

[10] S. Z. H. Naqvi and M. A. Choudhry, "An automated system for

classification of chronic obstructive pulmonary disease and

pneumonia patients using lung sound analysis," Sensors, vol. 20,

no. 22, p. 6512, 2020.

[11] Z. Tariq, S. K. Shah, and Y. Lee, "Lung Disease Classification

using Deep Convolutional Neural Network," 2019 IEEE

International Conference on Bioinformatics and Biomedicine

(BIBM), pp. 732-735, 2019.

[12] J. Saldanha, S. Chakraborty, S. Patil, K. Kotecha, S. Kumar, and

A. Nayyar, "Data augmentation using Variational Autoencoders

for improvement of respiratory disease classification," Plos one,

vol. 17, no. 8, p. e0266467, 2022.

[13] V. Basu and S. Rana, "Respiratory diseases recognition through

respiratory sound with the help of deep neural network," 2020 4th

International Conference on Computational Intelligence and

Networks (CINE), Kolkata, India, 2020, pp. 1-6, doi:

10.1109/CINE48825.2020.234388.

[14] S. I. Khan and R. B. Pachori, “Automated classification of lung

sound signals based on empirical mode decomposition,” Expert

Syst. Appl., vol. 184, p. 115456, Dec. 2021, doi:

10.1016/J.ESWA.2021.115456

[15] M. T. García-Ordás, J. A. Benítez-Andrades, I. García-Rodríguez,

C. Benavides, and H. Alaiz-Moretón, "Detecting Respiratory

Pathologies Using Convolutional Neural Networks and

Variational Autoencoders for Unbalancing Data," Sensors, vol.

20, no. 4, p. 1214, 2020.

[16] L. Brunese, F. Mercaldo, A. Reginelli, and A. Santone, "A Neural

Network-Based Method for Respiratory Sound Analysis and Lung

Disease Detection," Applied Sciences, vol. 12, no. 8, p. 3877,

2022.

[17] L. Fraiwan, O. Hassanin, M. Fraiwan, B. Khassawneh, A. M.

Ibnian, and M. Alkhodari, "Automatic identification of respiratory

diseases from stethoscopic lung sound signals using ensemble

classifiers," Biocybernetics and Biomedical Engineering, vol. 41,

no. 1, pp. 1-14, 2021.

[18] J. Acharya and A. Basu, "Deep neural network for respiratory

sound classification in wearable devices enabled by patient

specific model tuning," IEEE transactions on biomedical circuits

and systems, vol. 14, no. 3, pp. 535-544, 2020.

[19] M. U. Khan, S. J. T. J. o. E. E. Aziz, and C. Sciences, "A novel

pulse plethysmograph signal analysis method for identification of

myocardial infarction, dilated cardiomyopathy, and hypertension,"

vol. 29, no. 2, pp. 962-977, 2021.

[20] M. U. Khan et al., "Expert hypertension detection system featuring

pulse plethysmograph signals and hybrid feature selection and

reduction scheme," vol. 21, no. 1, p. 247, 2021.

[21] F. Amjad, A. Malik, M. Bilal, M. U. Khan, and S. Aziz, "Diagnosis

of Cardiac Disorders Featuring Pulse Plethysmograph Signals," in

2021 Mohammad Ali Jinnah University International Conference

on Computing (MAJICC), 2021, pp. 1-6: IEEE.

[22] M. U. Khan, S. Aziz, F. Amjad, and M. Mohsin, "Detection of

dilated cardiomyopathy using pulse plethysmographic signal

analysis," in 2019 22nd International multitopic conference

(INMIC), 2019, pp. 1-5: IEEE.

[23] M. U. Khan, S. Aziz, K. Iqtidar, R. J. B. S. P. Fernandez-Rojas,

and Control, "Computer-aided diagnosis system for cardiac

disorders using variational mode decomposition and novel cepstral

quinary patterns," vol. 81, p. 104509, 2023.

[24] S. Aziz, M. U. Khan, K. Iqtidar, S. Ali, A. N. Remete, and M. A.

J. E. S. Javid, "Pulse plethysmograph signal analysis method for

classification of heart diseases using novel local spectral ternary

patterns," p. e13011, 2022.

[25] Z. Kanji, "Classification of Auscultation Sounds Using a Smart

System," Massachusetts Institute of Technology, 2022.

[26] K. Iqtidar, U. Qamar, S. Aziz, M. U. J. C. i. B. Khan, and

Medicine, "Phonocardiogram signal analysis for classification of

Coronary Artery Diseases using MFCC and 1D adaptive local

ternary patterns," vol. 138, p. 104926, 2021.

[27] S. Aziz, M. U. Khan, M. Alhaisoni, T. Akram, and M. J. S. Altaf,

"Phonocardiogram signal processing for automatic diagnosis of

congenital heart disorders through fusion of temporal and cepstral

features," vol. 20, no. 13, p. 3790, 2020.

[28] M. U. Khan et al., "A two-stage classification model integrating

feature fusion for coronary artery disease detection and

classification," vol. 81, no. 10, pp. 13661-13690, 2022.

[29] U. Riaz, S. Aziz, M. Umar Khan, S. A. A. Zaidi, M. Ukasha, and

A. J. C. I. Rashid, "A novel embedded system design for the

detection and classification of cardiac disorders," vol. 37, no. 4,

pp. 1844-1864, 2021.

[30] S. Aziz, M. U. Khan, A. Rehman, Z. Tariq, and K. J. E. S. Iqtidar,

"Computer‐aided diagnosis of COVID‐19 disease from chest x‐

ray images integrating deep feature extraction," vol. 39, no. 5, p.

e12919, 2022.

[31] F. Bouaziz, D. Boutana, and H. Oulhadj, "Diagnostic of ECG

arrhythmia using wavelet analysis and K-nearest neighbor

algorithm," in 2018 International conference on applied smart

systems (ICASS), 2018, pp. 1-6: IEEE.

Authorized licensed use limited to: University of Canberra. Downloaded on June 10,2023 at 14:01:12 UTC from IEEE Xplore. Restrictions apply.

Machine Learning-Based Classification of Pulmonary Diseases through Real-Time Lung Sounds

Article

Full-text available

Oct 2023

The study presents a computer-based automated system that employs machine learning to classify pulmonary diseases using lung sound data collected from hospitals. Denoising techniques, such as discrete wavelet transform and variational mode decomposition, are applied to enhance classifier performance. The system combines cepstral features, such as Mel-frequency cepstrum coefficients and gammatone frequency cepstral coefficients, for classification. Four machine learning classifiers, namely the decision tree, k-nearest neighbor, linear discriminant analysis, and random forest, are compared. Evaluation metrics such as accuracy, recall, specificity, and f1 score are employed. This study includes patients affected by chronic obstructive pulmonary disease, asthma, bronchiectasis, and healthy individuals. The results demonstrate that the random forest classifier outperforms the others, achieving an accuracy of 99.72% along with 100% recall, specificity, and f1 scores. The study suggests that the computer-based system serves as a decision-making tool for classifying pulmonary diseases, especially in resource-limited settings.

Pulmo-TS2ONN: A Novel Triple Scale Self Operational Neural Network for Pulmonary Disorder Detection Using Respiratory Sounds

Article

Jan 2024

Pulmonary disorders (PDs) are one of the substantial hazards to human life, which can be diagnosed by a variety of clinical modalities, including peak flow meter and spirometry measurements, chest auscultation-based respiratory sound (RS) measurements, etc. Analyzing the acoustic RS measurements is one of the inexpensive yet essential diagnostic methods for identifying PDs as these RSs are correlated with structural flaws of the lungs that occur due to PDs. Additionally, the development of the digital stethoscope facilitates the continuous measurement of acoustic RSs of any individual, which can be exploited to identify a variety of PDs. In this paper, we have proposed a triple time-frequency feature set (TTFFS) driven triple-scale self-operational neural network (TS2ONN) architecture, namely Pulmo-TS2ONN, to classify a wide spectrum of PDs using the RSs. The proposed pulmo-TS2ONN comprises three major stages: pre-processing, TTFFS extraction, and finally classification of seven class PDs by using TS2ONN architecture which utilizes the improved non-linear neural backbone of self-operational neural network (SONN) in place of the linear neural architecture used in conventional deep learning networks. Upon experimental evaluation, the proposed framework outperforms the existing noteworthy research works by achieving the highest performance rates of 98.88%, 98.27%, and 99.84% for accuracy, sensitivity, and specificity, respectively. Lastly, the proposed framework is implemented on a quad-core ARM-A7-based Raspberry Pi-4 microcontroller, allowing the possibility of translating the research into real clinical situations for RS-based PD screening.

Computer-aided diagnosis system for cardiac disorders using variational mode decomposition and novel cepstral quinary patterns

Article

Full-text available

Dec 2022
BIOMED SIGNAL PROCES

Cardiac disorders cause a large number of human mortalities every year. This raises a sheer need for an early and accurate diagnosis of cardiac disorders to provide early meaningful intervention. Ischemic heart disease (IHD) and rheumatic heart disease (RHD) are the leading causes of heart failure. In this article, we proposed a novel framework for the classification of IHD and RHD using Pulse Plethysmograph (PuPG) signals obtained from a subject’s fingertip. The presented framework comprises a combination of variational mode decomposition (VMD), cosine-based soft segmentation, and novel cepstral quinary patterns (CQPs). The PuPG signals were first preprocessed through VMD by decomposing them in various modes. After an extensive time–frequency analysis, only relevant modes were selected and combined to reconstruct a preprocessed PuPG signal. The preprocessed signals were segmented through developed cosine-based soft segmentation to eliminate similar content in various classes. Features were extracted from the preprocessed signal using novel CQPs. CQPs were able to extract the hidden discriminative information about the disease through cepstrum transformed representation. The extracted CQP features were further reduced through the ReliefF ranking algorithm. The extracted reduced features were exposed to a range of well-known classification methods such as Support Vector Machines (SVM) with linear and non-linear kernels, Ensemble classifiers, and K-nearest neighbors. SVM-Gaussian (SVMG) provides the best performance of 99% accuracy using 10-fold cross-validation. The proposed CQPs were also compared with time, frequency, and cepstral features. Comparative analysis confirms that the proposed method outperforms the existing renowned techniques for the diagnosis of cardiac disorders.

Data augmentation using Variational Autoencoders for improvement of respiratory disease classification

Article

Full-text available

Aug 2022
PLOS ONE

Computerized auscultation of lung sounds is gaining importance today with the availability of lung sounds and its potential in overcoming the limitations of traditional diagnosis methods for respiratory diseases. The publicly available ICBHI respiratory sounds database is severely imbalanced, making it difficult for a deep learning model to generalize and provide reliable results. This work aims to synthesize respiratory sounds of various categories using variants of Variational Autoencoders like Multilayer Perceptron VAE (MLP-VAE), Convolutional VAE (CVAE) Conditional VAE and compare the influence of augmenting the imbalanced dataset on the performance of various lung sound classification models. We evaluated the quality of the synthetic respiratory sounds’ quality using metrics such as Fréchet Audio Distance (FAD), Cross-Correlation and Mel Cepstral Distortion. Our results showed that MLP-VAE achieved an average FAD of 12.42 over all classes, whereas Convolutional VAE and Conditional CVAE achieved an average FAD of 11.58 and 11.64 for all classes, respectively. A significant improvement in the classification performance metrics was observed upon augmenting the imbalanced dataset for certain minority classes and marginal improvement for the other classes. Hence, our work shows that deep learning-based lung sound classification models are not only a promising solution over traditional methods but can also achieve a significant performance boost upon augmenting an imbalanced training set.

A Neural Network-Based Method for Respiratory Sound Analysis and Lung Disease Detection

Article

Full-text available

Apr 2022

Background: Respiratory sound analysis represents a research topic of growing interest in recent times. In fact, in this area, there is the potential to automatically infer the abnormalities in the preliminary stages of a lung dysfunction. Methods: In this paper, we propose a method to analyse respiratory sounds in an automatic way. The aim is to show the effectiveness of machine learning techniques in respiratory sound analysis. A feature vector is gathered directly from breath audio and, thus, by exploiting supervised machine learning techniques, we detect if the feature vector is related to a patient affected by a lung disease. Moreover, the proposed method is able to characterise the lung disease in asthma, bronchiectasis, bronchiolitis, chronic obstructive pulmonary disease, pneumonia, and lower or upper respiratory tract infection. Results: A retrospective experimental analysis on 126 patients with 920 recording sessions showed the effectiveness of the proposed method. Conclusion: The experimental analysis demonstrated that it is possible to detect lung disease by exploiting machine learning techniques. We considered several supervised machine learning algorithms, obtaining the most interesting performance with the neural network model, with an F-Measure of 0.983 in lung disease detection and equal to 0.923 in lung disease characterisation, increasing the state-of-the-art performance.

Pulse plethysmograph signal analysis method for classification of heart diseases using novel local spectral ternary patterns

Article

Full-text available

Apr 2022
EXPERT SYST

Cardiac disorders are one of the prime reasons for an increasing global death rate. Reliable and efficient diagnosis procedures are imperative to minimize the risk posed by heart disorders. Computer‐aided diagnosis, based on machine learning and biomedical signal analysis, has recently been adopted by researchers to accurately predict cardiac ailments. Multi‐channel Electrocardiogram signals are mostly used in scientific literature as an indicator to diagnose cardiac disorders. Recently pulse plethysmograph (PuPG) signal got attention as an evolving biosignal and promising diagnostic tool to detect heart disorders since it has a simple sensor with low cost, non‐invasive, reliable, and easy to handle technology. This article proposes a computer‐aided diagnosis system to detect Myocardial Infarction, Dilated Cardiomyopathy, and Hypertension from PuPG signals. Raw PuPG signal is first preprocessed through empirical mode decomposition (EMD) by removing the redundant and useless information content. Then, highly discriminative features are extracted from preprocessed PuPG signal through novel local spectral ternary patterns (LSTP). Extracted LSTPs are input to a variety of classification methods such as support vector machines (SVM), K‐nearest neighbours, decision tree, and so on. SVM with cubic kernel yielded the best classification performance of 98.4% accuracy, 96.7% sensitivity, and 99.6% specificity with 10‐fold cross‐validation. The proposed framework was trained and tested on a self‐collected PuPG signals database of heart disorders. A comparison with previous studies and other feature descriptors shows the superiority of the proposed system. This research provides better insights into the contributions of PuPG signals towards reliable detection of heart disorder through low‐cost and non‐invasive means.

Acquisition and Classification of Lung Sounds for Improving the Efficacy of Auscultation Diagnosis of Pulmonary Diseases

Article

Full-text available

Apr 2022

Purpose Lung diseases are the third leading cause of death worldwide. Stethoscope-based auscultation is the most commonly used, non-invasive, inexpensive, and primary diagnostic approach for assessing lung conditions. However, the manual auscultation-based diagnosis procedure is prone to error, and its accuracy is dependent on the physician’s experience and hearing capacity. Moreover, the stethoscope recording is vulnerable to different noises that can mask the important features of lung sounds which may lead to misdiagnosis. In this paper, a method for the acquisition of lung sound signals and classification of the top 7 lung diseases has been proposed for improving the efficacy of auscultation diagnosis of pulmonary disease. Methods An electronic stethoscope has been constructed for signal acquisition. Lung sound signals were then collected from people with COPD, upper respiratory tract infections (URTI), lower respiratory tract infections (LRTI), pneumonia, bronchiectasis, bronchiolitis, asthma, and healthy people. Lung sounds were analyzed using a wavelet multiresolution analysis. To choose the most relevant features, feature selection using one-way ANOVA was performed. The classification accuracy of various machine learning classifiers was compared, and the Fine Gaussian SVM was chosen for final classification due to its superior performance. Model optimization was accomplished through the application of Bayesian optimization techniques. Results A test classification accuracy of 99%, specificity of 99.2%, and sensitivity of 99.04%, have been achieved for the 7 lung diseases using the optimized Fine Gaussian SVM classifier. Conclusion Our experimental results demonstrate that the proposed method has the potential to be used as a decision support system for the classification of lung diseases, especially in those areas where the expertise and the means are limited.

Person-independent facial expression recognition based on the fusion of HOG descriptor and cuttlefish algorithm

Article

Full-text available

Mar 2022
MULTIMED TOOLS APPL

This paper proposes an efficient approach for person-independent facial expression recognition based on the fusion of Histogram of Oriented Gradients (HOG) descriptor and Cuttlefish Algorithm (CFA). The proposed approach employs HOG descriptor due to its outstanding performance in pattern recognition, which results in features that are robust against small local pose and illumination variations. However, it produces some irrelevant and noisy features that slow down and degrade the classification performance. To address this problem, a wrapper-based feature selector, called CFA, is used. This is because CFA is a recent bio-inspired feature selection algorithm, which has been shown to effectively select an optimal subset of features while achieving a high accuracy rate. Here, support vector machine classifier is used to evaluate the quality of the features selected by the CFA. Experimental results validated the effectiveness of the proposed approach in attaining a high recognition accuracy rate on three widely adopted datasets: CK+ (97.86%), RaFD (95.15%), and JAFFE (90.95%). Moreover, the results also indicated that the proposed approach yields competitive or even superior results compared to state-of-the-art approaches.

Classification of Auscultation Sounds Using a Smart System

Thesis

May 2022

Zahra Kanji

Respiratory diseases are a leading cause of death worldwide. Despite modern medicine, treatment of lung diseases is limited by the tools available to diagnose these disorders, especially in low resource settings. While tools such as chest x-ray and CT scans are highly accurate, their high cost provides a high barrier for many patient populations. The physical exam has been a long standing tried and true method that provides a low cost solution for for diagnosis of many common lung diseases including pneumonia. However, this method is subjective and its sensitivity is limited to the operator ability. Lung sound classification and using a digital stethoscope can be used to provide an immediate diagnostic for respiratory-related diseases. The International Conference on Biomedical and Health Informatics (ICBHI) created a sound data base in 2017 that is annotated with a classification of the lung sound by physicians. In this thesis, artificial intelligence libraries are used in a deeo learning architecture to identify and classify the lung sounds. The data set was split into training and test data and evaluated using standard performance metrics: precision, 92.3%, accuracy, 87.3%, sensitivity (recall), 87.1%, specificity, 87.5% and F1 Score, 0.89%. Because the data set is skewed right, the best evaluation metric is the F1 Score, which is a weighted average of precision and sensitivity. The F1 score was found to be better than other comparable known attempts on this same data set. The space for new, innovative, portable and affordable diagnostic devices that aid patients towards pulmonary health and wellness will likely push the development further of the acceptance of electronic auscultations. As telemedicine grows, this will also drive up the demands for such devices. Other holistic measures that are used in medicine will likely also be be developed as the landscape of healthtech changes what is possible.

Bronchiectasis - A Clinical Review

Article

Aug 2022
NEW ENGL J MED

Anne E O'Donnell

Bronchiolitis

Article

Jul 2022
LANCET

Viral bronchiolitis is the most common cause of admission to hospital for infants in high-income countries. Respiratory syncytial virus accounts for 60-80% of bronchiolitis presentations. Bronchiolitis is diagnosed clinically without the need for viral testing. Management recommendations, based predominantly on high-quality evidence, advise clinicians to support hydration and oxygenation only. Evidence suggests no benefit with use of glucocorticoids or bronchodilators, with further evidence required to support use of hypertonic saline in bronchiolitis. Evidence is scarce in the intensive care unit. Evidence suggests use of high-flow therapy in bronchiolitis is limited to rescue therapy after failure of standard subnasal oxygen only in infants who are hypoxic and does not decrease rates of intensive care unit admission or intubation. Despite systematic reviews and international clinical practice guidelines promoting supportive rather than interventional therapy, universal de-implementation of interventional care in bronchiolitis has not occurred and remains a major challenge.

Proteomics endotyping of infants with severe bronchiolitis and risk of childhood asthma

Article

May 2022

Background: Bronchiolitis is the leading cause of hospitalization in U.S. infants and a major risk factor for childhood asthma. Growing evidence supports clinical heterogeneity within bronchiolitis. We aimed to identify endotypes of infant bronchiolitis by integrating clinical, virus, and serum proteome data, and examine their relationships with asthma development. Methods: This is a multicenter prospective cohort study of infants hospitalized for physician-diagnosis of bronchiolitis. We identified bronchiolitis endotypes by applying unsupervised machine learning (clustering) approaches to integrated clinical, virus (respiratory syncytial virus [RSV], rhinovirus [RV]), and serum proteome data measured at hospitalization. We then examined their longitudinal association with the risk for developing asthma by age 6 years. Results: In 140 infants hospitalized with bronchiolitis, we identified three endotypes: 1) clinicalatopic virusRV proteomeNFκB-dysregulated , 2) clinicalnon-atopic virusRSV/RV proteomeTNF-dysregulated , and 3) clinicalclassic virusRSV proteomeNFκB/TNF-regulated endotypes. Endotype 1 infants were characterized by high proportion of IgE sensitization and RV infection. These endotype 1 infants also had dysregulated NFκB pathways (FDR<0.001) and significantly higher risks for developing asthma (53% vs. 22%; adjOR 4.04; 95%CI, 1.49-11.0; P=0.006), compared with endotype 3 (clinically resembling "classic" bronchiolitis). Likewise, endotype 2 infants were characterized by low proportion of IgE sensitization and high proportion of RSV or RV infection. These endotype 2 infants had dysregulated tumor necrosis factor (TNF)-mediated signaling pathway (FDR<0.001) and significantly higher risks for developing asthma (44% vs. 22%; adjOR 2.71; 95%CI, 1.03-7.11, P=0.04). Conclusion: In this multicenter cohort, integrated clustering of clinical, virus, and proteome data identified biologically distinct endotypes of bronchiolitis that have differential risks of asthma development.

An Automated System for the Classification of Bronchiolitis and Bronchiectasis Diseases using Lung Sound Analysis

Abstract

Recommended publications

Expert System for Diagnosis of Multiple Neuromuscular Disorders using EMG Signals

Diagnosis of Diabetes Mellitus using Pulse Plethysmograph

Intelligent System for the Diagnosis of Schizophrenia featuring Brain Textures from EEG

Detecting Lung Infections with Empirical Mode Decomposition and Neural Networks