Conference PaperPDF Available

FPGA implementation of an automatic wheezes detector based on MFCC and SVM

March 2016

March 2016

DOI:10.1109/ATSIP.2016.7523173

Conference: 2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

Authors:

Ons Boujelben

Université du Québec à Rimouski UQAR

Mohammed Bahoura

Université du Québec à Rimouski UQAR

The present paper proposes a new hardware implementation to categorize lung sounds into wheeze and normal groups. The suggested architecture employs the Mel-Frequency Cepstral Coefficients (MFCC) for feature extraction and the Support Vector Machine (SVM) for classification. In the current study, the SVM parameters are obtained during the training phase using the LIBSVM library in MATLAB, while the testing phase is performed on FPGA. The used database is composed of 12 normal respiratory sounds and 12 respiratory sounds containing wheezes. The classification results obtained with FPGA are compared to those obtained with MATLAB.

Principle of classification using the SVM techniques for two classes

…

Classification of normal (a) and wheezing (b and c) respiratory sounds into of segments into normal {+1} and wheezing {−1} frames. Every subfigure contains the spectrogram of the tested sample (top) and the classification results using fixed-point of XSG (middle) and floating-point of MATLAB (bottom).

…

Figures - uploaded by Mohammed Bahoura

Content may be subject to copyright.

Content uploaded by Mohammed Bahoura

Content may be subject to copyright.

FPGA Implementation of an Automatic Wheezes

Detector based on MFCC and SVM

Ons Boujelben1

1Department of Engineering,

Universit´

e du Qu´

ebec `

a Rimouski,

300, all´

ee des Ursulines, Rimouski, Canada

Mohammed Bahoura1,2

2Department of Electronics,

Universit´

e Saad Dahlab de Blida,

Route de Soumˆ

aa, Blida, Algeria.

Abstract—The present paper proposes a new hardware im-

plementation to categorize lung sounds into wheeze and normal

groups. The suggested architecture employs the Mel-Frequency

Cepstral Coefﬁcients (MFCC) for feature extraction and the

Support Vector Machine (SVM) for classiﬁcation. In the current

study, the SVM parameters are obtained during the training

phase using the LIBSVM library in MATLAB, while the testing

phase is performed on FPGA. The used database is composed

of 12 normal respiratory sounds and 12 respiratory sounds

containing wheezes. The classiﬁcation results obtained with FPGA

are compared to those obtained with MATLAB.

Keywords—Respiratory sounds, Wheezes, Classiﬁcation, FPGA,

SVM, MFCC.

I. INTRODUCTION

Asthma is a Chronic Obstructive Pulmonary Disease

(COPD), where the number of the people affected is constantly

increasing. Computer-based lung sound analysis provides an

objective tool to diagnose the respiratory diseases. Many

researchers have been interested by the lung sounds recog-

nition problems, where different signal processing techniques

have been developed to classify lung sounds. Since 1980,

scientists try to identify automatically the presence of wheez-

ing [1]. To classify respiratory sounds, different combinations

of feature extraction and classiﬁcation methods have been

proposed in the literature. Mel-frequency cepstral coefﬁcients

(MFCC) was combined with support vector machine (SVM),

k-Nearest Neighbour (kNN) [2] and Gaussian mixture models

(GMM) [3]. The wavelet transform was proposed with artiﬁcial

neural networks (ANN) [4], [5], and other combination can be

found in [5], [6]. Among these techniques, the combination

MFCC-SVM has been efﬁciently applied to detect wheezes

sounds, it can achieve a classiﬁcation accuracy higher than

95% [1].

Despite its advantages, the respiratory sounds analysis has

not reach yet the step in which it can be used the clinical

environment. On the other hand, real-time implementation of

various signal processing of feature extraction and pattern

classiﬁcation remains a great challenge. Therefore, we are

interested in this research to carry out a hardware implementa-

tion of an automatic system for detecting and classifying lung

sounds into normal and wheezes. The literature review shows

that MFCC-based feature extraction for respiratory sounds has

been implemented on FPGA [7], while the SVM classiﬁer

was implemented on FPGA for Persian handwritten digits

recognition [8].

This paper proposes an FPGA-based real-time system to

detect wheezes episode at asthmatic patients using Xilinx

System Generator (XSG). The hardware design is generated

and veriﬁed in MATLAB/SIMULINK.

II. FE ATUR E EXTRACTION ALGORITHM

MFCC-based method is performed to describe lung sounds

in order to approximate the response of human auditory

system. The extracted feature describes ﬁrmly the sound that

can be heard over the stethoscope [1].

Sampled at 6000 Hz, the lung sound is ﬁrst segmented

into frames of Nsamples, and then multiplied by Hamming

window. For the mth segment s(m, n), the Discrete Fourier

Transform (DFT), S(m, k), is computed using the Fast Fourier

Transform (FFT) algorithm, the M-mel ﬁlter bank is applied

to the resulting energy spectrum. The logarithmic energy

output of the lth ﬁlter for the current frame mis deﬁned as

e(m, l) = log(

N−1

k=0

|S(m, k)|2Hl(k)) (1)

where Hl(k)represent the frequency response of the given

ﬁlter, where l= 1, ..., M. The MFCC coefﬁcients are obtained

by discrete cosine transform (DCT)

cm(n) =

l=1

e(m, l)cos(n(l−0.5)π/M)(2)

where nrepresent the index of the cepstral coefﬁcient. In this

case, 15 MFCC have been used: cm(3), ..., cm(17).

III. CLASSIFICATION ALGORITHM

Support vector machine (SVM) technique was proposed for

regression and classiﬁcation problems. It is based on a kernel

learning algorithm that classify binary or multiple data. The

SVM operates in both training and testing steps. During the

training step, SVM builds a predictive pattern using training

samples and the label values of the proper class, and then it

uses this model to classify the test set. Considering a linear

problem, the main purpose of SVM is to deﬁne an hyperplane

such that: the class labels of data {±1}are located on each

side of the hyperplane and the distance of the nearest vector

of the hyperplane (both classes) is maximum.

Fig. 1. Principle of classiﬁcation using the SVM techniques for two classes

The parameters wand bin Fig. 1 are obtained by solving

the following dual Lagrange problem

max(Ld(α)) =

i=1

αi−1

j,i=1

αiαjyiyjxjxT

i(3)

based on (0≤αi≤C

i=1 αiyi= 0

The equation that determines the optimal hyperplane separat-

ing the two classes with the highest margin is deﬁned by :

M(x) = wTx+b(4)

The decision function in the context of linear data is deﬁned

by the sign of the hyperplane in (4):

d(x) = sign(wTx+b)(5)

In the case of data non-linearly separated, SVM maps data

into a richer feature space (H) including non-linear features,

then constructs an hyperplane in that space.

ϕ:Rn→H(6)

x→ϕ(x)(7)

In this case the vector xis transformed into ϕ(x). The

kernel function is deﬁned by the following inner product:

k(xi, xj) = ϕ(xi)×ϕ(xj)(8)

For non-linear data, the SVM make a decision satisfying the

following equation

d(x) = sign(wT×ϕ(x) + b(9)

The software tests show that the use of the linear kernel gives

the maximum classiﬁcation accuracy. In this research, we pro-

pose to use the linear kernel function, because it demonstrates

a quite efﬁcient for classifying respiratory sounds.

TABLE I. RESOURCE UTILIZATION SUMMARY AND MAXIMUM

FREQUENCY OBTAINED FOR THE VIRTEX -6 XC6VLX240T CHIP.

Resource utilization

Flip Flops (301,440) 13,398 (4.4%)

LUTs (150,720) 18,943 (12.6%)

Bonded IOBs (600) 561 (93.5%)

RAMB18E1s (832) 4 (0.5%)

DSP48E1s (768) 154 (20.0%)

Slice (37,680) 5,959 (15%)

Maximum Operating Frequency 30.361 MHz

IV. FPGA IMPLEMENTATION

Figure 2 shows the block diagram of hardware architecture

design for MFCC feature extraction and two-class SVM-based

classiﬁer. The hardware implementation uses Xilinx System

Generator (XSG) tool and the Virtex-6 FPGA ML605 evalua-

tion board. Fig. 2 represents an optional subsystem designed

with SIMULINK blocks, which select one decision of classi-

ﬁcations for every frame. More details on the FPGA imple-

mentation of MFCC feature extraction technique and the SVM

classiﬁer can be found in [7] and [8], respectively. The training

phase of the classiﬁer is achieved ofﬂine with LIBSVM [9],

while the testing phase is done on hardware. Table I shows

the hardware resources used in Virtex-6XC6VLX240T device

and the maximum operating frequency of the implemented

architecture, as reported by Xilinx ISE Design Suite 13.4.

V. RE SU LTS AND DISCUSSION

To evaluate the proposed architecture, two classes of respi-

ratory sound (normal and wheezing) are used for training and

testing samples. Database is constructed from 12 records of

each class (total of 24 records), some wheezes sounds include

monophonic and polyphonic wheezes. The used lung sounds

are sampled at 6000 Hz. Wheezing sounds are manually

labelled. We named class1 with label {+1}for normal frame,

class2 with label {−1}for wheezing frame. The {±1}labels

represent the class of the tested frame.

The classiﬁcation results of normal and pure wheezing

respiratory sounds, presented in Fig. 3(a,b), shows that both the

designed architectures with Xilinx System Generator (XSG)

and MATLAB software provide the same classiﬁcation results.

The respiratory sound record presented in Fig. 3(c) contains

normal and wheezes sounds. In this case, both architectures

(XSG and MATLAB) can distinguish between the frame

containing normal lung sounds from those that containing

wheezes. The difference (one misclassiﬁed frame) can be

justiﬁed by the quantization errors in System Generator [8].

Finally, the designed architecture implemented with ﬁxed-

point XSG gives equivalent performances than those obtained

with the ﬂoating-point MATLAB.

VI. CONCLUSION

In this paper, FPGA architecture of an automatic wheezes

detector based on MFCC and SVM has been proposed. Based

on the tested records, the classiﬁcation performances obtained

with hardware implementation are analogous to those obtained

with the ﬂoating-point MATLAB. The designed architecture

Fig. 2. MFCC-SVM system based on Xilinx System Generator (XSG) blockest for wheezes classiﬁcation. The complete implemented system is presented on

the top, the subsystem details are presented. The green blocks are build using the XSG blocks (blue). The white blocks are the standard SIMULINK blocks.

The feature extraction MFCC is the same as described in [7].

Fig. 3. Classiﬁcation of normal (a) and wheezing (b and c) respiratory sounds into of segments into normal {+1}and wheezing {−1}frames. Every subﬁgure

contains the spectrogram of the tested sample (top) and the classiﬁcation results using ﬁxed-point of XSG (middle) and ﬂoating-point of MATLAB (bottom).

can be generalized to other respiratory sound classes. As a

future work, the proposed architecture will be tested on a large

database. The implementation of others feature extraction is

recommended to improve the identiﬁcation accuracy.

ACKNOWLEDGMENT

This research is supported by the NSERC of Canada.

REFERENCES

[1] I. Mazic, M. Bonkovic, and B. Daja, “Two-level coarse-to-ﬁne classiﬁca-

tion algorithm for asthma wheezing recognition in children’s respiratory

sounds,” Biomedical Signal Processing and Control, vol. 21, pp. 105–

118, 2015.

[2] R. Palaniappan, K. Sundaraj, and S. Sundaraj, “A comparative study

of the svm and k-nn machine learning algorithms for the diagnosis

of respiratory pathologies using pulmonary acoustic signals,” BMC

bioinformatics, vol. 15, no. 1, p. 223, 2014.

[3] M. Bahoura and C. Pelletier, “Respiratory sounds classiﬁcation using

cepstral analysis and gaussian mixture models,” in Engineering in

Medicine and Biology Society, 2004. IEMBS’04. 26th Annual Interna-

tional Conference of the IEEE, vol. 1, 2004, pp. 9–12.

[4] A. Kandaswamy, C. S. Kumar, R. P. Ramanathan, S. Jayaraman, and

N. Malmurugan, “Neural classiﬁcation of lung sounds using wavelet

coefﬁcients,” Computers in Biology and Medicine, vol. 34, no. 6, pp.

523–537, 2004.

[5] M. Bahoura, “Pattern recognition methods applied to respiratory sounds

classiﬁcation into normal and wheeze classes,” Computers in biology and

medicine, vol. 39, no. 9, pp. 824–843, 2009.

[6] R. Palaniappan, K. Sundaraj, and N. U. Ahamed, “Machine learning in

lung sound analysis: a systematic review,” Biocybernetics and Biomedical

Engineering, vol. 33, no. 3, pp. 129–135, 2013.

[7] M. Bahoura and H. Ezzaidi, “Hardware implementation of MFCC feature

extraction for respiratory sounds analysis,” in 8th Workshop on Systems,

Signal Processing and their Applications, 2013, pp. 226–229.

[8] D. Mahmoodi, A. Soleimani, H. Khosravi, M. Taghizadeh et al., “FPGA

simulation of linear and nonlinear support vector machine,” Journal of

Software Engineering and Applications, vol. 4, no. 05, p. 320, 2011.

[9] C.-C. Chang and C.-J. Lin, “LIBSVM: A library for support vector

machines,” ACM Transactions on Intelligent Systems and Technology

(TIST), vol. 2, no. 3, p. 27, 2011.

System-Level Power Consumption Analysis of the Wearable Asthmatic Wheeze Quantification

Article

Full-text available

Apr 2018

Long-term quantification of asthmatic wheezing envisions an m-Health sensor system consisting of a smartphone and a body-worn wireless acoustic sensor. As both devices are power constrained, the main criterion guiding the system design comes down to minimization of power consumption, while retaining sufficient respiratory sound classification accuracy (i.e., wheeze detection). Crucial for assessment of the system-level power consumption is the understanding of trade-off between power cost of computationally intensive local processing and communication. Therefore, we analyze power requirements of signal acquisition, processing, and communication in three typical operating scenarios: (1) streaming of uncompressed respiratory signal to a smartphone for classification, (2) signal streaming utilizing compressive sensing (CS) for reduction of data rate, and (3) respiratory sound classification onboard the wearable sensor. Study shows that the third scenario featuring the lowest communication cost enables the lowest total sensor system power consumption ranging from 328 to 428 μ W. In such scenario, 32-bit ARM Cortex M3/M4 cores typically embedded within Bluetooth 4 SoC modules feature the optimal trade-off between onboard classification performance and consumption. On the other hand, study confirms that CS enables the most power-efficient design of the wearable sensor (216 to 357 μ W) in the compressed signal streaming, the second scenario. In such case, a single low-power ARM Cortex-A53 core is sufficient for simultaneous real-time CS reconstruction and classification on the smartphone, while keeping the total system power within budget for uncompressed streaming.

Asthmatic Wheeze Detection From Compressively Sensed Respiratory Sound Spectra

Article

Dec 2017

Quantification of wheezing by a sensor system consisting of a wearable wireless acoustic sensor and smartphone performing respiratory sound classification, may contribute to the diagnosis, long-term control, and lowering treatment costs of asthma. In such battery-powered sensor system, compressive sensing (CS) was verified as a method for simultaneously cutting down power-cost of signal acquisition, compression, and communication on the wearable sensor. Matching real-time CS reconstruction algorithms, such as orthogonal matching pursuit (OMP), have been demonstrated on the smartphone. However, their lossy performance limits the accuracy of wheeze detection from CS-recovered short-term Fourier spectra (STFT), when using existing respiratory sound classification algorithms. Thus, here we present a novel, robust algorithm tailored specifically for wheeze detection from the CS-recovered STFT. Proposed algorithm identifies occurrence and tracks multiple individual wheeze frequency lines using hidden Markov model (HMM). Algorithm yields 89.34% of sensitivity, 96.28% specificity, and 94.91% of accuracy on Nyquist-rate sampled respiratory sounds STFT. It enables for less than 2% loss of classification accuracy when operating over STFT reconstructed by OMP, at the signal compression ratio of up to 4x (classification from only 25% signal samples). It features execution speed comparable to referent algorithms, and offers good prospects for parallelism.

Spectral Analysis of Lungs sounds for Classification of Asthma and Pneumonia Wheezing

Conference Paper

Full-text available

Jun 2020

World Health Organization Statistics declares the pulmonic illness as the class of deadly illness. Wheezing is a key indicator for the diagnosis of pulmonic illnesses like Asthma and pneumonia. In this research article, the identification of wheeze sound in asthma and pneumonia subjects is done from breathing sound. The analysis is performed through signal processing and machine learning practices. Overall, data is acquired from 300 subjects. It includes 100 Asthma, 100 Pneumonia, and 100 Normal subjects This research work proposes a complete design for accurate classification of wheezing signals. It includes pre-processing by normalization, denoising by filtration, segmentation to remove the non-breathing and silent parts, feature extraction from the spectral domain, and classification by support vector machine (SVM) using Matlab 2019b. The system evidenced an accuracy greater than 96%. Further investigation can be done by analyzing the wheezing sound originates in other pulmonic diseases and exploring its role to identify the pulmonary illness.

Reconfigurable Computing and Hardware Acceleration in Health Informatics

Chapter

Oct 2020

Health informatics connects biomedical engineering with information technology to devise a modern eHealth system which often requires precise biosignal processing. This “biosignal” is essentially an electrophysiological signal from a living organism. In practice, these signals are frequently used to assess patients’ health and to discover bio-physiological anonymities. However, as most of the biosignal processing units are multichannel systems with extensive datasets, conventional computation techniques often fail to offer immediate execution of data processing. Reconfigurable architecture offers a tangible solution to this problem by utilizing fast parallel computation based on the Field Programmable Gate Array (FPGA). This computation technique ensures “Hardware Acceleration” which essentially means the exclusive utilization of hardware resources to expedite computational tasks. This is the technique of designing application-specific circuits rather than using the general-purpose processors to do the signal processing. Because of its low cost and fast computation property, reconfigurable architecture is characteristically suitable for Health Informatics and has become one of the fastest-growing research fields of recent years. In literature, several works are found focusing on the efficient use of FPGAs as the biomedical computation units. Some of these researches involve fundamental spatiotemporal signal analysis like Fourier transform, power spectrum density measurement, and identifying significant signal peaks. In other studies, hardware acceleration is used to compress and predict the signal for data storage, processing, and transmission. Some of the works include digital filter designing for denoising the acquired signal, while a few of the advanced research projects incorporated reconfigurable architectures to develop artificial bio-organs and high-level prosthesis as a part of rehabilitation. In this chapter, these works will be briefly reviewed to find out the state-of-the-art research trends in this research field. https://www.springer.com/gp/book/9783030549312

Parallel Algorithm Design for Audio Feature Extraction

Conference Paper

Full-text available

Jan 2017

Intelligent automobile auxiliary propagation system based on speech recognition and AI driven feature extraction techniques

Article

Full-text available

Feb 2022
Int J Speech Tech

Haidong Xu

In the recent years, with the rapid development of China's national economy, people's living standards have been greatly improved. People's consumption demand is constantly increasing, and the consumption structure is constantly upgrading. In the automotive industry, which is increasingly related to residents' travel, consumers' demand for cars also presents a rising trend. The production and sales of domestic automobile market in China have increased greatly, and it has become the largest automobile consumer market in the world for eight consecutive years. In the information age, the speed of information interaction is faster and faster. Today, 5G technology has entered the commercial era, and the era of interconnection of everything has come. According to the work report of the Chinese government, the future city should be a smart city. In the background of Internet of things, smart city will greatly facilitate people's life in the way of ecosystem. As part of the future smart city ecosystem, intelligent cars must be self driving vehicles that meet the needs of smart city ecosystems. Therefore, the marketing strategy of smart cars is very important. Voice has always been one of the most concerned research contents in the field of human–computer communication and interaction. The main purpose of automatic speech recognition is to enable the computer to "understand" human speech and convert speech waveform signal into text. Speech recognition technology is one of the key technologies to realize intelligent human–computer interaction. The application of voice, the most natural way of interaction between human and machine, can effectively improve the input efficiency, error prone and other shortcomings of traditional interaction methods. This paper studies the intelligent vehicle marketing strategy based on speech recognition and artificial intelligence driven feature extraction technology. Through the modelling and the comparison simulations, the performance of the designed model is verified.

Music genre classification using spatan 6 FPGA and TMS320C6713 DSK

Conference Paper

Jul 2017

Comparison of Power-Efficiency of Asthmatic Wheezing Wearable Sensor Architectures

Conference Paper

Jul 2017

Power-requirements of a wireless wearable sensor for quantification of asthmatic wheezing in respiratory sounds, a typical symptom of chronic asthma, are analysed. Two converse sensor architectures are compared. One featuring processing-intensive on-board respiratory sound classification, and the other performing communication-intensive signal streaming, employing compressive sensing (CS) encoding for data-rate reduction, with signal reconstruction and classification performed on the peer mobile device. It is shown that lower total sensor power, ranging from 216 to 357 µW, may be obtained on the sensor streaming the CS encoded signal, operating at the compression rate higher than 2x. Total power-budget of 328 to 428 µW is shown required in the architecture with on-board processing.

A comparative study of the SVM and K-nn machine learning algorithms for the diagnosis of respiratory pathologies using pulmonary acoustic signals

Article

Full-text available

Jun 2014
BMC BIOINFORMATICS

Background Pulmonary acoustic parameters extracted from recorded respiratory sounds provide valuable information for the detection of respiratory pathologies. The automated analysis of pulmonary acoustic signals can serve as a differential diagnosis tool for medical professionals, a learning tool for medical students, and a self-management tool for patients. In this context, we intend to evaluate and compare the performance of the support vector machine (SVM) and K-nearest neighbour (K-nn) classifiers in diagnosis respiratory pathologies using respiratory sounds from R.A.L.E database. Results The pulmonary acoustic signals used in this study were obtained from the R.A.L.E lung sound database. The pulmonary acoustic signals were manually categorised into three different groups, namely normal, airway obstruction pathology, and parenchymal pathology. The mel-frequency cepstral coefficient (MFCC) features were extracted from the pre-processed pulmonary acoustic signals. The MFCC features were analysed by one-way ANOVA and then fed separately into the SVM and K-nn classifiers. The performances of the classifiers were analysed using the confusion matrix technique. The statistical analysis of the MFCC features using one-way ANOVA showed that the extracted MFCC features are significantly different (p < 0.001). The classification accuracies of the SVM and K-nn classifiers were found to be 92.19% and 98.26%, respectively. Conclusion Although the data used to train and test the classifiers are limited, the classification accuracies found are satisfactory. The K-nn classifier was better than the SVM classifier for the discrimination of pulmonary acoustic signals from pathological and normal subjects obtained from the RALE database.

Hardware implementation of MFCC feature extraction for respiratory sounds analysis

Conference Paper

Full-text available

May 2013

In this paper, an acoustic feature extraction method based on mel frequency cepstral coefficients (MFCC) was implemented on FPGA for real-time respiratory sound analysis. The proposed technique was implemented using Xilinx System Generator (XSG) in MATLAB/SIMULINK environment. The feature vectors obtained with fixed-point XSG implementation is compared to those obtained with on the floating-point MATLAB one using normal and wheezing respiratory sounds.

FPGA Simulation of Linear and Nonlinear Support Vector Machine

Article

Full-text available

Jan 2011

Simple hardware architecture for implementation of pairwise Support Vector Machine (SVM) classifiers on FPGA is presented. Training phase of the SVM is performed offline, and the extracted parameters used to implement testing phase of the SVM on the hardware. In the architecture, vector multiplication operation and classification of pairwise classifiers is designed in parallel and simultaneously. In order to realization, a dataset of Persian handwritten digits in three different classes is used for training and testing of SVM. Graphically simulator, System Generator, has been used to simulate the desired hardware design. Implementation of linear and nonlinear SVM classifier using simple blocks and functions, no limitation in the number of samples, generalized to multiple simultaneous pairwise classifiers, no complexity in hardware design, and simplicity of blocks and functions used in the design are view of the obvious characteristics of this research. According to simulation results, maximum frequency of 202.840 MHz in linear classification , and classification accuracy of 98.67% in nonlinear one has been achieved, which shows outstanding performance of the hardware designed architecture.

Neural classification of lung sounds using wavelet coefficients

Article

Full-text available

Oct 2004
COMPUT BIOL MED

Electronic auscultation is an efficient technique to evaluate the condition of respiratory system using lung sounds. As lung sound signals are non-stationary, the conventional method of frequency analysis is not highly successful in diagnostic classification. This paper deals with a novel method of analysis of lung sound signals using wavelet transform, and classification using artificial neural network (ANN). Lung sound signals were decomposed into the frequency subbands using wavelet transform and a set of statistical features was extracted from the subbands to represent the distribution of wavelet coefficients. An ANN based system, trained using the resilient back propagation algorithm, was implemented to classify the lung sounds to one of the six categories: normal, wheeze, crackle, squawk, stridor, or rhonchus.

Respiratory sounds classification using cepstral analysis and Gaussian mixture models

Article

Full-text available

Feb 2004

The Cepstral analysis is proposed with Gaussian Mixture Models (GMM) method to classify respiratory sounds in two categories: normal and wheezing. The sound signal is divided in overlapped segments, which are characterized by a reduced dimension feature vectors using Mel-Frequency Cepstral Coefficients (MFCC) or subband based Cepstral parameters (SBC). The proposed schema is compared with other classifiers: Vector Quantization (VQ) and Multi-Layer Perceptron (MLP) neural networks. A post processing is proposed to improve the classification results.

LIBSVM: A library for support vector machines

Article

Jan 2011

Two-level coarse-to-fine classification algorithm for asthma wheezing recognition in children's respiratory sounds

Article

Aug 2015
BIOMED SIGNAL PROCES

The paper proposes a two-layer pattern recognition system architecture for asthma wheezing detection in recorded children's respiratory sounds. The first layer consists of two SVM classifiers specifically designed as a cascade stacked in parallel to emphasize the differences among signals with similar acoustic properties, such as wheezes and inspiratory stridors. The second layer is realized using a digital detection threshold, which further upgrades the proposed structure with the aim of improving the process of wheezing detection. The results were experimentally evaluated on the data acquired from the General Hospital of Dubrovnik, Croatia. Classification results obtained on the test data sets revealed that the central frequency of wheezes included in the training data is important for the success of classification.

Machine learning in lung sound analysis: A systematic review

Article

Sep 2013

Machine learning has proven to be an effective technique in recent years and machine learning algorithms have been successfully used in a large number of applications. The development of computerized lung sound analysis has attracted many researchers in recent years, which has led to the implementation of machine learning algorithms for the diagnosis of lung sound. This paper highlights the importance of machine learning in computer-based lung sound analysis. Articles on computer-based lung sound analysis using machine learning techniques were identified through searches of electronic resources, such as the IEEE, Springer, Elsevier, PubMed and ACM digital library databases. A brief description of the types of lung sounds and their characteristics is provided. In this review, we examined specific lung sounds/disorders, the number of subjects, the signal processing and classification methods and the outcome of the analyses of lung sounds using machine learning methods that have been performed by previous researchers. A brief description on the previous works is thus included. In conclusion, the review provides recommendations for further improvements.

LIBSVM: A library for support vector machines

Article

Jul 2007

LIBSVM is a library for support vector machines (SVM). Its goal is to help users to easily use SVM as a tool. In this document, we present all its imple-mentation details. For the use of LIBSVM, the README file included in the package and the LIBSVM FAQ provide the information.

Pattern recognition methods applied to respiratory sounds classification into normal and wheeze classes

Article

Sep 2009

Mohammed Bahoura

In this paper, we present the pattern recognition methods proposed to classify respiratory sounds into normal and wheeze classes. We evaluate and compare the feature extraction techniques based on Fourier transform, linear predictive coding, wavelet transform and Mel-frequency cepstral coefficients (MFCC) in combination with the classification methods based on vector quantization, Gaussian mixture models (GMM) and artificial neural networks, using receiver operating characteristic curves. We propose the use of an optimized threshold to discriminate the wheezing class from the normal one. Also, post-processing filter is employed to considerably improve the classification accuracy. Experimental results show that our approach based on MFCC coefficients combined to GMM is well adapted to classify respiratory sounds in normal and wheeze classes. McNemar's test demonstrated significant difference between results obtained by the presented classifiers (p<0.05).

FPGA implementation of an automatic wheezes detector based on MFCC and SVM

Abstract and Figures

Recommended publications

Compiling Higher Order Functional Programs to Composable Digital Hardware

Support Vector Machines

Stellar Spectral Classification with Locality Preserving Projections and Support Vector Machine

Research on Rapid Identification Method of Buckwheat Varieties by Near-Infrared Spectroscopy Techniq...