ArticlePDF Available

Separation of Heart Sound Signal from Noise in Joint Cycle Frequency–Time–Frequency Domains Based on Fuzzy Detection

November 2010
IEEE transactions on bio-medical engineering 57(10):2438 - 2447

November 2010
57(10):2438 - 2447

DOI:10.1109/TBME.2010.2051225

Source
IEEE Xplore

Authors:

Hong Tang

Dalian University of Technology

Ting Li

Huazhong University of Science and Technology

Noise is generally unavoidable during recordings of heart sound signal. Therefore, noise reduction is one of the important preprocesses in the analysis of heart sound signal. This was achieved in joint cycle frequency-time-frequency domains in this study. Heart sound signal was decomposed into components (called atoms) characterized by time delay, frequency, amplitude, time width, and phase. It was discovered that atoms of heart sound signal congregate in the joint domains. On the other hand, atoms of noise were dispersed. The atoms of heart sound signal could, therefore, be separated from the atoms of noise based on fuzzy detection. In a practical experiment, heart sound signal was successfully separated from lung sounds and disturbances due to chest motion. Computer simulations for various clinical heart sound signals were also used to evaluate the performance of the proposed noise reduction. It was shown that heart sound signal can be reconstructed from simulated complex noise (perhaps non-Gaussian, nonstationary, and colored). The proposed noise reduction can recover variations in the both waveform and time delay of heart sound signal during the reconstruction. Correlation coefficient and normalized residue were used to indicate the closeness of the reconstructed and noise-free heart sound signal. Correlation coefficient may exceed 0.90 and normalized residue may be around 0.10 in 0-dB noise environment, even if the phonocardiogram signal covers only ten cardiac cycles.

…

Distribution of the atoms is shown in joint plane at cycle frequency 1 Hz. This atom congregate provides sufficient evidence that atoms of enhanced heart sound signal are quasi-cyclostationary.

…

Figures - uploaded by Hong Tang

Content may be subject to copyright.

Content uploaded by Hong Tang

Content may be subject to copyright.

2438 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 57, NO. 10, OCTOBER 2010

Separation of Heart Sound Signal from Noise in Joint

Cycle Frequency–Time–Frequency Domains Based

on Fuzzy Detection

Hong Tang∗, Member, IEEE, Ting Li, Yongwan Park, and Tianshuang Qiu, Member, IEEE

Abstract—Noise is generally unavoidable during recordings of

heart sound signal. Therefore, noise reduction is one of the im-

portant preprocesses in the analysis of heart sound signal. This

was achieved in joint cycle frequency–time–frequency domains in

this study. Heart sound signal was decomposed into components

(called atoms) characterized by time delay, frequency, amplitude,

time width, and phase. It was discovered that atoms of heart sound

signal congregate in the joint domains. On the other hand, atoms

of noise were dispersed. The atoms of heart sound signal could,

therefore, be separated from the atoms of noise based on fuzzy

detection. In a practical experiment, heart sound signal was suc-

cessfully separated from lung sounds and disturbances due to chest

motion. Computer simulations for various clinical heart sound sig-

nals were also used to evaluate the performance of the proposed

noise reduction. It was shown that heart sound signal can be re-

constructed from simulated complex noise (perhaps non-Gaussian,

nonstationary, and colored). The proposed noise reduction can re-

cover variations in the both waveform and time delay of heart

sound signal during the reconstruction. Correlation coefﬁcient and

normalized residue were used to indicate the closeness of the recon-

structed and noise-free heart sound signal. Correlation coefﬁcient

may exceed 0.90 and normalized residue may be around 0.10 in 0-

dB noise environment, even if the phonocardiogram signal covers

only ten cardiac cycles.

Index Terms—Fuzzy detection, heart sound signal, joint cycle

frequency–time–frequency domains, noise reduction.

I. INTRODUCTION

PHONOCARDIOGRAPHY is a noninvasive technique that

is used to check the functioning of heart valves. It is, there-

fore, widely used during medical examinations carried out by

physicians. However, ambient noise and disturbances can cor-

rupt the recorded heart sound signal. The sliding movements of

the stethoscope diaphragms in contact with the patient’s skin,

Manuscript received January 25, 2010; revised April 1, 2010; accepted April

30, 2010. Date of publication June 10, 2010; date of current version September

15, 2010. This work was supported in part by the National Natural Science

Foundation of China under Grant 30570475 and Grant 60872122. Asterisk

indicates corresponding author.

∗H. Tang is with the Department of Biomedical Engineering, Faculty of Elec-

tronic Information and Electrical Engineering, Dalian University of Technology,

Dalian 116024, China (e-mail: tanghong@dlut.edu.cn).

T. Li is with the College of Electromechanical and Information En-

gineering, Dalian Nationalities University, Dalian 116600, China (e-mail:

tracyli78@yahoo.com.cn).

Y. Park is with the Department of Information and Communication, Ye-

ungnam University, Gyeongsangbuk-Do 712-749, Korea (e-mail: ywpark@

yu.ac.kr).

T. Qiu is with the Department of Biomedical Engineering, Dalian University

of Technology, Dalian 116024, China (e-mail: qiutsh@dlut.edu.cn).

Digital Object Identiﬁer 10.1109/TBME.2010.2051225

lung sounds, the muscular activity controlling lung movements,

and surround speech sounds represent a few of the sources of

noise and disturbance affecting the accuracy of data being ac-

quired. These noises usually have high amplitude and last for

only a short period of time. Furthermore, there is considerable

overlap in the time–frequency domains of heart sound signal

and noises. As heart sound signals are transient, they are eas-

ily contaminated with the noises. Consequently, it may become

difﬁcult for physicians to obtain the correct diagnostic informa-

tion by auscultation when noise and disturbances are important.

Noise reduction would allow a quantitative analysis of heart

sound signal and lead to a more reliable diagnosis. The distur-

bance addressed in this paper is the impulsive noise interference,

which is typically characterized by noise pulses of short time

duration.

Over the years, various techniques of noise reduction have

been proposed for different purposes. Some techniques like

adaptive noise cancellation and ﬁltering can be applied to re-

duce noise from heart sound recordings [1], [2]. In particular, it

has been found that noise reduction performed in a time and/or

frequency domain may not be effective for non-Gaussian, non-

stationary, and colored noises. More speciﬁcally, it is considered

inappropriate because heart sound signal and noise overlap in

both time and frequency domains. On the contrary, techniques of

cyclostationary signal processing can reduce noise in the cycle

frequency domain. Beyar et al. [3] proposed to divide recordings

of heat sound signal into a sequence of repetitive cycles. Noise

was reduced simply by summations. However, heart sounds and

murmurs tend to occur with different timings from cycle to cycle

and cannot be totally preserved while noise is suppressed. In our

earlier paper [4], the timings of heart sounds and murmurs were

aligned from one cycle to next cycle by the nonlinear time scal-

ing (NTS). Noise and disturbance were subsequently reduced by

averaging. The results were promising, although segmentation

of the heart sound signal into ﬁrst heart sound (S1) and second

heart sound (S2) is fundamentally needed to determine the pa-

rameters for the NTS. This preprocess can have a detrimental

effect on the efﬁciency of the noise reduction. Its performance

degrades, if segmentation is inaccurate, or if the assumption that

heart sounds are consistent in consecutive cycles is not valid.

To avoid these commonly encountered limitations, we propose

a new noise reduction in this paper that is performed in the joint

cycle frequency–time–frequency domains based on fuzzy de-

tection. Comparing with the earlier study [4], one more domain,

frequency domain, is exploited. This proposed noise reduction

can accommodate variations in both time delay and waveform

TANG et al.: SEPARATION OF HEART SOUND SIGNAL FROM NOISE IN JOINT CYCLE FREQUENCY–TIME–FREQUENCY DOMAINS 2439

of heart sounds, murmurs. No segmentation is needed. On the

other hand, it can be operated in a somewhat automated manner.

The paper is organized as follows. Section II out-

lines the decomposition of heart sound signal into atoms.

Section III focuses on the quasi-cyclostationarity of the atoms. In

Section IV, we propose a fuzzy-detection method to detect atoms

of heart sound signal in the joint plane. Practical experiments

and various computer simulations are described in Section V.

In Section VI, we discuss the results, and in Section VII, ob-

servations regarding performance comparisons are presented.

Finally, Section VIII summarizes our main conclusions.

II. REPRESENTATION OF HEART SOUND SIGNAL IN

TIME–FREQUENCY DOMAINS

A. Decomposition

To our knowledge, several signal models can be found in

literature for the decomposition of heart sound signal, such as

the chirp models [5], [6], the damped sinusoidal models [7], [8],

the modiﬁed Prony models [9], and the Gaussian modulation

model [10]. Leung et al. employed the Gaussian modulation

model to decompose the second heart sound for the diagnosis

of pediatric heart diseases. This model is employed to represent

the heart sound signal of one cardiac cycle

hm(t)=



i=1

amie−(t−tmi)2/(2σ2

mi)cos(2πωmit+βmi)(1)

where hm(t)is the heart sound signal of the mth cycle. Namely,

(1) means that hm(t)is the sum of Lmatoms. Every atom is

characterized by ﬁve parameters: tmi is the time delay of the

ith atom with respect to the start of the mth cycle; ami is the

amplitude; ωmi is the frequency; σmi is the time width that

the atom needs support; βmi is the phase. Therefore, the heart

sound signal of this cycle is represented by the set of atoms

{tmi,a

mi,ω

mi,σ

mi,β

mi,1≤i≤Lm}. The number of atoms,

Lm, and the ﬁve parameters for each atom can be obtained using

short-time Fourier transform (STFT) analysis, as described in

[10].

The STFT of the heart sound signal hm(t)is

H(t, f )=hm(t)w(t−τ)e−2πωτdτ (2)

where w(t) is a Gaussian window. First, the atom with the max-

imum amplitude is identiﬁed by searching the magnitude of the

STFT. More speciﬁcally, the atom with maximum amplitude is

located by detecting the peak in the magnitude of the STFT.

Once the time delay of the atom tmi has been identiﬁed, its

amplitude ami, frequency ωmi, and phase βmi can be read di-

rectly from the STFT. σmi is obtained by the following optimal

procedure.

The waveform represented by the ith atom is

smi(t, σmi)=amie−(t−tmi)2/(2σ2

mi)cos(2πωmit+βmi).

(3)

The signal residue after the waveform of the ith atom is

subtracted from the signal h(m−1)(t)is

hmi(t, σmi)=hm(i−1)(t)−smi(t, σmi)(4)

Fig. 1. Heart sound signal of one cardiac cycle was decomposed into atoms.

(a) Recorded waveform of one cardiac cycle. (b) Atoms on the time–frequency

plane. (c) Reconstructed waveform and residue.

where hm0(t)is the original signal of the mth cycle. The nor-

malized residue energy is

ρmi(σmi)=|hmi(t, σmi)|2dt|hm0(t)|2dt. (5)

Obviously, ρmi(σmi)will be minimum, if a perfect “time

width” σmi is found. The minimum of the residue energy

ρmi(σmi)is, therefore, used as a criterion to optimize σmi.

For this purpose, we monitor ρmi(σmi)by varying σmi within

a predeﬁned range. The decomposition stops, if ρmi(σmi)is

sufﬁciently small. The heart sound signal of the mth cycle can

be reconstructed from the sum of all atoms as

hm(t)≈



i=1

smi(t).(6)

B. Data Acquisition

The normal heart sound signal used in this paper was recorded

in the authors’ laboratory. The male subject (33 years of age)

was asked to lie on his back on an examination bed. The sensor

was directed toward the mitral site. ECG signals and heart sound

signal were recorded simultaneously. It is known that the domi-

nant bandwidth of a heart sound signal is approximately 500 Hz.

To avoid higher frequency noise than 500 Hz, the data was pre-

ﬁltered by low-pass ﬁlter with cutoff frequency 500 Hz. The

sampling rate was set to 2 KHz, which is higher than the min-

imum rate required by the sampling theory. Furthermore, the

laboratory provided an environment, where even minor noise

could be controlled. These low-noise heart sound recordings

allowed us quantiﬁcationally evaluate noise reduction in simu-

lated noise.

C. Simulation

The normal heart sound signal of one single cardiac cycle was

decomposed into 16 atoms, as shown in Fig. 1. The placement

2440 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 57, NO. 10, OCTOBER 2010

Fig. 2. Correspondence between ECG and heart sound signal.

of each atom in the time–frequency plane is shown in Fig. 1(b).

The reconstructed waveform is shown in Fig. 1(c), where the

decomposing was stopped when the normalized residue energy

was less than 0.05. The high correlation coefﬁcient between the

reconstructed and the original waveform (up to 0.98) lead us to

the conclusion that atoms can represent accurately heart sound

signal in the time and frequency domains.

III. QUASI-CYCLOSTATIONARITY OF THE ATOMS

Cardiac cycle is a sequence of events repeated cyclically dur-

ing every heartbeat. Heart sounds (S1–S4) are caused by vibra-

tion of heart valves and/or surrounding heart tissues following

the closing or opening of heart valves, while heart murmurs are

caused by turbulence of blood ﬂow. The pace at which a heart

beats does not change abruptly, and therefore, the heart sound

signal is considered quasi-cyclostationary. Based on this approx-

imation, we assumed that atoms representing heart sound signal

in the time–frequency domains are also quasi-cyclostationary,

namely, if the heart sound signal was cyclostationary and the

cycle frequency corresponded to the reciprocal of cardiac cycle

duration, we expected that the atoms for one cardiac cycle will

superpose the atoms of the next cycle.

However, heart cycle duration varies from cycle to cycle. This

phenomenon, widely known as heart rate variability (HRV),

can be identiﬁed easily in the ECG [11]. The correspondence

between the ECG and heart sound signal is demonstrated in

Fig. 2. The occurrence of an R wave indicates the start of a

new cycle. In order to analyze heart sound signal at speciﬁc

cycle frequency, we perform linear time scaling on heart sound

signal of every cardiac cycle to enhance the cyclostationarity.

For example, the heart sound signal of the mth cycle hm(t)is

linearly time scaled as

m(t)=hm(T0t/Tm)(7)

where T0is the cycle duration used as reference and Tmis

the mth cycle’s duration. The cycle frequency for the enhanced

signal is thus 1/T0. We decompose the enhanced signal of the

mth cycle hl

m(t)into atoms

m(t)=



i=1

mie−(t−tl

mi)2/(2(σl

mi)2)cos(2πωl

mit+βl

mi).

(8)

Fig. 3. Distribution of the atoms is shown in joint plane at cycle frequency

1 Hz. This atom congregate provides sufﬁcient evidence that atoms of enhanced

heart sound signal are quasi-cyclostationary.

The ﬁrst three consecutive cardiac cycles shown in Fig. 2 are

processed as given in (7), where T0=1s. The signals hl

m(t)

with m=1, 2, and 3 are then decomposed into atoms according

to (8), as shown in Fig. 3. The atoms of one cycle almost super-

pose the atoms of another cycle. This atom congregate provides

sufﬁcient evidence that atoms are approximately cyclostation-

ary. The reason why atoms of different cycles do not perfectly

superpose on each other may be variations in the waveform

and time delay of heart sounds, murmurs in consecutive cycles.

Noise reduction could be really robust, if these variations can

be accommodated.

We assume that atoms of heart sound signal are quasi-

cyclostationary. On the other hand, atoms for noise and dis-

turbances are random and their cycle frequency spectrum does

not overlap with that of heart sound signal.

On the basis of this assumption, it is safe to say that atoms

of heart sound signal congregate on the joint cycle frequency–

time–frequency plane. Noise and disturbances are generally ran-

dom in nature. The atoms of noise and disturbances must be

dispersed over the joint plane. The atoms of heart sound signal

and the atoms of noise can thus be easily separated. In extreme

cases where noise is cyclostationary, the atoms of heart sound

signal can still be separated from those of noise, if their cycle

frequency spectra do not overlap [4].

IV. SEPARATION OF HEART SOUND SIGNAL FROM NOISE

BASED ON FUZZY DETECTION

A. Scale Match

It is assumed that the recording of heart sound signal has M

cardiac cycles. The start of each cardiac cycle is indicated by an

R wave, while the atoms for the recording are denoted in the joint

plane as the set of atoms {tl

mi,ω

mi,1≤i≤Ll

m,1≤m≤

M}. The frequency components of heart sounds and murmurs

range from several to ﬁve hundreds hertz [12]; however, the

normal duration of a cardiac cycle is about 850 ms and we have

chosen to normalize the cycle frequency to 1 s for improving

comparisons between cycles and between patients. For example,

in Fig. 3, where the vertical axis ranges from 0 to 200 and

the horizontal axis ranges from 0 to 1, the frequency ωl

mi is

TANG et al.: SEPARATION OF HEART SOUND SIGNAL FROM NOISE IN JOINT CYCLE FREQUENCY–TIME–FREQUENCY DOMAINS 2441

Fig. 4. Noisy heart sound signal and atom distribution. (a) Noisy heart sound

signal. (b) Atom distribution in the joint plane at a cycle frequency of 1 Hz. The

vertical axis is the “scaled frequency”. It is obvious that atoms of heart sound

signal congregate, whereas atoms of noise are more dispersed and can easily be

separated based on density of atoms (or the membership function).

a hundred times that of tl

mi. This mismatch will enable prior

evolution on frequency axis when the detection is operated in

the plane. To avoid this scale mismatch, the frequency ωl

mi is

scaled down so that its values are within the same range as time

delay tl

mi as shown in Fig. 4, where the vertical axis is named

“scaled frequency”.

B. Fuzzy Detection

In accordance with the assumption that atoms of heart sound

signal congregate in the joint cycle frequency–time–frequency

plane, the density of an atom provides an indication of whether

this speciﬁc atom represents a heart sound signal or noise. The

density of an atom is referred to the number of atoms found

within a radius ζaround the atom. For example, the density of

the atom (tl

mi,ω

mi)is deﬁned as

dmi =(number of atoms found with in ζ)

=(tl

mi −tl

nj)2+(ωl

mi −ωl

nj)2≤ζ,

1≤j≤Ll

n,1≤n≤M. (9)

For a given number of cardiac cycles, it is obvious that atoms

of higher density probably represent heart sound signal. We

deﬁne a membership function to detect the atoms that represent

heart sound signal as follows:

A(dmi)=0,d

mi <M

1,d

mi ≥M.(10)

Ideally, the density dmi is equal to the total number of car-

diac cycles M, which serves as a threshold in (10). If the density

is greater than or equal to M, then A(dmi)=1and the atom

(tl

mi,ω

mi)probably represents a heart sound signal. By repeat-

ing the same process across all atoms of the mth cycle, we can

identify those atoms synthesize heart sound signal of this cycle.

Fig. 5. Variations in the time delay of heart sounds S1 and S2 for two different

cycles.

C. Heart Sound Signal Reconstruction

Heart sound signal of the mth cycle hs

m(t)are approximated

by the sum

m(t)=



k=1

mke−(t−tl

mk)2/(2(σl

mk)2)cos(2πωl

mkt+βl

mk)

(11)

where Kl

mis the number of atoms, which meet the condition

A(dmk)=1. The heart sound signal of this cycle are subse-

quently reconstructed through reversed linear time scaling

m(t)=hs

m(Tmt/T0).(12)

Possible reconstruction errors are evaluated using the normal-

ized residue

E=M

m=1 Tm

0|hr

m(t)−hm(t)|2dt

M

m=1 Tm

0|hm(t)|2dt(13)

where Tmis the cycle duration of the mth cycle.

D. Parameter ζ

According to our assumption, the atoms of one cycle should

superpose perfectly on the atoms of another cycle, if heart sound

signal was cyclostationary and the radius ζwould be zero. How-

ever, the parameter ζis closely related to variations in the time

delay of heart sounds and murmurs. In order to determine this

parameter for the phonocardiographic signal recorded, we in-

vestigate the variations in the time delay of heart sounds S1 and

S2, as shown in Fig. 5. According to Tang et al. [4], heart sound

waveform is highly consistent between consecutive cardiac cy-

cles. Variations in the time delay of heart sounds of the mth

cycle with respect to the ﬁrst cycle can, therefore, be estimated

based on the minimum mean square error criterion

rS1

m= arg min

rS1

mEs11(t)−sm1(t−ζS1

m)2,m≥2

(14)

and

rS2

m= arg min

rS2

mEs12(t)−sm2(t−ζS2

m)2,m≥2

(15)

2442 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 57, NO. 10, OCTOBER 2010

TAB LE I

STATISTICS OF THE PARAMETER ZETA FOR VARIOUS CLINICAL HEART

SOUND SIGNAL

where rS1

mand rS2

mdenote variations in the time delay of S1 and

S2 of the mth cycle with respect to the ﬁrst cycle, respectively.

The span ζS1of rS1

mcan be calculated as

ζS1= max(rS1

m)−min(rS1

m)(16)

where max(·) and min(·) are the maximum and minimum oper-

ators, respectively. Similarly, the span ζS2of rS2

mcan be calcu-

lated as

ζS2= max(rS2

m)−min(rS2

m).(17)

The values of ζS1and ζS2for various clinical heart sound

signals are listed in Table I, for which it is obvious that ζS1

and ζS2vary for different cases. They also vary for the same

subject at different periods of time. Although there is a great in-

terest in the mechanism underlying these variations, it is beyond

the scope of this paper. The number of cases investigated (see

Table I) is limited. Still, the selected sample provides sufﬁcient

evidence that the variation of time delay of heart sound can

vary from several to dozens of milliseconds between consecu-

tive cardiac cycles. We followed the same approach to calculate

the differences in the time delay of the third heart sound ζS3,the

fourth heart sound ζS4, as well as heart murmurs ζmurmur.For

all heart sounds and murmurs, the parameter ζshould, however,

be chosen as the maximum variation. For example, it may be

chosen equal to ζ=0.039 s (the greater one of ζS1and ζS2)

for a healthy subject (see Table I). It should be noted that the

statistics included in Table I are obtained with low-noise heart

sounds. Parameter ζmay be larger, if the heart sound signal is

contaminated with noise. Without any prior knowledge about

the noise strength, parameter ζmay be chosen experimentally.

From the statistics compiled by the authors for a wide range

of clinical cases, noise reduction to normal rhythm heart sound

signal can be achieved when parameter ζis set in the range

[0.02–0.06]. It may be larger in cases of arrhythmia due to the

high variations in the time delay of heart sounds, murmurs. In

other words, parameter ζis related to the cyclostationarity of

heart sound signal, as discussed in Section VI-A. To some ex-

tent, the proposed noise reduction can accommodate variations

in the time delays. This accommodation leads that the variations

in the waveform can also be recovered in time domain. It is one

of main advantages of the proposed noise reduction.

Fig. 6. Lung sounds, chest motion contaminating heart sound signal. (a) Con-

taminated heart sound signal. (b) Atoms distribution in the joint plane. (c) Re-

constructed heart sounds. (d) Signal residues, which is the sum of lung sounds

and slide sounds (indicated by ellipses). The latter are sounds produced by the

lungs during breathing or by the muscular activity controlling lung movements.

E. Summary

The proposed noise reduction consists of the following steps.

Step 1: ECG signal and heart sound signal are synchronously

recorded.

Step 2: The beginning of individual cardiac cycles is identiﬁed

based on the occurrence of R waves. If the corresponding ECG

is not available, other signals (for example, wrist pulse, carotid

pulse) as well as cycle detection algorithms may be employed

to divide the phonocardiographic signal into cycles.

Step 3: To ensure that every cardiac cycle has the equal cycle

duration, the linear time scaling is carried out, according to

(7). The enhanced heart sound signal of each cardiac cycle is

decomposed into atoms, according to (8).

Step 4: The fuzzy detection is applied to identify on the

joint plane the atoms, which synthesize each heart sounds and

murmurs, as described in Sections IV-A and IV-B.

Step 5: Heart sound signal is reconstructed as in Section IV-C.

V. P RACTICAL EXPERIMENT AND COMPUTER SIMULATIONS

A. Separation of Heart Sound Signal From Lung Sounds and

Chest Motion

The experiment was performed on the same subject climbing

a staircase of 100 steps, who immediately after step climbing

laid on his back on an examination bed. During the recording,

the respiration was fast and, as expected, lung sounds and the

chest motion contaminated the phonocardiographic recordings,

TANG et al.: SEPARATION OF HEART SOUND SIGNAL FROM NOISE IN JOINT CYCLE FREQUENCY–TIME–FREQUENCY DOMAINS 2443

Fig. 7. Simulated noise and disturbances. (a) Simulated noise along with ran-

domly timed disturbances. (b) Spectrum of the simulated noise and disturbance,

which overlaps the frequency spectrum of heart sounds.

as shown in Fig. 6(a). Twelve cycles of heart sound signal were

submitted to the noise-reduction algorithm. Informal listening

by the authors shows that heart sounds can often be buried by

heavy lung sounds. Atoms distribution on the joint plane at a

cycle frequency of 1 Hz is shown in Fig. 6(b), where the atoms

indicated by prisms were identiﬁed for heart sounds. The re-

constructed heart sounds are given in Fig. 6(c). It is seen that

the recovered heart sounds become clear. S1s and S2s are easily

distinguished. The separated noise and disturbance are shown

in Fig. 6(d), which may comprise lung sounds and stethoscope

sliding disturbances (indicated by ellipses) due to chest mo-

tion. These sliding disturbances have high amplitude, short time

duration, and are randomly timed. The main frequency compo-

nents of heart sounds overlap those of lung sounds, as seen in

the joint plane in Fig. 6(b). It can be concluded that the noise

and disturbances in this case are non-Gaussian, nonstationary,

and colored.

B. Simulated Noise and Disturbance

In order to evaluate the proposed noise reduction for more

clinical heart sound signal, we simulate noise and disturbances

by using the model

v(t)=H(v1(t)+v2(t)) (18)

where v1(t)is a white double-side exponential distribution, and

v2(t)are randomly timed disturbances. H(·) is a band-pass ﬁl-

ter, whose frequency response band overlaps that of the heart

sound signal. One realization of the simulated noise is shown in

Fig. 7(a). Its spectrum is shown in Fig. 7(b).

C. Reduction of Simulated Noise From Normal Heart Sounds

A low-noise heart sound signal from the normal subject is

contaminated by the simulated non-Gaussian, nonstationary,

and colored noise. Twelve cycles of heart sound signal were

submitted to the noise-reduction algorithm. The recording plus

Fig. 8. Noise reduction for normal heart sound signal. (a) Low-noise heart

sound signal and the corresponding ECG. (b) Heart sounds contaminated with

simulated noise. Disturbances are indicated by an ellipse. (c) Atoms in the joint

plane at cycle frequency of 1 Hz. (d) Reconstructed heart sounds.

the simulated noise is given in Fig. 8(b). The SNR is 0 dB. We see

that S1s are almost completely buried by noise. The noise and

disturbances are so heavy that heart sounds cannot be identiﬁed

by the human eye. The scatter plot in the joint plane at a cycle

frequency of 1 Hz is shown in Fig. 8(c). The atoms represented

by prisms are found to form heart sounds based on fuzzy detec-

tion and the remaining atoms, represented by circles, are found

to form noise and disturbances. The reconstructed heart sounds

are shown in Fig. 8(d). It is relatively easy for the authors to

identify the heart’s rhythm from the reconstructed signal just by

listening. The correlation coefﬁcient between the reconstructed

and the original signal is 0.92, and the normalized residue is

0.12.

D. Reduction of Simulated Noise Form Heart Sound Signal

With Aortic Valve Stenosis

The noise reduction was also applied on heart sound signal of

patients with aortic valve stenosis. The noise-free data, whose

sampling rate is 22 050 Hz, were downloaded from the website

of the School of Medicine of the University of Dundee. The sam-

pling frequency was scaled down to 2 KHz, where a low-pass

ﬁlter with cutoff frequency 1 Hz was used to prevent frequency

aliasing. The heart sound signal has long, heavy systolic mur-

murs, as shown in Fig. 9(a). The signal was contaminated by

the simulated noise, as shown in Fig. 9(b). Some disturbances,

indicated by an ellipse, overlap the murmurs. The SNR is 0 dB.

The distribution of atoms on the joint plane at a cycle frequency

of 1 Hz is shown in Fig. 9(c). The atoms, indicated by prisms,

2444 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 57, NO. 10, OCTOBER 2010

Fig. 9. Noise reduction for heart sound signal with aortic valve stenosis.

(a) Low-noise recording. (b) Heart sounds contaminated with simulated noise.

Some disturbances indicated by an ellipse overlap heart murmurs in time do-

main. (c) Atoms in the joint plane at cycle frequency of 1 Hz. (d) Reconstructed

heart sound signal.

are identiﬁed to form heart sounds by fuzzy detection. The re-

constructed heart sound signal, shown in Fig. 9(d), seems to

be closer to the noise-free heart sound signal. The correlation

coefﬁcient between the reconstructed and original signal is 0.90

and the normalized residue 0.18.

E. Reduction of Simulated Noise From Heart Sound Signal With

the Fourth Heart Sound (S4)

The fourth example involves the reduction of noise from heart

sound signal with the fourth heart sounds (S4), as shown in

Fig. 10(a). The noise-free data, whose sampling frequency was

22050 Hz, was downloaded from the website of the Medicine

Department of the University of Dundee. We scaled the sampling

frequency down to 2 KHz, where the technology of preventing

frequency aliasing was also used. S4 generally has low ampli-

tude and is easily buried by noise. In this example, we show the

robustness of the proposed noise reduction in recovering S4 in

heavy noise and disturbance environments. The heart sound sig-

nal contaminated by the simulated noise is shown in Fig. 10(b).

The SNR is 0 dB. The S4 sounds can hardly be perceived by

the human eye. The disturbances, indicated by ellipses, are ran-

domly timed and cause the disordering of rhythms. The authors

could hardly observe the occurrence of S4 s. The distribution of

the atoms in the joint plane at a cycle frequency of 1 Hz is shown

in Fig. 10(c). The fuzzy detection identiﬁes the atoms of heart

sound signal, which are represented by prisms. The remaining

atoms, represented by circles, are identiﬁed from noise and dis-

turbances. The reconstructed heart sound signal are shown in

Fig. 10. Noise reduction for heart sound signal with the fourth heart sound.

(a) Low-noise recording. (b) Heart sound signal contaminated with simulated

noise. Disturbances are indicated by an ellipse. (c) Atoms in the joint plane at

cycle frequency of 1 Hz. (d) Reconstructed heart sounds.

Fig. 10(d). It is seen that the reconstructed waveform shows a

high level of approximation to the original waveform. The cor-

relation coefﬁcient between the reconstructed and the original

waveforms is up to 0.98, and the normalized signal residue is as

low as 0.05.

VI. DISCUSSION

A. Cyclostationarity Strength

The atoms of heart sound signal can be easily identiﬁed

by fuzzy detection because heart sound signal are quasi-

cyclostationary. The atoms of heart sound signal congregate

in the joint plane at a speciﬁc cycle frequency. The performance

of noise reduction is thus directly related to the cyclostationar-

ity strength of heart sound signal. We may safely conclude that

as the cyclostationary of heart sound signal increases, the level

of noise reduction increases. Ideally, the atoms of heart sound

signal superpose, if the recording is perfectly cyclostationary.

However, it is impossible for heart sound signal to be perfect

cyclostationary because of variations in the waveform and time

delay of heart sounds, murmurs between different cardiac cy-

cles. One question that arises is how to quantitatively evaluate

the cyclostationarity strength for given heart sound signals.

We introduce cyclic statistics to assess the cyclostationarity

strength of heart sound signals. ρα

x(f)is the cyclic spectral

coherence function, which is deﬁned as

ρα

x(f)= Sα

x(f)

[S0

x(f+α/2)S0

x(f−α/2)]1/2(19)

TANG et al.: SEPARATION OF HEART SOUND SIGNAL FROM NOISE IN JOINT CYCLE FREQUENCY–TIME–FREQUENCY DOMAINS 2445

TAB LE I I

CYCLOSTATIONARITY STRENGTH

where Sα

x(f)is the cyclic spectrum of signal x(t) at the cycle

frequency α.Sα

x(f)is written as

Sα

x(f)=+∞

−∞

Rα

x(τ)e−j2πfτ dτ (20)

and

Rα

x(τ)=

x(t)x(t+τ)e−j2παtdt. (21)

Rα

x(τ)is the cyclic correlation function. Rα

x(τ)degrades to a

traditional correlation when the cycle frequency αis zero.

In (19), ρα

x(f)represents a second-order cyclic statistics that

shows the cyclostationarity strength of a signal at a given cycle

frequency α. The larger the amplitude of ρα

x(f)is, the higher

the degree of cyclostationarity of the signal is. However, a heart

sound signal is not completely cyclostationary. It is partly sta-

tionary. ρ0

x(f)indicates the stationarity strength. In order to

evaluate the relative strength, we calculate the rate

γx(α)= ρα

x(f)df

ρ0

x(f)df +ρα

x(f)df .(22)

It is clear that γx(α)is 0.5, if the cyclostationarity strength is

equal to the stationarity strength. γx(α)is always less than or

equal to 1. In order to study how the cyclostationarity strength

affects noise reduction, we calculate γx(α)for the normal heart

sound signal shown in Fig. 8, heart sound signal with aortic

stenosis shown in Fig. 9, and the heart sound signal with S4

shown in Fig. 10 at a cycle frequency α=1 Hz. The rates are

listed in Table II. It is found that γAS(1) has the lowest value,

γFHS(1) has the highest value, and γNHS(1) has an intermediate

value. This means that the heart sound signal shown in Fig. 9

has the lowest cyclostationarity, the heart sound signal shown

in Fig. 10 has the highest cyclostationarity, and the heart sound

signal shown in Fig. 8 has medium cyclostationarity. These

results imply that the noise reduction shown in Fig. 10 is the

best and the noise reduction shown in Fig. 9 is the worst. These

results are conﬁrmed by comparing them with the results of

our simulations. The correlation coefﬁcients for the heart sound

signals shown in Figs. 8 and 9 are 0.92 and 0.90, respectively,

and the normalized residues are 0.12 and 0.18, respectively.

However, the correlation coefﬁcient and normalized residue for

Fig. 10 are 0.98 and 0.05, respectively.

It is well known that HRV is unavoidable for a live sub-

ject, namely, there are no heart sound signals that are perfectly

cyclostationary. We employ linear time scaling to enhance cy-

clostationarity, as shown in (7). However, this is not sufﬁcient

to align the timings of the heart sounds and murmurs. To further

enhance cyclostationarity, some special techniques may be em-

ployed, such as NTS proposed in our earlier paper [4]. Another

TABLE III

INFLUENCE OF THE NUMBER OF CYCLES

factor that can affect cyclostationarity is the variations in the

waveform. In the past, there have been few studies that have

used waveform improvement techniques to enhance cyclosta-

tionarity.

B. Inﬂuence of the Number of Cycles

As shown in the simulations, the quasi-cyclostationarity of

heart sound signal can be indicated by the density of atoms

in the joint plane at a speciﬁc cycle frequency. The quasi-

cyclostationarity may be sufﬁcient as the number of cycles in-

creases. The density of atoms is thus a suitable indicator for

the detection of heart sound signal. To evaluate the performance

of the noise reduction with respect to the number of cycles,

we perform independent noise reduction simulations for nor-

mal heart sound signal in 0-dB environments when the number

of cycles is 5, 10, 20, and 30. The statistics of performance

indicators are listed in Table III. We see that the correlation co-

efﬁcient between the reconstructed and the noise-free signal is

0.90 and the normalized signal residue is 0.15 when only ﬁve

cardiac cycles are covered. As the number of cycle increases,

the performance improves. However, the performance does not

improve linearly with respect to the number of cycles. It does not

show any improvement as the number of cycles increases from

20 to 30. We drew the following conclusion from this absence of

improvement: cyclostationarity cannot be always improved by

increasing the number of cycles. Therefore, there is a limit to the

degree to which the performance can be improved by increasing

the number of cycles. This proves that noise reduction mainly

depends on the cyclostationarity strength. The statistics show

that an acceptable performance can be achieved, if the number

of cycles is selected to be 10.

VII. PERFORMANCE COMPARISONS

A. Short Description of Our Earlier Study

In our earlier study [4], we made the assumption that heart

sound waveforms are consistent between consecutive cardiac

cycles. NTS was proposed to minimize variations in the timing

of heart sounds, murmurs and ultimately allow the exploita-

tion of the cyclostationarity features of the signal. In the ideal

case that NTS was successful, the enhanced heart sound signal

could be considered to be cyclostationary. Noise was reduced

by obtaining the average of consecutive cardiac cycles, namely,

it operated in joint cycle frequency–time domains. We called

this technique NR-NTS. It is theoretically insensitive to: 1) sta-

tionary noise; 2) zero-mean noise; and 3) cyclostationary noise,

whose cycle frequency does not coincide with that of heart sound

signal. However, ﬁrst, heart sounds need to be segmented into

ﬁrst and second heart sounds in order to determine the parame-

ters for NTS. If segmentation is inaccurate or if the assumption

2446 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 57, NO. 10, OCTOBER 2010

TAB LE I V

PERFORMANCE COMPARISONS

that heart sounds are consistent in consecutive cycles is not

valid, this preprocessing phase can have a detrimental effect on

the efﬁciency of the NR-NTS.

B. Comparison

The noise reduction proposed in this paper is referred to as

NR-FD, since it is based on fuzzy detection. It is performed

in the joint cycle frequency–time–frequency domains, namely,

one more domain, frequency domain, is exploited. To compare

the performance of the two algorithms, we applied them on

the same normal heart sound signal in simulated noise environ-

ments. Correlation coefﬁcients between the reconstructed and

the noise-free are listed in Table IV.

Based on the assumption that heart sound waveforms are con-

sistent between consecutive cardiac cycles [4], the efﬁciency of

NR-NTS is improved by increasing the number of cycles. How-

ever, this assumption holds for ten cardiac cycles or so, and

while noise is reduced, variations in the heart sound waveform

become more prominent. As listed in Table IV, the correla-

tion coefﬁcient decreased where the number of cardiac cycles

increased from 10 to 30 regardless of whether the signal was

contaminated with 0 dB or −5 dB noise.

Unlike NR-NTS, NR-FD can accept, to some degree, the

variations in the waveform and time delay of the heart sounds

because of the use of fuzzy detection. Both these variations

can be recovered during the reconstruction of the heart sound

signal, and no segmentation is needed as preprocess for NT-FD,

leading us to the conclusion that NR-FD is more robust than

NR-NTS. The performance of NR-FD does not degrade when

heart sound signal covers more than ten cardiac cycles. As a

consequence, the correlation coefﬁcients obtained with NR-FD

are signiﬁcantly better than those obtained with NR-NTS in 0

dB noise (see Table IV). These features make NR-FD suitable

for the automated analysis of heart sounds. We note, however,

that the correlation coefﬁcients obtained with NR-FD are not

signiﬁcantly higher than those obtained with NR-NTS in −5dB

noise. The reason behind this performance may be that atoms

of heart sound signal are diluted on the joint plane due to the

presence of excessive noise.

VIII. CONCLUSION

In this study, noise reduction for heart sound signal has been

achieved in the joint cycle frequency–time–frequency domains.

The beginning of cardiac cycles is identiﬁed based on the oc-

currence of R waves, and subsequently, each heart sound signal

of a cardiac cycle is linearly time scaled and decomposed into

atoms by means of a Gaussian modulation model. The atoms of

heart sound signal congregate on the joint plane. In contrast, the

atoms of noise and disturbances are dispersed in the joint plane.

The atoms of heart sound signal can then be easily identiﬁed

based on fuzzy detection. Variations in both the waveform and

time delay of the heart sound signal can be recovered during the

reconstruction. In practical experiment, heart sound signal was

successfully separated from lung sounds and stethoscope sliding

disturbance produced by chest muscular activities during breath-

ing. Computer simulations are used to test the performance of

the proposed noise reduction for various heart sound signals

contaminated with simulated non-Gaussian, nonstationary, and

colored noise. The statistics show that its performance may be

maintained even for short heart sound signal covering only ﬁve

cardiac cycles. Furthermore, the algorithm can be operated in a

rather unsupervised manner.

ACKNOWLEDGMENT

The authors would like to thank the data collectors. The au-

thors would also like to thank the anonymous reviewers for their

valuable comments. The heart sound data used in this paper were

downloaded from the website of the Medicine Department, Uni-

versity of Dundee, Texas Heart Institute at St. Luke’s Episcopal

Hospital.

REFERENCES

[1] Y. Bai and C. Lu, “The embedded digital stethoscope uses the adaptive

noise cancellation ﬁlter and the type I Chebyshev IIR bandpass ﬁlter to

reduce the noise of the heart sound,” in Proc. 7th Int. Workshop Enter-

prise Netw. Comput. Healthcare Ind. Busan, Korea, Jun. 23–25, 2005,

pp. 278–281.

[2] A. S. Paul, E. A. Wan, and A. T. Nelson, “Noise reduction for heart

sounds using a modiﬁed minimum-mean squared error estimator with

ECG gating,” in Proc. 28th Annu. Int. Conf. IEEE Eng. Med. Biol. Soc.

New York, Aug. 30–Sep. 3, 2006, pp. 3385–3390.

[3] R. Beyar, S. Levkovitz, S Braun, and Y Palti, “Heart-sound processing by

average and variance calculation – physiologic basic and clinical impli-

cations,” IEEE Trans. Biomed. Eng., vol. BME-31, no. 9, pp. 591–596,

Sep. 1984.

[4] H. Tang, T. Li, and T. Qiu, “Noise and disturbance reduction in cycle

frequency domain based on non-linear time scaling,” IEEE Trans. Biomed.

Eng., vol. 57, no. 2, pp. 325–333, Feb. 2010.

[5] J. Xu, L. Durand, and P. Pibarot, “Nonlinear transient chirp signal model-

ing of the aortic and pulmonary components of the second heart sound,”

IEEE Trans. Biomed. Eng., vol. 47, no. 7, pp. 1328–1335, Jul. 2000.

[6] J. Xu, L. Durand, and P. Pibarot, “Extraction of the aortic and pulmonary

components of the second heart sound using a nonlinear transient chirp

signal model,” IEEE Trans. Biomed. Eng., vol. 48, no. 3, pp. 277–283,

Mar. 2001.

[7] A. Baykal, Y. Z. Ider, and H. Koymen, “Distribution of aortic mechanical

prosthetic valve closure sound model parameters on the surface of the

chest,” IEEE Trans. Biomed. Eng., vol. 42, no. 4, pp. 358–370, Apr. 1995.

[8] H. Koymen, B. K. Altay, and Y. Z. Ider, “A study of prosthetic heart valve

sounds,” IEEE Trans. Biomed. Eng., vol. BME-34, no. 11, pp. 853–863,

Nov. 1987.

[9] H. P. Sava and J. T. E. McDonnell, “Modiﬁed forward-backward overde-

termined Prony’s method and its application in modeling heart sounds,”

IEE Proc.—Vis. Image Signal Process, vol. 142, no. 6, pp. 375–380, Jun.

1995.

[10] T. S. Leung, P. R. White, W. B. Cook et al., “Analysis of the second heart

sound for diagnosis of paediatric heart disease,” IEEE proc.-Sci. Meas.

Technol., vol. 145, no. 6, pp. 285–290, Jun. 1998.

[11] A. K. Baros and N. Ohnishi, “Heart instantaneous frequency (HIF): An

alternative approach to extract heart rate variability,” IEEE Trans. Biomed.

Eng., vol. 48, no. 8, pp. 850–855, Aug. 2001.

[12] S. M. Debbal and F. Bereksi-Reguig, “Computerized heart sounds analy-

sis,” Comput. Biol. Med., vol. 38, pp. 263–280, 2008.

TANG et al.: SEPARATION OF HEART SOUND SIGNAL FROM NOISE IN JOINT CYCLE FREQUENCY–TIME–FREQUENCY DOMAINS 2447

Hong Tang (M’10) received the B.S. degree

in mechanical manufacture and automation from

Zhongyuan University of Technology, Zhengzhou,

Henan, China, in 2000, the M.S. degree in biomed-

ical engineering from Jilin University, Changchun,

Jilin, China, in 2003, and the Ph.D. degree in sig-

nal processing from the Dalian University of Tech-

nology (DUT), Dalian, Liaoning, China, in 2006,

respectively.

From 2006 to 2007, he was a Research Fellow

in the Regional Innovation Center, Yeungnam Uni-

versity, Korea. He is currently an Assistant Professor at DUT. His research

interests include non-Gaussian signal processing, biomedical signal processing,

and wireless location.

Ting Li received the B.S. degree in electronic en-

gineering, and the M.S. degree in signal process-

ing, both from the Dalian University of Technol-

ogy (DUT), Dalian, China, in 2002 and 2005,

respectively.

She is currently an Assistant Professor at Dalian

Nationalities University, Dalian. Her research in-

terests include non-Gaussian signal processing and

biomedical signal processing.

Yongwan Park received the B.E. and M.E. degrees

in electrical engineering from Kyungpook University,

Daegu, Korea, in 1982 and 1984, respectively, and the

M.S. and Ph.D. degrees in electrical engineering from

the State University of New York, Buffalo, in 1989

and 1992, respectively.

From 1992 to 1993, he was a Research Fellow at

the California Institute of Technology. From 1994 to

1996, he was a Chief Researcher at SK Telecom, Ko-

rea, where he was involved in developing IMT-2000

system. Since September 1996, he has been a Pro-

fessor of information and communication engineering at Yeungnam University,

Gyeongsan, Korea. His current research interests include beyond 3G/4G sys-

tem, orthogonal frequency-division multiplexing system, peak-to-average ratio

reduction, and biomedical signal processing, etc.

Tianshuang Qiu (M’97) received the B.S. degree

from Tianjin University, Tianjin, China, in 1983, the

M.S. degree from Dalian University of Technology,

Dalian, China, in 1993, and the Ph.D. degree from

Southeastern University, Nanjing, China, in 1996, all

in electrical engineering.

He was a Research Scientist at Dalian Institute

of Chemical Physics, Chinese Academy of Sciences,

Dalian during 1983 through 1996. He was with the

faculty of electrical engineering, Dalian Railway In-

stitute during 1996. He was a Postdoctoral Researcher

in the Department of Electrical Engineering, Northern Illinois University,

DeKalb. He is currently a Professor in the Department of Electronic Engineering,

Dalian University of Technology. His research interests include non-Gaussian

and nonstationary signal processing, radio-frequency signal processing, and

biomedical signal processing.

A New Defect Diameter Prediction using Heart Sound and Possibility to Implement as IoT Healthcare

Article

Full-text available

Aug 2023
MOBILE NETW APPL

Healthcare facilities for diagnosing congenital heart defects (CHD) in archipelagic areas such as Indonesia face a tradeoff between the number of instruments and reducing costs. IoT is a perfect solution for remote healthcare due to its potential for low-cost development at scale. This paper describes a breakthrough solution for sizing defects by decoding information from heart sounds. The heart sounds contain information that can be translated into features through exact or heuristic methods used in previous work on CHD. This potential can be further revealed by the heuristic method proposed in this paper. Moreover, heart sound recording technology is well-established and available in the market at low prices. The proposed method decodes the information puzzle in heart sounds by using a new feature extraction process that converts the feature into atoms. Our method is executed in several stages. First, the heart sound signal is divided into two parts according to the systole and diastole intervals in each cardiac cycle. Second, we extract features using correlations among segments, which are the cross-correlation between systole and diastole segments and autocorrelation among diastole segments. Both processes generate eigenvalues for creating atoms in a planar plane. The last step is candidate selection for sizing CHD, which is determined by the Euclidian distance of atoms to the center of gravity (COG). We use a reference size as the baseline and threshold for the smallest Root Mean Square Error (RMSE) to speed up computation. After some training, we found that the Euclidian mean between atoms COG is the best candidate with RMSE < 0.5. Therefore, this method is called Average Distance Scattered Atoms of Eigenvalues (ADSAE). We conducted experiments on 30 samples and compared our results with Space Vector Machine (SVM), Fuzzy Clustering (FC), and Eclipse Method (EM) in terms of accuracy and F1 scores involving small tolerances. We found that ADSAE was superior to SVM, FC, and EM, especially for small defect sizes. However, for large defects, SVM was the most superior, followed by FC, ADSAE, and EM. ADSAE is limited to sizing only and does not have the capability to determine the shape and depth of defects. Nevertheless, ADSAE is a simple method suited for massive IoT devices for CHD healthcare.

An End-to-End Deep Learning Framework for Real-Time Denoising of Heart Sounds for Cardiac Disease Detection in Unseen Noise

Article

Full-text available

Jan 2023

Objective: The heart sound signals captured via a digital stethoscope are often distorted by environmental and physiological noise, altering their salient and critical properties. The problem is exacerbated in crowded low-resource hospital settings with high noise levels which degrades the diagnostic performance. In this study, we present a novel deep encoder-decoder-based denoising architecture (LU-Net) to suppress ambient and internal lung sound noises. Methods: Training is done using a large benchmark PCG dataset mixed with physiological noise, i.e., breathing sounds. Two different noisy datasets were prepared for experimental evaluation by mixing unseen lung sounds and hospital ambient noises with the clean heart sound recordings. We also used the inherently noisy portion of the PASCAL heart sound dataset for evaluation. Results: The proposed framework showed effective suppression of background noises in both unseen real-world data and synthetically generated noisy heart sound recordings, improving the signal-to-noise ratio (SNR) level by 5.575 dB on an average using only 1.32 M parameters. The proposed model outperforms the current state-of-the-art U-Net model with an average SNR improvement of 5.613 dB and 5.537 dB in the presence of lung sound and unseen hospital noise, respectively. LU-Net also outperformed the state-of-the-art Fully Convolutional Network (FCN) by 1.750 dB and 1.748 dB for lung sound and unseen hospital noise conditions, respectively. In addition, the proposed denoising method model improves classification accuracy by 38.93% in the noisy portion of the PASCAL heart sound dataset. Conclusion: The results presented in the paper indicate that our proposed architecture demonstrated a robust denoising performance on different datasets with diverse levels and characteristics of noise. Significance: The proposed deep learning-based PCG denoising approach is a pioneering study that can significantly improve the accuracy of computer-aided auscultation systems for detecting cardiac diseases in noisy, low-resource hospitals and underserved communities.

Monaural cardiopulmonary sound separation via complex-valued deep autoencoder and cyclostationarity

Article

Full-text available

Mar 2023

Objective: Cardiopulmonary auscultation is promising to get smart due to the emerging of electronic stethoscopes. Cardiac and lung sounds often appear mixed at both time and frequency domain, hence deteriorating the auscultation quality and the further diagnosis performance. The conventional cardiopulmonary sound separation methods may be challenged by the diversity in cardiac/lung sounds. In this study, the data-driven feature learning advantage of deep autoencoder and the common quasi-cyclostationarity characteristic are exploited for monaural separation. Approach: Different from most of the existing separation methods that only handle the amplitude of short-time Fourier transform (STFT) spectrum, a complex-valued U-net (CUnet) with deep autoencoder structure, is built to fully exploit both the amplitude and phase information. As a common characteristic of cardiopulmonary sounds, quasi-cyclostationarity of cardiac sound is involved in the loss function for training. Main results: In experiments to separate cardiac/lung sounds for heart valve disorder auscultation, the averaged achieved signal distortion ratio (SDR), signal interference ratio (SIR), and signal artifact ratio (SAR) in cardiac sounds are 7.84 dB, 21.72 dB, and 8.06 dB, respectively. The detection accuracy of aortic stenosis can be raised from 92.21% to 97.90%. Significance: The proposed method can promote the cardiopulmonary sound separation performance, and may improve the detection accuracy for cardiopulmonary diseases.

An End-to-end Deep Learning Framework for Real-Time Denoising of Heart Sounds for Cardiac Disease Detection in Unseen Noise

Preprint

Full-text available

Jan 2023

p>Objective: The heart sound signals captured via a digital stethoscope are often distorted by environmental and physiological noise, altering their salient and critical properties. The problem is exacerbated in crowded low-resource hospital settings with high noise levels which degrades the diagnostic performance. In this study, we present a novel deep encoder-decoder based denoising architecture (LU-Net) to suppress ambient and internal lung sound noises. Methods: Training is done using a large benchmark PCG dataset mixed with physiological noise, i.e., breathing sounds. Two different noisy datasets were prepared for experimental evaluation by mixing unseen lung sounds and hospital ambient noises with the clean heart sound recordings. We also use the inherently noisy portion of the PASCAL heart sound dataset for evaluation. Results: The proposed framework showed effective suppression of background noises in both un?seen real-world data and synthetically generated noisy heart sound recordings, improving the signal-to-noise ratio (SNR) level by 5.575 dB on an average using only 1.32 M parameters. The proposed model outperforms the current state-of-the-art U-Net model with an average SNR improvement of 5.613 dB and 5.537 dB in the presence of lung sound and unseen hospital noise, respectively. LU-Net also outperformed the state-of-the-art Fully Convolutional Network (FCN) by 1.750 dB and 1.748 dB for lung sound and unseen hospital noise conditions, respectively. In addition, the proposed denoising method model improves classification accuracy by 38.93% in the noisy portion of the PASCAL heart sound dataset. Conclusion: The results presented in the paper indicate that our proposed architecture demonstrated a robust denoising performance on different datasets with diverse levels and characteristics of noise. Significance: The proposed deep learning-based PCG denoising approach is a pioneering study that can significantly improve the accuracy of computer-aided auscultation systems for detecting cardiac diseases in noisy, low-resource hospitals and underserved communities. </p

Real-Time Implementation of a Frequency Shifter for Enhancement of Heart Sounds Perception on VLIW DSP Platform

Article

Full-text available

Oct 2023

Auscultation of heart sounds is important to perform cardiovascular assessment. External noises may limit heart sound perception. In addition, heart sound bandwidth is concentrated at very low frequencies, where the human ear has poor sensitivity. Therefore, the acoustic perception of the operator can be significantly improved by shifting the heart sound spectrum toward higher frequencies. This study proposes a real-time frequency shifter based on the Hilbert transform. Key system components are the Hilbert transformer implemented as a Finite Impulse Response (FIR) filter, and a Direct Digital Frequency Synthesizer (DDFS), which allows agile modification of the frequency shift. The frequency shifter has been implemented on a VLIW Digital Signal Processor (DSP) by devising a novel piecewise quadratic approximation technique for efficient DDFS implementation. The performance has been compared with other DDFS implementations both considering piecewise linear technique and sine/cosine standard library functions of the DSP. Piecewise techniques allow a more than 50% reduction in execution time compared to the DSP library. Piecewise quadratic technique also allows a more than 50% reduction in total required memory size in comparison to the piecewise linear. The theoretical analysis of the dynamic power dissipation exhibits a more than 20% reduction using piecewise techniques with respect to the DSP library. The real-time operation has been also verified on the DSK6713 rapid prototyping board by Texas Instruments C6713 DSP. Audiologic tests have also been performed to assess the actual improvement of heart sound perception. To this aim, heart sound recordings were corrupted by additive white Gaussian noise, crowded street noise, and helicopter noise, with different signal-to-noise ratios. All recordings were collected from public databases. Statistical analyses of the audiological test results confirm that the proposed approach provides a clear improvement in heartbeat perception in noisy environments.

Design of Abnormal Heart Sound Recognition System Based on HSMM and Deep Neural Network

Article

Full-text available

Aug 2022

Introduction Heart sound signal is an important physiological signal of human body, and the identification and research of heart sound signal is of great significance. Methods For abnormal heart sound signal recognition, an abnormal heart sound recognition system, combining hidden semi-Markov models (HSMM) with deep neural networks, is proposed. Firstly, HSMM is used to build a heart sound segmentation model to accurately segment the heart sound signal, and then the segmented heart sound signal is subjected to feature extraction. Finally, the trained deep neural network model is used for recognition. Results Compared with other methods, this method has a relatively small amount of input feature data and high accuracy, fast recognition speed. Discussion HSMM combined with deep neural network is expected to be deployed on smart mobile devices for telemedicine detection.

An edge-device-compatible algorithm for valvular heart diseases screening using phonocardiogram signals with a lightweight convolutional neural network and self-supervised learning

Article

Nov 2023
COMPUT METH PROG BIO

PCG Heart Sounds Quality Classification Using Neural Networks and SMOTE Tomek Links for the Think Health Project

Chapter

Mar 2023

Cardiac PCG signal recordings are an important part of cardiology teleconsultations. The main problems related to high-quality recordings are due to less experienced healthcare personnel taking lower-quality samples and ambient noise, and these scenarios can lead to errors in diagnosis by the physician and PCG heart sound classification algorithms. Given this problem, machine learning algorithms were proposed for quality classification of PCG recordings that aid in accurate diagnosis and faster care. One difficulty in the application of these algorithms is the problems related to class imbalance, which is very common in medical applications that affect model performance. In this study, an artificial neural network (ANN) classifier with a SMOTE Tomek Links class imbalance method is used. Public databases containing 7893 recordings with ten features of each PCG signal are used. We use two types of labeling which have different levels of imbalance. The use of SMOTE Tomek Links in combination with neural networks showed better performance compared to SVM classifier. For future work, we intend to perform laboratory tests in remote areas applying the proposed algorithm and we also intend to use the concept of mel spectrograms and convolutional networks for the classification of heart sounds.KeywordsQuality classification PCGAuscultationData imbalanceNeural artificial networksThink health project

Multiscale Kernel Residual Convolutional Neural Network to Detect Heart Valve Diseases

Conference Paper

Nov 2022

Combined empirical mode decomposition and phase space reconstruction based psychologically stressed and non-stressed state classification from cardiac sound signals

Article

Feb 2023
BIOMED SIGNAL PROCES

Mandeep Singh

Noise and Disturbance Reduction for Heart Sounds in Cycle-Frequency Domain Based on Nonlinear Time Scaling

Article

Full-text available

Mar 2010

Through an investigation of various clinical cases, heart sounds are found to be quasi-cyclostationary. Nonlinear time scaling from cycle-to-cycle is proposed to enhance cyclic stationarity, where nonlinear time scaling is approximated by a piecewise linear function. The techniques of cyclostationary signal processing are employed in this paper to reduce noise and disturbance in the cycle-frequency domain. Heart sounds can be theoretically recovered in the presence of additive, zero mean noise, and disturbance (perhaps non-Gaussian, nonstationary, or colored). The experimental tests in various conditions confirm the theoretical results.

Heart instantaneous frequency (HIF): An alternative approach to extract heart rate variability

Article

Full-text available

Sep 2001

Our study focuses on a new method of estimating the heart rate variability (HRV) which does not require the use of electrocardiogram (ECG) R-wave detection. Contrary to the R-wave detection method which requires a sampling frequency higher than 100 Hz, the one proposed here can be used to calculate the HRV from an ECG signal sampled at a frequency of approximately 5 Hz with a relative mean error of 0.03. This new method is based on extracting the instantaneous fundamental frequency from the ECG. The method could be efficiently used to extract the HRV from an ECG measured for healthy subjects performing an exercise in which the HRV increases linearly with time, and for subjects with respiratory and cardiac problems. The overall error decreased as we low-pass filtered the HRV with lower cut-off frequencies. Moreover, it was shown that the method could be efficiently used to calculate the HRV from blood pressure measurements and to be robust to noise.

Noise Reduction for Heart Sounds Using a Modified Minimum-Mean Squared Error Estimator with ECG Gating

Article

Full-text available

Feb 2006

In this paper, we present a method for single channel noise reduction of heart sound recordings. Multiple noise sources, such as lung sounds, muscle contraction, and background noise can contaminate the heart sound collection making subsequent analysis difficult. Our approach is based on a spectral domain minimum-mean squared error (MMSE) estimation, originally introduced by Ephraim and Malah in the context of speech enhancement. This method uses a "decision-directed" approach to estimate the noise spectrum without the need for a separate reference signal. The noise spectrum is used to compute the SNR on-line for adapting the Wiener filter gain applied to the spectral amplitudes. A number of modifications are made to the baseline algorithm to increase the level of noise reduction while simultaneously reducing signal distortion. Enhancements include the use of a "soft" threshold to determine when to update the noise spectrum, a forward-backward filtering implementation (i.e., smoothing), and a "second-pass" iterative estimation scheme in which the residual noise is used to re-estimate the SNR and update the Wiener gains. In addition, ECG analysis is used to provide gating information on when desired heart sounds may be present in order to further guide the noise spectral estimation procedure. The noise reduction algorithm is tested as a front-end to an automatic heart sound analysis system. The sounds are collected through two sensors that act simultaneously as microphones and ECG electrodes. The proposed algorithm demonstrates improvements over existing noise reduction approaches in terms of SNR gain, qualitative evaluations, and automatic detection of abnormalities present in the heart sounds.

Extraction of the aortic and pulmonary components of the second heart sound using a nonlinear transient chirp signal model

Article

Apr 2001

The objective of this paper is to adapt and validate a nonlinear transient chirp signal modeling approach for the analysis and synthesis of overlapping aortic (A2) and pulmonary (P2) components of the second heart sound (S2). The approach is based on the time-frequency representation of multicomponent signals for estimating and reconstructing the instantaneous phase and amplitude functions of each component. To evaluate the accuracy of the approach, a simulated S2 with A2 and P2 components having different overlapping intervals (5-30 ms) was synthesized. The simulation results show that the technique is very effective for extracting the two components, even in the presence of noise (-15 dB). The normalized root-mean-squared error between the original A2 and P2 components and their reconstructed versions varied between 1% and 6%, proportionally to the duration of the overlapping interval, and it increased by less than 2% in the presence of noise. The validated technique was then applied to S2 components recorded in pigs under normal or high pulmonary artery pressures. The results show that this approach can successfully isolate and extract overlapping A2 and P2 components from successive S2 recordings obtained from different heartbeats of the same animal as well from different animals.

Computerized Heart Sound Analysis

Article

Mar 2008
COMPUT BIOL MED

This paper is concerned with a synthesis study of the fast Fourier transform (FFT), the short-time Fourier transform (STFT), the Wigner distribution (WD) and the wavelet transform (WT) in analysing the phonocardiogram signal (PCG). It is shown that these transforms provide enough features of the PCG signals that will help clinics to obtain qualitative and quantitative measurements of the time-frequency (TF) PCG signal characteristics and consequently aid diagnosis. Similarly, it is shown that the frequency content of such a signal can be determined by the FFT without difficulties. The studied techniques (FT, STFT, WD, CWT, DWT and PWT) of analysis can thus be regarded as complementary in the TF analysis of the PCG signal; each will relate to a part distinct from the analysis in question.

The embedded digital stethoscope uses the adaptive noise cancellation filter and the type I Chebyshev IIR bandpass filter to reduce the noise of the heart sound

Conference Paper

Jul 2005

In this paper, we propose a design and implementation of an embedded digital stethoscope that uses the adaptive noise cancellation filter and the Type I Chebyshev IIR bandpass filter to reduce the noise of the heart sound. We integrate a traditional stethoscope, two microphones, an amplifier, an analog to digital conversion, a DSP board, and embedded board. First, the system acquits the heart sound, amplify, digitize, and input into the DSP board for noise reduction by using of the adaptive noise cancellation and the IIR bandpass filter. Then, the preprocessed heart sound signals are send into the embedded board for the LCD displaying and the interface to PC. Overall, we design the digital filter, interface circuit, displaying driver, graphical user interface. The prototype system integration of the hardware and software modules shows the stable operation with a range of adjustments.

Modified forward-backward overdetermined Prony method and its application in modelling heart sounds

Article

Jan 1996
IEE Proc Vis Image Signal Process

Prony's method is found to be a very effective method for the analysis-synthesis of transient data. However, straightforward application of this method can lead to poor performance, especially for short and noisy data records. The authors present a new over-determined forward-backward Prony method (MFBPM) and its application to the analysis of the first and second heart sounds. The accuracy of the method is measured using both cross-correlation and the normalised-mean-square-error (NMRSE) between a real signal and a synthetic one. Results from more than 80 different subjects show that the MFBPM is highly stable and gives very good performance with an average cross-correlation coefficient of 99.62%. Comparison of the results based on the NMRSE criterion show that the MFBPM is more precise than the modified backward Prony method (MBPM) with an accuracy improvement of upto 10%, and upto 20%, when compared with the conventional forward-backward Prony method (FBPM). Furthermore, a new method for dynamic estimation of model order is proposed for the case of heart sounds based on a subset of synthesised heart sounds which best approximates the observed data using NMRSE

Analysis of the second heart sound for diagnosis of paediatric heart disease

Article

Dec 1998

The second heart sound (S2) consists of two major components, one due to the closure of the aortic valve (A2) and the other due to the closure of the pulmonary valve (P2). The aortic valve normally closes before the pulmonary valve and leads to a time delay between the two sounds. This delay is known as the “split” in the medical community and is of significant diagnostic importance. The authors aim to develop an automatic technique to measure the split and to compare two common splitting patterns (i.e. variable splitting and fixed splitting), in a quantitative manner. A signal model for S2 is proposed. Accordingly, S2 is decomposed into a number of components with an algorithm based on the time-frequency distribution of the signal. A2 and P2 are selected from the model components and from these the split can be estimated. Groups of patients with the two splitting patterns have been investigated. For each patient, the splits of 20 successive cardiac cycles are measured, their mean and standard deviation are then calculated and used to characterise the two splitting patterns. It is found that the two simple statistical quantities can be used to identify the splitting patterns and hence offer important diagnostic information

A Study of Prosthetic Heart Valve Sounds

Article

Dec 1987

In this paper a new mechanism is proposed for the generation of phonocardiogram (PCG) sounds from implanted mechanical prosthetic heart valves. The structures in the chest, the heart, its partitions, and major vessels, constitute a frequency selective system excited by the rapidly decelerating valve occluder. It is shown that the source, the rapidly decelerating valve, has a wide and flat power spectrum and hence is an impulsive excitation that couples energy to the resonance modes specified by the structures in the chest. Consequently, the PCG signal is composed of decaying sinusoids. The parameters of the decaying sinusoids are estimated, and it is observed that the power spectra of the PCG signals have two dominant peaks in the frequency band of 200-500 Hz. The energy coupled to these two modes depends on the state of the valve. With thrombus the decelerating occluder slows down and becomes a broader pulse concentrating the energy to the lower resonance mode. This is verified by experiments on 30 patients during postoperative time course. However, no significant change in the resonance frequencies are observed which is an evidence for their anatomical and not valvular dependence.

Heart-Sound Processing by Average and Variance Calculation - Physiologic Basic and Clinical Implications

Article

Oct 1984

A new statistical method for heart-sound processing was developed and tested on normal subjects and on patients suffering from various cardiac pathologies. The method is effective in decreasing noise and in separating heart sounds from murmurs, as well as in deriving new physiological parameters. The theory is based on the assumption that heart sounds can be classified into deterministic and nondeterministic sounds. The processing results in a very significant attenuation of strong murmurs, while the deterministic events, such as SI-S4, are only slightly affected. The method includes dividing the heart-sound signal into a set of repetitive signals (ensemble) according to the trigger selected to be the peak of the ECG R-wave. The variability of the time elapsed from the trigger to the evoked sound is defined as the jitter. The average and variance functions are calculated from the ensemble. Calculation of the heartsound jitter from the average and variance functions shows a jitter of 5.5 ms Â±2.6 ms for S<sub>1</sub>, and 8.2 ms Â±3.3 ms for S<sub>2</sub>. The jitter, which is an objective parameter of the trigger-response linkage, can be used experimentally to clarify some of the cardiac electromechanical mechanisms, and it may have diagnostic value.

Separation of Heart Sound Signal from Noise in Joint Cycle Frequency–Time–Frequency Domains Based on Fuzzy Detection

Abstract and Figures

Recommended publications

Nonlinear analysis of heart murmurs using wavelet-based higher-order spectral parameters

Time-Frequency Analysis of the First Heart Sound

Advances in Time-Frequency Analysis of Biomedical Signals

Time-frequency analysis of the first heart sound. Part 2: An appropriate time-frequency representati...