ArticlePDF Available

Separation of Heart Sound Signal from Noise in Joint Cycle Frequency–Time–Frequency Domains Based on Fuzzy Detection

Authors:

Abstract and Figures

Noise is generally unavoidable during recordings of heart sound signal. Therefore, noise reduction is one of the important preprocesses in the analysis of heart sound signal. This was achieved in joint cycle frequency-time-frequency domains in this study. Heart sound signal was decomposed into components (called atoms) characterized by time delay, frequency, amplitude, time width, and phase. It was discovered that atoms of heart sound signal congregate in the joint domains. On the other hand, atoms of noise were dispersed. The atoms of heart sound signal could, therefore, be separated from the atoms of noise based on fuzzy detection. In a practical experiment, heart sound signal was successfully separated from lung sounds and disturbances due to chest motion. Computer simulations for various clinical heart sound signals were also used to evaluate the performance of the proposed noise reduction. It was shown that heart sound signal can be reconstructed from simulated complex noise (perhaps non-Gaussian, nonstationary, and colored). The proposed noise reduction can recover variations in the both waveform and time delay of heart sound signal during the reconstruction. Correlation coefficient and normalized residue were used to indicate the closeness of the reconstructed and noise-free heart sound signal. Correlation coefficient may exceed 0.90 and normalized residue may be around 0.10 in 0-dB noise environment, even if the phonocardiogram signal covers only ten cardiac cycles.
Content may be subject to copyright.
2438 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 57, NO. 10, OCTOBER 2010
Separation of Heart Sound Signal from Noise in Joint
Cycle Frequency–Time–Frequency Domains Based
on Fuzzy Detection
Hong Tang, Member, IEEE, Ting Li, Yongwan Park, and Tianshuang Qiu, Member, IEEE
Abstract—Noise is generally unavoidable during recordings of
heart sound signal. Therefore, noise reduction is one of the im-
portant preprocesses in the analysis of heart sound signal. This
was achieved in joint cycle frequency–time–frequency domains in
this study. Heart sound signal was decomposed into components
(called atoms) characterized by time delay, frequency, amplitude,
time width, and phase. It was discovered that atoms of heart sound
signal congregate in the joint domains. On the other hand, atoms
of noise were dispersed. The atoms of heart sound signal could,
therefore, be separated from the atoms of noise based on fuzzy
detection. In a practical experiment, heart sound signal was suc-
cessfully separated from lung sounds and disturbances due to chest
motion. Computer simulations for various clinical heart sound sig-
nals were also used to evaluate the performance of the proposed
noise reduction. It was shown that heart sound signal can be re-
constructed from simulated complex noise (perhaps non-Gaussian,
nonstationary, and colored). The proposed noise reduction can re-
cover variations in the both waveform and time delay of heart
sound signal during the reconstruction. Correlation coefficient and
normalized residue were used to indicate the closeness of the recon-
structed and noise-free heart sound signal. Correlation coefficient
may exceed 0.90 and normalized residue may be around 0.10 in 0-
dB noise environment, even if the phonocardiogram signal covers
only ten cardiac cycles.
Index Terms—Fuzzy detection, heart sound signal, joint cycle
frequency–time–frequency domains, noise reduction.
I. INTRODUCTION
PHONOCARDIOGRAPHY is a noninvasive technique that
is used to check the functioning of heart valves. It is, there-
fore, widely used during medical examinations carried out by
physicians. However, ambient noise and disturbances can cor-
rupt the recorded heart sound signal. The sliding movements of
the stethoscope diaphragms in contact with the patient’s skin,
Manuscript received January 25, 2010; revised April 1, 2010; accepted April
30, 2010. Date of publication June 10, 2010; date of current version September
15, 2010. This work was supported in part by the National Natural Science
Foundation of China under Grant 30570475 and Grant 60872122. Asterisk
indicates corresponding author.
H. Tang is with the Department of Biomedical Engineering, Faculty of Elec-
tronic Information and Electrical Engineering, Dalian University of Technology,
Dalian 116024, China (e-mail: tanghong@dlut.edu.cn).
T. Li is with the College of Electromechanical and Information En-
gineering, Dalian Nationalities University, Dalian 116600, China (e-mail:
tracyli78@yahoo.com.cn).
Y. Park is with the Department of Information and Communication, Ye-
ungnam University, Gyeongsangbuk-Do 712-749, Korea (e-mail: ywpark@
yu.ac.kr).
T. Qiu is with the Department of Biomedical Engineering, Dalian University
of Technology, Dalian 116024, China (e-mail: qiutsh@dlut.edu.cn).
Digital Object Identifier 10.1109/TBME.2010.2051225
lung sounds, the muscular activity controlling lung movements,
and surround speech sounds represent a few of the sources of
noise and disturbance affecting the accuracy of data being ac-
quired. These noises usually have high amplitude and last for
only a short period of time. Furthermore, there is considerable
overlap in the time–frequency domains of heart sound signal
and noises. As heart sound signals are transient, they are eas-
ily contaminated with the noises. Consequently, it may become
difficult for physicians to obtain the correct diagnostic informa-
tion by auscultation when noise and disturbances are important.
Noise reduction would allow a quantitative analysis of heart
sound signal and lead to a more reliable diagnosis. The distur-
bance addressed in this paper is the impulsive noise interference,
which is typically characterized by noise pulses of short time
duration.
Over the years, various techniques of noise reduction have
been proposed for different purposes. Some techniques like
adaptive noise cancellation and filtering can be applied to re-
duce noise from heart sound recordings [1], [2]. In particular, it
has been found that noise reduction performed in a time and/or
frequency domain may not be effective for non-Gaussian, non-
stationary, and colored noises. More specifically, it is considered
inappropriate because heart sound signal and noise overlap in
both time and frequency domains. On the contrary, techniques of
cyclostationary signal processing can reduce noise in the cycle
frequency domain. Beyar et al. [3] proposed to divide recordings
of heat sound signal into a sequence of repetitive cycles. Noise
was reduced simply by summations. However, heart sounds and
murmurs tend to occur with different timings from cycle to cycle
and cannot be totally preserved while noise is suppressed. In our
earlier paper [4], the timings of heart sounds and murmurs were
aligned from one cycle to next cycle by the nonlinear time scal-
ing (NTS). Noise and disturbance were subsequently reduced by
averaging. The results were promising, although segmentation
of the heart sound signal into first heart sound (S1) and second
heart sound (S2) is fundamentally needed to determine the pa-
rameters for the NTS. This preprocess can have a detrimental
effect on the efficiency of the noise reduction. Its performance
degrades, if segmentation is inaccurate, or if the assumption that
heart sounds are consistent in consecutive cycles is not valid.
To avoid these commonly encountered limitations, we propose
a new noise reduction in this paper that is performed in the joint
cycle frequency–time–frequency domains based on fuzzy de-
tection. Comparing with the earlier study [4], one more domain,
frequency domain, is exploited. This proposed noise reduction
can accommodate variations in both time delay and waveform
0018-9294/$26.00 © 2010 IEEE
TANG et al.: SEPARATION OF HEART SOUND SIGNAL FROM NOISE IN JOINT CYCLE FREQUENCY–TIME–FREQUENCY DOMAINS 2439
of heart sounds, murmurs. No segmentation is needed. On the
other hand, it can be operated in a somewhat automated manner.
The paper is organized as follows. Section II out-
lines the decomposition of heart sound signal into atoms.
Section III focuses on the quasi-cyclostationarity of the atoms. In
Section IV, we propose a fuzzy-detection method to detect atoms
of heart sound signal in the joint plane. Practical experiments
and various computer simulations are described in Section V.
In Section VI, we discuss the results, and in Section VII, ob-
servations regarding performance comparisons are presented.
Finally, Section VIII summarizes our main conclusions.
II. REPRESENTATION OF HEART SOUND SIGNAL IN
TIME–FREQUENCY DOMAINS
A. Decomposition
To our knowledge, several signal models can be found in
literature for the decomposition of heart sound signal, such as
the chirp models [5], [6], the damped sinusoidal models [7], [8],
the modified Prony models [9], and the Gaussian modulation
model [10]. Leung et al. employed the Gaussian modulation
model to decompose the second heart sound for the diagnosis
of pediatric heart diseases. This model is employed to represent
the heart sound signal of one cardiac cycle
hm(t)=
Lm
i=1
amie(ttmi)2/(2σ2
mi)cos(2πωmit+βmi)(1)
where hm(t)is the heart sound signal of the mth cycle. Namely,
(1) means that hm(t)is the sum of Lmatoms. Every atom is
characterized by five parameters: tmi is the time delay of the
ith atom with respect to the start of the mth cycle; ami is the
amplitude; ωmi is the frequency; σmi is the time width that
the atom needs support; βmi is the phase. Therefore, the heart
sound signal of this cycle is represented by the set of atoms
{tmi,a
mi
mi
mi
mi,1iLm}. The number of atoms,
Lm, and the five parameters for each atom can be obtained using
short-time Fourier transform (STFT) analysis, as described in
[10].
The STFT of the heart sound signal hm(t)is
H(t, f )=hm(t)w(tτ)e2πωτ (2)
where w(t) is a Gaussian window. First, the atom with the max-
imum amplitude is identified by searching the magnitude of the
STFT. More specifically, the atom with maximum amplitude is
located by detecting the peak in the magnitude of the STFT.
Once the time delay of the atom tmi has been identified, its
amplitude ami, frequency ωmi, and phase βmi can be read di-
rectly from the STFT. σmi is obtained by the following optimal
procedure.
The waveform represented by the ith atom is
smi(t, σmi)=amie(ttmi)2/(2σ2
mi)cos(2πωmit+βmi).
(3)
The signal residue after the waveform of the ith atom is
subtracted from the signal h(m1)(t)is
hmi(t, σmi)=hm(i1)(t)smi(t, σmi)(4)
Fig. 1. Heart sound signal of one cardiac cycle was decomposed into atoms.
(a) Recorded waveform of one cardiac cycle. (b) Atoms on the time–frequency
plane. (c) Reconstructed waveform and residue.
where hm0(t)is the original signal of the mth cycle. The nor-
malized residue energy is
ρmi(σmi)=|hmi(t, σmi)|2dt|hm0(t)|2dt. (5)
Obviously, ρmi(σmi)will be minimum, if a perfect “time
width” σmi is found. The minimum of the residue energy
ρmi(σmi)is, therefore, used as a criterion to optimize σmi.
For this purpose, we monitor ρmi(σmi)by varying σmi within
a predefined range. The decomposition stops, if ρmi(σmi)is
sufficiently small. The heart sound signal of the mth cycle can
be reconstructed from the sum of all atoms as
hm(t)
Lm
i=1
smi(t).(6)
B. Data Acquisition
The normal heart sound signal used in this paper was recorded
in the authors’ laboratory. The male subject (33 years of age)
was asked to lie on his back on an examination bed. The sensor
was directed toward the mitral site. ECG signals and heart sound
signal were recorded simultaneously. It is known that the domi-
nant bandwidth of a heart sound signal is approximately 500 Hz.
To avoid higher frequency noise than 500 Hz, the data was pre-
filtered by low-pass filter with cutoff frequency 500 Hz. The
sampling rate was set to 2 KHz, which is higher than the min-
imum rate required by the sampling theory. Furthermore, the
laboratory provided an environment, where even minor noise
could be controlled. These low-noise heart sound recordings
allowed us quantificationally evaluate noise reduction in simu-
lated noise.
C. Simulation
The normal heart sound signal of one single cardiac cycle was
decomposed into 16 atoms, as shown in Fig. 1. The placement
2440 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 57, NO. 10, OCTOBER 2010
Fig. 2. Correspondence between ECG and heart sound signal.
of each atom in the time–frequency plane is shown in Fig. 1(b).
The reconstructed waveform is shown in Fig. 1(c), where the
decomposing was stopped when the normalized residue energy
was less than 0.05. The high correlation coefficient between the
reconstructed and the original waveform (up to 0.98) lead us to
the conclusion that atoms can represent accurately heart sound
signal in the time and frequency domains.
III. QUASI-CYCLOSTATIONARITY OF THE ATOMS
Cardiac cycle is a sequence of events repeated cyclically dur-
ing every heartbeat. Heart sounds (S1–S4) are caused by vibra-
tion of heart valves and/or surrounding heart tissues following
the closing or opening of heart valves, while heart murmurs are
caused by turbulence of blood flow. The pace at which a heart
beats does not change abruptly, and therefore, the heart sound
signal is considered quasi-cyclostationary. Based on this approx-
imation, we assumed that atoms representing heart sound signal
in the time–frequency domains are also quasi-cyclostationary,
namely, if the heart sound signal was cyclostationary and the
cycle frequency corresponded to the reciprocal of cardiac cycle
duration, we expected that the atoms for one cardiac cycle will
superpose the atoms of the next cycle.
However, heart cycle duration varies from cycle to cycle. This
phenomenon, widely known as heart rate variability (HRV),
can be identified easily in the ECG [11]. The correspondence
between the ECG and heart sound signal is demonstrated in
Fig. 2. The occurrence of an R wave indicates the start of a
new cycle. In order to analyze heart sound signal at specific
cycle frequency, we perform linear time scaling on heart sound
signal of every cardiac cycle to enhance the cyclostationarity.
For example, the heart sound signal of the mth cycle hm(t)is
linearly time scaled as
hl
m(t)=hm(T0t/Tm)(7)
where T0is the cycle duration used as reference and Tmis
the mth cycle’s duration. The cycle frequency for the enhanced
signal is thus 1/T0. We decompose the enhanced signal of the
mth cycle hl
m(t)into atoms
hl
m(t)=
Ll
m
i=1
al
mie(ttl
mi)2/(2(σl
mi)2)cos(2πωl
mit+βl
mi).
(8)
Fig. 3. Distribution of the atoms is shown in joint plane at cycle frequency
1 Hz. This atom congregate provides sufficient evidence that atoms of enhanced
heart sound signal are quasi-cyclostationary.
The first three consecutive cardiac cycles shown in Fig. 2 are
processed as given in (7), where T0=1s. The signals hl
m(t)
with m=1, 2, and 3 are then decomposed into atoms according
to (8), as shown in Fig. 3. The atoms of one cycle almost super-
pose the atoms of another cycle. This atom congregate provides
sufficient evidence that atoms are approximately cyclostation-
ary. The reason why atoms of different cycles do not perfectly
superpose on each other may be variations in the waveform
and time delay of heart sounds, murmurs in consecutive cycles.
Noise reduction could be really robust, if these variations can
be accommodated.
We assume that atoms of heart sound signal are quasi-
cyclostationary. On the other hand, atoms for noise and dis-
turbances are random and their cycle frequency spectrum does
not overlap with that of heart sound signal.
On the basis of this assumption, it is safe to say that atoms
of heart sound signal congregate on the joint cycle frequency–
time–frequency plane. Noise and disturbances are generally ran-
dom in nature. The atoms of noise and disturbances must be
dispersed over the joint plane. The atoms of heart sound signal
and the atoms of noise can thus be easily separated. In extreme
cases where noise is cyclostationary, the atoms of heart sound
signal can still be separated from those of noise, if their cycle
frequency spectra do not overlap [4].
IV. SEPARATION OF HEART SOUND SIGNAL FROM NOISE
BASED ON FUZZY DETECTION
A. Scale Match
It is assumed that the recording of heart sound signal has M
cardiac cycles. The start of each cardiac cycle is indicated by an
R wave, while the atoms for the recording are denoted in the joint
plane as the set of atoms {tl
mi
l
mi,1iLl
m,1m
M}. The frequency components of heart sounds and murmurs
range from several to five hundreds hertz [12]; however, the
normal duration of a cardiac cycle is about 850 ms and we have
chosen to normalize the cycle frequency to 1 s for improving
comparisons between cycles and between patients. For example,
in Fig. 3, where the vertical axis ranges from 0 to 200 and
the horizontal axis ranges from 0 to 1, the frequency ωl
mi is
TANG et al.: SEPARATION OF HEART SOUND SIGNAL FROM NOISE IN JOINT CYCLE FREQUENCY–TIME–FREQUENCY DOMAINS 2441
Fig. 4. Noisy heart sound signal and atom distribution. (a) Noisy heart sound
signal. (b) Atom distribution in the joint plane at a cycle frequency of 1 Hz. The
vertical axis is the “scaled frequency”. It is obvious that atoms of heart sound
signal congregate, whereas atoms of noise are more dispersed and can easily be
separated based on density of atoms (or the membership function).
a hundred times that of tl
mi. This mismatch will enable prior
evolution on frequency axis when the detection is operated in
the plane. To avoid this scale mismatch, the frequency ωl
mi is
scaled down so that its values are within the same range as time
delay tl
mi as shown in Fig. 4, where the vertical axis is named
“scaled frequency”.
B. Fuzzy Detection
In accordance with the assumption that atoms of heart sound
signal congregate in the joint cycle frequency–time–frequency
plane, the density of an atom provides an indication of whether
this specific atom represents a heart sound signal or noise. The
density of an atom is referred to the number of atoms found
within a radius ζaround the atom. For example, the density of
the atom (tl
mi
l
mi)is defined as
dmi =(number of atoms found with in ζ)
=(tl
mi tl
nj)2+(ωl
mi ωl
nj)2ζ,
1jLl
n,1nM. (9)
For a given number of cardiac cycles, it is obvious that atoms
of higher density probably represent heart sound signal. We
define a membership function to detect the atoms that represent
heart sound signal as follows:
A(dmi)=0,d
mi <M
1,d
mi M.(10)
Ideally, the density dmi is equal to the total number of car-
diac cycles M, which serves as a threshold in (10). If the density
is greater than or equal to M, then A(dmi)=1and the atom
(tl
mi
l
mi)probably represents a heart sound signal. By repeat-
ing the same process across all atoms of the mth cycle, we can
identify those atoms synthesize heart sound signal of this cycle.
Fig. 5. Variations in the time delay of heart sounds S1 and S2 for two different
cycles.
C. Heart Sound Signal Reconstruction
Heart sound signal of the mth cycle hs
m(t)are approximated
by the sum
hs
m(t)=
Kl
m
k=1
al
mke(ttl
mk)2/(2(σl
mk)2)cos(2πωl
mkt+βl
mk)
(11)
where Kl
mis the number of atoms, which meet the condition
A(dmk)=1. The heart sound signal of this cycle are subse-
quently reconstructed through reversed linear time scaling
hr
m(t)=hs
m(Tmt/T0).(12)
Possible reconstruction errors are evaluated using the normal-
ized residue
E=M
m=1 Tm
0|hr
m(t)hm(t)|2dt
M
m=1 Tm
0|hm(t)|2dt(13)
where Tmis the cycle duration of the mth cycle.
D. Parameter ζ
According to our assumption, the atoms of one cycle should
superpose perfectly on the atoms of another cycle, if heart sound
signal was cyclostationary and the radius ζwould be zero. How-
ever, the parameter ζis closely related to variations in the time
delay of heart sounds and murmurs. In order to determine this
parameter for the phonocardiographic signal recorded, we in-
vestigate the variations in the time delay of heart sounds S1 and
S2, as shown in Fig. 5. According to Tang et al. [4], heart sound
waveform is highly consistent between consecutive cardiac cy-
cles. Variations in the time delay of heart sounds of the mth
cycle with respect to the first cycle can, therefore, be estimated
based on the minimum mean square error criterion
rS1
m= arg min
rS1
mEs11(t)sm1(tζS1
m)2,m2
(14)
and
rS2
m= arg min
rS2
mEs12(t)sm2(tζS2
m)2,m2
(15)
2442 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 57, NO. 10, OCTOBER 2010
TAB LE I
STATISTICS OF THE PARAMETER ZETA FOR VARIOUS CLINICAL HEART
SOUND SIGNAL
where rS1
mand rS2
mdenote variations in the time delay of S1 and
S2 of the mth cycle with respect to the first cycle, respectively.
The span ζS1of rS1
mcan be calculated as
ζS1= max(rS1
m)min(rS1
m)(16)
where max(·) and min(·) are the maximum and minimum oper-
ators, respectively. Similarly, the span ζS2of rS2
mcan be calcu-
lated as
ζS2= max(rS2
m)min(rS2
m).(17)
The values of ζS1and ζS2for various clinical heart sound
signals are listed in Table I, for which it is obvious that ζS1
and ζS2vary for different cases. They also vary for the same
subject at different periods of time. Although there is a great in-
terest in the mechanism underlying these variations, it is beyond
the scope of this paper. The number of cases investigated (see
Table I) is limited. Still, the selected sample provides sufficient
evidence that the variation of time delay of heart sound can
vary from several to dozens of milliseconds between consecu-
tive cardiac cycles. We followed the same approach to calculate
the differences in the time delay of the third heart sound ζS3,the
fourth heart sound ζS4, as well as heart murmurs ζmurmur.For
all heart sounds and murmurs, the parameter ζshould, however,
be chosen as the maximum variation. For example, it may be
chosen equal to ζ=0.039 s (the greater one of ζS1and ζS2)
for a healthy subject (see Table I). It should be noted that the
statistics included in Table I are obtained with low-noise heart
sounds. Parameter ζmay be larger, if the heart sound signal is
contaminated with noise. Without any prior knowledge about
the noise strength, parameter ζmay be chosen experimentally.
From the statistics compiled by the authors for a wide range
of clinical cases, noise reduction to normal rhythm heart sound
signal can be achieved when parameter ζis set in the range
[0.02–0.06]. It may be larger in cases of arrhythmia due to the
high variations in the time delay of heart sounds, murmurs. In
other words, parameter ζis related to the cyclostationarity of
heart sound signal, as discussed in Section VI-A. To some ex-
tent, the proposed noise reduction can accommodate variations
in the time delays. This accommodation leads that the variations
in the waveform can also be recovered in time domain. It is one
of main advantages of the proposed noise reduction.
Fig. 6. Lung sounds, chest motion contaminating heart sound signal. (a) Con-
taminated heart sound signal. (b) Atoms distribution in the joint plane. (c) Re-
constructed heart sounds. (d) Signal residues, which is the sum of lung sounds
and slide sounds (indicated by ellipses). The latter are sounds produced by the
lungs during breathing or by the muscular activity controlling lung movements.
E. Summary
The proposed noise reduction consists of the following steps.
Step 1: ECG signal and heart sound signal are synchronously
recorded.
Step 2: The beginning of individual cardiac cycles is identified
based on the occurrence of R waves. If the corresponding ECG
is not available, other signals (for example, wrist pulse, carotid
pulse) as well as cycle detection algorithms may be employed
to divide the phonocardiographic signal into cycles.
Step 3: To ensure that every cardiac cycle has the equal cycle
duration, the linear time scaling is carried out, according to
(7). The enhanced heart sound signal of each cardiac cycle is
decomposed into atoms, according to (8).
Step 4: The fuzzy detection is applied to identify on the
joint plane the atoms, which synthesize each heart sounds and
murmurs, as described in Sections IV-A and IV-B.
Step 5: Heart sound signal is reconstructed as in Section IV-C.
V. P RACTICAL EXPERIMENT AND COMPUTER SIMULATIONS
A. Separation of Heart Sound Signal From Lung Sounds and
Chest Motion
The experiment was performed on the same subject climbing
a staircase of 100 steps, who immediately after step climbing
laid on his back on an examination bed. During the recording,
the respiration was fast and, as expected, lung sounds and the
chest motion contaminated the phonocardiographic recordings,
TANG et al.: SEPARATION OF HEART SOUND SIGNAL FROM NOISE IN JOINT CYCLE FREQUENCY–TIME–FREQUENCY DOMAINS 2443
Fig. 7. Simulated noise and disturbances. (a) Simulated noise along with ran-
domly timed disturbances. (b) Spectrum of the simulated noise and disturbance,
which overlaps the frequency spectrum of heart sounds.
as shown in Fig. 6(a). Twelve cycles of heart sound signal were
submitted to the noise-reduction algorithm. Informal listening
by the authors shows that heart sounds can often be buried by
heavy lung sounds. Atoms distribution on the joint plane at a
cycle frequency of 1 Hz is shown in Fig. 6(b), where the atoms
indicated by prisms were identified for heart sounds. The re-
constructed heart sounds are given in Fig. 6(c). It is seen that
the recovered heart sounds become clear. S1s and S2s are easily
distinguished. The separated noise and disturbance are shown
in Fig. 6(d), which may comprise lung sounds and stethoscope
sliding disturbances (indicated by ellipses) due to chest mo-
tion. These sliding disturbances have high amplitude, short time
duration, and are randomly timed. The main frequency compo-
nents of heart sounds overlap those of lung sounds, as seen in
the joint plane in Fig. 6(b). It can be concluded that the noise
and disturbances in this case are non-Gaussian, nonstationary,
and colored.
B. Simulated Noise and Disturbance
In order to evaluate the proposed noise reduction for more
clinical heart sound signal, we simulate noise and disturbances
by using the model
v(t)=H(v1(t)+v2(t)) (18)
where v1(t)is a white double-side exponential distribution, and
v2(t)are randomly timed disturbances. H(·) is a band-pass fil-
ter, whose frequency response band overlaps that of the heart
sound signal. One realization of the simulated noise is shown in
Fig. 7(a). Its spectrum is shown in Fig. 7(b).
C. Reduction of Simulated Noise From Normal Heart Sounds
A low-noise heart sound signal from the normal subject is
contaminated by the simulated non-Gaussian, nonstationary,
and colored noise. Twelve cycles of heart sound signal were
submitted to the noise-reduction algorithm. The recording plus
Fig. 8. Noise reduction for normal heart sound signal. (a) Low-noise heart
sound signal and the corresponding ECG. (b) Heart sounds contaminated with
simulated noise. Disturbances are indicated by an ellipse. (c) Atoms in the joint
plane at cycle frequency of 1 Hz. (d) Reconstructed heart sounds.
the simulated noise is given in Fig. 8(b). The SNR is 0 dB. We see
that S1s are almost completely buried by noise. The noise and
disturbances are so heavy that heart sounds cannot be identified
by the human eye. The scatter plot in the joint plane at a cycle
frequency of 1 Hz is shown in Fig. 8(c). The atoms represented
by prisms are found to form heart sounds based on fuzzy detec-
tion and the remaining atoms, represented by circles, are found
to form noise and disturbances. The reconstructed heart sounds
are shown in Fig. 8(d). It is relatively easy for the authors to
identify the heart’s rhythm from the reconstructed signal just by
listening. The correlation coefficient between the reconstructed
and the original signal is 0.92, and the normalized residue is
0.12.
D. Reduction of Simulated Noise Form Heart Sound Signal
With Aortic Valve Stenosis
The noise reduction was also applied on heart sound signal of
patients with aortic valve stenosis. The noise-free data, whose
sampling rate is 22 050 Hz, were downloaded from the website
of the School of Medicine of the University of Dundee. The sam-
pling frequency was scaled down to 2 KHz, where a low-pass
filter with cutoff frequency 1 Hz was used to prevent frequency
aliasing. The heart sound signal has long, heavy systolic mur-
murs, as shown in Fig. 9(a). The signal was contaminated by
the simulated noise, as shown in Fig. 9(b). Some disturbances,
indicated by an ellipse, overlap the murmurs. The SNR is 0 dB.
The distribution of atoms on the joint plane at a cycle frequency
of 1 Hz is shown in Fig. 9(c). The atoms, indicated by prisms,
2444 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 57, NO. 10, OCTOBER 2010
Fig. 9. Noise reduction for heart sound signal with aortic valve stenosis.
(a) Low-noise recording. (b) Heart sounds contaminated with simulated noise.
Some disturbances indicated by an ellipse overlap heart murmurs in time do-
main. (c) Atoms in the joint plane at cycle frequency of 1 Hz. (d) Reconstructed
heart sound signal.
are identified to form heart sounds by fuzzy detection. The re-
constructed heart sound signal, shown in Fig. 9(d), seems to
be closer to the noise-free heart sound signal. The correlation
coefficient between the reconstructed and original signal is 0.90
and the normalized residue 0.18.
E. Reduction of Simulated Noise From Heart Sound Signal With
the Fourth Heart Sound (S4)
The fourth example involves the reduction of noise from heart
sound signal with the fourth heart sounds (S4), as shown in
Fig. 10(a). The noise-free data, whose sampling frequency was
22050 Hz, was downloaded from the website of the Medicine
Department of the University of Dundee. We scaled the sampling
frequency down to 2 KHz, where the technology of preventing
frequency aliasing was also used. S4 generally has low ampli-
tude and is easily buried by noise. In this example, we show the
robustness of the proposed noise reduction in recovering S4 in
heavy noise and disturbance environments. The heart sound sig-
nal contaminated by the simulated noise is shown in Fig. 10(b).
The SNR is 0 dB. The S4 sounds can hardly be perceived by
the human eye. The disturbances, indicated by ellipses, are ran-
domly timed and cause the disordering of rhythms. The authors
could hardly observe the occurrence of S4 s. The distribution of
the atoms in the joint plane at a cycle frequency of 1 Hz is shown
in Fig. 10(c). The fuzzy detection identifies the atoms of heart
sound signal, which are represented by prisms. The remaining
atoms, represented by circles, are identified from noise and dis-
turbances. The reconstructed heart sound signal are shown in
Fig. 10. Noise reduction for heart sound signal with the fourth heart sound.
(a) Low-noise recording. (b) Heart sound signal contaminated with simulated
noise. Disturbances are indicated by an ellipse. (c) Atoms in the joint plane at
cycle frequency of 1 Hz. (d) Reconstructed heart sounds.
Fig. 10(d). It is seen that the reconstructed waveform shows a
high level of approximation to the original waveform. The cor-
relation coefficient between the reconstructed and the original
waveforms is up to 0.98, and the normalized signal residue is as
low as 0.05.
VI. DISCUSSION
A. Cyclostationarity Strength
The atoms of heart sound signal can be easily identified
by fuzzy detection because heart sound signal are quasi-
cyclostationary. The atoms of heart sound signal congregate
in the joint plane at a specific cycle frequency. The performance
of noise reduction is thus directly related to the cyclostationar-
ity strength of heart sound signal. We may safely conclude that
as the cyclostationary of heart sound signal increases, the level
of noise reduction increases. Ideally, the atoms of heart sound
signal superpose, if the recording is perfectly cyclostationary.
However, it is impossible for heart sound signal to be perfect
cyclostationary because of variations in the waveform and time
delay of heart sounds, murmurs between different cardiac cy-
cles. One question that arises is how to quantitatively evaluate
the cyclostationarity strength for given heart sound signals.
We introduce cyclic statistics to assess the cyclostationarity
strength of heart sound signals. ρα
x(f)is the cyclic spectral
coherence function, which is defined as
ρα
x(f)= Sα
x(f)
[S0
x(f+α/2)S0
x(fα/2)]1/2(19)
TANG et al.: SEPARATION OF HEART SOUND SIGNAL FROM NOISE IN JOINT CYCLE FREQUENCY–TIME–FREQUENCY DOMAINS 2445
TAB LE I I
CYCLOSTATIONARITY STRENGTH
where Sα
x(f)is the cyclic spectrum of signal x(t) at the cycle
frequency α.Sα
x(f)is written as
Sα
x(f)=+
−∞
Rα
x(τ)ej2πfτ (20)
and
Rα
x(τ)=
t
x(t)x(t+τ)ej2παtdt. (21)
Rα
x(τ)is the cyclic correlation function. Rα
x(τ)degrades to a
traditional correlation when the cycle frequency αis zero.
In (19), ρα
x(f)represents a second-order cyclic statistics that
shows the cyclostationarity strength of a signal at a given cycle
frequency α. The larger the amplitude of ρα
x(f)is, the higher
the degree of cyclostationarity of the signal is. However, a heart
sound signal is not completely cyclostationary. It is partly sta-
tionary. ρ0
x(f)indicates the stationarity strength. In order to
evaluate the relative strength, we calculate the rate
γx(α)= ρα
x(f)df
ρ0
x(f)df +ρα
x(f)df .(22)
It is clear that γx(α)is 0.5, if the cyclostationarity strength is
equal to the stationarity strength. γx(α)is always less than or
equal to 1. In order to study how the cyclostationarity strength
affects noise reduction, we calculate γx(α)for the normal heart
sound signal shown in Fig. 8, heart sound signal with aortic
stenosis shown in Fig. 9, and the heart sound signal with S4
shown in Fig. 10 at a cycle frequency α=1 Hz. The rates are
listed in Table II. It is found that γAS(1) has the lowest value,
γFHS(1) has the highest value, and γNHS(1) has an intermediate
value. This means that the heart sound signal shown in Fig. 9
has the lowest cyclostationarity, the heart sound signal shown
in Fig. 10 has the highest cyclostationarity, and the heart sound
signal shown in Fig. 8 has medium cyclostationarity. These
results imply that the noise reduction shown in Fig. 10 is the
best and the noise reduction shown in Fig. 9 is the worst. These
results are confirmed by comparing them with the results of
our simulations. The correlation coefficients for the heart sound
signals shown in Figs. 8 and 9 are 0.92 and 0.90, respectively,
and the normalized residues are 0.12 and 0.18, respectively.
However, the correlation coefficient and normalized residue for
Fig. 10 are 0.98 and 0.05, respectively.
It is well known that HRV is unavoidable for a live sub-
ject, namely, there are no heart sound signals that are perfectly
cyclostationary. We employ linear time scaling to enhance cy-
clostationarity, as shown in (7). However, this is not sufficient
to align the timings of the heart sounds and murmurs. To further
enhance cyclostationarity, some special techniques may be em-
ployed, such as NTS proposed in our earlier paper [4]. Another
TABLE III
INFLUENCE OF THE NUMBER OF CYCLES
factor that can affect cyclostationarity is the variations in the
waveform. In the past, there have been few studies that have
used waveform improvement techniques to enhance cyclosta-
tionarity.
B. Influence of the Number of Cycles
As shown in the simulations, the quasi-cyclostationarity of
heart sound signal can be indicated by the density of atoms
in the joint plane at a specific cycle frequency. The quasi-
cyclostationarity may be sufficient as the number of cycles in-
creases. The density of atoms is thus a suitable indicator for
the detection of heart sound signal. To evaluate the performance
of the noise reduction with respect to the number of cycles,
we perform independent noise reduction simulations for nor-
mal heart sound signal in 0-dB environments when the number
of cycles is 5, 10, 20, and 30. The statistics of performance
indicators are listed in Table III. We see that the correlation co-
efficient between the reconstructed and the noise-free signal is
0.90 and the normalized signal residue is 0.15 when only five
cardiac cycles are covered. As the number of cycle increases,
the performance improves. However, the performance does not
improve linearly with respect to the number of cycles. It does not
show any improvement as the number of cycles increases from
20 to 30. We drew the following conclusion from this absence of
improvement: cyclostationarity cannot be always improved by
increasing the number of cycles. Therefore, there is a limit to the
degree to which the performance can be improved by increasing
the number of cycles. This proves that noise reduction mainly
depends on the cyclostationarity strength. The statistics show
that an acceptable performance can be achieved, if the number
of cycles is selected to be 10.
VII. PERFORMANCE COMPARISONS
A. Short Description of Our Earlier Study
In our earlier study [4], we made the assumption that heart
sound waveforms are consistent between consecutive cardiac
cycles. NTS was proposed to minimize variations in the timing
of heart sounds, murmurs and ultimately allow the exploita-
tion of the cyclostationarity features of the signal. In the ideal
case that NTS was successful, the enhanced heart sound signal
could be considered to be cyclostationary. Noise was reduced
by obtaining the average of consecutive cardiac cycles, namely,
it operated in joint cycle frequency–time domains. We called
this technique NR-NTS. It is theoretically insensitive to: 1) sta-
tionary noise; 2) zero-mean noise; and 3) cyclostationary noise,
whose cycle frequency does not coincide with that of heart sound
signal. However, first, heart sounds need to be segmented into
first and second heart sounds in order to determine the parame-
ters for NTS. If segmentation is inaccurate or if the assumption
2446 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 57, NO. 10, OCTOBER 2010
TAB LE I V
PERFORMANCE COMPARISONS
that heart sounds are consistent in consecutive cycles is not
valid, this preprocessing phase can have a detrimental effect on
the efficiency of the NR-NTS.
B. Comparison
The noise reduction proposed in this paper is referred to as
NR-FD, since it is based on fuzzy detection. It is performed
in the joint cycle frequency–time–frequency domains, namely,
one more domain, frequency domain, is exploited. To compare
the performance of the two algorithms, we applied them on
the same normal heart sound signal in simulated noise environ-
ments. Correlation coefficients between the reconstructed and
the noise-free are listed in Table IV.
Based on the assumption that heart sound waveforms are con-
sistent between consecutive cardiac cycles [4], the efficiency of
NR-NTS is improved by increasing the number of cycles. How-
ever, this assumption holds for ten cardiac cycles or so, and
while noise is reduced, variations in the heart sound waveform
become more prominent. As listed in Table IV, the correla-
tion coefficient decreased where the number of cardiac cycles
increased from 10 to 30 regardless of whether the signal was
contaminated with 0 dB or 5 dB noise.
Unlike NR-NTS, NR-FD can accept, to some degree, the
variations in the waveform and time delay of the heart sounds
because of the use of fuzzy detection. Both these variations
can be recovered during the reconstruction of the heart sound
signal, and no segmentation is needed as preprocess for NT-FD,
leading us to the conclusion that NR-FD is more robust than
NR-NTS. The performance of NR-FD does not degrade when
heart sound signal covers more than ten cardiac cycles. As a
consequence, the correlation coefficients obtained with NR-FD
are significantly better than those obtained with NR-NTS in 0
dB noise (see Table IV). These features make NR-FD suitable
for the automated analysis of heart sounds. We note, however,
that the correlation coefficients obtained with NR-FD are not
significantly higher than those obtained with NR-NTS in 5dB
noise. The reason behind this performance may be that atoms
of heart sound signal are diluted on the joint plane due to the
presence of excessive noise.
VIII. CONCLUSION
In this study, noise reduction for heart sound signal has been
achieved in the joint cycle frequency–time–frequency domains.
The beginning of cardiac cycles is identified based on the oc-
currence of R waves, and subsequently, each heart sound signal
of a cardiac cycle is linearly time scaled and decomposed into
atoms by means of a Gaussian modulation model. The atoms of
heart sound signal congregate on the joint plane. In contrast, the
atoms of noise and disturbances are dispersed in the joint plane.
The atoms of heart sound signal can then be easily identified
based on fuzzy detection. Variations in both the waveform and
time delay of the heart sound signal can be recovered during the
reconstruction. In practical experiment, heart sound signal was
successfully separated from lung sounds and stethoscope sliding
disturbance produced by chest muscular activities during breath-
ing. Computer simulations are used to test the performance of
the proposed noise reduction for various heart sound signals
contaminated with simulated non-Gaussian, nonstationary, and
colored noise. The statistics show that its performance may be
maintained even for short heart sound signal covering only five
cardiac cycles. Furthermore, the algorithm can be operated in a
rather unsupervised manner.
ACKNOWLEDGMENT
The authors would like to thank the data collectors. The au-
thors would also like to thank the anonymous reviewers for their
valuable comments. The heart sound data used in this paper were
downloaded from the website of the Medicine Department, Uni-
versity of Dundee, Texas Heart Institute at St. Luke’s Episcopal
Hospital.
REFERENCES
[1] Y. Bai and C. Lu, “The embedded digital stethoscope uses the adaptive
noise cancellation filter and the type I Chebyshev IIR bandpass filter to
reduce the noise of the heart sound,” in Proc. 7th Int. Workshop Enter-
prise Netw. Comput. Healthcare Ind. Busan, Korea, Jun. 23–25, 2005,
pp. 278–281.
[2] A. S. Paul, E. A. Wan, and A. T. Nelson, “Noise reduction for heart
sounds using a modified minimum-mean squared error estimator with
ECG gating,” in Proc. 28th Annu. Int. Conf. IEEE Eng. Med. Biol. Soc.
New York, Aug. 30–Sep. 3, 2006, pp. 3385–3390.
[3] R. Beyar, S. Levkovitz, S Braun, and Y Palti, “Heart-sound processing by
average and variance calculation physiologic basic and clinical impli-
cations,” IEEE Trans. Biomed. Eng., vol. BME-31, no. 9, pp. 591–596,
Sep. 1984.
[4] H. Tang, T. Li, and T. Qiu, “Noise and disturbance reduction in cycle
frequency domain based on non-linear time scaling,” IEEE Trans. Biomed.
Eng., vol. 57, no. 2, pp. 325–333, Feb. 2010.
[5] J. Xu, L. Durand, and P. Pibarot, “Nonlinear transient chirp signal model-
ing of the aortic and pulmonary components of the second heart sound,”
IEEE Trans. Biomed. Eng., vol. 47, no. 7, pp. 1328–1335, Jul. 2000.
[6] J. Xu, L. Durand, and P. Pibarot, “Extraction of the aortic and pulmonary
components of the second heart sound using a nonlinear transient chirp
signal model,” IEEE Trans. Biomed. Eng., vol. 48, no. 3, pp. 277–283,
Mar. 2001.
[7] A. Baykal, Y. Z. Ider, and H. Koymen, “Distribution of aortic mechanical
prosthetic valve closure sound model parameters on the surface of the
chest,” IEEE Trans. Biomed. Eng., vol. 42, no. 4, pp. 358–370, Apr. 1995.
[8] H. Koymen, B. K. Altay, and Y. Z. Ider, “A study of prosthetic heart valve
sounds,” IEEE Trans. Biomed. Eng., vol. BME-34, no. 11, pp. 853–863,
Nov. 1987.
[9] H. P. Sava and J. T. E. McDonnell, “Modified forward-backward overde-
termined Prony’s method and its application in modeling heart sounds,”
IEE Proc.—Vis. Image Signal Process, vol. 142, no. 6, pp. 375–380, Jun.
1995.
[10] T. S. Leung, P. R. White, W. B. Cook et al., “Analysis of the second heart
sound for diagnosis of paediatric heart disease,” IEEE proc.-Sci. Meas.
Technol., vol. 145, no. 6, pp. 285–290, Jun. 1998.
[11] A. K. Baros and N. Ohnishi, “Heart instantaneous frequency (HIF): An
alternative approach to extract heart rate variability,” IEEE Trans. Biomed.
Eng., vol. 48, no. 8, pp. 850–855, Aug. 2001.
[12] S. M. Debbal and F. Bereksi-Reguig, “Computerized heart sounds analy-
sis,” Comput. Biol. Med., vol. 38, pp. 263–280, 2008.
TANG et al.: SEPARATION OF HEART SOUND SIGNAL FROM NOISE IN JOINT CYCLE FREQUENCY–TIME–FREQUENCY DOMAINS 2447
Hong Tang (M’10) received the B.S. degree
in mechanical manufacture and automation from
Zhongyuan University of Technology, Zhengzhou,
Henan, China, in 2000, the M.S. degree in biomed-
ical engineering from Jilin University, Changchun,
Jilin, China, in 2003, and the Ph.D. degree in sig-
nal processing from the Dalian University of Tech-
nology (DUT), Dalian, Liaoning, China, in 2006,
respectively.
From 2006 to 2007, he was a Research Fellow
in the Regional Innovation Center, Yeungnam Uni-
versity, Korea. He is currently an Assistant Professor at DUT. His research
interests include non-Gaussian signal processing, biomedical signal processing,
and wireless location.
Ting Li received the B.S. degree in electronic en-
gineering, and the M.S. degree in signal process-
ing, both from the Dalian University of Technol-
ogy (DUT), Dalian, China, in 2002 and 2005,
respectively.
She is currently an Assistant Professor at Dalian
Nationalities University, Dalian. Her research in-
terests include non-Gaussian signal processing and
biomedical signal processing.
Yongwan Park received the B.E. and M.E. degrees
in electrical engineering from Kyungpook University,
Daegu, Korea, in 1982 and 1984, respectively, and the
M.S. and Ph.D. degrees in electrical engineering from
the State University of New York, Buffalo, in 1989
and 1992, respectively.
From 1992 to 1993, he was a Research Fellow at
the California Institute of Technology. From 1994 to
1996, he was a Chief Researcher at SK Telecom, Ko-
rea, where he was involved in developing IMT-2000
system. Since September 1996, he has been a Pro-
fessor of information and communication engineering at Yeungnam University,
Gyeongsan, Korea. His current research interests include beyond 3G/4G sys-
tem, orthogonal frequency-division multiplexing system, peak-to-average ratio
reduction, and biomedical signal processing, etc.
Tianshuang Qiu (M’97) received the B.S. degree
from Tianjin University, Tianjin, China, in 1983, the
M.S. degree from Dalian University of Technology,
Dalian, China, in 1993, and the Ph.D. degree from
Southeastern University, Nanjing, China, in 1996, all
in electrical engineering.
He was a Research Scientist at Dalian Institute
of Chemical Physics, Chinese Academy of Sciences,
Dalian during 1983 through 1996. He was with the
faculty of electrical engineering, Dalian Railway In-
stitute during 1996. He was a Postdoctoral Researcher
in the Department of Electrical Engineering, Northern Illinois University,
DeKalb. He is currently a Professor in the Department of Electronic Engineering,
Dalian University of Technology. His research interests include non-Gaussian
and nonstationary signal processing, radio-frequency signal processing, and
biomedical signal processing.
... The method used is the time-frequency domain time-frequency characteristics using Fourier Transform and its derivative [8][9][10][11][12][13] or wavelet [10,[14][15][16]. In case of CHD, the researchers are aimed to repeal the information from heart sounds, namely detection [1, 2, 8-10, 12, 14, 15, 17-20]; segmentation [15,16,[19][20][21][22][23][24][25][26][27][28][29]; feature extraction [8-12, 14-16, 21-36]; classification [30][31][32][33][34][35][36] and estimation [16,[26][27][28]36]. The development of research on portable or mobile heart sounds monitoring devices [1,37] or wearable devices Content courtesy of Springer Nature, terms of use apply. ...
... • We validated ADSAE with the CHD size provided by the dataset. We also expand our simulation to test the performance of the proposed algorithm by comparing our results equitably with SVM [15], EM and FC [29] in terms of accuracy and F1 score in terms of type 2 error. • We also discuss the future direction of this research and its possible use in IoT healthcare. ...
... Previous works were concerned with feature extraction of heart sound signals. The process may or may not involve segmentation [15,16,[19][20][21][22][23][24][25][26][27][28][29]. Segmentation is a process to separate heart sounds signals into four major parts, namely: S1 and S2, or systole and diastole [3]. ...
Article
Full-text available
Healthcare facilities for diagnosing congenital heart defects (CHD) in archipelagic areas such as Indonesia face a tradeoff between the number of instruments and reducing costs. IoT is a perfect solution for remote healthcare due to its potential for low-cost development at scale. This paper describes a breakthrough solution for sizing defects by decoding information from heart sounds. The heart sounds contain information that can be translated into features through exact or heuristic methods used in previous work on CHD. This potential can be further revealed by the heuristic method proposed in this paper. Moreover, heart sound recording technology is well-established and available in the market at low prices. The proposed method decodes the information puzzle in heart sounds by using a new feature extraction process that converts the feature into atoms. Our method is executed in several stages. First, the heart sound signal is divided into two parts according to the systole and diastole intervals in each cardiac cycle. Second, we extract features using correlations among segments, which are the cross-correlation between systole and diastole segments and autocorrelation among diastole segments. Both processes generate eigenvalues for creating atoms in a planar plane. The last step is candidate selection for sizing CHD, which is determined by the Euclidian distance of atoms to the center of gravity (COG). We use a reference size as the baseline and threshold for the smallest Root Mean Square Error (RMSE) to speed up computation. After some training, we found that the Euclidian mean between atoms COG is the best candidate with RMSE < 0.5. Therefore, this method is called Average Distance Scattered Atoms of Eigenvalues (ADSAE). We conducted experiments on 30 samples and compared our results with Space Vector Machine (SVM), Fuzzy Clustering (FC), and Eclipse Method (EM) in terms of accuracy and F1 scores involving small tolerances. We found that ADSAE was superior to SVM, FC, and EM, especially for small defect sizes. However, for large defects, SVM was the most superior, followed by FC, ADSAE, and EM. ADSAE is limited to sizing only and does not have the capability to determine the shape and depth of defects. Nevertheless, ADSAE is a simple method suited for massive IoT devices for CHD healthcare.
... Pure cardiac auscultation signals are usually not accessible due to the fact that both signals originate from proximal anatomical sites [37]. Thus, the measured signals are often a mixture of heart and lung sounds, and they are characterized by intrusive quasi-periodic interference in both the temporal and spectral domains [42]. Even experienced cardiologists are sometimes perplexed by the diastolic murmur-shaped breathing sounds present in a typical HS recording, coupled with low pitched wheeze and bronchial breathing [35]. ...
... Due to the transitory nature of heart sound signals, they are susceptible to irregular ambient noise components that are randomly distributed in the time and frequency domain, such as children wailing in the background, people talking, hospital activity sounds, street announcement sounds, etc. [42]. Another potential source of noise is the sliding motions of the stethoscope diaphragms as a result of the physician relocating the recording spot, the patient becoming agitated, or accidental displacement [43]. ...
... Another potential source of noise is the sliding motions of the stethoscope diaphragms as a result of the physician relocating the recording spot, the patient becoming agitated, or accidental displacement [43]. These noise components are often of high amplitude, last for a short duration, and introduce substantial time-frequency overlap between heart sound signals [42]. These effects add to the complexity of the analysis and make it more cumbersome for the physicians to extract relevant diagnostic information. ...
Article
Full-text available
Objective: The heart sound signals captured via a digital stethoscope are often distorted by environmental and physiological noise, altering their salient and critical properties. The problem is exacerbated in crowded low-resource hospital settings with high noise levels which degrades the diagnostic performance. In this study, we present a novel deep encoder-decoder-based denoising architecture (LU-Net) to suppress ambient and internal lung sound noises. Methods: Training is done using a large benchmark PCG dataset mixed with physiological noise, i.e., breathing sounds. Two different noisy datasets were prepared for experimental evaluation by mixing unseen lung sounds and hospital ambient noises with the clean heart sound recordings. We also used the inherently noisy portion of the PASCAL heart sound dataset for evaluation. Results: The proposed framework showed effective suppression of background noises in both unseen real-world data and synthetically generated noisy heart sound recordings, improving the signal-to-noise ratio (SNR) level by 5.575 dB on an average using only 1.32 M parameters. The proposed model outperforms the current state-of-the-art U-Net model with an average SNR improvement of 5.613 dB and 5.537 dB in the presence of lung sound and unseen hospital noise, respectively. LU-Net also outperformed the state-of-the-art Fully Convolutional Network (FCN) by 1.750 dB and 1.748 dB for lung sound and unseen hospital noise conditions, respectively. In addition, the proposed denoising method model improves classification accuracy by 38.93% in the noisy portion of the PASCAL heart sound dataset. Conclusion: The results presented in the paper indicate that our proposed architecture demonstrated a robust denoising performance on different datasets with diverse levels and characteristics of noise. Significance: The proposed deep learning-based PCG denoising approach is a pioneering study that can significantly improve the accuracy of computer-aided auscultation systems for detecting cardiac diseases in noisy, low-resource hospitals and underserved communities.
... Empirical mode decomposition (EMD) technique and prediction algorithm was used in [13] for reduction of heart sound interference from the recording of lung sound signals. Based on fuzzy detection, a cardiac sound extraction algorithm was developed in the joint cycle frequency-time-frequency domains [14]. In [15], the extraction of cardiac sound was studied using ensemble EMD and Kurtosis features. ...
... In PC-DAE, the periodicity of heart and lung sound was analyzed by the modulation frequency analysis (MFA) to separate feature submatrices derived by a DAE. The quasi-cyclostationarity measurement was first introduced to the separation of heart sound signal from noise in [14]. It was later exploited in the task of heart sound cancellation from lung sound [11]. ...
... As aforementioned, there are also many methods aiming at deriving cardiac sounds or lung sounds only. We chose a cardiac sound extraction method in [14] and a lung sound extraction method in [13] for comparison. The normal and AS testing datasets were used, and the results are shown in table 5. ...
Article
Full-text available
Objective: Cardiopulmonary auscultation is promising to get smart due to the emerging of electronic stethoscopes. Cardiac and lung sounds often appear mixed at both time and frequency domain, hence deteriorating the auscultation quality and the further diagnosis performance. The conventional cardiopulmonary sound separation methods may be challenged by the diversity in cardiac/lung sounds. In this study, the data-driven feature learning advantage of deep autoencoder and the common quasi-cyclostationarity characteristic are exploited for monaural separation. Approach: Different from most of the existing separation methods that only handle the amplitude of short-time Fourier transform (STFT) spectrum, a complex-valued U-net (CUnet) with deep autoencoder structure, is built to fully exploit both the amplitude and phase information. As a common characteristic of cardiopulmonary sounds, quasi-cyclostationarity of cardiac sound is involved in the loss function for training. Main results: In experiments to separate cardiac/lung sounds for heart valve disorder auscultation, the averaged achieved signal distortion ratio (SDR), signal interference ratio (SIR), and signal artifact ratio (SAR) in cardiac sounds are 7.84 dB, 21.72 dB, and 8.06 dB, respectively. The detection accuracy of aortic stenosis can be raised from 92.21% to 97.90%. Significance: The proposed method can promote the cardiopulmonary sound separation performance, and may improve the detection accuracy for cardiopulmonary diseases.
... Pure cardiac auscultation signals are usually not accessible due to the fact that both signals originate from proximal anatomical sites [25]. Thus, the measured signals are often a mixture of heart and lung sounds, and they are characterized by intrusive quasi-periodic interference in both the temporal and spectral domains [30]. Even experienced cardiologists are sometimes perplexed by the diastolic murmurshaped breathing sounds present in a typical HS recording, coupled with low pitched wheeze and bronchial breathing [23]. ...
... Due to the transitory nature of heart sound signals, they are susceptible to irregular ambient noise components that are randomly distributed in the time and frequency domain, such as children wailing in the background, people talking, hospital activity sounds, street announcement sounds, etc. [30]. Another potential source of noise is the sliding motions of the stethoscope diaphragms as a result of the physician relocating the recording spot, the patient becoming agitated, or accidental displacement [31]. ...
... Another potential source of noise is the sliding motions of the stethoscope diaphragms as a result of the physician relocating the recording spot, the patient becoming agitated, or accidental displacement [31]. These noise components are often of high amplitude, last for a short duration, and introduce substantial time-frequency overlap between heart sound signals [30]. These effects add to the complexity of the analysis and make it more cumbersome for the physicians to extract relevant diagnostic information. ...
Preprint
Full-text available
p>Objective: The heart sound signals captured via a digital stethoscope are often distorted by environmental and physiological noise, altering their salient and critical properties. The problem is exacerbated in crowded low-resource hospital settings with high noise levels which degrades the diagnostic performance. In this study, we present a novel deep encoder-decoder based denoising architecture (LU-Net) to suppress ambient and internal lung sound noises. Methods: Training is done using a large benchmark PCG dataset mixed with physiological noise, i.e., breathing sounds. Two different noisy datasets were prepared for experimental evaluation by mixing unseen lung sounds and hospital ambient noises with the clean heart sound recordings. We also use the inherently noisy portion of the PASCAL heart sound dataset for evaluation. Results: The proposed framework showed effective suppression of background noises in both un?seen real-world data and synthetically generated noisy heart sound recordings, improving the signal-to-noise ratio (SNR) level by 5.575 dB on an average using only 1.32 M parameters. The proposed model outperforms the current state-of-the-art U-Net model with an average SNR improvement of 5.613 dB and 5.537 dB in the presence of lung sound and unseen hospital noise, respectively. LU-Net also outperformed the state-of-the-art Fully Convolutional Network (FCN) by 1.750 dB and 1.748 dB for lung sound and unseen hospital noise conditions, respectively. In addition, the proposed denoising method model improves classification accuracy by 38.93% in the noisy portion of the PASCAL heart sound dataset. Conclusion: The results presented in the paper indicate that our proposed architecture demonstrated a robust denoising performance on different datasets with diverse levels and characteristics of noise. Significance: The proposed deep learning-based PCG denoising approach is a pioneering study that can significantly improve the accuracy of computer-aided auscultation systems for detecting cardiac diseases in noisy, low-resource hospitals and underserved communities. </p
... Many researchers have proposed a variety of methods [18] for heart sound denoising, either based on hardware or software solutions. Among software solutions, a first class of algorithms is based on thresholding applied in transform domains, including Short Time Fourier Transform [19][20][21][22], Fast Fourier Transform [23][24][25], Wavelet transform [19,[26][27][28][29][30][31][32][33][34][35], and Convolution Wavelet Packet Transform [36][37][38][39]. However, the choice of the threshold function [35] strongly affects the denoising effect. ...
... Since every Chebyshev Polynomial T j (ω) in contained in the interval [−1, +1], the maximum errors are given by (see also (21) and (25)): ...
Article
Full-text available
Auscultation of heart sounds is important to perform cardiovascular assessment. External noises may limit heart sound perception. In addition, heart sound bandwidth is concentrated at very low frequencies, where the human ear has poor sensitivity. Therefore, the acoustic perception of the operator can be significantly improved by shifting the heart sound spectrum toward higher frequencies. This study proposes a real-time frequency shifter based on the Hilbert transform. Key system components are the Hilbert transformer implemented as a Finite Impulse Response (FIR) filter, and a Direct Digital Frequency Synthesizer (DDFS), which allows agile modification of the frequency shift. The frequency shifter has been implemented on a VLIW Digital Signal Processor (DSP) by devising a novel piecewise quadratic approximation technique for efficient DDFS implementation. The performance has been compared with other DDFS implementations both considering piecewise linear technique and sine/cosine standard library functions of the DSP. Piecewise techniques allow a more than 50% reduction in execution time compared to the DSP library. Piecewise quadratic technique also allows a more than 50% reduction in total required memory size in comparison to the piecewise linear. The theoretical analysis of the dynamic power dissipation exhibits a more than 20% reduction using piecewise techniques with respect to the DSP library. The real-time operation has been also verified on the DSK6713 rapid prototyping board by Texas Instruments C6713 DSP. Audiologic tests have also been performed to assess the actual improvement of heart sound perception. To this aim, heart sound recordings were corrupted by additive white Gaussian noise, crowded street noise, and helicopter noise, with different signal-to-noise ratios. All recordings were collected from public databases. Statistical analyses of the audiological test results confirm that the proposed approach provides a clear improvement in heartbeat perception in noisy environments.
... The heart sound signal is generated by the vibration of the whole heart structure caused by the opening and closing of the heart valves. 1 It is an important physiological signal in the human body and has an important role in the prevention and diagnosis of cardiovascular diseases. 2 At present, domestic and international hospitals still use manual auscultation in the diagnosis of heart sound, which is susceptible to both the interference of the external environment and personal experience. ...
Article
Full-text available
Introduction Heart sound signal is an important physiological signal of human body, and the identification and research of heart sound signal is of great significance. Methods For abnormal heart sound signal recognition, an abnormal heart sound recognition system, combining hidden semi-Markov models (HSMM) with deep neural networks, is proposed. Firstly, HSMM is used to build a heart sound segmentation model to accurately segment the heart sound signal, and then the segmented heart sound signal is subjected to feature extraction. Finally, the trained deep neural network model is used for recognition. Results Compared with other methods, this method has a relatively small amount of input feature data and high accuracy, fast recognition speed. Discussion HSMM combined with deep neural network is expected to be deployed on smart mobile devices for telemedicine detection.
Chapter
Cardiac PCG signal recordings are an important part of cardiology teleconsultations. The main problems related to high-quality recordings are due to less experienced healthcare personnel taking lower-quality samples and ambient noise, and these scenarios can lead to errors in diagnosis by the physician and PCG heart sound classification algorithms. Given this problem, machine learning algorithms were proposed for quality classification of PCG recordings that aid in accurate diagnosis and faster care. One difficulty in the application of these algorithms is the problems related to class imbalance, which is very common in medical applications that affect model performance. In this study, an artificial neural network (ANN) classifier with a SMOTE Tomek Links class imbalance method is used. Public databases containing 7893 recordings with ten features of each PCG signal are used. We use two types of labeling which have different levels of imbalance. The use of SMOTE Tomek Links in combination with neural networks showed better performance compared to SVM classifier. For future work, we intend to perform laboratory tests in remote areas applying the proposed algorithm and we also intend to use the concept of mel spectrograms and convolutional networks for the classification of heart sounds.KeywordsQuality classification PCGAuscultationData imbalanceNeural artificial networksThink health project
Article
Full-text available
Through an investigation of various clinical cases, heart sounds are found to be quasi-cyclostationary. Nonlinear time scaling from cycle-to-cycle is proposed to enhance cyclic stationarity, where nonlinear time scaling is approximated by a piecewise linear function. The techniques of cyclostationary signal processing are employed in this paper to reduce noise and disturbance in the cycle-frequency domain. Heart sounds can be theoretically recovered in the presence of additive, zero mean noise, and disturbance (perhaps non-Gaussian, nonstationary, or colored). The experimental tests in various conditions confirm the theoretical results.
Article
Full-text available
Our study focuses on a new method of estimating the heart rate variability (HRV) which does not require the use of electrocardiogram (ECG) R-wave detection. Contrary to the R-wave detection method which requires a sampling frequency higher than 100 Hz, the one proposed here can be used to calculate the HRV from an ECG signal sampled at a frequency of approximately 5 Hz with a relative mean error of 0.03. This new method is based on extracting the instantaneous fundamental frequency from the ECG. The method could be efficiently used to extract the HRV from an ECG measured for healthy subjects performing an exercise in which the HRV increases linearly with time, and for subjects with respiratory and cardiac problems. The overall error decreased as we low-pass filtered the HRV with lower cut-off frequencies. Moreover, it was shown that the method could be efficiently used to calculate the HRV from blood pressure measurements and to be robust to noise.
Article
Full-text available
In this paper, we present a method for single channel noise reduction of heart sound recordings. Multiple noise sources, such as lung sounds, muscle contraction, and background noise can contaminate the heart sound collection making subsequent analysis difficult. Our approach is based on a spectral domain minimum-mean squared error (MMSE) estimation, originally introduced by Ephraim and Malah in the context of speech enhancement. This method uses a "decision-directed" approach to estimate the noise spectrum without the need for a separate reference signal. The noise spectrum is used to compute the SNR on-line for adapting the Wiener filter gain applied to the spectral amplitudes. A number of modifications are made to the baseline algorithm to increase the level of noise reduction while simultaneously reducing signal distortion. Enhancements include the use of a "soft" threshold to determine when to update the noise spectrum, a forward-backward filtering implementation (i.e., smoothing), and a "second-pass" iterative estimation scheme in which the residual noise is used to re-estimate the SNR and update the Wiener gains. In addition, ECG analysis is used to provide gating information on when desired heart sounds may be present in order to further guide the noise spectral estimation procedure. The noise reduction algorithm is tested as a front-end to an automatic heart sound analysis system. The sounds are collected through two sensors that act simultaneously as microphones and ECG electrodes. The proposed algorithm demonstrates improvements over existing noise reduction approaches in terms of SNR gain, qualitative evaluations, and automatic detection of abnormalities present in the heart sounds.
Article
The objective of this paper is to adapt and validate a nonlinear transient chirp signal modeling approach for the analysis and synthesis of overlapping aortic (A2) and pulmonary (P2) components of the second heart sound (S2). The approach is based on the time-frequency representation of multicomponent signals for estimating and reconstructing the instantaneous phase and amplitude functions of each component. To evaluate the accuracy of the approach, a simulated S2 with A2 and P2 components having different overlapping intervals (5-30 ms) was synthesized. The simulation results show that the technique is very effective for extracting the two components, even in the presence of noise (-15 dB). The normalized root-mean-squared error between the original A2 and P2 components and their reconstructed versions varied between 1% and 6%, proportionally to the duration of the overlapping interval, and it increased by less than 2% in the presence of noise. The validated technique was then applied to S2 components recorded in pigs under normal or high pulmonary artery pressures. The results show that this approach can successfully isolate and extract overlapping A2 and P2 components from successive S2 recordings obtained from different heartbeats of the same animal as well from different animals.
Article
This paper is concerned with a synthesis study of the fast Fourier transform (FFT), the short-time Fourier transform (STFT), the Wigner distribution (WD) and the wavelet transform (WT) in analysing the phonocardiogram signal (PCG). It is shown that these transforms provide enough features of the PCG signals that will help clinics to obtain qualitative and quantitative measurements of the time-frequency (TF) PCG signal characteristics and consequently aid diagnosis. Similarly, it is shown that the frequency content of such a signal can be determined by the FFT without difficulties. The studied techniques (FT, STFT, WD, CWT, DWT and PWT) of analysis can thus be regarded as complementary in the TF analysis of the PCG signal; each will relate to a part distinct from the analysis in question.
Conference Paper
In this paper, we propose a design and implementation of an embedded digital stethoscope that uses the adaptive noise cancellation filter and the Type I Chebyshev IIR bandpass filter to reduce the noise of the heart sound. We integrate a traditional stethoscope, two microphones, an amplifier, an analog to digital conversion, a DSP board, and embedded board. First, the system acquits the heart sound, amplify, digitize, and input into the DSP board for noise reduction by using of the adaptive noise cancellation and the IIR bandpass filter. Then, the preprocessed heart sound signals are send into the embedded board for the LCD displaying and the interface to PC. Overall, we design the digital filter, interface circuit, displaying driver, graphical user interface. The prototype system integration of the hardware and software modules shows the stable operation with a range of adjustments.
Article
Prony's method is found to be a very effective method for the analysis-synthesis of transient data. However, straightforward application of this method can lead to poor performance, especially for short and noisy data records. The authors present a new over-determined forward-backward Prony method (MFBPM) and its application to the analysis of the first and second heart sounds. The accuracy of the method is measured using both cross-correlation and the normalised-mean-square-error (NMRSE) between a real signal and a synthetic one. Results from more than 80 different subjects show that the MFBPM is highly stable and gives very good performance with an average cross-correlation coefficient of 99.62%. Comparison of the results based on the NMRSE criterion show that the MFBPM is more precise than the modified backward Prony method (MBPM) with an accuracy improvement of upto 10%, and upto 20%, when compared with the conventional forward-backward Prony method (FBPM). Furthermore, a new method for dynamic estimation of model order is proposed for the case of heart sounds based on a subset of synthesised heart sounds which best approximates the observed data using NMRSE
Article
The second heart sound (S2) consists of two major components, one due to the closure of the aortic valve (A2) and the other due to the closure of the pulmonary valve (P2). The aortic valve normally closes before the pulmonary valve and leads to a time delay between the two sounds. This delay is known as the “split” in the medical community and is of significant diagnostic importance. The authors aim to develop an automatic technique to measure the split and to compare two common splitting patterns (i.e. variable splitting and fixed splitting), in a quantitative manner. A signal model for S2 is proposed. Accordingly, S2 is decomposed into a number of components with an algorithm based on the time-frequency distribution of the signal. A2 and P2 are selected from the model components and from these the split can be estimated. Groups of patients with the two splitting patterns have been investigated. For each patient, the splits of 20 successive cardiac cycles are measured, their mean and standard deviation are then calculated and used to characterise the two splitting patterns. It is found that the two simple statistical quantities can be used to identify the splitting patterns and hence offer important diagnostic information
Article
In this paper a new mechanism is proposed for the generation of phonocardiogram (PCG) sounds from implanted mechanical prosthetic heart valves. The structures in the chest, the heart, its partitions, and major vessels, constitute a frequency selective system excited by the rapidly decelerating valve occluder. It is shown that the source, the rapidly decelerating valve, has a wide and flat power spectrum and hence is an impulsive excitation that couples energy to the resonance modes specified by the structures in the chest. Consequently, the PCG signal is composed of decaying sinusoids. The parameters of the decaying sinusoids are estimated, and it is observed that the power spectra of the PCG signals have two dominant peaks in the frequency band of 200-500 Hz. The energy coupled to these two modes depends on the state of the valve. With thrombus the decelerating occluder slows down and becomes a broader pulse concentrating the energy to the lower resonance mode. This is verified by experiments on 30 patients during postoperative time course. However, no significant change in the resonance frequencies are observed which is an evidence for their anatomical and not valvular dependence.
Article
A new statistical method for heart-sound processing was developed and tested on normal subjects and on patients suffering from various cardiac pathologies. The method is effective in decreasing noise and in separating heart sounds from murmurs, as well as in deriving new physiological parameters. The theory is based on the assumption that heart sounds can be classified into deterministic and nondeterministic sounds. The processing results in a very significant attenuation of strong murmurs, while the deterministic events, such as SI-S4, are only slightly affected. The method includes dividing the heart-sound signal into a set of repetitive signals (ensemble) according to the trigger selected to be the peak of the ECG R-wave. The variability of the time elapsed from the trigger to the evoked sound is defined as the jitter. The average and variance functions are calculated from the ensemble. Calculation of the heartsound jitter from the average and variance functions shows a jitter of 5.5 ms ±2.6 ms for S<sub>1</sub>, and 8.2 ms ±3.3 ms for S<sub>2</sub>. The jitter, which is an objective parameter of the trigger-response linkage, can be used experimentally to clarify some of the cardiac electromechanical mechanisms, and it may have diagnostic value.