ArticlePDF Available

HIGH RESOLUTION NEARLY-ML ESTIMATION OF SINUSOIDS IN NOISE USING A FAST FREQUENCY DOMAIN APPROACH

January 1998

January 1998

Authors:

University of Strathclyde

Estimating the frequencies, amplitudes and phases of sinusoids in noise is a problem which arises in many applications. The aim of the methods in this paper is to achieve computational eciency and near-ML perfor-mance (i.e. low bias, variance and threshold SNR), in problems such as vibration or audio analysis where the number of tones may be large (e.g. > 20). An approach has recently been published for resolved tones [4]. This paper extends that frequency domain approach t o the high-resolution problem.

Two tone case. Bound on frequency estimate rms error (i.e. p CRB) normalised to single-tone CRB, plotted against frequency separation in`binsin`bins'. Solid line: CRB of full ML estimator; Dashed line: CRB of frequency domain estimator using only 9 DFS samples.

…

Two tone case. Bound on frequency estimate rms error (i.e. p CRB) normalised to single-tone CRB, plotted against frequency separation in`binsin`bins'. A: = =2; B: = =4; C: = =8; D: = =16; E: = 0.

…

Figures - uploaded by Malcolm David Macleod

Content may be subject to copyright.

Content uploaded by Malcolm David Macleod

Content may be subject to copyright.

HIGH RESOLUTION NEARLY-ML ESTIMATION OF

SINUSOIDS IN NOISE USING A FAST FREQUENCY

DOMAIN APPROACH

Dr Malcolm D. Macleod

Cambridge University Engineering Department, Trumpington Street,

CAMBRIDGE, CB2 1PZ, UK Tel: +44 1223 332671; fax: +44 1223 332662

e-mail: mdm@eng.cam.ac.uk

ABSTRACT

Estimating the frequencies, amplitudes and phases of

sinusoids in noise is a problem which arises in many

applications. The aim of the metho ds in this paper is

to achieve computational eciency and near-ML perfor-

mance (i.e. low bias, variance and threshold SNR), in

problems such as vibration or audio analysis where the

number of tones may be large (e.g.

20). An approach

has recently been published for resolved tones [4]. This

paper extends that frequency domain approachtothe

high-resolution problem.

1 INTRODUCTION

The estimation of the frequencies, amplitudes and

phases of sinusoids in noise is important in many ap-

plications, including radar, sonar, instrumentation, and

audio analysis. In such applications, many of the tones

may not be resolved (i.e. their frequency separations

may be

=N

rad/sample, where

is the block-

length). In cases like these, esp ecially where the number

of tones is large, Maximum Likelihood (ML) estimation

is usually rejected [1] because it requires computation-

ally expensive non-linear optimisation. A recent algo-

rithm [3,4] uses frequency-domain interpolators, cou-

pled with a simple non-linear optimisation strategy,

to obtain nearly-ML estimates in the case of resolved

multiple tones (frequencies separated by at least 4

=N

rad/sample). This pap er extends these results to the

high-resolution case, giving very nearly ML estimates

with much reduced computation.

1.1 Problem denition

The observed discrete signal

is modelled as

, where

is the sum of

cisoids

(complex sinu-

soids) and

is zero-mean complex noise [1] of variance



, with independent real and imaginary parts, eachof

variance



2. If the cisoids have amplitudes

, phases



, and frequencies

radians per sample,

can be

written as

exp(

)

;

(1)

where

exp(



) is the

complex amplitude

of the

cisoid. Let the

-sample data blocks be written

as column vectors,

; :::; y



]

, etc., and de-

ne parameter vectors

= [

; :::; b

]

and

[

; :::; !

]

. The problem, assuming

is known,

is to estimate

and

, given

. (Pure real sig-

nals, and the problem of estimating

, are considered

later).

Let matrix

have columns which are the cisoid basis

functions at frequencies

:::!

(

) = [

(

)

;

(

)

; :::;

(

)], where

(

) =

;

exp(

)

; :::;

exp(

(



)]

. Then the signal model

(1) can be written

(

)

. When the noise

white (i.i.d.) Gaussian, the Maximum Likelihood (ML)

estimate of the parameters (

;

) is the one [1] which

minimises



)

, the sum of squared errors

(SSE) between the estimated signal ^

)

and the

observed signal

. For any given estimate ^

, the ML

estimate of

is given by [1]

)=(

(

)

(

))



)

(2)

and the joint ML estimate of

and

is found by max-

imising



(

)(

)

))



)

(3)

by searching over the

-dimensional ^

. For the single-

tone case (

=1),

), and the ML estimate

of the scalar ^

is obtained [1] by maximising the

peri-

odogram

of the signal

) =

)(

)

))



)

=(1

)

;

since

)

. From (2), the ML estimate of b

) = (1

)

(

), where

(

) =

)

is the

DTFT of

. The DTFT of a single cisoid at frequency

(

)

(



), where

(

) =



exp(



jk!

)

=exp(



((



2))

sin(

!N=

sin(

(4)

is a form of the Dirichlet kernel. It has the properties:

(0) =

;

(

)=0if

= 0; and

(

)





. A traditional way to estimate ^

is perform

a coarse search for the periodogram peak, using a zero-

padded DFT, and then rene the estimate by optimisa-

tion. A more ecient approach [3,4] is to lo cate the peak

in the standard DFT and estimate ^

using a closed-form

interpolator in the discrete frequency domain.

For

M >

1 the non-linear searchover ^

is in general

computationally intensive. The elements of the matrix

(

)

(

), whose inverse appears in (3), are

(

)

(



)

(5)

From (4), the diagonal elements

, and



,so

is Hermitian.

1.2 Low resolution multiple tone ML analysis





(where

=N

), the o-diagonal

elements

are much smaller than the diagonal el-

ements, and the

and

tones produce resolved

peaks in the periodogram. Simple application of a

single-tone estimator to each peak gives biased estimates

[2], caused by the non-zero o-diagonal elements of

Provided the tone frequencies are separated by at least

(2 `bins'), the bias may be removed [3,4] by a compu-

tationally simple iterative optimisation procedure which

converges rapidly.

2 HIGH RESOLUTION ANALYSIS

The key to the new high resolution approach is to recog-

nise that in typical multi-tone high-resolution problems,

some

of the tones will be resolved, while others will be in

`clusters' with frequency separations

=N

.Assume

that the frequencies are indexed so that

< !

:::<!

.Dene a `cluster' of

tones, with frequencies

::: !



,by the property that the frequency sepa-

ration between any tone in the cluster and any tone not

in the cluster is much greater than

. That is, for any

; j





1, and

;n<j

n>j



wehave



j

, hence

(



)

j

Assume that in a given case there are

clusters. If

matrix elements with magnitudes



are regarded

as negligible, the matrix

has approximately the fol-

lowing structure (illustrated for the example of

'clusters'):

0 0

0 A

0 0 A

(6)

in which the square sub-matrices

correspond

to the clusters, and have non-negligible o-diagonal el-

ements. The overall maximisation in (3) can then be

achieved by independently maximising, for each cluster

; :::; K

, the function



(

)(

(

)

(

))



(

)

;

(7)

where

; :::; !



]

contains the frequencies of

the L tones in cluster k. Typically many of the `clusters'

will be single isolated tones, so the maximisation (7)

associated with the corresponding submatrix of size 1x1

will be achieved by fast single tone estimation [4].

This reduces the number of

parameters

in each min-

imisation, but computation of

(

)

in (7) still re-

quires

multiplications and additions. A substantial

further improvement can be obtained by extending the

frequency domain approach proposed in [3,4].

2.1 Frequency domain computation

Since the DFT is a linear transform, the ML estimation

task can be formulated equivalently in the discrete fre-

quency domain. Sp ecically,



) in (3) can be shown

to be equal to



)=(1

)



(

)(



)



))



)

(8)

where

is the DFT of

and



) is the column-by-

column DFT of

). Similarly,



) in (7) has a

frequency domain equivalent of the form of (8). The

column of



) in (8), being the DFT,

(

), of the

cisoid

(

), is of the form

(



). The ma jor-

ity of the \energy" (sum squared modulus) of

(

)is

contained in only a few samples centred around the fre-

quency

. We showed in [3,4] that for single tones, the

use of only 5 DFT samples gives estimates very close to

the true ML estimates; this reduces computation in the

ratio 5

, whichisvery signicant for large

. The size

of window required for multi-tone clusters is discussed

below.

Computation of

(

) is /em not carried out by com-

puting the DFT of

(

), but by the much more ecient

direct evaluation of

(



) using (4). Other ad-

vantages of the frequency domain approach [4] are that

it remains near-optimal in non-Gaussian input noise

and/or coloured noise, for typical large values of

Pure real (as opposed to complex) signals are handled by

a simple extension of the above procedure [4]. Only the

parameters of positive frequency tones are estimated,

and corresponding negative frequencies are inferred.

2.2 Size of frequency domain window

For multi-tone clusters, the number of terms of

(

)

needed to achieve accurate estimates can be

determined by extending the Cramer-Rao bound (CRB)

calculation approach outlined in [4]. For example, con-

sider the case of two tones. The solid line in Figure 1

shows the CRB for frequency estimation of one of the

tones, normalised to the single-tone CRB for that tone,

and plotted against the frequency dierence between the

two tones, for the worst case relative phase between the

two tones (as shown in [2]).

10−2 10−1 100101

100

101

102

103

RMS ERROR RATIO

FREQUENCY SEPARATION (DFS BINS)

Fig 1. Two tone case. Bound on frequency estimate

rms error (i.e.

CRB) normalised to single-tone CRB,

plotted against frequency separation in `bins'. Solid

line: CRB of full ML estimator; Dashed line: CRB of

frequency domain estimator using only 9 DFS samples.

The dashed line in Fig. 1 shows the CRB of the

frequency domain estimator using only 9 terms of

(

)

, centred on the frequency of one of the two

tones. The estimator variance is increased by only 1.3

dB at a frequency separation of 0.0625 bins, falling to

less than 0.75 dB for frequency separations of 2 bins or

more. If for example 11 terms are used, these impair-

ments are reduced to 1.06 dB and 0.63 dB respectively.

2.3 Full algorithm

The full algorithm is :



Compute the DFT



Repeatedly detect the largest local peak with am-

plitudes above a detection threshold, and apply

single-tone estimation to the new peak (as in [4]).



Apply a single tone bias estimation heuristic [4]

and, for close tones, re-estimate the frequencies by

iteration [4].



Test the residual error over the 5 samples centred

on each peak. If this is suciently small for all

peaks, nish.



For all p eaks with large residual errors, increase the

cluster size L by 1 and re-estimate the L frequencies

and amplitudes of the cluster.



If there are other tones or clusters close enough to

be aected, re-estimate their frequencies.



Test the residual errors for each cluster. If they are

now all small, nish; otherwise continue to increase

the cluster size L (up to a suitable limit) for clusters

with large residual errors, and repeat.

Model order estimation is an intrinsic part of this algo-

rithm. The initial estimate of model order (number of

tones) is simply the number of detected peaks. This is

then increased whenever a cluster size is increased.

3 CONTINUOUS ESTIMATION

The approach described in this paper is being used for

musical audio analysis, where typical blocklengths are

=2048 with

= 20-50 tones. In applications suchas

this a further requirementistocombine estimates from

sequential (perhaps overlapping) blocks optimally. This

requires knowledge of the estimate variance which, for

a nearly-ML estimator, is approximately equal to the

CRB. However, the CRBs depend strongly on the rel-

ative phase of the tones, and only the CRBs for worst

case phase were published in [2]. A closed-form expres-

sion for the CRBs is desirable. We will consider the two

tone case because it is the most commonly occurring,

and in any case estimator variance increases rapidly as

further close tones are added.

The CRB for frequency estimation of tone

can be ap-

proximated by three asymptotes. The rst is the single-

tone CRB,

var











(9)

This is an absolute lower bound. The second is

var











(

F

)

(10)

where

F

is the frequency separation of the tones in

bins:

F

= (



)



). This asymptote meets

the single tone bound at

F



6bins. The third

asymptote depends on relative phase. Dene  =







F

(



; this equals the phase dierence

at the block centre (half waybetween sample



and sample

2). The third asymptote is

var











sin

()(

F

)

(11)

Note that this becomes innite as 

0 or



The complete estimate for the CRB is as follows; it is

max[ (9), min[ (10), (11)]]. Hence max[(9), (10)] is the

bound for worst case phase ( = 0 or



), as rst shown

in [2].

To conrm the above model, Fig. 2 shows the actual

CRBs and the above asymptotic t for two tone esti-

mation, for  = 0

;=

2, as functions of

frequency.

10−2 10−1 100101

100

101

102

103

104

RMS ERROR RATIO

FREQUENCY SEPARATION (DFS BINS)

Fig 2. Two tone case. Bound on frequency estimate

rms error (i.e.

CRB) normalised to single-tone CRB,

plotted against frequency separation in `bins'. A:

 =

=

2; B:  =

=

4; C:  =

=

8; D:

 =

=

16; E:  = 0.

This closed-form expression makes it possible to com-

bine the estimates from successive blocks with the ap-

propriate weighting to reect the (potentially very dif-

ferent) variances of the estimates from the dierent

blocks.

4 CONCLUSIONS

The frequency domain approach described in section 2

achieves high resolution estimation of sinusoids in white

or coloured noise, with performance very close to ML.

It is computationally ecient, particularly for problems

such as audio analysis where there maybemany tones,

many of them resolved.

The CRB model described in section 3 illustrates the

nature of the two-tone CRB more fully than in [2], and

permits fast approximate calculation of the two-tone

CRB. This is of value in continuous estimation of fre-

quencies from successive blocks.

REFERENCES

1. Kay, S.M.,

Modern Spectral Estimation

, Prentice-

Hall, Englewood Clis, NJ, 1988.

2. Rife, D.C., Boorstyn, R.R., "Multiple Tone Pa-

rameter Estimation from Discrete-Time Observations",

BSTJ

,Vol 55, No 9, Nov 1976, pp.1389 - 1410.

3. Macleo d, M. D., "Fast high accuracy estimation of

multiple cisoids in noise",

Signal Processing V

, (Pro c.

Eusipco 90), Elsevier, 1990, pp. 333-336.

4. Macleod, M. D., "Fast Nearly-ML Estimation of the

Parameters of Real or Complex Single Tones or Resolved

Multiple Tones",

IEEE Trans SP

, 1998, Vol.46, No. 1,

pp.141-148 (Jan 98).

Signal separation of musical instruments: simulation-based methods for musical signal decomposition and transcription

Thesis

May 2001

Paul Jospeh Walmsley

This thesis presents techniques for the modelling of musical signals, with particular regard to monophonic and polyphonic pitch estimation. Musical signals are modelled as a set of notes, each comprising of a set of harmonically-related sinusoids. An hierarchical model is presented that is very general and applicable to any signal that can be decomposed as the sum of basis functions. Parameter estimation is posed within a Bayesian framework, allowing for the incorporation of prior information about model parameters. The resulting posterior distribution is of variable dimension and so reversible jump MCMC simulation techniques are employed for the parameter estimation task. The extension of the model to time-varying signals with high posterior correlations between model parameters is described. The parameters and hyperparameters of several frames of data are estimated jointly to achieve a more robust detection. A general model for the description of time-varying homogeneous and heterogeneous multiple component signals is developed, and then applied to the analysis of musical signals. The importance of high level musical and perceptual psychological knowledge in the formulation of the model is highlighted, and attention is drawn to the limitation of pure signal processing techniques for dealing with musical signals. Gestalt psychological grouping principles motivate the hierarchical signal model, and component identifiability is considered in terms of perceptual streaming where each component establishes its own context. A major emphasis of this thesis is the practical application of MCMC techniques, which are generally deemed to be too slow for many applications. Through the design of efficient transition kernels highly optimised for harmonic models, and by careful choice of assumptions and approximations, implementations approaching the order of realtime are viable.

Signal separation of musical instruments: simulation-based methods for musical signal decomposition and transcription

Article

Aug 2013

Paul Jospeh Walmsley

Signal Separation of Musical Instruments

Article

Full-text available

Paul Walmsley

Onset Detection in Musical Audio Signals

Article

Full-text available

Aug 2003

This paper presents work on changepoint detection in musical audio signals, focusing on the case where there are note changes with low associated energy variation. Several methods are described and results of the best are presented.

Time Frequency Reassignment: A Review and Analysis

Article

Full-text available

Oct 2003

Time-frequency reassignment is a relatively old but under-explored method for time frequency analysis. This report reviews previous research on reassignment and relates it to instantaneous frequency in an explicit and novel way. New measures based on reassignment of the amplitude spectrum, as opposed to the traditional phase spectrum, are proposed and analysed. The statistical properties of reassignment as a sinusoidal estimator are compared with the Cramer-Rao bound and finally some applications of reassignment in the field of musical analysis are proposed.

Region of Interest Based Adaptive High Resolution Parameter Estimation with Applications in Automotive Radar

Conference Paper

Jun 2018

Solution of the general harmonic estimation problem. (High-resolution sinusoid parameter estimation)

Article

Full-text available

Dec 2002

Malcolm David Macleod

This paper presents a method for the analysis of harmonic processes with discrete or mixed spectra, that is, signals which consist of a sum of sinusoids, or complex sinusoids, and additive white or coloured noise. This is a much studied problem, with many important applications. Nevertheless, existing approaches have significant limitations. In many cases, the model order (number of sinusoids) is assumed known, and in most cases Additive White Gaussian Noise (AWGN) is assumed. We present a new method for jointly determining the model order and estimating the sinusoid parameters in white or coloured noise. It is an iterative detection and estimation algorithm. At each iteration deflation based on orthogonal projection (also known as the “notch periodogram ” is used to remove already-detected tones; a detection test is applied and if a new tone is detected the model order is increased and Maximum Likelihood tone frequency re-estimation is carried out. In the mixed spectrum case the detection test requires an estimate of the noise PSD, which is obtained by smoothing the logarithm of the notch periodogram.

A fast frequency domain notch periodogram algorithm

Article

Jul 2001
SIGNAL PROCESS

Malcolm David Macleod

The notch periodogram is an algorithm which may be used iteratively for detection and super-resolution frequency estimation of multiple sinusoids in noise (harmonic analysis). A general notch periodogram algorithm has been described for an arbitrary number of notch frequencies; there is also an approximate algorithm for well separated notch frequencies or well separated clusters of notches, which has reduced computation load. However, the computation load of these algorithms is high, especially because almost all the computation is repeated for each iteration. This paper describes frequency domain notch periodogram algorithms which greatly reduce the computation load. After a single FFT of the data, iterated notch periodograms are computed by operations in the frequency domain. When refining notch frequencies, the notch periodogram only has to be computed over a narrow frequency range, and the new algorithms do this efficiently. Further speedups are achieved using new DFT and periodogram interpolation techniques which may be used to reduce the required zero-padding factor, and by an algorithm for fast approximation of the denominator to any specified accuracy. Representative speedup factors from 10 to over 100 are achieved.

Joint detection and high resolution ML estimation of multiple sinusoids in noise

Conference Paper

Feb 2001
Acoust Speech Signal Process

Malcolm David Macleod

Harmonic analysis, the analysis of signals which consist of a sum of sinusoids (or complex sinusoids) with additive white or colored noise, is a much studied problem, with many important applications. Nevertheless, existing approaches have significant limitations. In many, the model order (number of sinusoids) is assumed known, and in most cases additive white Gaussian noise (AWGN) is assumed. We present a method for jointly determining the model order and estimating the sinusoid parameters in white or colored noise. It uses the notch periodogram in an iterative detection and estimation algorithm. It uses an explicit detection test based on an estimate of the noise power density spectrum (PDS), which is obtained by smoothing the logarithm of the notch periodogram

Fast nearly ML estimation of the parameters of real or complex single tones or resolved multiple tones

Article

Full-text available

Feb 1998

Malcolm David Macleod

This paper presents new computationally efficient algorithms for estimating the parameters (frequency, amplitude, and phase) of one or more real tones (sinusoids) or complex tones (cisoids) in noise from a block of N uniformly spaced samples. The first algorithm is an interpolator that uses the peak sample in the discrete Fourier spectrum (DFS) of the data and its two neighbors. We derive Cramer-Rao bounds (CRBs) for such interpolators and show that they are very close to the CRB's for the maximum likelihood (ML) estimator. The new algorithm almost reaches these bounds. A second algorithm uses the five DFS samples centered on the peak to produce estimates even closer to ML. Enhancements are presented that maintain nearly ML performance for small values of N. For multiple complex tones with frequency separations of at least 4π/N rad/sample, unbiased estimates are obtained by incorporating the new single-tone estimators into an iterative “cyclic descent” algorithm, which is a computationally cheap nonlinear optimization. Single or multiple real tones are handled in the same way. The new algorithms are immune to nonzero mean signals and (provided N is large) remain near-optimal in colored and non-Gaussian noise

Multiple Tone Parameter Estimation From Discrete‐Time Observations

Article

Nov 1976

Estimation of the parameters of a single-frequency complex tone from a finite number of noisy discrete-time observations is discussed. The appropriate Cramer-Rao bounds and maximum-likelihood (ML) estimation algorithms are derived. Some properties of the ML estimators are proved. The relationship of ML estimation to the discrete Fourier transform is exploited to obtain practical algorithms. The threshold effect of one algorithm is analyzed and compared to simulation results. Other simulation results verify other aspects of the analysis.

Modern Spectral Estimation

Book

Jan 1988

S M Kay

Fast high accuracy estimation of multiple cisoids in noise

Jan 1990
333-336

M D Macleod

Macleod, M. D., "Fast high accuracy estimation of multiple cisoids in noise", Signal Processing V, (Proc. Eusipco 90), Elsevier, 1990, pp. 333-336.

HIGH RESOLUTION NEARLY-ML ESTIMATION OF SINUSOIDS IN NOISE USING A FAST FREQUENCY DOMAIN APPROACH

Abstract and Figures

Recommended publications

Consistent Nonparametric Spectrum Estimation Via Cepstrum Thresholding

Channel and Noise Variance Estimation Improvement for PCP-SC System

Statistical analysis of Pisarenko's method for sinusoidal frequency estimation

Parameter Estimation for Sinusoidal Signals with Deterministic Amplitude Modulation