Académique Documents
Professionnel Documents
Culture Documents
Kari Roth and Ismo Kauppinen Paulo A. A. Esquef and Vesa Välimäki
An auto-regressive (AR) model [1] is defined by equation From Eq. (1) it is easily seen that the residual en can be calculated
from the signal yn by
p
yn = − am yn−m + en (1) p p
m=1
en = yn + am yn−m = am yn−m (2)
where yn are the signal samples, p is the model order, am are m=1 m=0
the model coefficients, and en is the residual. The model coef-
ficients am are calculated by minimizing the total energy of the where a0 = 1. If the signal frame consists of N samples y0 ,
residual E = 2
n en . There exist several methods for estimat- y1 , . . . , yN −1 , the residual samples ep , ep+1 , . . . , eN −1 can be
ing the AR parameters. The least squares method (also known as regarded as the output of a finite impulse response (FIR) prediction
the covariance method) and the Yule-Walker method (also known error filter. This FIR filter can be implemented through the lattice
as the autocorrelation method) are the mostly used approaches for structure shown in Fig. 1. The equations of the lattice filter are
historical reasons [2]. Burg’s method is considered preferable for
applications which require models of high accuracy, e.g., signal (l−1)
fn(l) = fn(l−1) + kl bn−1
extrapolation [3] and detection [2].
(l−1)
n = l, l + 1, . . . , N − 1 (3)
There is a long tradition in performing signal analysis and pro- n = bn−1 + kl fn
b(l) (l−1)
k1 k2 kM
yn
k1 k2 kM
z -1
Σ z -1
Σ z -1
Σ
b n(0) b n(1) b n(2) b n(p-1) b n(p)
Figure 2: Lattice form of the warped prediction-error filter. The unit delays in Fig. 1 have been replaced by first-order allpass filters.
Minimizing El with respect to the reflection coefficient kl yields the employed filter structures are replaced with first-order allpass
filters, D(z). These allpass filters can be regarded as frequency-
N −1
∂El (l−1) (l−1) dependent delay elements and are defined by
=2 fn(l−1) + kl bn−1 bn−1
∂kl
n=l (5) z −1 − λ
z̃ −1 = D(z) = . (9)
(l−1)
+ bn−1 + kl fn(l−1) fn(l−1) = 0, 1 − λz −1
Conversely to the linear phase response of an ordinary unit-
from which the reflection coefficients can be solved, i.e., delay, the phase response of D(z) can be made non-linear by ad-
−1 (l−1) (l−1) justing the warping factor parameter λ. Indeed, the mapping from
−2 N n=l fn bn−1 the uniform to the warped frequency scale is governed by the phase
kl = 2 2 . (6)
N −1 (l−1)
fn
(l−1)
+ bn−1 response of D(z), which is given by [5]
n=l
(1 − λ2 ) sin(ω)
The AR coefficients am can be obtained from the reflection ω̃ = arctan , (10)
(1 + λ2 ) cos(ω) − 2λ
coefficients kl via the Levinson-Durbin algorithm. The recursion
(0)
is initialized with a0 = 1 and where ω = 2πf /fs and fs is the sampling frequency. Figure 3
shows the attained mapping for several values of λ. For positive
(l−1)
a(l) (l−1)
m = am + kl al−m m = 1, 2, . . . , l − 1 (7) values of λ, the resolution at low frequencies is increased. On the
(l) contrary, negative values of λ yield a higher resolution at high fre-
al = kl (8) quencies. Suitable values of λ can be chosen depending on the
(p) application. For instance, in [9] it is shown that an approxima-
is repeated for l = 1, 2, . . . , p. At the end of the iterations, am tion of the frequency resolution of the human auditory system is
gives the desired prediction error filter coefficients am of Eq. (2). attained by setting λ = 0.723.
Equation (6) ensures that |kl | < 1 and therefore Burg’s method is Warped linear predictive coding can be carried out similarly to
guaranteed to provide a stable model. standard methods. For instance, the coefficients ãm of a warped
prediction filter can be estimated via the warped autocorrelation
3. FREQUENCY WARPING normal equations. In these equations, the conventional autocorre-
∗
lation function rk = E{yn yn−k } is replaced with
The time-domain representation of a signal relates to its spectrum
via the Fourier transform. The frequency-resolution of the result- r̃k = E{δ̃0 [yn ] δ̃k [yn∗ ]}, (11)
ing spectrum is uniform along the frequency axis. Signal analy-
where E is the expectation operator and δ̃k [·] is a generalized shift
sis on non-uniform frequency-resolutions or on frequency-warped
operator defined by [6]
scales can be achieved by means of a frequency mapping operator.
In this paper, frequency warping is restricted to a conformal bi- δ̃k [yn ] = dn ∗ dn ∗ · · · ∗ dn ∗yn , (12)
linear mapping. This basically means that the unit-delays, z −1 , of k fold convolutions
2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 19-22, 2003, New Paltz, NY
BURG’S METHOD
3
0.9 20
Amplitude [dB]
0.7 0
Phase [rad]
0.4 −20
2
−40
0.0
−60
−0.4
100 200 500 1k 2k 5k 10k
1 −0.7 frequency [Hz]
−0.9 WARPED YULE−WALKER METHOD
20
Amplitude [dB]
0
0 0.5 1 0
Normalized Frequency −20
−40
Figure 3: Phase response of D(z) for several values of λ. −60
100 200 500 1k 2k 5k 10k
frequency [Hz]
WARPED BURG’S METHOD
with dn being the impulse response of the allpass filter. Yet, the
equation system can be solved efficiently via the Levinson-Durbin 20
Amplitude [dB]
algorithm.
p Finally, the prediction error filter is given by A(z) = 0
m −20
m=1 ãm D(z) . See [6] for more detailed information.
−40
−60
4. WARPED BURG’S ALGORITHM 100 200 500 1k 2k 5k 10k
frequency [Hz]
The warped Burg’s method is based on warping the lattice filter Figure 4: Comparison among the magnitude responses of the all-
depicted in Fig. 1. This is done by replacing the delay elements pole filters obtained using Burg’s method, the warped Yule-Walker
with warping allpass filters. The warped prediction error filter is method, and the warped Burg’s method. The employed setup was
shown in Fig. 2. To calculate the warped prediction error in stage p = 50 and λ = 0.723. The thinner curves relate to the signal
l we need the allpass filtered backward residual whereas the thicker (shifted upwards for clarity) to the models.
(l−1) (l)
n = bn−1 − λ bn
b̃(l) (l−1)
− b̃n−1
(13)
n = l, l + 1, . . . , N − 1, The comparison is made using the same model order p = 50 in all
three methods and warping factor λ = 0.723 in both of the warped
where λ is the warping factor. Because this is a recursive filter methods to provide frequency-warping close to the auditory fre-
(l)
the initial condition (i.e. the value of b̃l−1 ) has to be set. Using quency scale [9]. Figure 4 shows the magnitude spectrum of a tar-
(l)
b̃l−1 = 0 is the most obvious choice. get signal (2048 samples of a guitar tone sampled at 44.1 kHz) and
Warping also changes the lattice equations of Eq. (3) to those of the modeled signals, which were obtained using different
methods. Figures 5 and 6 compare the same models in the z-plane
(l)
fn = fn
(l−1) (l)
+ k̃l b̃n by showing the pole locations of the models. In the frequency do-
n = l, l + 1, . . . , N − 1 (14) main, the ordinary Burg’s method models the signal with equal
(l) (l) (l−1)
bn = b̃n + k̃l fn . emphasis on all the frequencies. In the example, the target sig-
nal has most of its energy concentrated in the lower quarter of the
The resulting equation for the reflection coefficient is (linear) frequency range but the actual model has its poles equally
−1 (l−1) (l) concentrated across the whole frequency range. Burg’s method
−2 N fn b̃n
k̃l = n=l 2 2 . (15)
N −1 (l−1) (l)
n=l fn + b̃n
0.4
POLES OF THE MODELS
From Eq. (13) it can be seen that parameter value λ = 0 reduces 0.3
1
the algorithm to ordinary Burg’s method. 0.2
0.5
Imaginary part
Imaginary part
0.1
5. EXPERIMENTS
0 0
Modeling signals with all-pole infinite impulse resonse (IIR) filter −0.1
−0.5
is a powerful application of AR modeling. One example of this
−0.2
is linear predictive coding (LPC) which is usually used to model −1
only the spectral envelope of the signal. In this section the model- −0.3
−1 −0.5 0 0.5 1
ing performance of different methods is compared with emphasis Real part
Burg −0.4
0.7 0.8 0.9 1 1.1
on high perceptual accuracy. The target is to obtain a model that Warped Burg Real part
extracts the strongest signal frequencies. This is an important is-
sue, e.g., in model-based signal extrapolation. Figure 5: The poles of the AR-models obtained using Burg’s
Figures 4–7 present a comparison among the ordinary Burg’s method and the warped Burg’s method. The employed setup was
method, its warped version, and the warped Yule-Walker method. p = 50 and λ = 0.723.
2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 19-22, 2003, New Paltz, NY
Amplitude [dB]
0.3 −20
1
−40
0.2
−60
0.5 −80
Imaginary part
Imaginary part
0.1
−100
0 0 −120
100 200 500 1k 2k 5k 10k 20k
−0.1 frequency [Hz]
−0.5
WARPED YULE−WALKER METHOD
−0.2 0
Amplitude [dB]
−1 −20
−0.3
−1 −0.5 0 0.5 1 −40
Real part −0.4 −60
Warped Yule−Walker 0.7 0.8 0.9 1 1.1
Warped Burg
−80
Real part
−100
−120
Figure 6: The poles of the AR-models obtained using the warped 100 200 500 1k 2k 5k 10k 20k
frequency [Hz]
Burg’s method and the warped Yule-Walker method. The em-
ployed setup was p = 50 and λ = 0.723. 0
WARPED BURG’S METHOD
Amplitude [dB]
−20
−40
−60
”wastes” its poles by modeling the high frequency noise peaks. −80
By using positive warping factor λ, the emphasis of the modeling −100
procedure is shifted to lower frequencies, thus resulting in a more −120
100 200 500 1k 2k 5k 10k 20k
accurate model in a perceptual sense with the same model order. If frequency [Hz]
the same accuracy in lower frequency region were to be achieved
with the ordinary Burg’s method the model order should be about Figure 7: Comparison among the magnitude spectra (thicker
three times higher. For equal comparison in terms of computa- curves) of the prediction residuals obtained using Burg’s method,
tional cost the order should be only about 33 % higher to compen- the warped Yule-Walker method, and the warped Burg’s method.
sate for the more expensive modeling. The warped Yule-Walker The employed setup was p = 50 and λ = 0.723. The thinner
method captures the most significant spectral components of the curves represent the magnitude spectrum of the signal.
signal but the model obtained is still less accurate than the one
given by the warped Burg’s method. The warped Burg’s method
places the poles closer to the unit circle and this makes the peaks
[2] M. J. L. de Hoon, T. H. J. J. van der Hagen, H. Schoonewelle,
more pronounced in the frequency response of the all-pole filters.
and H. van Dam, “Why Yule-Walker Should not be Used
Figure 7 compares the spectra of the residuals obtained with for Autoregressive Modelling,” Annals of Nuclear Energy,
different methods. In the lower frequencies, the ordinary Burg’s vol. 23, pp. 1219–1228, 1996.
method models only the spectral envelope of the signal. As a re-
sult the ratio of the height of the spectral peaks to the background is [3] I. Kauppinen, J. Kauppinen, and P. Saarinen, “A Method for
preserved in the residual. The warped Yule-Walker method mod- Long Extrapolation of Audio Signals,” J. Audio Eng. Soc.,
els also some of the strongest spectral peaks and in the residual vol. 49, no. 12, pp. 1167–1180, Dec. 2001.
the peaks rise lower above the background. The residual obtained [4] A. V. Oppenheim, D. H. Johnson, and K. Steiglitz, “Com-
with the warped Burg’s method has a more flat spectrum in a log- putation of Spectra with Unequal Resolution Using the Fast
arithmic scale than the ones obtained with the other methods. Fourier Transform,” Proc. IEEE, vol. 59, pp. 299–301, 1971.
[5] H. W. Strube, “Linear Prediction on a Warped Frequency
6. CONCLUSIONS Scale,” J. Acoust. Soc. Am., vol. 68, no. 4, pp. 1071–1076,
Oct. 1980.
In this paper we have presented the warped Burg’s algorithm for [6] A. Härmä, M. Karjalainen, V. Välimäki, L. Savioja, U. Laine,
calculating the AR model parameters. The warping is achieved and J. Huopaniemi, “Frequency-Warped Signal Processing for
by replacing the delay elements of the lattice filter by first-order Audio Applications,” J. Audio Eng. Soc., vol. 48, no. 11, Nov.
allpass warping elements. 2000.
We have demonstrated the improving effect of warping in mod- [7] G. Evangelista and S. Cavaliere, “Discrete Frequency Warped
eling of audio signals whose energy distribution is concentrated in Wavelets: Theory and Applications,” IEEE Trans. Signal Pro-
the lower part of the spectral range. The improved AR modeling of cessing, vol. 46, no. 4, pp. 874–885, Apr. 1998.
the warped Burg’s method is compared to the conventional Burg’s
method and the warped Yule-Walker method. It can be concluded [8] J. P. Burg, “A New Analysis Technique for Time Series Data,”
that, for a given model order, the warped Burg’s method yields NATO Advanced Study Institute on Signal Processing with
more accurate models from the perceptual point of view. Emphasis on Underwater Acoustics, Enschede, The Nether-
lands, Aug. 1968, reprinted in Modern Spectrum Analysis,
D. G. Childers, ed., IEEE Press, New York, 1978.
7. REFERENCES [9] J. O. Smith and J. S. Abel, “Bark and ERB Bilinear Trans-
forms,” IEEE Trans. Speech Audio Processing, vol. 7, no. 6,
[1] S. Haykin, Ed., Nonlinear Methods of Spectral Analysis, pp. 697–708, Nov. 1999.
2nd ed. Berlin: Springer-Verlag, 1983.