Académique Documents
Professionnel Documents
Culture Documents
ENHANCEMENT
ABSTRACT
This paper proposes the application of the Wiener filter in an adaptive manner in
speech enhancement. The proposed adaptive Wiener filter depends on the adaptation of the
filter transfer function from sample to sample based on the speech signal statistics(mean
and variance). The adaptive Wiener filter is implemented in time domain rather than in
frequency domain to accommodate for the varying nature of the speech signal. The
proposed method is compared to the traditional Wiener filter and spectral subtraction
methods and the results reveal its superiority.
1 INTRODUCTION
Speech enhancement is one of the most is removed first. Decomposition of the vector space
important topics in speech signal processing. of the noisy signal is performed by applying an
Several techniques have been proposed for this eigenvalue or singular value decomposition or by
purpose like the spectral subtraction approach, the applying the Karhunen-Loeve transform (KLT)[8].
signal subspace approach, adaptive noise canceling Mi. et. al. have proposed the signal / noise KLT
and the iterative Wiener filter[1-5] . The based approach for colored noise removal[9]. The
performances of these techniques depend on idea of this approach is that noisy speech frames
quality and intelligibility of the processed speech are classified into speech-dominated frames and
signal. The improvement of the speech signal-to- noise-dominated frames. In the speech-dominated
noise ratio (SNR) is the target of most techniques. frames, the signal KLT matrix is used and in the
noise-dominated frames, the noise KLT matrix is
Spectral subtraction is the earliest method for used.
enhancing speech degraded by additive noise[1]. In this paper, we present a new technique to
This technique estimates the spectrum of the clean improve the signal-to-noise ratio in the enhanced
(noise-free) signal by the subtraction of the speech signal by using an adaptive implementation
estimated noise magnitude spectrum from the noisy of the Wiener filter. This implementation is
signal magnitude spectrum while keeping the phase performed in time domain to accommodate for the
spectrum of the noisy signal. The drawback of this varying nature of the signal.
technique is the residual noise.
The paper is organized as follows: in section
Another technique is a signal subspace II, a review of the spectral subtraction technique is
approach [3]. It is used for enhancing a speech presented. In section III, the traditional Wiener
signal degraded by uncorrelated additive noise or filter in frequency domain is revisited. Section IV,
colored noise [6,7]. The idea of this algorithm is proposes the adaptive Wiener filtering approach for
based on the fact that the vector space of the noisy speech enhancement. In section V, a comparative
signal can be decomposed into a signal plus noise study between the proposed adaptive Wiener filter,
subspace and an orthogonal noise subspace. the Wiener filter in frequency domain and the
Processing is performed on the vectors in the signal spectral subtraction approach is presented.
plus noise subspace only, while the noise subspace
σs
2
A priori knowledge sˆ(n) = mx + ( x(n) - mx ) ∗ δ ( n)
σs + σv
2 2
Space-
σs
2
Degraded speech variant Enhanced
x(n) speech
= mx + ( x(n) − mx )
σs + σv
2 2
h(n)
signal sˆ( n) (14)
If it is assumed that mx and σs are updated at
each sample, we can say:
Measure of
σs (n) ( x(n) − mx(n))
2
approximated by:
performed.
σˆs (n) = ⎨
2 60
⎩0, otherwise
O u tp u t P S N R (d B )
50
(17.a)
Where 40
30
n+ M
1
σˆx (n) =
2
(2 M + 1)
∑ ( x(k ) − mˆ (n))
k =n−M
x
2
20
Spectral Subtraction
(17.b)
10 Wiener Filter
By this proposed method, we guarantee that Adaptive Wiener Filter
the filter transfer function is adapted from sample
to sample based on the speech signal statistics. 0
-10 -5 0 5 10 15 20 25 30 35
Input SNR (dB)
5 EXPERIMENTAL RESULTS
For evaluation purposes, we use different Figure 2: PSNR results for white noise
speech signals like the handel, laughter and gong case at-10 dB to +35 dB SNR levels for Handle signal
signals. White Gaussian noise is added to each
speech signal with different SNRs. The different
speech enhancement algorithms such as the
spectral subtraction method, the Weiner filter in
frequency domain and the proposed adaptive
Wiener filter are carried out on the noisy speech
signals. The peak signal to noise ratio (PSNR)
results for each enhancement algorithm are
compared.
1 1
50
A m p lit u d e
A m p lit u d e
0 0
40 -1 -1
O u tp u t P S N R (d B )
30 1 1
A m p lit u d e
A m p lit u d e
0 0
20
-1 -1
0 2000 4000 6000 8000 0 2000 4000 6000 8000
Spectral Subtraction (c) (d)
10 Wiener Filter 1
A m p lit u d e
Adaptive Wiener Filter 0
0
-10 -5 0 5 10 15 20 25 30 35 -1
0 2000 4000 6000 8000
Input SNR (dB) (e) Time(msec)
70
Amplitude (dB)
Amplitude (dB)
0 0
60 -20 -20
-40
O u tp u t P S N R (d B )
50 -40
0 1000 2000 3000 4000 0 1000 2000 3000 4000
(a) (b)
40
Amplitude (dB)
Amplitude (dB)
0 0
30 -20
-20
20 Spectral Subtraction -40 -40
0 1000 2000 3000 4000 0 1000 2000 3000 4000
Wiener Filter (c) (d)
10
Adaptive Wiener Filter
Amplitude (dB)
0
0
-10 -5 0 5 10 15 20 25 30 35 -20
Input SNR (dB) -40
0 1000 2000 3000 4000
Figure 4: PSNR results for white noise case at -10 dB (e) Freq.(Hz)
to +35 dB SNR levels for Gong signal
The results of the different enhancement Figure 6:The spectrum of the Handel sig. in Fig.(5) (a)
original sig. (b) noisy sig. (c) spectral subtraction. (d)
algorithms for the handle signal with SNRs of 5,
Wiener filtering. (e) adaptive Wiener filtering.
10,15 and 20 dB in the both time and frequency
domain are given in Figs. (5) to (12). These results
A m p lit u d e
1 1
0 0
A m p lit u d e
A m p lit u d e
0 0
-1 -1
0 2000 4000 6000 8000 0 2000 4000 6000 8000
-1 -1
(a) (b) 0 2000 4000 6000 8000 0 2000 4000 6000 8000
(a) (b)
1 1
A m p lit u d e
A m p lit u d e
1 1
0 0
A m p lit u d e
A m p lit u d e
0 0
-1 -1
0 2000 4000 6000 8000 0 2000 4000 6000 8000 -1 -1
(c) (d) 0 2000 4000 6000 8000 0 2000 4000 6000 8000
(c) (d)
1
1
A m p lit u d e
A m p lit u d e
0 0
-1 -1
0 2000 4000 6000 8000 0 2000 4000 6000 8000
(e) Time (msec) (e) Time(msec)
0
Amplitude (dB)
0
-20
-20
-40
-40
0 1000 2000 3000 4000 0 1000 2000 3000 4000
(e)Freq. (Hz) (e)Freq. (Hz)
Figure 8: The spectrum of the Handel sig. in Fig.(7) Figure 10: The spectrum of the Handel sig. in Fig.(9)
(a) original sig. (b) noisy sig. (c) spectral subtraction. (d) (a) original sig. (b) noisy sig. (c) spectral subtraction. (d)
Wiener filtering. (e) adaptive Wiener filtering. Wiener filtering. (e) adaptive Wiener filtering.
A m p lit u d e
A m p lit u d e
An adaptive Wiener filter approach for
0 0 speech enhancement is proposed in this papaper.
This approach depends on the adaptation of the
-1 -1
0 2000 4000 6000 8000 0 2000 4000 6000 8000 filter transfer function from sample to sample
(a) (b) based on the speech signal statistics(mean and
variance). This results indicates that the proposed
1 1 approach provides the best SNR improvement
A m p lit u d e
A m p lit u d e
0 0 among the spectral subtraction approach and the
traditional Wiener filter approach in frequency
-1 -1 domain. The results also indicate that the proposed
0 2000 4000 6000 8000 0 2000 4000 6000 8000 approach can treat musical noise better than the
(c) (d) spectral subtraction approach and it can avoid the
1 drawbacks of Wiener filter in frequency domain .
A m p lit u d e
0 REFERENCES
0
[3] Y. Ephriam and H. L. Van Trees: A signal
0
subspace approach for speech enhancement, in
-20 -20 Proc. International Conference on Acoustic,
-40 Speech and Signal Processing, vol. II, Detroit,
-40
0 1000 2000 3000 4000 0 1000 2000 3000 4000 MI, U.S.A., pp. 355-358, May (1993).
(a) (b) [4] Simon Haykin: Adaptive Filter Theory,
Prentice-Hall, ISBN 0-13-322760-X, (1996).
Amplitude (dB)
Amplitude (dB)