Vous êtes sur la page 1sur 16

LOGO

MULTI-BAND
EXCITATION SPEECH
CODER
PRESENTED BY ::
SHAVEL GUPTA (07)
KANIKA TAYAL (03)

ABSTRACT
The s/m is based on MBE (Multi-Band
Excitation) speech model ~ developed by
Griffin.
The s/m divides speech into 20ms. Segments
& for each speech segment a set of MBE
speech model parameters are estimated.
The parameters are then quantized,
transmitted & finally used to synthesize a
speech s/g.
Primary advantage this modelling process
reduces the amount of information needed to
represent speech with higher speech quality.

CONTD.
Here, an algorithm is proposed for segment
based coding of MBE model parameter at fixed
bit rate below 1kbps.
This paper explores the hypothesis that the
use of more complex quantization methods
can allow a major reduction in bit rate while
maintaining high speech quality.

INTRODUCTION:
In typical speech analysis/synthesis s/m
(vocoder), a model is used to characterize the
speech s/g.
In speech analysis these parameters is used
to generate a synthetic speech s/g.

CONTD. (CONVENTIONAL
SPEECH MODEL)
Here, the vocoder analyses each segment by
estimating the parameters which characterize
linear filter & the excitation sequence for that
segment.
This is facilated by dividing speech segments
into 2 classes :
- VOICED SPEECH-> Excitation is modelled as
a Periodic Impulse Train.
- UNVOICED SPEECH -> A White Noise
sequence is used as
excitation.
But this s/m doesnt produce HIGH QUALITY
SPEECH.

MBE (MULTI-BAND SPEECH


MODEL)
MBE Model provides more flexibility.
A new Algo. For estimating pitch & spectral
envelope has been developed which offers
superior performance over previous methods.
In MBE Speech:
-VOICED represented by Harmonics of
fundamental freq.
-UNVOICED by spectrally shaped random noise.
Voice Info. allows mixing of Harmonic
spectrum with random spectrum in a freq.
gives Synthesized O/P.

CONVENTIONSL V/S MBE


MODEL

SEGMENTATION OF SPEECH
The speech spectrum is typically slowly varying
with the relatively abrupt spectral transitions.
Segmentation of speech into homogenous regions
of variable length is based on detecting spectral
transitions.
Slow variation of spectral parameters across the
frames belonging to a single segment facilitates a
simple parametric representation of the segment.
Here, the resultant bit rate is variable, due to the
varying durations of segments.
Therefore, in order to achieve fixed bit rate , it is
necessary to introduce constraints on the
segment based coding. (by fixing no of segments
over large block of frame.)

CONTD.
An important design involves the choice or
rate of the block duration & the segments per
block.
Longer the block durations improve the
efficiency but adversely impact the code
delay.
We choose a block duration of 400ms or
20frames.
The no. of per block can be based on the
expected average segment rate.

PARAMETER QUANTISATION
BLOCK QUANTISATION refers to the fact that
the data which is to be quantized is first
grouped into fixed length block.
The block is then quantized, transmitted, &
then reconstructed at the receiver.
Vector quantization & transform coding are
both block quantization algorithms.
ADVANTAGE: Provides a convenient manner
for accessing the redundancies in the data.

Company Logo

LSF VECTOR & GAIN

Company Logo

ENCODER BLOCK DIAGRAM

Company Logo

POST FILTERING
In low bit rate speech coders, it is common to
use a short term postfilter at the decoder to
improve the precieved quality of
reconstructed speech.

Company Logo

CONCLUSION
A 1KBPS Speech coding s/m has been presented
which offers high quality speech capability.
S/M can be implemented as a cascade of several
algorithms.
The 1st element of s/m estimates MBE
Parameters.
These parameters are quantized & Tx across
channel.
A decoder then reconstructs the quantized
parameter values, then synthesis algo. Produces
synthesized speech.
Thus, the low bit rate coder operating at 940 bps
achieves acceptable speech quality &
intelligibility.

SUGGESTIONS FOR IMPROVED


QUANTIZATION

Company Logo

Lower the bit rate -----> Better Quantization


may profit the use of algo.
Efficiency.
One interesting possibility is to use a treestructure vector quantizer.
Efficiency also improved by allocating bits
over multiple speech segments, rather than
allocating bits to each frame, while keeping
the total no of bits constant.
Another way is to use higher order

LOGO

Vous aimerez peut-être aussi