Académique Documents
Professionnel Documents
Culture Documents
MULTI-BAND
EXCITATION SPEECH
CODER
PRESENTED BY ::
SHAVEL GUPTA (07)
KANIKA TAYAL (03)
ABSTRACT
The s/m is based on MBE (Multi-Band
Excitation) speech model ~ developed by
Griffin.
The s/m divides speech into 20ms. Segments
& for each speech segment a set of MBE
speech model parameters are estimated.
The parameters are then quantized,
transmitted & finally used to synthesize a
speech s/g.
Primary advantage this modelling process
reduces the amount of information needed to
represent speech with higher speech quality.
CONTD.
Here, an algorithm is proposed for segment
based coding of MBE model parameter at fixed
bit rate below 1kbps.
This paper explores the hypothesis that the
use of more complex quantization methods
can allow a major reduction in bit rate while
maintaining high speech quality.
INTRODUCTION:
In typical speech analysis/synthesis s/m
(vocoder), a model is used to characterize the
speech s/g.
In speech analysis these parameters is used
to generate a synthetic speech s/g.
CONTD. (CONVENTIONAL
SPEECH MODEL)
Here, the vocoder analyses each segment by
estimating the parameters which characterize
linear filter & the excitation sequence for that
segment.
This is facilated by dividing speech segments
into 2 classes :
- VOICED SPEECH-> Excitation is modelled as
a Periodic Impulse Train.
- UNVOICED SPEECH -> A White Noise
sequence is used as
excitation.
But this s/m doesnt produce HIGH QUALITY
SPEECH.
SEGMENTATION OF SPEECH
The speech spectrum is typically slowly varying
with the relatively abrupt spectral transitions.
Segmentation of speech into homogenous regions
of variable length is based on detecting spectral
transitions.
Slow variation of spectral parameters across the
frames belonging to a single segment facilitates a
simple parametric representation of the segment.
Here, the resultant bit rate is variable, due to the
varying durations of segments.
Therefore, in order to achieve fixed bit rate , it is
necessary to introduce constraints on the
segment based coding. (by fixing no of segments
over large block of frame.)
CONTD.
An important design involves the choice or
rate of the block duration & the segments per
block.
Longer the block durations improve the
efficiency but adversely impact the code
delay.
We choose a block duration of 400ms or
20frames.
The no. of per block can be based on the
expected average segment rate.
PARAMETER QUANTISATION
BLOCK QUANTISATION refers to the fact that
the data which is to be quantized is first
grouped into fixed length block.
The block is then quantized, transmitted, &
then reconstructed at the receiver.
Vector quantization & transform coding are
both block quantization algorithms.
ADVANTAGE: Provides a convenient manner
for accessing the redundancies in the data.
Company Logo
Company Logo
Company Logo
POST FILTERING
In low bit rate speech coders, it is common to
use a short term postfilter at the decoder to
improve the precieved quality of
reconstructed speech.
Company Logo
CONCLUSION
A 1KBPS Speech coding s/m has been presented
which offers high quality speech capability.
S/M can be implemented as a cascade of several
algorithms.
The 1st element of s/m estimates MBE
Parameters.
These parameters are quantized & Tx across
channel.
A decoder then reconstructs the quantized
parameter values, then synthesis algo. Produces
synthesized speech.
Thus, the low bit rate coder operating at 940 bps
achieves acceptable speech quality &
intelligibility.
Company Logo
LOGO