Speech Coder

LOGO
MULTI-BAND
EXCITATION SPEECH
CODER
PRESENTED BY ::
SHAVEL GUPTA (07)
KANIKA TAYAL (03)
ABSTRACT
The s/m is based on MBE (Multi-Band
Excitation) speech model ~ developed by
Griffin.
The s/m divides speech into 20ms. Segments
& for each speech segment a set of MBE
speech model parameters are estimated.
The parameters are then quantized,
transmitted & finally used to synthesize a
speech s/g.
Primary advantage this modelling process
reduces the amount of information needed to
represent speech with higher speech quality.
CONTD.
Here, an algorithm is proposed for segment
based coding of MBE model parameter at fixed
bit rate below 1kbps.
This paper explores the hypothesis that the
use of more complex quantization methods
can allow a major reduction in bit rate while
maintaining high speech quality.
INTRODUCTION:
In typical speech analysis/synthesis s/m
(vocoder), a model is used to characterize the
speech s/g.
In speech analysis these parameters is used
to generate a synthetic speech s/g.
CONTD. (CONVENTIONAL
SPEECH MODEL)
Here, the vocoder analyses each segment by
estimating the parameters which characterize
linear filter & the excitation sequence for that
segment.
This is facilated by dividing speech segments
into 2 classes :
- VOICED SPEECH-> Excitation is modelled as
a Periodic Impulse Train.
- UNVOICED SPEECH -> A White Noise
sequence is used as
excitation.
But this s/m doesnt produce HIGH QUALITY
SPEECH.
MBE (MULTI-BAND SPEECH

MODEL)
MBE Model provides more flexibility.
A new Algo. For estimating pitch & spectral
envelope has been developed which offers
superior performance over previous methods.
In MBE Speech:
-VOICED represented by Harmonics of
fundamental freq.
-UNVOICED by spectrally shaped random noise.
Voice Info. allows mixing of Harmonic
spectrum with random spectrum in a freq.
gives Synthesized O/P.
CONVENTIONSL V/S MBE

MODEL
SEGMENTATION OF SPEECH
The speech spectrum is typically slowly varying
with the relatively abrupt spectral transitions.
Segmentation of speech into homogenous regions
of variable length is based on detecting spectral
transitions.
Slow variation of spectral parameters across the
frames belonging to a single segment facilitates a
simple parametric representation of the segment.
Here, the resultant bit rate is variable, due to the
varying durations of segments.
Therefore, in order to achieve fixed bit rate , it is
necessary to introduce constraints on the
segment based coding. (by fixing no of segments
over large block of frame.)
CONTD.
An important design involves the choice or
rate of the block duration & the segments per
block.
Longer the block durations improve the
efficiency but adversely impact the code
delay.
We choose a block duration of 400ms or
20frames.
The no. of per block can be based on the
expected average segment rate.
PARAMETER QUANTISATION
BLOCK QUANTISATION refers to the fact that
the data which is to be quantized is first
grouped into fixed length block.
The block is then quantized, transmitted, &
then reconstructed at the receiver.
Vector quantization & transform coding are
both block quantization algorithms.
ADVANTAGE: Provides a convenient manner
for accessing the redundancies in the data.
Company Logo
LSF VECTOR & GAIN
Company Logo
ENCODER BLOCK DIAGRAM
Company Logo
POST FILTERING
In low bit rate speech coders, it is common to
use a short term postfilter at the decoder to
improve the precieved quality of
reconstructed speech.
Company Logo
CONCLUSION
A 1KBPS Speech coding s/m has been presented
which offers high quality speech capability.
S/M can be implemented as a cascade of several
algorithms.
The 1st element of s/m estimates MBE
Parameters.
These parameters are quantized & Tx across
channel.
A decoder then reconstructs the quantized
parameter values, then synthesis algo. Produces
synthesized speech.
Thus, the low bit rate coder operating at 940 bps
achieves acceptable speech quality &
intelligibility.
SUGGESTIONS FOR IMPROVED

QUANTIZATION
Company Logo
Lower the bit rate -----> Better Quantization

may profit the use of algo.
Efficiency.
One interesting possibility is to use a treestructure vector quantizer.
Efficiency also improved by allocating bits
over multiple speech segments, rather than
allocating bits to each frame, while keeping
the total no of bits constant.
Another way is to use higher order
LOGO

Speech Coder

Transféré par

Informations du document

Titre original

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

Speech Coder

Transféré par

Droits d'auteur :

Formats disponibles

LOGO

MBE (MULTI-BAND SPEECH

CONVENTIONSL V/S MBE

LSF VECTOR & GAIN

ENCODER BLOCK DIAGRAM

SUGGESTIONS FOR IMPROVED

Lower the bit rate -----> Better Quantization

Vous aimerez peut-être aussi