Académique Documents
Professionnel Documents
Culture Documents
Receive
d
Receive
info.
User
r
Transmitter
Formatter
Source
encoder
Channel
encoder
Modulator
Receiver
Formatter
Lecture 1
Source
decoder
Channel
decoder
4
Demodulator
Source
SOURCE
Nois
Transmitted e
Receive
signal
d
Transmitter
Channe signal
l
Design goals
Trade-off between various parameters
5
Utilized techniques
Digital Fundamentals:
The Sampling Theorem, Nyquist
Sampling
Spread Spectrum:
DS-SS, FH-SS, Coding Gain,
Fading Margins
Amplitude Modulation
SSB Modulators
Product Detection
VCO circuit
Wideband FM generation
FM Demod circuit
Gaussian pulse-shapes
BPSK constellation
DPSK modulation
DPSK receiver
QPSK modulation
QPSK receiver
IF Differential Detection
FM Discriminator detector
Noncoherent FSK
MSK modulation
MSK reception
GMSK Demodulator
QAM efficiencies
PN Sequence Generator
Effects of Fading
Outline
Cyclic Code
CRC Code
BCH Code
RS Code
Term paper
Diversity
Linear Code
Deep Fading
Channel Coding
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
Overcoming Channel
Impairments
Diversity Techniques
EQUALIZATION,
DIVERSITY &
CODING
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
DIVERSITY
I. Introduction
79
MRC Impairments:
1) Equalization
2) Diversity
3) Channel Coding
Used independently or together
We will consider Diversity and Channel Coding
80
81
83
85
Instantaneous SNR i
1
p( i ) e
Eb
SNR
N0
, the pdf of i
i 0 (6.155)
1
Pr i p( i )d i e
0
0
d i 1 e
86
Pr i 1 PM ( ) 1 (1 e / ) M
The pdf of :
d
M
pM ( )
PM ( ) 1 e
d
M 1
pM ( )d Mx 1 e
x M 1
e x dx, x
k 1 k
87
Pr 1 ,... M (1 e / ) M PM ( )
88
1) Selection diversity
2) Feedback diversity
3) Maximal radio combining
4) Equal gain diversity
89
90
91
Average SNR is 20 dB
Acceptable SNR is 10 dB
Assume four branch diversity
Determine that the probability that one signal
has SNR less than 10 dB
92
Example:
93
94
2) Scanning Diversity
rM Gi ri
i 1
NT N Gi2
i 1
rM2
2 NT
96
( rM is maximized when Gi ri / N
98
i 1
i 1
M i i M
99
100
101
103
104
RAKE Receiver
105
106
107
108
109
In outdoor environments
In indoor environments
Diversity Motivation
Multiple branches
Low correlation between branches
Similar mean powers:
Efficient combiner
Diversity Example
Spatial Diversity
Multiple input multiple out system (MIMO)
Beamforming, smart antenna
Space time coding
Horizontal and Vertical Combining
Frequency diversity
Frequency diversity transmits information on more than one
carrier frequency
Frequencies separated by more than the coherence bandwidth of
the channel will not experience the same fads
Time diversity
Time diversity repeatedly transmits information at time spacings
that exceed the coherence time of the channel
Polarization diversity
Multi-user diversity
Different Diversity
Space Diversity
Time Diversity
Wideband Channel
Simultaneous Transmission
Wastes power and bandwidth
Equalizers
Channel
Spectrum
Frequency
Frequency Diversity
Performance
Example 7.4
Interleaving
Block interleaver where source bits are read into columns and out as n-bit rows
Hamming Code
Combining Techniques
Selection diversity
Feedback diversity
Maximal ration combining
Equal gain diversity
H(7,4)
Generator matrix G: first 4-by-4 identical matrix
Message information vector p
Transmission vector x
Received vector r
and error vector e
Parity check matrix H
Error Correction
Exercise
ReedMuller code
Cyclic code
They posses rich algebraic structure that can be utilized in a
variety of ways.
They have extremely concise specifications.
They can be efficiently implemented using simple shift register
Many practically important codes are cyclic
EXAMPLE
Divider
Example of CRC
Capability of CRC
All single-bit errors if G(x) has more than one nonzero term
All double-bit errors if G(x) has a factor with three terms
Any odd number of errors, if P(x) contain a factor x+1
Any burst with length less or equal to n-k
A fraction of error burst of length n-k+1; the fraction is 1-2^(-(-nk-1)).
A fraction of error burst of length greater than n-k+1; the fraction
is 1-2^(-(n-k)).
BCH Code
Bose, Ray-Chaudhuri, Hocquenghem
Industry standards
(511, 493) BCH code in ITU-T. Rec. H.261 video codec for
audiovisual service at kbit/s a video coding a standard used for
video conferencing and video phone.
(40, 32) BCH code in ATM (Asynchronous Transfer Mode)
BCH Performance
Reed-Solomon Codes
1971: Mariner 9
Mariner 9 used a [32,6,16] Reed-Muller
code to transmit its grey images of Mars.
camera rate:
100,000 bits/second
transmission speed:
16,000 bits/second
More recently
Turbo codes
were invented,
which are used in
3G cell phones,
(future) satellites,
and in the CassiniHuygens space
probe [1997].
Other modern codes: Fountain, Raptor, LT, online codes
Modern Codes
Imperfectness of a given code as the difference between the code's required Eb/No to attain a
given word error probability (Pw), and the minimum possible Eb/No required to attain the same Pw,
as implied by the sphere-packing bound for codes with the same block size k and code rate r.
Speech-Coding Techniques
Introduction
Advantages for VoIP
Digital streams of ones and zeros
The lower the bandwidth, the lower the quality
Voice Quality
Bandwidth is easily quantified
MOS, Mean Opinion Score
ITU-T Recommendation P.800
Excellent 5
Good 4
Fair 3
Poor 2
Bad 1
A minimum of 30 people
Listen to voice samples or in conversations
3-147
Toll quality
A MOS of 4.0 or higher
3-148
P.800 recommendations
ITU-T P.861
faithfully represent human judgement and perception
algorithmic comparison between the output signal
and a know input
type of speaker, loudness, delay, active/silence
frames, clipping, environmental noise
3-149
Speech
Speech sounds
The vocal cords vibrate open and close
Interrupt the air flow
Quasi-periodic pluses of air
The rate of the opening and closing the
pitch
A high degree of periodicity at the pitch period
2-20 ms
3-151
Voiced sound
3-152
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
Voiced speech
Power spectrum density
3-153
Unvoiced sounds
3-154
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
unvoiced speech
Power spectrum density
Plosive sounds
Voice Sampling
discrete samples of the waveform and
represent each sample by some number of
bits
A signal can be reconstructed if it is sampled
at a minimum of twice the maximum freq.
Human speech
300-3800 Hz
8000 samples per second
A-to-D
3-156
Quantization
3-158
Non-uniform quantization
Waveform codecs
3-161
Hybrid codecs
G.711
The most commonplace codec
If uniform quantization
12 bits * 8 k/sec = 96 kbps
Non-uniform quantization
64 kbps DS0 rate
mu-law
North America
A-law
Other countries, a little friendlier to lower signal levels
DPCM
Only transmit the difference between the predicated value and
the actual value
Voice changes relatively slowly
It is possible to predict the value of a sample base on the
values of previous samples
The receiver perform the same prediction
The simplest form
No prediction
No algorithmic delay
3-163
ADPCM
ADPCM, Adaptive DPCM
Past samples
Factoring in some knowledge of how speech varies
over time
G.721
32 kbps
G.726
A-law/mu-law PCM -> 16, 24, 32, 40 kbps
An MOS of about 4.0 at 32 kbps
3-164
Analysis-by-Synthesis (AbS)
Codecs
Fill the gap between waveform and source codecs
The most successful and commonly used
Hybrid codec
G.728 LD-CELP
CELP codecs
A vector = a set of elements representing various char. of the
excitation
Transmit
Filter coefficients, gain, a pointer to the vector chosen
LD-CELP encoder
Minimize a frequency-weighted mean-square
error
3-167
LD-CELP decoder
G.723.1 ACELP
6.3 or 5.3 kbps
The coder
A band-limited input speech signal
Sampled at 8 KHz, 16-bit uniform PCM quantization
Operate on blocks of 240 samples at a time
A look-ahead of 7.5 ms
A total algorithmic delay of 37.5 ms + other delays
A high-pass filter to remove any DC component
3-169
Both mandatory
Can change from one to another during a
conversation
6.3kbps
24 octets/frame
5.3kbps
20
SID frame 4
G.723.1 Annex A
8 kbps
Input frames of 10 ms, 80 samples for 8 KHz
sampling rate
5 ms look-ahead
Algorithmic delay of 15 ms
G.729
G.729.B
Based on analysis of several parameters of the input
The current frames plus two preceding frames
a lower-rate extension
6.4 kbps; 10 ms speech samples, 64 bits/frame
MOS 6.3 kbps G.723.1
G.729 Annex E
a higher bit rate enhancement
the linear prediction filter of G.729 has 10 coef.
that of G.729 Annex E has 30 coef.
the codebook of G.729 has 35 bits
that of G.729 Annex E has 44 bits
118 bits/frame; 11.8 kbps
3-174
G.729 Annex D
Other Codecs
Variable-rate coder
Two most common rates
The high rate, 13.3 kbps
A lower rate, 6.2 kbps
Silence suppression
For use with RTP, RFC 2658
3-175
GSM 06.60
An enhanced version of GSM Full-Rate
ACELP-based codec
The same bit rate and the same overall
packing structure
12.2 kbps
20 ms coding delay
Eight different modes
4.75 kbps to 12.2 kbps
12.2 kbps, GSM EFR
7.4 kbps, IS-641 (TDMA cellular systems)
Change the mode at any time
Offer discontinuous transmission
The SID (Silence Descriptor) is sent in every 8 th frame and is 5
bytes in size
Processing Power
G.728 or G.729, 40 MIPS
G.726 10 MIPS
3-178
3-179
iLBC
Speex
Cascaded Codecs
3-182
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
Effects of packetization
G.711 is OK
G.723.1 and G.729 can be unintelligible
The ingress gateway needs to intercept
The tones and DTMF digits
Use an external signaling system
3-183
An Internet Draft
Both methods described before
A large number of tones and events
DTMF digits, a busy tone, a congestion tone, a
ringing tone, etc.
3-186
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
Payload format
message
linguistic code (~ 50 b/s)
motor control
speech production
8000 x 11 = 88 kb/s
Simple and universal but not very efficient
OUT
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
Better quantization ?
A - law
- law
Differential coding
time
current sample
DPCM
a single predictor
reflecting global
predictability of speech
predictor order up to 4-5
delta modulation - gross
quantization of prediction
error into 1 bit (typically
requires up-sampling well
over the Nyquist rate)
adaptive DPCM
new predictor for every
new speech block
predictor needs to be
transmitted together with
the prediction error
Speech Coders
source
filter
speech
changes slowly
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
long-term prediction
short-term prediction
time
short-term
long-term
current
sample
LPC vocoder
Receiver
re-introduce high
frequencies in the
simplified residual
(nonlinear
distortion)
Transmitter:
Analysis-by-synthesis
Automatic recognition of
speech
electric signal
(more than 50 kb/s)
prior knowledge
( textbook )
acquired knowledge
( data )
linguistic message
phoneme string
(below 50 b/s)
Auditory perception
how do biological systems
respond to acoustic stimuli
Knowledge of auditory
perception ?
knowledge
160 Hz
hi
hi
pm
hi
pf
The model
pm-f
p1m
170 Hz
200 Hz
hi
hi
hi
hi
hi
P(sound|gender)
sequence of male and
female groups?
f
f0
pf-m
hi
m
f
f0=160 Hz 170 Hz
x
units of speech
(phonemes)
Speech signal ?
histogram
speech signal
correlations
/u/ /o/
/a/
// /iy/
beer
/
uw//ao//ah//eh//ih/
/iy/
it is in the
spectrum !!
Isaac Newton
Inertia in engineering
Steam Engine (1769)
frequency
Short-term Spectrum
time
10-20 ms
histogram
histogram
histogram
correlations
cosine transform of
logarithmic short-term speech
spectral envelope
(cepstrum)
frequency
auditory-like
modifications
auditory-like
spectrum
short-term spectrum
Frequency
resolution of
human ear
decreases with
frequency
FFT
critical-band energy
Perceptual Linear
Prediction (PLP)
Auditory-like modifications
of short-term speech
spectrum prior to its
approximation by all-pole
autoregressive model
critical-band spectral
resolution
equal-loundness
sensitivity
intensity-loudness
nonlinearity
Today applied in virtually
all state-of-the-art
experimental ASR systems
[Hermansky 1990]
/j/
/u/
/ar/
/j/
/o/
/j/
/o/
time
frequency
63 %
12 %
16 %
2%
Spectral resolution of
LDA-derived
spectral basis is
higher at low
frequencies
Critical bands of
human hearing are
narrower at lower
frequencies
Sensitivity to Spectral
Change
(Malayath 1999)
LDA-derived bases
Critical-band fi
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
Cosine basis
energy of the
signal
resource space
energy of the
signal
resource space
spectrum
Simultaneous Masking
Nonlinear
frequency
resolution of
hearing
Critical bands
up to ~600 Hz
constant
bandwidth
above 1 kHz
constant Q
tone at f
critical
bandwidth
threshold of
perception
of the tone
noise bandwidth
band-pass filtered
noise centered at f
Replace spectral
vector by a
matrix of
posterior
probabilities of
{p(f)} acoustic events
S ( frequency )
pf4
pf5
pf6
frequency
coarticulation
d
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
Masking in Time
masker
signal
stronger masker
200 ms
time
increase
in threshold
data x
time
Short-term Features?
~10 ms
time
processing
time-frequency distribution of
the linear component of the
most efficient stimulus that
excites the given auditory
neuron
FREQUENCY
TIME [s]
time
200-1000 ms
10-20 ms
time
data x
1-3 Bark
200-1000 ms
200-1000 ms
1-3 Bark
DCT
of
the signal
conventional LP
the signal
2002 Pearson Education, Inc. Commercial use, distribution, or sale prohibited.
spectral domain LP
Hilbert
envelope
of the signal
signal power
spectrum
all-pole
model of
the Hilbert
envelope
all-pole
model of
the power
spectrum
low frequency
signal
prediction
all-pole model
of lowfrequency
Hilbert
envelope
discrete
cosine
transform
prediction
high frequency
all-pole model
of highfrequency
Hilbert envelope
tonality
tonality
time
Critical-band Spectrum From All-pole Models
Of Hilbert Envelopes in Critical Bands
time
class posteriors
processing
( trained NN )
data
processing
( trained NN )
data
time
processing
( trained NN )
some function
of phoneme
posteriors
frequency