Vous êtes sur la page 1sur 15

www.jntuworld.

com
Audio Coding
Introduction
S. R. M. Prasanna

Dept of ECE,
IIT Guwahati,
prasanna@iitg.ernet.in

Audio Coding p. 1/1


www.jntuworld.com
Goal of Audio Coding
Terms Coding and Compression are used
interchangeably.
Goal of audio coding is to develop methods for compact
digital representation of audio signals.
Efficient transmission or storage.
Minimum number of bits with transparent perceptual
quality.

Audio Coding p. 2/1


www.jntuworld.com
First Generation Audio Coders
Digital representation of audio signals.
Compact Disc (CD) is the digital storage medium.
Sampling frequency is 44.1 kHz and Bit rate 16
bits/sample
20 kHz audio spectrum + 2.05 guard band = 22.05
kHz
Sampling freq = 22.05 2 = 44.1kHz .
Data rate:
44100 16 = 705.6 kb/s for mono
705.6 2 = 1.41 Mb/s for stereo

Audio Coding p. 3/1


www.jntuworld.com
Second Generation Audio Coders
For network and wireless multimedia digital audio.
Bandwidth is the severe constraint.
At the same time, end-users need CD quality.
Conflicting requirements.
Goal is to reduce data rate without compromising on
the perceptual quality.
Led to several audio compression algorithms.
Exploit both perceptual irrelevancies and statistical
redundancies.

Audio Coding p. 4/1


www.jntuworld.com
Third Generation Audio Coders
Lossless audio
Spatial audio
Real-time source localization
Head related transfer function (HRTF)
Immersive audio

Audio Coding p. 5/1


www.jntuworld.com
Audio Coding Methods
PCM (1.41 Mb/s).
DPCM (0.75 x PCM data rate).
ADPCM (0.5 x PCM data rate).
Not much data rate reduction.
Need for high compression methods driven by potential
applications.
New approaches for audio coding based on the
principles of psychoacoustics.

Audio Coding p. 6/1


www.jntuworld.com
Psychoacoustics
Characterizing human auditory perception.
Time-frequency analysis capabilities of the inner ear.
Perceptually irrelevant audio signal information.
Contributions from psychoacoustics:
Perceptual entropy
Auditory filter bank
Perceptual entropy deals with estimate of the
fundamental limit of transparent audio signal
compression.
Auditory filter bank based on the time-frequency
analysis capabilities of the inner ear.

Audio Coding p. 7/1


www.jntuworld.com
Some Audio Coding Standards
MPEG-1 Audio (1992).
MPEG-2 Audio (1996).
MPEG-4 Audio v1 (1999).
MPEG-4 Audio v2 (2000)

Audio Coding p. 8/1


www.jntuworld.com
Block Diagram of Generic Audio Coder

Audio Coding p. 9/1


www.jntuworld.com
Principle of Generic Audio Coder
Segment input signals into quasi-stationary frames of
2-50 ms.
Time-frequency analysis estimates the temporal and
spectral components of each frame.
TFA approach employed is based on human auditory
system.
Objective is to extract a set of time-frequency
parameters that are robust to quantization according to
a perceptual distortion metric.
Perceptual distortion control is achieved by a
psychoacoustic signal analysis section that estimates
signal masking power based on psychoacoustic
principles.

Audio Coding p. 10/1


www.jntuworld.com
Principle of AC (contd.)
Psychoacoustic model delivers masking thresholds that
quantify the maximum amount of distortion at each
point in the time-frequency plane such that quantization
of the time-frequency parameters does not introduce
audible artifacts.
Psychoacoustic model allows the quantization section
to exploit perceptual irrelevancies.
Final redundancy removal based on the perceptual
entropy coding scheme.

Audio Coding p. 11/1


www.jntuworld.com
Audio Coder Attributes
Audio reproduction quality.
Operating bit rates.
Computational complexity.
Codec delay.
Channel error robustness.
High quality audio at low bit rates (< 32 kb/s) with an
acceptable algorithm delay (5-20 ms), and with low
computational complexity (1-10 MIPS).

Audio Coding p. 12/1


www.jntuworld.com
Types of Audio Coders
Based on the signal model or analysis-synthesis
technique.
LP
Transform
Subband
Sinusoidal

Audio Coding p. 13/1


www.jntuworld.com
AC-Expt.1
Effect of Sampling Frequency and Bit Resolution

Objective is to analyze the effect of sampling frequency


and bit resolution on the perceptual quality of audio.
Take a CD quality music signal of 1 sec, sampled at
44.1 kHz with 16 bits/sample and perform the following.
Change its sampling frequency to 16, 8 and 4 kHz.
Keep bit resolution constant at 16 bits/sample.
Consider about 50 ms segment in a high energy
region.
Plot the time domain and DFT spectra for all the
four cases.
Comment on the effect of different sampling
frequency.
Comment also on the perceptual quality of the
audio. Audio Coding p. 14/1
www.jntuworld.com
AC-Expt.1
Effect of Sampling Frequency and Bit Resolution

Change its bit resolution to 8, 4 and 1 bits/sample.


Keep sampling frequency constant at 44.1 kHz.
Consider the same 50 ms segment in a high energy
region.
Plot the time domain and DFT spectra for all the
four cases.
Comment on the effect of different bit resolutions.
Comment also on the perceptual quality of the
audio.

Audio Coding p. 15/1

Vous aimerez peut-être aussi