www.jntuworld.com Goal of Audio Coding Terms Coding and Compression are used interchangeably. Goal of audio coding is to develop methods for compact digital representation of audio signals. Efficient transmission or storage. Minimum number of bits with transparent perceptual quality.
Audio Coding p. 2/1
www.jntuworld.com First Generation Audio Coders Digital representation of audio signals. Compact Disc (CD) is the digital storage medium. Sampling frequency is 44.1 kHz and Bit rate 16 bits/sample 20 kHz audio spectrum + 2.05 guard band = 22.05 kHz Sampling freq = 22.05 2 = 44.1kHz . Data rate: 44100 16 = 705.6 kb/s for mono 705.6 2 = 1.41 Mb/s for stereo
Audio Coding p. 3/1
www.jntuworld.com Second Generation Audio Coders For network and wireless multimedia digital audio. Bandwidth is the severe constraint. At the same time, end-users need CD quality. Conflicting requirements. Goal is to reduce data rate without compromising on the perceptual quality. Led to several audio compression algorithms. Exploit both perceptual irrelevancies and statistical redundancies.
Audio Coding p. 4/1
www.jntuworld.com Third Generation Audio Coders Lossless audio Spatial audio Real-time source localization Head related transfer function (HRTF) Immersive audio
Audio Coding p. 5/1
www.jntuworld.com Audio Coding Methods PCM (1.41 Mb/s). DPCM (0.75 x PCM data rate). ADPCM (0.5 x PCM data rate). Not much data rate reduction. Need for high compression methods driven by potential applications. New approaches for audio coding based on the principles of psychoacoustics.
Audio Coding p. 6/1
www.jntuworld.com Psychoacoustics Characterizing human auditory perception. Time-frequency analysis capabilities of the inner ear. Perceptually irrelevant audio signal information. Contributions from psychoacoustics: Perceptual entropy Auditory filter bank Perceptual entropy deals with estimate of the fundamental limit of transparent audio signal compression. Auditory filter bank based on the time-frequency analysis capabilities of the inner ear.
www.jntuworld.com Block Diagram of Generic Audio Coder
Audio Coding p. 9/1
www.jntuworld.com Principle of Generic Audio Coder Segment input signals into quasi-stationary frames of 2-50 ms. Time-frequency analysis estimates the temporal and spectral components of each frame. TFA approach employed is based on human auditory system. Objective is to extract a set of time-frequency parameters that are robust to quantization according to a perceptual distortion metric. Perceptual distortion control is achieved by a psychoacoustic signal analysis section that estimates signal masking power based on psychoacoustic principles.
Audio Coding p. 10/1
www.jntuworld.com Principle of AC (contd.) Psychoacoustic model delivers masking thresholds that quantify the maximum amount of distortion at each point in the time-frequency plane such that quantization of the time-frequency parameters does not introduce audible artifacts. Psychoacoustic model allows the quantization section to exploit perceptual irrelevancies. Final redundancy removal based on the perceptual entropy coding scheme.
Audio Coding p. 11/1
www.jntuworld.com Audio Coder Attributes Audio reproduction quality. Operating bit rates. Computational complexity. Codec delay. Channel error robustness. High quality audio at low bit rates (< 32 kb/s) with an acceptable algorithm delay (5-20 ms), and with low computational complexity (1-10 MIPS).
Audio Coding p. 12/1
www.jntuworld.com Types of Audio Coders Based on the signal model or analysis-synthesis technique. LP Transform Subband Sinusoidal
Audio Coding p. 13/1
www.jntuworld.com AC-Expt.1 Effect of Sampling Frequency and Bit Resolution
Objective is to analyze the effect of sampling frequency
and bit resolution on the perceptual quality of audio. Take a CD quality music signal of 1 sec, sampled at 44.1 kHz with 16 bits/sample and perform the following. Change its sampling frequency to 16, 8 and 4 kHz. Keep bit resolution constant at 16 bits/sample. Consider about 50 ms segment in a high energy region. Plot the time domain and DFT spectra for all the four cases. Comment on the effect of different sampling frequency. Comment also on the perceptual quality of the audio. Audio Coding p. 14/1 www.jntuworld.com AC-Expt.1 Effect of Sampling Frequency and Bit Resolution
Change its bit resolution to 8, 4 and 1 bits/sample.
Keep sampling frequency constant at 44.1 kHz. Consider the same 50 ms segment in a high energy region. Plot the time domain and DFT spectra for all the four cases. Comment on the effect of different bit resolutions. Comment also on the perceptual quality of the audio.