Académique Documents
Professionnel Documents
Culture Documents
2.1 Introduction
2.2 Digitization principles 2.3 Text 2.4 Images 2.5 Audio 2.6 Video
Introduction
Analog data, such as the sounds made by a human
voice, take on continuous values. When someone speaks, an analog wave is created in the air. This can be captured by a microphone and converted to an analog signal or sampled and converted to a digital signal.
composite. A simple periodic analog signal, a sine wave, cannot be decomposed into simpler signals. A composite periodic analog signal is composed of multiple sine waves.
Parameters
A sine wave can be represented by three parameters:
Amplitude
The peak amplitude of a signal is the absolute value of
its highest intensity, proportionalto the energy it carries. For electric signals, peak amplitude is normally measured in volts.
Amplitude
Frequency
Period refers to the amount of time, in seconds, a
signal needs to complete 1 cycle. Frequency refers to the number of periods in I s. Note that period and frequency are just one characteristic defined in two ways. Period is the inverse of frequency, and frequency is the inverse of period,
Frequency
Phase
The term phase describes the position of the waveform
relative to time 0. If we think of the wave as something that can be shifted backward or forward along the time axis,phase describes the amount of that shift.
Phase
The conversion of an analog signal into a digital form . is carried out by a an electric circuit called Signal encoder Sampling the amplitude of analog signal at repetitive time intervals and then converting amplitude of each sample into corresponding digital value The conversion of stored digitized samples into their corresponding time- varying analog form is carried out by an electric circuit called Signal decoder .
Process
Process
1. The analog signal is sampled.
2. The sampled signal is quantized. 3. The quantized values are encoded as streams of bits
Multimedia Information Representation Multimedia Information is stored and processed within a computer in a digital form
Codeword: Combination of a fixed number of bits that represents each character, in the case of textual information analogue signal: Signal whose amplitude (magnitude of the sound/image intensity) varies continuously with time Signal encoder: Electrical circuit used for the conversion of an analog signal into a digital form Signal decoder: Electrical circuit that converts stored digitized samples into time-varying analog form
Analog Signals
As mentioned earlier the amplitude of the signal varies continuously with time The Fourier analysis can be used to show that any time varying signal is made up of infinite number of single-frequency sinusoidal components
Analog Signals
The range of frequencies of the sinusoidal components that make up the signal is called the signal bandwidth
Speech bandwidth: 50Hz 10kHz Music Bandwidth: 15Hz 20kHz
Continued
To transmit an analog signal through a network the bandwidth of the transmission channel should be equal to or greater than the signal bandwidth
If the bandwidth of the channel is less than the signal bandwidth than channel is called the bandlimiting channel
Encoder Design
Encoder Design
Encoder Design
The Encoder consists of bandlimiting filter and an
signal at regular intervals and holds the sampled amplitudes between samples
Quantizer: Converts the samples into their corresponding
binary form
Sampling Rate
Nyquist sampling theorem: To obtain an accurate
representation of a time-varying analog signal, its amplitude must be sampled at a minimum that is equal to or greater than twice the highest sinusoidal frequency component that is present in the signal
Nyquist rate is represented in samples per seconds (sps) Antialiasing filter: Another name for bandlimiting
filter. Since it passes frequencies that are within the Nyquist rate
Quantization Intervals
Representation of the analogue samples require an infinite number of digits
Quantization Intervals
Quantization error is the difference between the actual signal amplitude and the corresponding nominal amplitude (also known as quantization noise since values vary randomly)
Dynamic Range
With high-fidelity music it is important to be able to hear very quiet passages without any distortion created by quantization noise Dynamic range is defined as the ratio of the maximum signal amplitude to the minimum. D = 20 log10 (Vmax/Vmin) dB
Decoder Design
A signal decoder is an electronic circuit that performs the conversion prior to their output back again into their analog form through a digital-to-analogue converter and a low pass filter
Encoder+decode= Codec
Low-pass filter: Only passes those frequency components that were filtered through the bandlimiting filter in the encoder
2.3 Text
Three types of text Unformatted text Formatted text hypertext
Text
Unformatted text: Known as plain text; enables
pages to be created which comprise strings of fixed-sized characters from a limited character set Formatted Text: Known as richtext; enables pages to be created which comprise of strings of characters of different styles, sizes and shape with tables, graphics, and images inserted at appropriate points
(Each comprising formatted text) to be created which have defined linkages between them
Control characters
The American Standard Code for Information Interchange is one of the most widely used character sets and the table includes the binary codewords used to represent each character (7 bit
The characters in columns 010/011 and 110/111 are replaced with the set of mosaic characters; and then used, together with the various uppercase characters illustrated, to
Although in practice the total page is made up of a matrix of symbols and characters which all have the same size, some simple graphical symbols and text of larger sizes can be constructed by the use of groups of the basic symbols
Formatted Text
It is produced by most word processing packages and used extensively in the publishing sector for the preparation of papers, books, magazines, journals and so on.. Documents of mixed type (characters, different styles, fonts, shape etc) possible.
Format control characters are used
Hypertext can be used to create an electronic version of documents with the index, descriptions of departments, courses on offer, library, and other facilities all written in hypertext as pages with various defined hyperlinks.
An example of a hypertext language is HTML used to describe how the contents of a document are presented on a printer or a display; other mark-up languages are: Postscript, SGML (Standard Generalized Mark-up language, Tex, Latex.
2.4 Images
Image are displayed in the form of a two-dimensional
Graphics
Shape can be drawn using pencil or paint brush or
mouse. Can change shape ,color , size can create gallery known as clip art. Also add shadows in case of three dimensional. VGA-video graphics array. Images has attributes. Color fill and rendering.
2.4.1 Graphics
Two forms of representation of a computer graphic: a high-level version (similar to the source code of a high-level program) and the actual pixel-image of the graphic (similar to the byte-string corresponding to the low-level machine codebitmap format) Standardized forms of representation such as GIF (graphical interchange format) and TIFF (tagged
over complete screen Discrete horizontal lines from top left corner to botom right corner. Progressive scanning Each complete set of horizontal scan is called a frame. Picture tube coated with light sensitive phosphor emits light when energized by electron beam. Brightness depends on power in electron beam.
an image sensor Silicon chip consists of a two-dimensional grid of lightsensitive cells called photosites Stores the level of intensity of light falls on it. A widely-used image sensor is a charge-coupled device (CCD) Fig 2.16
2.5 Audio
Typical Audio Types
Speech signal for interpersonal application such as (video) telephony Music-quality audio such as CD synthesizer is an electronic instrument that utilizes multiple sound generators to create complex waveforms microphone Basics on Audio Signals
1. Human speech: 50Hz -10KHz (4Khz in a plain-old-telephone system) - 2 x 10K - ideally, 12 bits/sample 2. Human audible music: 15Hz - 20KHz - 2 x 20K - ideally, 16 bits/sample
53
2.5 Audio
The bandwidth of a typical speech signal is from 50Hz
through to 10kHz; music signal from 15 Hz through to 20kHz The sampling rate: 20ksps (2*10kHz) for speech and 40ksps (2*20kHz) for music Music stereophonic (stereo) results in a bit rate double that of a monaural(mono) signal
PCM Speech(1)
Companding (compressing/expanding)
Compander (compressor/expander)
Enhanced PCM signals Non-linear (unequal) interval quantization & narrower intervals for smaller amplitude signals
Equal (linear) interval quantization & same level of quantization error Irrespective of the magnitude of the input signal , the same error level for both low (quiet) signals and high (loud) signals is produced
Why companding ?
Because the human ears are more sensitive to noise on quiet signals than it is on loud signals. Hence the effect of quantization noise (error) can be reduced with companding
56
CD-Quality Audio
Human audible bandwidth: 15Hz-20Khz 40Ksps In CD-ROMs, more higher, say, 44.1Ksps & 16-bit/sample used bit rate for channel = sampling rate x bits per sample = 44.1 x 103 x 16 = 705.6 Kbps total rate required for stereophonic music = 2 x 705.6 = 1.411 Mbps
57
Synthesized audio
Used in multimedia applications which required to
store digitized waveform of lesser magnitude. Midi-music interface digital interface. Three components-computer, keyboard and set of sound generators. Takes input from keyboard and generate digital from analog .
times per second to avoid flicker A refresh rate of 25 times per second is sufficient Field: the first comprising only the odd scan lines and the second the even scan lines . Each frame is refreshed at 30/25 frames per second. In PAL 625 lines and 50 refresh rate In NTSC 525 lines and 60 refresh rate.
television receiver using a technique known as interlaced scanning Fig 2.19 The three main properties of a color source
Brightness: represents the amount of energy varies on
grayscale from black to white Hue: this represents the actual color of the source Saturation: this represents the strength or vividness of the color.
a source .amount of white light it contains The hue and saturation are referred to as its chrominance Ys 0.299 Rs 0.587 Gs 0.144 Bs
Where Ys is the amplitude of the luminance signal and Rs,Gs and Bs are the magnitudes of the three color
component signals
Cb Bs Ys
Cr Rs Ys
Digital Video
For luminance we have 6MHz signal bandwidth and
for two chrominance 3MHz. So sampling rate of 12 ksps and 6ksps is required.
A line sampling rate of 13.5MHz for luminance and
and for 11.56 microseconds beam is turned off . so 52 seconds in actual. Therefore scanning rate =702 samples per line.
The number of samples per line is increased to 720
samples The numbers 480 and 576 being the number of active (visible) lines in the respective system Each line sampled at constant rate with fixed number of samples per line called as orthogonal and orthogonal sampling.
applications Interlaced scanning is used and the absence of chrominance samples in alternative lines The same luminance resolution but half the chrominance resolution Fig2.22
The source intermediate format (SIF) give a picture quality comparable with video recorders(VCRs)
2.6.3 PC video