Académique Documents
Professionnel Documents
Culture Documents
MPEG-4
MPEG-4 addresses the need towards
Mixing of natural and synthetic audiovisual information High interactivity in the presentation of multimedia content Deployment of communication systems for realtime or broadcast delivery of coded data streams
A new approach for describing, coding and presenting a scene MPEG-4 combines different coding tools for
4/20/2012
MPEG-4 Objects
The audio/video components of MPEG-4
Objects are coded, transmitted separately and composed at the decoder site They can exist independently Multiple objects can be grouped together to form complex objects Video and audio can be easily manipulated Permits choosing appropriate coding tools for audio, video and graphics objects
4/20/2012 3
4/20/2012
MPEG-4 Coding
The scene is composed and rendered at the sender site video frames, audio are coded, multiplexed and transmitted tools for coding arbitrarily shaped objects At the receiver the stream is demultiplexed video and audio are decoded, composed, synchronized and presented as defined at the senders site
4/20/2012 5
Object Coding
Objects are described mathematically (e.g. by their positions)
similarly for audio and graphics objects an object need only be defined once the viewer can change their position transmit calculations to update the scene at the receiver this is a critical feature when the response has to be fast and bit-rate is limited
4/20/2012 6
scene graph
4/20/2012
4/20/2012
Profiles: MPEG-4s definitions of these subsets for audio, visual, graphics information Levels: define the computational complexity of the profiles tool subset Certain combinations of profiles fit well 4/20/2012 12 together
MPEG-4 Profiles
4/20/2012
13
Alpha shape (gray scale) coding: each pixel is assigned a value for its transparency
objects can be smoothly blended into a background or with other objects
4/20/2012 14
Visual Objects
Rectangular natural images and scenes are coded using MPEG-1, 2 Texture is coded separately by a DCT, block based coding scheme or wavelets E.g., weather reports: the weathermans image seems to be standing in front of a map which is actually generated elsewhere
4/20/2012 15
Object Segmentation
MPEG does not specify how objects are extracted
video object segmentation is difficult e.g., record weathermans image in front of a color background
MPEG-4 Applications
MPEG-4 makes video possible even at very low bit-rates (e.g., 10 kb/s) Scalable objects for low bit-rates
mobile devices, internet
a base layer conveys all the information in some basic quality one of more enhancement layers can be sent to get better quality send only the most important objects
17
4/20/2012
Sprites
For coding unchanged backgrounds The background is defined and coded only once Must be updated for each change (e.g., when the viewing angles changes) The sprite is sent only once New views are created by sending the new positions
4/20/2012 18
Advanced Features
Map images into computer generated shapes
a 2D or 3D mesh may have an image mapped onto it a few parameters to deform the mesh generate the impression of a moving picture rather than sending new images for each change, send commands and parameters to the viewer pre-defined faces are particularly interesting meshes the appearance of a face may be left to the decoder (e.g., custom facial models can be downloaded)
4/20/2012 19
MPEG-4 Faces
Images laid over a wire-frame face Send wire-frame plus parameters Image reconstruction at receivers site Speech is generated from text in steps with motions of the mouth, eyes and lips
4/20/2012 20
MPEG-7
MPEG-7 (2002) focuses on description of multimedia content
modalities: image, speech, video, graphics and their combinations
MPEG-7 complements existing MPEG standards and is applicable even to non-MPEG formats (compressed or uncompressed) MPEG-7 is driven by trends in technology, market and user needs Applications: VideoOnDemand, NewsOnDemand, InteractiveTV, multimedia information systems etc.
4/20/2012
21
4/20/2012
22
4/20/2012
23
25
MPEG-7 Elements
1. Descriptors (D) : define syntax and semantics of features of audio-visual content
Application independent Low level: shape, motion, color, camera motion, harmonicity, timbre for audio ... Semantic level: events, concepts ...
4/20/2012
26
4/20/2012
27
4/20/2012
28
MPEG-7 Descriptions
MPEG-7 allows descriptions at different levels of abstractions
low level features extracted automatically semantic features with human interaction or textual annotation
MPEG-7 does not specify how features are extracted or used (e.g., filtering, retrieval)
their representation must conform to the MPEG-7 standard
4/20/2012
29
MPEG-7 Parts
Systems: specifies functionality at system level
Preparation of descriptions for efficient transport and storage synchronization of content and descriptors development of decoders
Description Definition Language (DDL): language for specifying new Ds and DSs
extension of XML schema
4/20/2012 30
MPEG-7 Visual
Specifies a set of standardized visual Ds and DSs
Color descriptors: color space, quantization Texture descriptors: homogeneous texture, texture browsing, edge histogram ... Shape descriptors: for regions or contours Motion descriptors: camera motion, trajectories, motion activity ... Face recognition
31
4/20/2012
MPEG-7 Audio
Specifies standardized audio descriptors and descriptor schemes for pure music, pure speech, sound effects, soundtracks
silence descriptor spoken content descriptors sound effects descriptors melody contour descriptors
4/20/2012 32
basic elements: data types, structures, Ds content management: content from several viewpoints (creation, usage etc.) organization of content by collections, classification navigation and access user interaction
33
4/20/2012
4/20/2012
34
4/20/2012
35