Académique Documents
Professionnel Documents
Culture Documents
snellwilcox.com
snellwilcox.com
Abstract The aspect ratio of the display may be anything from nearly
square (or even a “portrait” format) to the 16:9 shape now
The proliferation of video source formats, deliver y mechanisms common in home TV displays. Projection systems have greater
and display devices has significantly increased broadcasters’ flexibility to match ver y widescreen cinema film formats.
needs for aspect ratio conversion. We have taken this as an
opportunity to develop aspect processing, a framework for The multiplicity of deliver y and display platforms means that
producing an arbitrar y shape intelligently and flexibly. aspect ratio conversion has become an important part of the
Aspect processing extends aspect ratio conversion to the broadcast chain.
definition of optimum viewing for a given screen size and
shape. This paper presents new aspect processing technologies,
In this paper, we first review conventional approaches to
which introduce a set of efficient criteria to automatically
aspect ratio conversion and discuss their advantages and
identify regions of interest in the picture and to process the
disadvantages. To address the shortcomings, the wider concept
picture accordingly.
of aspect processing is introduced, in which we attempt to
optimize the viewing experience for a given display shape
This paper also introduces a unique and novel technology – and size. Two sections then introduce new content-aware
video seam carving, which works by altering the technologies for aspect processing: dynamic reframing and
perspective of the image sequence to change between video seam car ving.
aspect ratios.
Figure 3: Cropping
Figure 4: Letterboxing
Aspect Processing
Aspect processing is a generalization of aspect ratio
A compromise approach sometimes used successfully in
conversion. Aspect ratio conversion may be thought of as
consumer TV sets is to combine all three of the above
using a map function that links pixel locations in the input
techniques, “sharing the pain” between them. In our example,
space to pixel locations in the output space. In the examples
the required 25% reduction in aspect ratio could be achieved
above, this map is more or less complicated but is fixed. In
by a 9.14% reduction from each of the three techniques.
aspect processing, we allow the map function to var y smoothly
This would produce the result shown in Figure 5.
from frame to frame in dependence on variations in content.
Aspect processing is therefore a content aware technology.
snellwilcox.com
One important feature of aspect processing is that the map A further refinement of the approach is to include in the
approach provides a convenient framework for these content distance measure a suitably normalized measure of the
aware methods to be applied together and, if desired, in badness of fit of the pixel to a motion model calculated for
conjunction with the fixed aspect ratio conversion techniques each segment. As the segment membership is updated, the
described above. motion models can be refined using an iterative gradient
approach such as that described in [2].
Two approaches to aspect processing will now be presented. In
dynamic reframing, the map function defines a rectangular Foreground-background segmentation from motion
window of variable position and size that attempts to estimation
encompass the region of greatest interest in the scene. In Another approach to foreground-background segmentation is
seam carving, the map function is much more complex and based more directly on motion information. Motion estimation
attempts to remove (or to expand) areas all over the picture can be applied to many image processing tasks, for example
that have least interest while maintaining the integrity of the standards conversion, image restoration or compression. In
areas of greatest interest. many cases, a motion estimator is therefore already available.
Dynamic Reframing The use of motion estimation for segmentation is based on the
assumption that the background motion can be modelled by a
In order to do dynamic reframing, the first problem is to
single, global, vector. The foreground is then an object or a set
segment the image into a region of greatest interest, or
of objects that move in a markedly different way from the
foreground, and the rest of the picture, the background.
background. The larger the difference between the movement
We have developed three methods of generic foreground -
of the object and that of the background, the more likely the
background segmentation, which can be combined and further
object is to be interesting and to be part of the foreground.
modified by genre-specific enhancements.
Dynamic pre-warping
One disadvantage of dynamic reframing is that it removes all
control of the framing from the display. Some viewers may
prefer the full reframing suggested by the segmentation
process, while others may prefer a milder reframing or none at
all. To give flexibility back to the display, it is necessar y to
transmit the whole picture (or at least a full-height, dynamic
pan-scan picture), but if this is done at the display resolution
and the display tries to do its own reframing, the result will
Figure 8 Dynamic pan-scan have unacceptably low resolution.
A novel solution to this problem is to pre-warp the picture [4], The minimum-energy seam can be found using a recursive
assigning more pixels and hence higher resolution to the technique in which we calculate best partial seams leading to
region of interest, at the expense of fewer pixels outside the each pixel on successive rows of the picture until we have a
region of interest. If the display chooses to zoom into the minimum-energy seam leading to each pixel on the bottom
region of interest, sufficient resolution will then be available, row. We simply remove all pixels belonging to this seam from
while if it chooses to display the whole picture, the only the picture, shifting the rest of the picture into the gap to make
penalty is a softening of the picture outside the region of a new picture one pixel narrower than before. This is the
interest. Pre-warping is an attractive idea but it does produce a process of “car ving” a seam from the picture.
non-standard transmitted picture, so the display has to include
an inverse warping function which is controlled by metadata Video seam car ving
and by user input. In our application of seam car ving to aspect processing of
moving sequences, we have made three powerful improvements
Figure 10 shows a pre-warped transmitted picture and to the original algorithm.
examples of zoomed-in and full-picture displays, both derived
from the same transmitted picture. The first is that an element of motion compensated recursion
is introduced, in which the energy function is weighted to
favour placing a seam where we would have expected the
corresponding seam in the previous picture to have moved
to [6].
References
The improvements that we have made to seam car ving for
1. Knee, M.J. Image segmentation algorithms for video re-
video sequences allow a great deal of flexibility. We can
purposing. Paper presented at CVMP, London, November 2006
perform seam car ving independently in both horizontal and
2. Vlachos, T. and Hill, L. Optimal search in Hough parameter
vertical dimensions and combine the resulting maps as
hyperspace for estimation of complex motion in image
described in [6]. We can use seam car ving to expand or
sequences. IEE Proc. Vision, Image and Signal Processing, vol
contract the picture in either or both dimensions. For example,
149, issue 2, Apr 2002, pp 63 - 71
Figure 12 shows the result of seam car ving both horizontally
and vertically, the aim this time being to retain the 16:9 aspect 3. Knee, M.J. International HDTV content exchange. Proc. IBC
ratio but to produce a smaller picture in which objects of 2006.
interest retain their original size and shape. In this example, 4. Knee, M.J. Video transmission. International patent application
the smooth seam car ving approach is not used, so the first PVT/GB2008/050158 filed 5 March 2008 – to be published.
pictures show the “hard” horizontal and vertical seams that are 5. Avidan, S and Shamir, A. 2006. Seam car ving for content-
removed from the picture. aware image resizing. ACM Transactions on Graphics, vol. 26,
no. 3
6. Knee, M.J. Seam car ving for video. Proc. NAB 2008.
Acknowledgements
The authors would like to thank the Directors of
Snell & Wilcox Ltd. for their permission to publish this paper.
There are parallels here with pre-warping, except that here the
“warped” picture is designed to be viewed directly and does
not require an inverse warping operation in the display.
snellwilcox.com