Académique Documents
Professionnel Documents
Culture Documents
discussions, stats, and author profiles for this publication at: http://www.researchgate.net/publication/221415478
CITATION
READS
19
3 AUTHORS, INCLUDING:
Fan Chen
Japan Advanced Institute of Science and T
35 PUBLICATIONS 122 CITATIONS
SEE PROFILE
Fabien Lavigne
TCTS laboratory, University of Mons, Belgium
Fabien.Lavigne@fpms.ac.be
Fan Chen
Xavier Desurmont
Keywords:
sport team video broadcasting, clip segmentation, view type detection, ball detection, region of interest focus
Abstract:
This paper presents a general framework to adapt the size of a sport team video extracted from TV to a
small device screen. We use a soccer game context to describe the four main steps of our video processing
framework: (1) A view type detector helps to decide whether the current frame of the video has to be resized
or not. (2) If the camera point of view is far, a ball detector localizes the interesting area of the scene. (3)
Then, the current frame is resized and centred on the ball, taking into account some parameters, such as the
ball position and its speed. (4) At the end of the process, the score banner is detected and removed by an
inpainting method.
INTRODUCTION
Figure 1: Overview of the system. Process (a) splits video into a set of clips. Each clip is described by its camera point of view
(far, medium, close) and whether it is a replay or not. Processes (b) and (c) can be ran in parallel. Process (b) detects the ball
and estimates parameters of zoom (position and factor). Process (c) detects the banner and applies an inpainting algorithm to
remove it. The last process (d) generates a new video from inpainting video and zoomming parameters.
BALL DETECTION
(a)
(b)
(c)
(d)
Figure 5: Ball detection process: (a) Input image. (b) Soccer field segmentation. (c) Label map after closure. (d) Ball
bounding box in red.
ZOOMING FRAME
Zi = Zi1 +
Ci = Ci1 +
i
1
(Bk Bk1 )
.n k=in+1
(1)
(2)
ADDITIONNAL PROCESSES
RESULTS
(a)
(b)
compromise between zooming on the ball and including most of the players in the zooming frame. It also
could be very interesting in a near future to evaluate
the performance of our system with a subjective assessment method proposed by International Telecommunication Union (ITU) in the report (ITU-R, 2009).
Such evaluation will allow us to measure the impact
of our system on human perception but also to select optimal input parameters. To conclude, we are
about to integrate this system in a real-time streaming
server/client architecture based on the one proposed
by (Bomcke and Vleeschouwer, 2009).
(c)
Figure 8: (a) Original score banner. (b) Banner is inpainted.
(c) Rectified incoherent image.
CONCLUSION AND
PERSPECTIVES
ACKNOWLEDGEMENTS
This work has been funded by the Wallon region in the framework of WALCOMO and 3DMEDIA projects.
REFERENCES
Ariki, Y., Kubota, S., and Kumano, M. (2006). Automatic
production system of soccer sports video by digital
camera work based on situation recognition. In ISM
06: Proc.s of the Eighth IEEE International Symposium on Multimedia, pages 851860, Washington,
DC, USA. IEEE Computer Society.
Bomcke, E. and Vleeschouwer, C. D. (2009). An interactive
video streaming architecture fo h.264/avc compliant
players. In Multimedia and Expo, 2009. ICME 2009.
IEEE International.
Delannay, D., Roover, C. D., and Macq, B. (2003). Temporal alignment of video sequences for watermarking
systems. pages 481492, Santa Clara, USA. SPIE.
Ekin, A., Tekalp, A. M., and Mehrotra, R. (2003). Automatic soccer video analysis and summarization. Image Processing, IEEE Trans. on, 12(7):796807.
ITU-R (2009). Recommandation bt.500-11 : Methodology
for the subjective assessment of the quality of television pictures. Technical report.
Knoche, H., Papaleo, M., Sasse, M. A., and Vanelli-Coralli,
A. (2007). The kindest cut: enhancing the user experience of mobile tv through adequate zooming. In
MULTIMEDIA 07: Proc.s of the 15th international
conference on Multimedia, pages 8796, New York,
NY, USA. ACM.
Pan, H., van Beek, P., and Sezan, M. I. (2001). Detection
of slow-motion replay segments in sports video for
highlights generation. In ICASSP 01: Proc.s of the
Acoustics, Speech, and Signal Processing, 2001. on
IEEE International Conf., pages 16491652, Washington, DC, USA. IEEE Computer Society.
Takahashi, Y., Nitta, N., and Babaguchi, N. (2005). Video
summarization for large sports video archives. Mul-
Figure 9: Left column shows processings (ball detection, banner inpainting and zooming box). Middle column shows original
images resized to the target screen. Right column shows images resized to the target screen.
(a)
(b)
Figure 10: Two problematic situations for our algorithm. (a) The ball is followed at the right of the original image but the
ROI is located at left where a player has been hurt. (b) The ball is lost because of a sudden big shot.
Tschumperle, D. (2006).
Fast anisotropic smoothing
of multi-valued images using curvature-preserving
pdes. Int. J. Comput. Vision, 68(1):6582.
Yu, X., Xu, C., Leong, H. W., Tian, Q., Tang, Q., and
Wan, K. W. (2003). Trajectory-based ball detection
and tracking with applications to semantic analysis of
broadcast soccer video. In MULTIMEDIA 03: Proc.s
of the eleventh ACM international conference on Multimedia, pages 1120, New York, NY, USA. ACM.