Académique Documents
Professionnel Documents
Culture Documents
Video sequence
Segmentation
1030 sitting
Static posture recognition
Low Level data: person segmentation
Adaptive background removal algorithm :
consecutive frame differences + adaptive reference image
A. Caplier, L. Bonnaud and J. M. Chassery « Robust fast extraction of video objects combining frame
differences and reference image » in Proc. IEEE International Conference on Image Processing, pp. 785-
788, September 2001.
Cb
Method :
thresholding in CbCr plane
with initial thresholds :
Y
Cb Є [86, 140] Cr Є [139, 175]
Low Level data: temporal tracking
Computation of SRBBs overlap :
forward backward result
T-1
V. Girondel, A. Caplier, and L. Bonnaud « Hands Detection and Tracking for Interactive Multimedia
Applications » in Proc. International Conference on Computer Vision and Graphics, pp. 282-287,
September 2002.
System overview
1. Low level data extraction :
- segmentation
- temporal tracking
- skin detection and face localization
2. Static posture recognition :
- low-level data
- belief theory, definitions, models, data fusion, decision
3. Results :
- training and test sets
- recognition results
- video sequence
Posture recognition: measures
Distance measurements
D1, D2, D3, D4
ideas : person height and shape compactness
Reference posture
Normalization : ri=Di/Diref « Da Vinci Vitruvian man posture »
standing, arms stretched horizontally
i
860 112
SRBB
SPAB
SPAB
SRBB
Posture recognition:belief theory
Advantages :
- use of imprecise, conflictous data
- not computationally expensive (HMMs, NNs…)
Universe : Ω = {Hi} i=1…n 2N subsets A of Ω
frame
Posture recognition: measures modeling
Belief mass distributions : mri , measurements imprecision
Ω=
Example :
Test set :
mean
recognition
rate :
80.8%
Results: video example
Outline
Is it possible to recognize
the facial expression ?
Facial expressions analysis
1 Assumptions and Applications
2 Facial features segmentation: low level data
3 Facial expressions recognition: high level
interpretation
4 Facial expression recognition based on audio: a
step towards a multimodal system
Facial features extraction:
models choice
P6
P3 P7
P5
P1 P2
P4
P2 P3 P4
x
x x
P1 P5
x x x
P6
E
p cercle
I ( p ).n ( p )
n(p) I(p)
Facial features extraction:
models deformations (2)
Joy :
D2
D1
{open mouth} => D3 >Dn3 and D4 >Dn4 ,
{mouth corners backwards} => D5 < Dn5 ,
{slakened brows} => D2 no modification
D5
D3
Surprise :
{raising up brows} => D2 >Dn2,
{stared eyes} => D1 >Dn1,
D4 {open mouth} => D3<Dn3 and D4>Dn4.
Facial expressions recognition:
distances discretisation
D2
Doubt modelling :
SC+ : Di > Dni (state S C+)
SC- : Di < Dni (state S C-)
D5
D5 evolution (sourire)
Facial expressions recognition:
basis of rules
D3 D5
D1 D2
D4
joy C- S / C- C+ C+ C-
Surprise C+ C+ C- C+ C+
disgust C- C- S / C+ C+ S/C-
anger C+ C- S S/C- S
sadness C- C+ S S S
fear S / C+ S / C+ S / C- S / C+ S
neutral S S S S S
Facial expressions recognition:
evidence mass distribution and modelling
To each Di is associated the following mass of evidence :
m Di:
2 [0,1]
A m Di
(A )
Modelling mDi
Di
Hammal-Caplier database
Cohn-Kanade database
Facial expressions recognition:
results (2)
E2 Surprise 0 12% 0 0
E3 disgust 0 0 43,10% 0
E1 E3 10,90% 0 8,62% 0
E2 E6 0 72,44% 0 0