45

Real Time Pedestrian Detection,
Tracking and Distance Estimation

Keywords: HOG, Lukas Kanade, Pinehole Camera, OpenCV
Omid.Asudeh@mavs.uta.edu # of slides : 30
University of Texas At Arlington
Problem definition (Goal):

In this project, given a stream of video, we want to
detect people, track them, and find their distance in a
real-time manner.
Issues:
Pedestrian Detection
Pedestrian Tracking
Distance Detection
People Detection
What is HOG (Histograms of Oriented Gradients)?:
Answer: It is a kind of feature
descriptor.
Then what is a feature descriptor?
The goal of a feature descriptor is to generalize an object in

a way that same objects (e.g. pedestrians) make as close as
possible descriptors. That way the task of classification
becomes easier.
Descriptors
Objects
Classifier (SVM)
HOG
Yes
Is Pedestrian
No
Is Not Pedestrian
Pedestrian detection Process

4
Challenges of HOG and SVM Classifier:

1. HOG is Computationally Complex
2. It needs the whole body to be detected
3. Classifiers are not exact (it may classify wrongly)
HOG in Open CV :
Input : an image
Output : a list of rectangles each bounding a pedestrian
Idea : Read a video stream frame by frame and do HOG
on each frame and show the results
Slow
Lets try the idea! To make sure
Feature point Tracking

What Does a feature tracker do ?
Answer: Suppose we are given two images. We have the
position of some points on the first image. In the second
image the scene has changed a little. The goal of the tracker
is to find the position of the points in the second image.
Benefit : It is not computationally complex

6
Tracking in OpenCV: OpenCV has implemented the

Lucas-Kanade (LK) traking algorithm
Input: Image A, list of feature points in image A, Image B
Output : List of tracked feature points in the Image B
Idea : Forget about the HOG. Read the video stream

frame by frame, do corner detection, track corners using
LK algorithm from the current frame to the next in a loop.
Not that easy!
seems that you forgot about the pedestrian. Using this algorithm, the
program will track all the feature points (corners) of the image and not
specially those related to the people.
Lets Not try the idea To make sure!

7
HOG detects people but is not fast enough to be used in real-time applications
Tracking corners is a relatively fast operation but it is not specified to people
Idea :
1)Read the video stream frame by frame
2)Do people detection using HOG until finding people.
3)Do corner detection for each people
4)Track corners using LK from the current frame to the
next.
But Why??
Good idea, but it still have some issues!!
ImgA = Get Current frame
Initial Idea
People-list =
Find People
(HOG) in ImgA
No
People found?
Yes
ppeoplelist
ImgB =
Get Next frame
1. Tracked-feature-list =
Track (ImgA,ImgB,Feature-list)
2. Feature-list = Tracked-feature-list
3. ImgA = ImgB
4. Show (Feature-list)
Feature-list + =
findFeaturePoints(p)
Detection
Tracking loop
What if some feature points lost during tracking loop?
10
Developing Idea...

People-list =
Find People
(HOG) in ImgA
No
People found?
ImgB =
Get Next frame
3. ImgA = ImgB
Yes
ppeoplelist
# of tracked features
<Threshold
Feature-list + =
Detection
No
yes
Tracking loop
11
Developing Idea...
People-list =
Find People
(HOG) in ImgA
No
People found?
Yes
ppeoplelist
ImgB =
Get Next frame
3. ImgA = ImgB
# of tracked features No
<Threshold
yes
Tracking loop
Feature-list + =
Detection
This part is not clear,

how do you deal with
finding features of each
person?
12
Okay, Let's talk about more details
What does HOG give us in openCV?

HOG
A list of rectangles
13
What is a rectangle?
So, HOG gives us :
<X,Y,W,H>
<(x1,y1,w1,h1);(x2,y2,w2,h2);(x3,y3,w3,h3),...>
14
Crop each rectangle and do feature detection for it

Then map features
of each rectangle to
the original image
15
Coordinates in
original image
X = X1+x'
Y = Y1+y'
16
It seems that you tackled the complexity issue of

HOG so far, and the algorithm can detect and
track people relatively fast. Also it can track
people even if their full body is not in the screen.
Yes, I did.
You know what? I think there is
something badly wrong with your
algorithm
What??????
17
Challenge :
HOG is not a perfect algorithm. It may consider

wrong objects (like a tree ) as people.
The tree is not moving, so it rarely loses its
feature points during tracking loop.
Therefore, the algorithm recur tracking

loop infinite times (trap), and it will never
go to the detection phase again!!
You may say this is the end of

the story, but there is always
hope!
18
The wrongly recognized object, is not moving,
so it
rarely loses its feature points.
This is the key!

Idea:
If the number of the feature points are not

changing during a while, say in 200 frames,
then the algorithm should suspect a trap state
and break the tracking loop.
Therefore, we can tackle this
problem by adding a simple
condition to the previous
algorithm
19
Dealing with wrong object detection challenge

People-list =
Find People
(HOG) in ImgA
No
People found?
Yes
ImgB =
Get Next frame
3. ImgA = ImgB
No
ppeoplelist
Feature-list + =
(# of tracked features
<Threshold)
OR
(# of features are not
changing for a while )
yes
Detection
Tracking loop
20
Distance recognition
21
Problem definition: Using a pinhole camera,

we want an strategy to estimate the distance
of the pedestrians
Challenge: that is impossible to get the distance of
objects using just a pinhole camera!!
Yes, but what if we have a piece of information?
What kind of information are

you talking about?
Like the height of the camera from the ground!
22
Image plane
v0
It can be proved that:
v
h
f k vh
Z=
vv 0
Where f and k_v are the focal length and

the pixel density(pixel/meter) of the
camera.
23
Challenge: We should find V which is the

height of the base point (where feet of the
pedestrian touch the ground) in the image
plane
But How to find the bottom

point?
24
Let's define the bottom of the rectangle

obtained from detection phase as the
base line.
h_i
h_j
Idea: the average height of the

feature points from the base
line remains unchanged in the
video stream.
Base line
25
Example for 2 points
Base line Estimation
j
V_ i'
Goal
i
d_i
d_ j
V'_b
V_ j'
j'
i'
Detection Rectangle
Keep <d_i, d_j>
The whole image while tracking
V'_b = Average(V_ j'+d_ j , V_ I' +d_ i)
26
Scaling
The closer the pedestrian to the camera the more difference
between the points.
V_ i'
Scaling factor:
V_ j'
j'
i'
= Average
i, j people[ p]
V j' V i'
d id j
V b =V ' b
27
Let's see some videos of the results

of this work!
28
References:
Learning opencv, 1st edition , O'Reilly Media, Inc. 2008
ISBN: 9780596516130, Dr. Gary Rost Bradski, Adrian Kaehler
http://en.wikipedia.org/wiki/Histogram_of_oriented_gradients
https://chrisjmccormick.wordpress.com/2013/05/09/hog-person-detectortutorial/
http://en.wikipedia.org/wiki/Lucas%E2%80%93Kanade_method
29
Omid.Asudeh@mavs.uta.edu
30

45

Transféré par

Informations du document

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

45

Transféré par

Droits d'auteur :

Formats disponibles

Real Time Pedestrian Detection,

Tracking and Distance Estimation

Problem definition (Goal):

The goal of a feature descriptor is to generalize an object in

Pedestrian detection Process

Challenges of HOG and SVM Classifier:

Lets try the idea! To make sure

Feature point Tracking

Benefit : It is not computationally complex

Tracking in OpenCV: OpenCV has implemented the

Idea : Forget about the HOG. Read the video stream

Not that easy!

Lets Not try the idea To make sure!

Tracking corners is a relatively fast operation but it is not specified to people

ImgA = Get Current frame

What if some feature points lost during tracking loop?

ImgA = Get Current frame

This part is not clear,

Okay, Let's talk about more details

What does HOG give us in openCV?

So, HOG gives us :

Crop each rectangle and do feature detection for it

It seems that you tackled the complexity issue of

HOG is not a perfect algorithm. It may consider

Therefore, the algorithm recur tracking

You may say this is the end of

The wrongly recognized object, is not moving,

rarely loses its feature points.

This is the key!

If the number of the feature points are not

Dealing with wrong object detection challenge

Problem definition: Using a pinhole camera,

What kind of information are

Like the height of the camera from the ground!

It can be proved that:

Where f and k_v are the focal length and

Challenge: We should find V which is the

But How to find the bottom

Let's define the bottom of the rectangle

Idea: the average height of the

Example for 2 points

Base line Estimation

Keep <d_i, d_j>

The whole image while tracking

V'_b = Average(V_ j'+d_ j , V_ I' +d_ i)

Let's see some videos of the results

Vous aimerez peut-être aussi