Face Detection by Image Discriminating: Master Thesis Intelligent Software Systems Thesis No: MCS-2006:08 August 2006

Master Thesis Intelligent Software Systems Thesis no: MCS-2006:08 August 2006
Face Detection by Image Discriminating
Muhammad Tariq Mahmood
Department of Systems and Software Engineering Blekinge Institute of Technology Box 520 SE 372 25 Ronneby Sweden
This thesis is submitted to the School of Engineering at Blekinge Institute of Technology in partial fulfillment of the requirements for the degree of Master of Science in Intelligent Software Systems. The thesis is equivalent to 20 weeks of full time studies.
Author Muhammad Tariq Mahmood
E-mail: mtariqmr@hotmail.com
Advisors: 1. Prof. Rune Gustavsson

2. Niklas Lavesson
E-mail: rune.gustavsson@bth.se
E-mail: niklas.lavesson@bth.se
School of Engineering Blekinge Institute of Technology Box 520 SE 372 25 Ronneby Sweden
Internet Phone Fax
: www.bth.se/tek : +46 457 385000 : +46 457 27125

ii
Acknowledgment
I want to express my gratitude to supervisors. Prof. Rune Gustavsson is a personality with knowledge and experience and has impressed me for the whole my stay at BTH. Discussions with him have always broadened my visions and have been source of learning. Without his constant guidance, support and encouragement this thesis would not have been completed. I would also like to thank Niklas Lavesson for his support and help during this work. Discussions with him have always been a source of information as well as thought provoking. He also helped me to manage this document.
Abstract
Human face recognition systems have gained a considerable attention during last few years. There are very many applications with respect to security, sensitivity and secrecy. Face detection is the most important and first step of recognition system. Human face is non rigid and has very many variations regarding image conditions, size, resolution, poses and rotation. Its accurate and robust detection has been a
challenge for the researcher. A number of methods and techniques are proposed but due to a huge number of variations no one technique is much successful for all kinds of faces and images. Some methods are exhibiting good results in certain conditions and others are good with different kinds of images. Image discriminating techniques are widely used for pattern and image analysis. Common discriminating methods are discussed.
Key Words: face detection, Principle Component Analysis, Linear Discriminating Analysis,
TABLE OF CONTENTS
TABLE OF CONTENTS .................................................................................................... 5 1 1.1 1.2 1.3 1.4 2 2.1 2.2 2.3 2.3.1 2.3.2 2.3.3 2.3.4 2.4 3 3.1 3.2 3.2.1 3.2.2 3.3 4 4.1 4.2 4.3 4.3.1 4.3.2 4.3.3 4.4 4.5 4.6 5 5.1 5.2 6 7 INTRODUCTION....................................................................................................... 6
OVERVIEW ................................................................................................................ 6 MOTIVATION ............................................................................................................. 6 AIMS AND OBJECTIVES ............................................................................................. 7 EXPECTED OUTCOME ............................................................................................... 7
BACKGROUND.......................................................................................................... 8
FACE DETECTION ...................................................................................................... 8 CHALLENGES ............................................................................................................ 8 FACE DETECTION METHODS .................................................................................... 9
Knowledge Based Methods ...................................................................................... 9 Template based methods .......................................................................................... 9 Feature based methods .......................................................................................... 10 Appearance based methods.................................................................................... 11 CONCLUSION .......................................................................................................... 13 CASE STUDY............................................................................................................ 15 IMAGE DATABASE .................................................................................................. 15 IMAGE DISCRIMINATION TECHNIQUES .................................................................... 16 Principal Component Analysis............................................................................... 17 Linear Discriminating Analysis ............................................................................. 20 PROPOSED SYSTEM................................................................................................. 21 KERNEL METHODS............................................................................................... 23 OVERVIEW ............................................................................................................. 23 KERNEL FUNCTIONS ............................................................................................... 24 SUPPORT VECTOR MACHINES .................................................................................. 24 Polynomial ............................................................................................................. 24 Radial Based .......................................................................................................... 25 Sigmoid .................................................................................................................. 25 KERNEL PRINCIPAL COMPONENT ANALYSIS (KPCA) ............................................ 25 KERNEL FISHER DISCRIMINANT ANALYSIS ....................................................... 26 COMPARISON OF PCA AND LDA............................................................................ 26 CONCLUSION.......................................................................................................... 28 DISCUSSION ............................................................................................................ 28 FUTURE WORK ....................................................................................................... 29 REFERENCES .......................................................................................................... 31 FIGURES &TABLES ............................................................................................... 35
INTRODUCTION
This chapter presents overview of the thesis. Few paragraphs are written for the purpose and motivation of undergoing work. The objective and overview of the thesis is presented and planning to accomplish the task also is discussed here.
1.1
OVERVIEW
Face detection has gained significant importance during the last few years. It has very many applications in the field computer vision. The law enforcement agencies, security organizations, personal identification systems and monitoring applications can use this technology. Human face detection has traditionally been a challenging task due to a number of factors like face size, image size, type and other conditions involved. The still pictures containing faces may have different poses and angles. Also pictures and images differ with respect to resolution, camera lighting, and contrast. Images have different properties and digital structure in case of gray scale and colored. The task even become more difficult when faces are occluded by other objects like mustaches, beards, glasses and masks etc. There are still lot of efforts to be carried out before achieving optimal accuracy, robustness and computational efficiency in detecting faces. A number of approaches are adopted for face detection and these can be categorized in different ways. There is knowledge, invariant and templates based methods developed. The face is so non-rigid and also when some have poses and gestures then most of above said methods cannot work well and failed to provide good enough accuracy. For last few years a significant work and achievements were gained by an appearance based approach. This approach involves machine learning techniques and algorithms. Again there a number of machine learning algorithms like back propagation Neural Networks, Nave Bayes, Support Vector Machine and ensemble technique like Adaboost etc. Support vector Machines based face detection methods have gained a considerable attention during last few years. Some experiments are done for classification of face and non-face classes based on kernel methods.
1.2
MOTIVATION
The Federal Board of Intermediate and Secondary Education Islamabad Pakistan [30] is an organization which is responsible for conducting and processing Secondary School Certificate and Higher Secondary School Certificate examinations within its jurisdiction. The examination system is highly sensitive with respect to security and secrecy. There are a huge number of candidates taking exams each year. Each candidate has to submit the application form for each exam. More over candidates may re-submit application form for subsequent examination. There is a tendency of impersonation in the examination centers.
6
As the volume of the candidates increase every year, there is a higher probability of impersonation. Some times, management has to face very many problems while investigating such cases. Also the board is seeking assistance from systems around that can automate the process of detecting and recognizing faces from images already captured from the admission form and stored into their database. For the needed system face detection is first and important step. My long association with this organization motivated me to take this project as a case study. A study of the literature and earlier experiment done using the image database of this organization will lead to the development of a comprehensive system for detection and recognition face of candidates.
1.3
AIMS AND OBJECTIVES

In first step, a survey of different face detection methods proposed by researcher will be done. It will help for deep understanding of the face detection methods their merits, demerits and problem faced. . After study of different methods a suitable system is to be proposed keeping in view the requirements and conditions of the organization Evaluating few techniques for image discriminating. A detailed study of Principle component Analysis and Fishers Discriminating Analysis is done.
1.4
EXPECTED OUTCOME
The study will reveal the important aspects of different methods with respect to accuracy, speed and suitability for specific condition of input from FBISE. The best suitable method will be recommended for face detection and recognition system for FBISE. Also study will provide guidelines for improving accuracy and speed of face detection systems. This study will help for generalizing the concept for detecting other objects from still images.
BACKGROUND
This chapter describes the work done in the area of face detection and recognition. A number of researchers have proposed different methods and approaches. A survey is done for deep understanding of different techniques and then classified according to the basic idea to solve the problem. A discussion session will conclude and summarize the advantages and limitations of different methods.
2.1
FACE DETECTION Face detection is first step to human recognition and identification system. Images are usually used to representation of objects. Digital image is a stored graph in the computer memory and consists of pixels. No of pixels in unit area is known as resolution and each pixel has its own characteristics. Detection of human face has been challenging task. Face has so many variations and characteristics.
2.2
CHALLENGES
Face localization, detection and then identification or verification has been a challenging task due to a number of factors. Face is so non-rigid and has so many variations that no one technique can cope with all these variations. That is why in spite a large number of algorithms and technique a robust system is still far from real implementation. Some of factors are illustrated here.
POSE: Faces in images may have different poses. The position of the face may vary while
capturing image by camera or other device. Due to this variation face detection become difficult as nose and eyes make different angles. Image orientation directly affects the angles of the faces. In addition to face detection techniques for pose estimation are needed when human are moving and there is a large variation in poses.
ILLUMINATION: It is another big challenge to detect accurately faces. Illumination problem
make a larger difference with the same face as compare to difference within different faces while comparing faces [5, pp 378]. The illusion problem is basically due to the lighting. The background and bad light affect the values taken by eigenvectors.
OCCULUSION: Faces in images some times are occluded with other objects. Beard,
moustaches, optical lenses and other types of object make it challenge to find accurate face behind them.
FACIAL EXPRESSIONS: Facial expressions of a person while imaging is make a wide
difference for face detection process. Same person may have different expression at different times.
8
IMAGE CONDITIONS: Image conditions also play an important role during face detection
process. The camera lighting, background, distance between camera and person, intensity and resolution of the image are important factors. Characteristics of Image capturing devices affect the conditions of images and faces as well.
FACE SIZE: Size of faces also make difficult to automate a system for face detection and
recognition
2.3
FACE DETECTION METHODS

There a large number of techniques and algorithms are suggested by researchers for face detection. Methods can be categorized in different ways. So many methods are combination of two or more algorithms and techniques. The following categorization is taken from the Yangs book [1, pp 10] for images containing single face.
2.3.1
KNOWLEDGE BASED METHODS These methods are based upon simple rules. Features of the face are described into simple rules. Rules normally are relationships between the features of the face. For example there may be a rule intensity in central region of the face is uniform and the also eyes on the face are symmetric. The search algorithms guided by the rules are applied to find the target face [1]. The technique also called top down. Yang and Huang [] introduced a knowledge based method in 1994. It detects faces in three stages or levels. At first stage high level rules are applied. In second stage the histogram and equalization methods are used for edge detection and finally eyes and mouth features are found. Knowledge based method introduced by Kotropoulos and Pitas [21]. The human knowledge is difficult to translate into rules in general. With respect to face it is more difficult as if some one wants to describe a face according to her/his knowledge. Thought the theoretically it seems simple to develop rule based face detection but in practice these methods are not very useful. If try to define detailed rules then there may be a large number of faces stratifying the rules. Few rules are unable to describe the face exactly. Generalisation of these methods for moving images and faces with poses also decrease the accuracy and performance.
2.3.2
TEMPLATE BASED METHODS

A larges number of templates of faces are coded into the system. The input face is matched with already stored. Faces are located by correlation. Also models are required to map the problem. The approach is simple to implement but not successful for face detection and cannot deal with variations in scales, poses and shape.
A number of methods proposed to overcome the variations in poses and illumination. P.Sinha [39] used a set of image invariants to describe face pattern. The brightness of different parts of the face like eyes and nose changes while illumination conditions. The said methods calculated the pair wise ratios of brightness and projected directions. A face is localised by satisfying all conditions for dark and bright regions. Following is the enhance figure having 23 defined relation for a face template.
Figure 2.3.2.1 P sinha defined face template for face localization [39]. It is hard to define all templates for the faces. These methods are not successful in poses and illumination problems.
2.3.3
FEATURE BASED METHODS It is also known as bottom up approach. The main focus is to find the invariant features of the face in first step. The results are collected by integrating them. A number of different methods are proposed a number of researchers based on facial feature, texture, skin colour and combination of different features [1, pp 14]. Edge detection, segmentation and histogram commonly used for extracting facial features, textures, skin colour and other features from the images. Leung [33] in 1995 proposed face detection by matching labelled graph. Gaussian filters and distribution distance are used. The initial step is to localise the facial features locations. Five facial features like two eyes, two nostrils and nose lip junction are considered to be necessary to form a face. Feature based are not successful when faces have different poses and illumination problems. Occluded faces with other objects are also difficult to detect.
10
2.3.4
APPEARANCE BASED METHODS These kinds of methods have gained a considerable attention during the last few years due to considerable accuracy and efficiency. In contrasts with template based methods templates are learned by computer system. Machine learning techniques are widely used for learning face and none face classes. There is no need of model for representation of image separately. All appearance based methods have common properties like classification of face and non face classes, pre-processing, learning and post processing of the images. Here a brief history of different appearance based techniques is illustrated.
Figure 2.3.4.1 Face and non-face classes are separated by using density function for patterns in image data [ 23].
Sung and Poggio [23] proposed classification of face and non face classes based upon distribution distance in 1997. Positive and negative examples of face and non-faces patterns are trained. The methods can be divided into two parts. Firstly the distribution-based model is proposed for separation of face and none face patterns and the secondly a multilayer preceptron classifier. .
11
Figure 2.3.4.2 The distance measures used by Sung and Poggio [23]
Neural networks based systems are also proposed for face detection. Some of them produced better results as the architecture of neural network can be trained for complex class conditional density function for face pattern. The computational efforts are required while tuning a number of parameters like number of nodes, number of hidden layers, learning rate and time etc.
Figure 2.3.4.3. System Diagram of Rowleys Methods [1, pp78].
Rowley [24][25] proposed method based upon multilayer neural network. Face and nonface classes are learn by the network. Different stages involved into the method are shown in figure 2.3.4.3. Peichung and Liu [7] suggested a method for face detection using support vector machine and discriminating feature analysis. By applying discriminating feature analysis a vector is
12
found by combining 1-D Haar wavelet representation and amplitude projections. Face class modelling use the support vector machine for classification of face and non-face classes.
2.4
CONCLUSION
This chapter discussed a number of approaches for the face detection. The following figure 2.4.1 summarizes them. The pros and corns of each type of methods are discussed. The different methods are producing good results in certain condition. The variation in face and image properties effect the results numerously.
Methods
Knowledge Based Appearance Based
Template Based
Invariant based
Support Vector Machine Pre Processing Neural Networks Hidden Markov Model
PCA, FDA
Post Processing
AdaBoost Decision Tree (C4.5) K-Nearest neighbour
Figure 2.4.1. Face detection methods summary
Here different experimental results are analysed for conclusion. The experiments were conducted using different image databases witch are famous among the computer vision research community. Some of them are explained here. MIT Database: It contains faces of 16 people and each person has 27 faces in different images. Images are taken under various conditions and faces contain illumination and orientation problems [1, pp 44]. UMIST Database: images in this database have with faces different poses and have frontal views. Database contains 564 images.
13
University of Bern: There are 300 images in the database. Photographs of 30 people are taken and each person has 10 images. The images are frontal and are taken in different conditions. Here a comparison of different methods is given. The accuracy is calculated face detected by a methods divided by total faces. The figures are collected from different research papers collected by [1, pp 49].
SNo. 1. 2. 3. 4. 5. 6. 7. 8. 9.
Method Distribution based Neural Network Nave byes Classifier Kullback relative information Support Vector Machine Mixture of factor Analyzers Fisher Linear Discriminating Snow w/ multi-scale features Inductive learning
Faces Tested 136 483 483 483 136 483 136 483 136 483 136 483
False Accuracy Detection % 13 81.9 862 88 12758 20 82 3 74 3 84 3 N/A 92.5 93.0 98 74 92.3 89.4 93.6 91.5 94.2 93.6 90
Table 2.4.1 comparison of different face detection techniques
From above results it can be concluded that the fare evaluation of different methods is very difficult. The different methods used to test different images with different types of variations. The tests normally take too much time as learning algorithms required more time according to the data provided. So normally this aspect is ignored during these experiments. While in real world applications it has importance and can not be ignored. Also the no of images and faces are taken very limited. The systems involving huge image databases require more efficient and robust algorithms.
14
CASE STUDY
This chapter contains work done for the FBISE organization. On the basis of previous chapter study a suitable method is proposed. The architecture of the system will be discussed. This chapter will explain the design and prototypes of the face recognition system for FBISE. Before learning and training of the images it is necessary to do some work with image. As image is high dimensionally data so there is need to transform in a suitable lower dimension. Also as support vector machine algorithm, based upon kernel functions, is used in proposed system. This chapter also contains information about representation of image in kernel function so that data can be processed by Support Vector Machine. There are number of algorithm like Principal Component Analysis, Discriminating Feature Analysis, Fisher Analysis and many more. The proposed system also has involvement of Support Vector Machines algorithm. This chapter provide comprehensive information about the algorithm. How this algorithm will classify the face and none face class from image data.
3.1
IMAGE DATABASE
The Federal board of Intermediate and Secondary Education scans the photographs from the application forms for each examination. The flat bed scanners are used to capture images. The face size, angle and pose of each image may slightly vary due to different photographers and scanner operators. The photograph may be coloured or black and white before scanning but images are saved as gray scale .bmp image. The resolution is kept between 100dpi to 150 dpi. It is very low resolution. The purpose behind is to save memory. As each year 300,000 images are captured and processed. The following figure illustrates the image type. The coloured path leads the specific images stored in the database and will be used for experiments.
15
Images
Videos
Still
Color
B&W
Gray scale
Color
B&W
Gray scale
Multi face
Single face
Downward
Upward
Downwar d
Upward Resolution Lighting Size Poses Occulted
Figure 3.1.1 Image type classification
3.2
IMAGE DISCRIMINATION TECHNIQUES

Image discriminating is primary area while dealing with face detection or other biometric systems like Iris, ear, figure print analysis etc. These techniques pay an important role while analysis of the image data and feature extraction [4, pp22]. The primary object of usage of these techniques in image analysis is to gain reduction in data dimensions. There are a number of techniques are available but current study is focused widely used two techniques Principle Component Analysis and Discriminant Feature Analysis. The following figure 3.2.1 illustrates the discriminative methods. Appearance based methods use machine learning techniques. There are a number of statistical approaches to classify the data before training. Statistical pattern recognitions approaches can be divided into two categories generative and discriminative [1, pp98]. Generative methods are based upon estimating higher order of probability distribution over
16
examples and for gaining this purpose maximum likelihood(ML) or maximum a posterior (MAP) are used. Hidden Markov Model (HHM), Markov Random Field (MRF) and Nave Bayes algorithms are examples of generative methods.
Appearance based methods
Statistical methods Generative methods Discriminative methods
Hidden Markov Model Naive Bayes Classifier Hidden Markov Model
Support Vector Machine Principle Component Analysis Fishiers Linear Discriminant
Figure 3.2.1 image discrimination methods Discriminative methods based upon to find a decision surface between face and none face patterns. And these methods needed face (positive) and non-face (negative) examples to find decision surface [1, pp 98]. There are advantages and disadvantages of using each approach. The discriminative approach will be discussed further and will be used for development of the system FBISE. 3.2.1 PRINCIPAL COMPONENT ANALYSIS Principal Component Analysis (PCA) is a statistical technique used to transform dimensions of the space from a higher one to a lower. It helps to find patterns in data. It uses standard deviation, mean value, variance, covariance and matrix algebra concepts from statistics and mathematics. As image is a high dimension data. Work with images is high source demanding and complex. Principal Component Analysis is very useful and widely used. All the image data is represented in the form of long vectors. Principal Component Analysis has two methods covariance and correlation. Before going to explain PCA algorithm it is worth to explain some terms and definitions used by PCA and other image discriminating technologies.
17
Before going to exploring Principle Component Analysis algorithm it is worth to go through some mathematics and statistical terminologies. Let we have a data set S= {s1, s2, s3, sn} then mean denoted by SM will be. SM = si/n i= 1,2,3,..n 3.2.1.1
Standard deviation SD will be as SD = ( (Si SM)^2/n-1)^1/2 i=1,2,3,n 3.2.1.2 Variance is very similar to the standard deviation and the formula for the data set S can be calculated as Var(S) = (Si SM)^2/n-1 i=1,2,3,.n 3.2.1.3
Covariance an other term is used in statistics. Standard Deviation and Variance deal with one dimensional data where as the Covariance is similar measures between 2 dimensional data. Let consider S and L two data sets then the covariance will be as. Cov(S)= (Si SM)(L-LM)/n-1 i=1,2,3,.n If covariance is calculated for one dimensional data then it will be equal to the variance. Engine Vector: The matrix M can be written as
Mx = x (M- I)x =0
3.3.1.1 3.3.1.2
Where I is identity matrix and 0 is zero vectors. The solution of the equation results engine vectors. No of engine vectors depends upon the dimension of the matrix.
Engine Values: The corresponding values of from 1 to d are known as engine values.
18
start
Data Set
The first step to choose dataset for processing. In case of image the dataset is two dimensional matrix of LxM order. L represents rows of the pixels and M denotes for columns. Each and every pixel may be represented by (x,y) coordinates.
Mean Subtraction
The mean values x1 and y1 of x and y coordinates are subtracted from x and y respectively. The new dataset produced will have zero mean value.
Covariance Matrix
The Covariance matrix is calculated. Dataset is given here
Eigenvectors & enginvalues
Eigenvectors and corresponding engine values are calculated
Feature Vector Component
The feature vector is calculated
New data set
The new data set is ready for further processing. The algorithm is repeated until the desired results are obtained.
Figure 3.2.1.1 Principle Component Analysis Algorithm
Principal Component analysis is limited to the linear transformation. In many cases nonlinear transformation produces better and faster results. Kernel Principal Component Analysis provides non-linear transformation of dimensions. The Support Vector Machine uses the image data transformed by KPCA for classification. It has all capabilities of PCA with some extensions.
19
3.2.2
LINEAR DISCRIMINATING ANALYSIS Linear Discriminating Analysis finds the linear combination of feature which separates the two or more classes accurately. The algorithm can be used classification as well as for reduction of dimensions of the data. One of the examples of Linear Discriminating Analysis is Fishers Linear Discriminant. It finds the optimal projection for class separation from dataset instead of separating and projecting variance. There are two terms normally discussed while Fishers Linear Discriminating i.e. between-class scatter and within-class scatter. The algorithm finds such projection that maximizes the ration between the above said two variations. i.e. max (between-class scatter/within-class scatter). The output of the algorithm with respect to face detection is known as fisher faces. Consider the between-class scatter matrix according to [1, pp 112] is. Sb = Ni(i-)(i-)^T And within-class scatter matrix is
Sw = (Xk-i)(Xk-i)^T = Swi Xk Xi, i=1,2,3..c 1= 1,2,3, c 3.2.2.2 3.2.2.3
i=1,2,3..c
3.2.2.1
Where Ni is number of sample in class Xi and i is the mean of class Xi. Swi is the covariance of class Xi. The optimal projection W is the matrix that is maximizing the ratio of the determinant of the between-class scatter matrix to the within-class scatter matrix and it is written as. W = arg max |W^T SB W| / | W^T Sw W| = [w1, w2, w3, wm] 3.2.2.4 3.2.2.4
After computing maximum ratio between two classes scatter mentioned above a training set is project in c-1 dimensional feature space i.e. X = W X and then Gaussian distribution can be used to model each class-conditional density function. The decision rule weather a face is present in an image window or not is decided based upon following likelihood Xl = arg max P(X|Xi) I = 1,2 3, c 3.2.2.5
20
3.3
PROPOSED SYSTEM
The previous chapters illustrate different techniques and methods of face detection and recognition. Each category of method performs well in certain criteria and also has drawbacks as well. The experiments are carried out using a small number of images (200 to 500). Systems with robustness and certain level of accuracy are still far away. Keeping in view case study the following architecture is proposed for the detection and recognition system.
Image Selected
Face Detection Image Scanned Image Database Face recognition
Recognized / Not recognized
Figure 3.3.1 Face Recognition System Architecture
As discussed in previous chapter that the robust system catering the needs of real world situation like mentioned in case study is a challenging task. The images are scanned by flatbed scanner and stored into database. The candidate may interact with organization later regarding subsequent examination(s). Again the image is scanned and stored into the database. Now two images of the same candidate are stored into the database. The first step is to select desired images from the database then for comparisons them the next step is to detect faces from each image. The next step is to recognise that images as of the same candidate or not. Following is the abstract architecture of face detection module.
21
Face Class Image discriminating Techniques
SVM
Non-face Class
Figure 3.3.2 Face Detection method
The above proposed face detection method is inspired by method suggested by Peichung Shih and Chengjun Liu [7]. It is combination of Support Vector Machine and Discriminating Feature Analysis. The method has shown 98.2 detection rates and is comparatively takes less time for computation. Classification of face and non-face is taken place at two stages. Firstly the Discriminating feature Vector is calculated by representation of image data by 1-D Haar Wavelet representation and its amplitude projections. It defines distribution base measure for face, non-face and undecided classes. These undecided classes are again processed for Support Vector Machine. Thus entire image is classified into face and non-face classes.
22
KERNEL METHODS
Previous chapter discussed the use of image discriminating techniques for face detection for undergoing case study. This chapter demonstrates the kernel version of the Principle Component Analysis and Fishers Discriminating Analysis. This is the advanced form of the both discussed techniques and can be mapped into modular framework of kernel methods.
4.1
OVERVIEW
Patterns analysis is study of the problem of detecting and characterising relations in data. The study is needed in different contexts and problems. In machine learning techniques the patterns are analysed based upon classification rules, regression functions and structured clusters in data. While analysing patterns in data by using algorithms the efficiency, robustness and stability are main concerns Kernel methods [23] are also used for the analysis and detection patterns in data in a modular way.
Kernel Function
Algorithm
Figure 4.1.1 Kernel methods modular approach The first step the data is processed to form a kernel matrix using kernel function. There may be used any kind of kernel functions depending upon the nature and type of the data. Data may be image, text or heterogeneous. The second step is to use kernel algorithm for analysis of data. The information is taken from the kernel matrix. The above said approach separates the kernel function and kernel algorithm. It provides a way to use any kernel function with any kernel algorithm according to the problem and data.
23
4.2
KERNEL FUNCTIONS
Kernel functions are used to make a kernel matrix in above said modular approach of kernel methods. That means kernels functions can be used to for implementation of nonlinear function mapping of linear functions. This property makes support vector machines able for classification non-linearly as well. Commonly kernel functions are polynomials; Radial based function (RBF) and sigmoid kernels. A number of operations can be done upon Kernel function. Some of them are listed here. 1. 2. 3. 4. 5. K(x,z) = k1(x,z)+k2(x,z) K(x,z)= aK1(x,z) K(x,z)= K1(x,z) K2(x,z) K(x,z)=f(x)f(z) K(x,z)=k3((x),(z))
Where k, k1, k2, k3 are kernels over X x X and X R. [35, pp 75]. By applying simple operations complex kernels can be constructed. Operations upon kernel are done for transforming data, centering data, and subspace projection, whitening and sculpting the feature space.
4.3
SUPPORT VECTOR MACHINES

Support vector machine is an algorithm used to classify and learn data by separating classes by maximum marginal hyper plan. The idea first introduced by Vladimir Vapnik based upon computational and natural learning theory. The support vector machines can also be used for non-linear data classification by using kernels functions. The main purpose is to find a plane between different kind of data clustered and which can clearly separate them. It is important while using support vector machine algorithm that which kernel function is used. There three types of kernel functions used polynomials, sigmoid and radial based kernels. A small description of each is given below.
4.3.1
POLYNOMIAL Polynomial kernel functions are constructed by raising power of dot product of two vectors representing cluster of two types of data. Let A and B are two vectors then a polynomial function may be P(A,B) = (A.B)^n. this function will compute dimensions for any values of A and B vectors depending upon the power of dot
24
product of these two vectors. For normalised training data sets polynomials are more suitable. 4.3.2 RADIAL BASED Radial based kernel maps data in higher space non-linearly so it can handle cases where attributes of different data sets have non-linear relations between them. Radial bases functions have been popular choice for support vector machines due to less numerical computation. The radial bases function for A and B vectors may be written as RBF (A, B) = e^||A-B||^2/2. 4.3.3 SIGMOID Sigmoid kernels functions are tangents of the product of vectors. For vectors A and B the sigmoid function will be Sig(A,B) + tanh(A.B + coefficient)
4.4
KERNEL PRINCIPAL COMPONENT ANALYSIS (KPCA)

The kernel Principle Component Analysis is extension and application of PCA in kernel defined feature space. The Principle Component Analysis has not facility to handle dual representation. It provides non-liner function for features analysis. Projections onto feature space eigenvectors is computed through the dual representation. It is computed from the engine values and engine vectors from Kernel matrix. So here two extra things are to do as compare to Principle Component Analysis. The computation of kernel matrix and solve the following equation and the
M = K
4.2.1
Here KPCA algorithm [35, pp 151] is illustrated.
25
Input Process
data set S={x1, x2, x3,.xm} Kij = k(xi,xj) I,j=1,2,3,,m K=K-1/mjjK-1/mKjj-1/m^2jKjj = eig[K] j = 1/^1/2 Vj xi = jk(xi,x) j=1,2,3,.K
Output Figure 4.4.1
transformed data set KPCA algorithm
4.5
KERNEL FISHER DISCRIMINANT ANALYSIS

As said in earlier sections the kernel approach has gained considerable attention during the last few years. It helped to overcome the computational problems while dealing with high dimensional data. KPCA is advanced version of Fisher Discriminating Analysis. Here algorithm is described. For a given non-linear mapping the input data space R can be mapped in space H [4, pp 237]. Mathematically can be written as : RH x (x) Where H is Hilbert space and it has much higher space may be of infinite dimensions. The motivation behind Kernel Fisher Discriminating Analysis is to perform Linear Discriminant Analysis or Principle Component Analysis in feature space H. Without Kernel tricks it is very difficult [4, pp 237]. So consider {x1, c2, x3,, xm } R then covariance operator over feature space H can be calculated. Sv = 1/m ( (xj)-mo)( (xj)-mo)^t Where Mo = 1/m ( (xj) is covariance matrix in H feature space. Covariance operators may be bounded, compact, positive and self-joined [4, pp 237]. j=1,2,3,.m 4.5.1
4.6
COMPARISON OF PCA AND LDA

As stated earlier sections that principle Component Analysis and Linear Discriminating Analysis are two techniques used to reduce high dimensionally of image data so that the processing can be don fast and robust. The Linear
26
Discriminating Analysis is considered more efficient than Principle Component Analysis [36]. The Principle Component Analysis focused upon to find component from the data regardless of classification whereas the Linear Discriminating Analysis is more focused upon find linear separation between classes in data. Theoretically Linear Discriminating Analysis seems more suitable than Principle Component Analysis. Experimentally PCA is more accurate and commonly has been being implemented in face detection and recognition systems. According to [38] an evaluation and comparison of Principle Component Analysis and Linear Discriminant Analysis is done by using more than 640 FERET images containing faces and concluded that LDA performs worst than PCA uniformly. While implementing PCA and LDA there are a number of other factors like training setup, computation efforts affect the results. Kernel versions of the PCA and Fishers Discriminating Analysis have reduced the computational efforts considerably. So it is very difficult at present state that which one is more efficient and accurate method.
27
CONCLUSION
This chapter concludes the work done and study during this thesis. The summery of all chapters is discussed under heading discussion. The future work contains some intentions to enhancement of the work done and also application of this study may helpful in certain areas is discussed.
5.1
DISCUSSION
Face detection using image discriminating methods have shown reasonable advancement in the area of computer vision and biometric image analysis. The combination with machine learning techniques has resulted more accurate and efficient. Some of the discriminating techniques are discussed for face detection. During current study we revealed the area of face detection. A comprehensive survey is carried out to understand the problems and opportunities in context of computer vision and machine learning. The area demands a lot of research to achieve a higher level of accuracy and to gain computational efficiency. Different approaches and techniques are discussed. The pros and corn helped to understand the problem and leads to proper method selection according to the nature of image database. The case study provides a suitable image database for experimenting different techniques and methods discussed in chapter 2. In chapter 3 a face detection and recognition system is suggested for the case study based upon the study carried out in previous chapters. As concluded that the method proposed by [7] has provided good results so far. Support vector machine and discrimination feature analysis The Kernel methods provide a modular way of analysis of patterns in data. Especially when data is of high dimensions like image the computational efforts are considerably reduced by use of kernels. The chapter 4 discussed the application of discriminating techniques through kernel functions.
28
5.2
FUTURE WORK
The goal, robust face detection and recognition, is still far. The case study discussed during the current work is interesting for further research and development activities. Due to a variations and a huge number of images is perfect platform for experimenting algorithms. During case study there are many related problems and areas where the current study may help to analysis and development of systems. One of the problems is involved to read hand written roll numbers. The image discriminating techniques may be useful for this project. But still a considerable work to go to overcome variations while detecting faces in the images. Image discriminating techniques studied during the thesis work can be implemented to other biometric systems. For example systems involved detection and recognition of iris, ear, palm, signature, and finger prints etc need comprehensive image analysis. These techniques are very helpful while analysing patterns and classification. The face detection methods based upon image discriminating techniques and machine learning can be extending to detect other objects. For example in parking system vehicles can be detected and monitored accordingly. As discussed in chapter two the face detection has been a challenging task due to number of factors like poses and illumination. Most of the exiting methods and algorithms are unable to cater for these variations. A considerable attention is required to develop efficient methods that can work with and overcome to such problems. The research in this area is very actively going on and the current study will help for laying solid foundation for it. Kernel methods and support vector machines algorithms have been shown better results to overcome above said problems. Recently more and more researchers are performing experiments by using image discriminating techniques and support vector machines and hoping for better results. The under discussion case study the images have almost all variations discussed in chapter two. To develop a automated recognition system all aspects are to be considered. A comprehensive system is suggested in chapter three. The accuracy and efficiency of the system still a big question mark. Further research and development in the area of face detection and recognition will help to improve the system efficiency and accuracy. Also the system involves a large image database. It would also an interesting to deal with a huge data. One of the problems faced is that there is no comprehensive platform for students and young researchers for performing experiments. All have to rely on MATLAB or one
29
has to develop code from ABC in some language like C, C++ or JAVA. The complex mathematics and statistical equations demand hung amount of time and efforts. There are few C++ libraries provided by some research groups free of cost for further research in the area of image processing but still they are insufficient for building up a platform for experiments in the area of face detection and recognition. So it is suggested that research groups in this area of face detection and recognition should be promoted to provide their work online so that new incumbents in this field may get encouragement and can perform experiments to conclude results without waiting time in drudgery work of coding.
30
6
[1]
REFERENCES
Yang, Ming-Hsuan, Face detection and gesture recognition for human-computer interaction, Boston, Mass. ; London : Kluwer Academic 2001. Jolliffe, Ian T, Principal Component Analysis, Springer-Verlag New York, Incorporated Date: 2002. Chen, Yixin. Machine Learning and Statistical Approaches to Image Retrieval Secaucus, NJ, USA: Kluwer Academic Publishers, 2004. Zhang, David. Biometric Image Discrimination Technologies. Hershey, PA, USA: Idea Group Publishing, 2006. Javidi, Bahram. Image Recognition and Classification : Algorithms, Systems, and Applications. New York, NY, USA: Marcel Dekker Incorporated, 2002 Hoiem, D.; Efros, A.A.; Hebert, M.; Geometric context from a single image Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on Volume 1, 17-21 Oct. 2005 Page(s):654 - 661 Vol. 1 Shih, P.; Liu, C Face detection using discriminating feature analysis and Support Vector Machine Pattern Recognition Year: 2006 Volume: 39 Issue: 2 Pages: 260276. Peichung Shih and Chengjun Liu Face Detection Using Distribution-based Distance and Support Vector Machine Proceedings of the Sixth International Conference on Computational Intelligence and Multimedia Applications (ICCIMA05)0-7695-2358-7/05. Hyun-Chul Choi, Se-Young Oh Face Detection in Static Images using Bayesian Discriminating Feature and Particle Attractive Genetic Algorithm Intelligent Robots and Systems, 2005. (IROS 2005). 2005 IEEE/RSJ International Conferenceon 2-6 Aug. 2005 Page(s):1072 1077
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10] Gottumukkal, R.; Asari, V.K. Real Time Face Detection from Color Video Stream Based on PCA Method Applied Imagery Pattern Recognition Workshop, 2003. Proceedings. 32nd 15-17 Oct. 2003 Page(s):146 150.
31
[11] Hongliang Jin; Qingshan Liu; Xiaoou Tang; Hanqing Lu; Learning Local Descriptors for Face Detection Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on 06-06 July 2005 Page(s):928 931 [12] Mita, T, Kaneko, T.; Hori, O Joint Haar-like features for face detection Mita.; Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on Volume 2, 17-21 Oct. 2005 Page(s):1619 - 1626 Vol. 2. [13] R-QiongXu ,Bi-Cheng Li ,Bo Wang Face Detection And Recognition Using Neural Network And Hidden Markov Models.0-7803-7702-8/03/ [14] Emad Barsoum, Tamer Mostafa, AbdelMonem A. WahdanA New Image Comparing Technique for Content-Based Image Retrieval 0-7803-8575-6/04. [15] KES 2001 (Conference) Amsterdam : IOS Press; ; Tokyo : Ohmsha, cop. 2001 Knowledge-based intelligent information engineering systems & allied technologies : KES' 2001. Part 2 [16] Model-based coding : extraction, coding, and evaluation of face model parameters Linkping : Univ., 2002 [17] Mike James Pattern recognition John Wiley 1988. [18] Biometrics: personal identification in networked society Boston ; London :Kluwer, 1999 [19] Lindsay I Smith A tutorial on Principal Components Analysis 2002 February 26,
[20] London, Justin. Modeling Derivatives in C++. Hoboken, NJ, USA: John Wiley, 2005. [21] Kotropoulos, C. Pitas, I, Rule-based face detection in frontal views Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on Volume 4, 21-24 April 1997 Page(s):2537 - 2540 vol.4 [22] Xiaofeng Lu; Nanning Zheng; Songfeng Zheng,Linear sparse feature based face detection in gray images Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on Volume 3, 14-17 Sept. 2003 Page(s):III - 889-92 vol.2
32
[23] Sung,K.-K, Poggio T, Example based learning for view based human face detection Pattern Analysis and Machine Intelligence, IEEE transactions on, volume 0,Issue 1 Jan 1998, Pages( 39 51). [24] Rowley, HA:, Baluja, S:, Kanade, T:, Neural Network-Based face Detection Computer Vision and Pattern Recognition, 1996 Proceedings CVPR96, 1996 IEEE Computer Society Conference on, 18-20 June 1996, pages 203-208 [25] Rowley, HA:, Baluja, S:, Kanade, T:, Rotation invariant Neural Network-Based face DetectionComputer Vision and Pattern Recognition, 1998. Proceedings. 1998 IEEE Computer Society Conference on 23-25 June 1998 Page(s):963 - 963 [26] Feraud, R.; Bernier, O.; Viallet, J.E.; Collobert, M.; A fast and accurate face detector for indexation of face images Automatic Face and Gesture Recognition, 2000. Proceedings. Fourth IEEE International Conference on28-30 March 2000 Page(s):77 - 82 [27] Viola, P.; Jones, M.; Robust real-time face detectionComputer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference onVolume 2, 714 July 2001 Page(s):747 - 747 [28] Cootes, T.F.; Edwards, G.J.; Taylor, C.J.; Active appearance models Pattern Analysis and Machine Intelligence, IEEE Transactions on Volume 23, Issue 6, June 2001 Page(s):681 - 685 [29] Cherkassky, V., The Nature Of Statistical Learning Theory Neural Networks, IEEE Transactions on, Volume 8, Issue 6, Nov. 1997 Page(s):1564 - 1564 [30] www.fbise.edu.pk dated 12-06-2006 [31] Waring, C.A.; Xiuwen Liu, Face detection using spectral histograms and SVMs Systems, Man and Cybernetics, Part B, IEEE Transactions on Volume 35, Issue 3, June 2005 Page(s):467 476 [32] Yang, M.-H.; Ahuja, N.; Kriegman, D., Face recognition using kernel eigenfaces Image Processing, 2000. Proceedings. 2000 International Conference on Volume 1, 10-13 Sept. 2000 Page(s):37 - 40 vol.1 [33] Leung, T.K.; Burl, M.C.; Perona, P., Finding faces in cluttered scenes using random labeled graph matching,Computer Vision, 1995. Proceedings., Fifth International Conference on 20-23 June 1995 Page(s):637 644
33
[34] Kin Choong Yow; Cipolla, R., A probabilistic framework for perceptual grouping of features for human face detection, Automatic Face and Gesture Recognition, 1996., Proceedings of the Second International Conference on14-16 Oct. 1996 Page(s):16 21 [35] John Shawe-Taylor, Nello Cristianini, Kernel Methods for Pattern Analysis Cambridge University Press 2004. ISBN 0 521 81397 2. [36] A.M. Martinez, A.C. Kak, PCA versus LDA, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 23, No. 2, 2001, pp. 228-233. [37] B Scholkopf, A Smola, KR Muller Kernel Principal Component Analysis
Advances in Kernel Methods-Support Vector Learning, 1999 - cs.columbia.edu
[38] J.R. Beveridge, K. She, B. Draper, and G.H. Givens, A Nonparametric Statistical Comparison of Principal Component and Linear Discriminant Subspaces for Face Recognition, Proc. of the IEEE Conference on Computer Vision and Pattern Recognition, December 2001, Kaui, HI, USA, pp. 535-542. [39] P. Sinha. Object Recognition via Image Invariants: A Case Study. Investigative Ophthalmology and Visual Science, 35, pp. 1.735-1.740, 1994.
34
FIGURES &TABLES
Figure 2.3.2.1 P sinha defined face template for face localization [39] Figure 2.3.4.1 Face and non-face classes are separated by using Density function for patterns in image data [23] Figure 2.3.4.1 The distance measures used by Sung and Poggio [23] Figure 2.3.4.3. System Diagram of Rowleys Methods [178] Figure 2.4.1. Face detection methods summary Table 2.4.1 comparison of different face detection techniques Figure 3.1.1 Image type classification Figure 3.2.1 image discrimination methods Figure 3.2.1.1 Principle Component Analysis Algorithm Figure 3.3.1 Face Recognition System Architecture Figure 3.3.2 Face Detection method Figure 4.1.1 Kernel methods modular approach Figure 4.4.1 KPCA algorithm 11 12 12 13 14 16 17 19 20 21 23 26 10.
35

Face Detection by Image Discriminating: Master Thesis Intelligent Software Systems Thesis No: MCS-2006:08 August 2006

Transféré par

Informations du document

Description originale:

Titre original

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

Face Detection by Image Discriminating: Master Thesis Intelligent Software Systems Thesis No: MCS-2006:08 August 2006

Transféré par

Droits d'auteur :

Formats disponibles

Master Thesis Intelligent Software Systems Thesis no: MCS-2006:08 August 2006

Face Detection by Image Discriminating

Muhammad Tariq Mahmood

Author Muhammad Tariq Mahmood

Advisors: 1. Prof. Rune Gustavsson

Internet Phone Fax

: www.bth.se/tek : +46 457 385000 : +46 457 27125

AIMS AND OBJECTIVES

FACE DETECTION METHODS

TEMPLATE BASED METHODS

Figure 2.3.4.3. System Diagram of Rowleys Methods [1, pp78].

Knowledge Based Appearance Based

AdaBoost Decision Tree (C4.5) K-Nearest neighbour

Figure 2.4.1. Face detection methods summary

Table 2.4.1 comparison of different face detection techniques

Upward Resolution Lighting Size Poses Occulted

Figure 3.1.1 Image type classification

IMAGE DISCRIMINATION TECHNIQUES

Appearance based methods

Statistical methods Generative methods Discriminative methods

Hidden Markov Model Naive Bayes Classifier Hidden Markov Model

Support Vector Machine Principle Component Analysis Fishiers Linear Discriminant

The Covariance matrix is calculated. Dataset is given here

Eigenvectors & enginvalues

Eigenvectors and corresponding engine values are calculated

Feature Vector Component

The feature vector is calculated

New data set

Figure 3.2.1.1 Principle Component Analysis Algorithm

Face Detection Image Scanned Image Database Face recognition

Recognized / Not recognized

Figure 3.3.1 Face Recognition System Architecture

Face Class Image discriminating Techniques

Figure 3.3.2 Face Detection method

SUPPORT VECTOR MACHINES

KERNEL PRINCIPAL COMPONENT ANALYSIS (KPCA)

Here KPCA algorithm [35, pp 151] is illustrated.

Output Figure 4.4.1

transformed data set KPCA algorithm

KERNEL FISHER DISCRIMINANT ANALYSIS

COMPARISON OF PCA AND LDA

Advances in Kernel Methods-Support Vector Learning, 1999 - cs.columbia.edu

Vous aimerez peut-être aussi