Face demographics for age estimation using multi-task deep...

Post on 28-May-2020

4 views 0 download

Transcript of Face demographics for age estimation using multi-task deep...

Face demographics for age estimation using multi-task deep neural networks.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Structure of today’s talk:

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

• Brief history of face recognition.• Various nets and their uses.• Image feature hierarchy.• The difficulty of age estimation.• Weaknesses of current age architectures.• Potential solutions to weaknesses• The proposed architectures.• Questions.

Brief history of automated facial recognition.• Semi automated facial recognition system W. Bledsoe et al 1966A, 1966B, for shadowy

organization, required assistance with picking face features.

• First linear algebra detection algorithms come into existence in the late 1980s using PCA analysis.

• Followed by multi-linear PCA analysis, Linear discriminant analysis and Independent component analysis

• In the early to mid 1990s methods based on elastic graph algorithms began to be used reaching accuracies well above 96% when tested against images in a control environment.

• During this same period work based on support vector machines was published. These used labelled training data, and achieved higher accuracies than elastic graph algorithms.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Brief history of automated facial recognition.• 1986 advent of back propagation algorithm designed by David Rumelhart on 2 layer

networks.

• First recorded use of Convolution Neural Networks for temporal signal processing, and digit classification was the Neo-cognitron in the 1988, LeNet-5 first hand written digit classifier in 2003.

• Earliest paper I could find on CNN’s for face recognition is from 1997 published by the IEEE Computational Society.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Various neural network topologies and their uses.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Various neural network topologies and their uses.

Siamese network.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Recurrent neural network.

Cascade network.

Various neural network topologies.

Convolution Neural network

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Neocognitronconceptual network.

Deep ArchitecturesAlexNet.

Basic Multi-task classification architecture date, WACVI 2014.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Difference between Multi-task vs Single task classification loss functions.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Standard loss function minimized by gradient descent.

Difference between Multi-task vs Single task classification loss functions.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

VS.

The problem of face age detection.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Depends on a myriad of inter related subtle factors:• Ethnicity.• Gender.• Life style.• Position of the eyes, nose and mouth.• Texture of skin

Best age classifiers.

JFLDNN: 2014

Hyperface: 2016, DLA architectureSimilar, 2015

DEX, 2015

Best age classifiers: weaknesses.

JFLDNN: 2014 Hyperface: 2016

DEX, 2015

Though promising when it comes to Gender, it fails at age classification. However it highlights the potential use of concatenating intermediate Layers.

This points out the problem of over regularization and extended training time of AlexNet.

DEX didn’t take advantage of face landmarks.

The problem of regularization.

Multi-task task wise dropout classification architecture.Dropout for auxiliary tasks:Problem: Each task has different loss functions.Possible solution: Covariance matrix learning is not an option, as tasks have the same loss function.Task-wise early stopping: This stops overfitting but different from weight regularization

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

The problem of regularization.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

The problem of limited training data

The problem of limited training data

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Sequential training independent of data set.

The problem of long training times.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

+

Squeeze Net, 2016

Proposed Architectures, A.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Proposed Architectures, B.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Proposed data sets, FGNET and MORPH Data sets.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Morph:labels: D.O.BGender labels: Yes

Race: yes

FG Net:labels: D.O.BGender labels: YesIn the wild: No

Proposed: Evaluation metrics.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Proposed: Hardware and Software.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Proposed: Hardware and Software.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Future proposed project Goals• Implement architecture in Matlab and Matconvnet.

• Test against DLA and DEX architectures using FGNET and MORPH data sets.

• Interface with single page web app, to make interactive website.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Further work.• Future work will see these same architectures used to characterize

other features such as ethnicity and emotion.

• The networks main image hierarchy can be switched to a newer leaner and/or deeper convolution network.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

References:1. Zhanpeng Zhang, Ping Luo, Chen Change Loy, and Xiaoou, Facial Landmark detection by deep multi-task learning, Tang Dept. of

Information Engineering, The Chinese University of Hong Kong, 2014.

2. Gil Levi and Tal Hassner, Age and Gender Classification using Convolutional Neural Networks, Department of Mathematics and Computer Science, The Open University of Israel, 2015

3. Rajeev Ranjan, Vishal M. Patel and Rama Chellappa, HyperFace: A deep multi-task learning framework for face detection, landmark localization, pose estimation an gender recognition, 2016

4. Yuxin Jiang, Songbin Li, Peng Liu and Qiongxing Dai, Multi-feature deep learning for face gender recognition, 2014, IEEE 7th

Joint International Information Technology and Artificial Intelligence Conference, 2014

5. R. Rothe, Timofter R. and L. Gool, DEX: Deep expectation of apparent age from a single image, Proceedings of the IEEE International Conference on Computer Vision Workshops, pages 10-15, 2015

6. Zafeiriou Stefanos, Cha Zhang and Zhengyou Zhang, A survey on face detection in the wild: past, present and future. Computer Vision and image understanding. Computer Vision and Image understanding, 4(138):1-24, 2015

7. Forrest N Iandola, Matthew W Moskewicz, Khalid Ashraf, Song Han, William J Dally and Kurt Keutxer, Squeezenet: AlexNet-level accuracy with 50x fewer parameters and 0.5mb model size, Arvix preprint: arvix:1602.00360, 2016

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Questions?

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa