Face demographics for age estimation using multi-task deep...

Face demographics for age estimation using multi-task deep neural networks.

Supervisor: Dr. Jose AlvarezStudent: Kiarie Ndegwa

Structure of today’s talk:

• Brief history of face recognition.• Various nets and their uses.• Image feature hierarchy.• The difficulty of age estimation.• Weaknesses of current age architectures.• Potential solutions to weaknesses• The proposed architectures.• Questions.

Brief history of automated facial recognition.• Semi automated facial recognition system W. Bledsoe et al 1966A, 1966B, for shadowy

organization, required assistance with picking face features.

• First linear algebra detection algorithms come into existence in the late 1980s using PCA analysis.

• Followed by multi-linear PCA analysis, Linear discriminant analysis and Independent component analysis

• In the early to mid 1990s methods based on elastic graph algorithms began to be used reaching accuracies well above 96% when tested against images in a control environment.

• During this same period work based on support vector machines was published. These used labelled training data, and achieved higher accuracies than elastic graph algorithms.

Brief history of automated facial recognition.• 1986 advent of back propagation algorithm designed by David Rumelhart on 2 layer

networks.

• First recorded use of Convolution Neural Networks for temporal signal processing, and digit classification was the Neo-cognitron in the 1988, LeNet-5 first hand written digit classifier in 2003.

• Earliest paper I could find on CNN’s for face recognition is from 1997 published by the IEEE Computational Society.

Various neural network topologies and their uses.

Siamese network.

Recurrent neural network.

Cascade network.

Various neural network topologies.

Convolution Neural network

Neocognitronconceptual network.

Deep ArchitecturesAlexNet.

Basic Multi-task classification architecture date, WACVI 2014.

Difference between Multi-task vs Single task classification loss functions.

Standard loss function minimized by gradient descent.

Difference between Multi-task vs Single task classification loss functions.

The problem of face age detection.

Depends on a myriad of inter related subtle factors:• Ethnicity.• Gender.• Life style.• Position of the eyes, nose and mouth.• Texture of skin

Best age classifiers.

JFLDNN: 2014

Hyperface: 2016, DLA architectureSimilar, 2015

DEX, 2015

Best age classifiers: weaknesses.

JFLDNN: 2014 Hyperface: 2016

DEX, 2015

Though promising when it comes to Gender, it fails at age classification. However it highlights the potential use of concatenating intermediate Layers.

This points out the problem of over regularization and extended training time of AlexNet.

DEX didn’t take advantage of face landmarks.

The problem of regularization.

Multi-task task wise dropout classification architecture.Dropout for auxiliary tasks:Problem: Each task has different loss functions.Possible solution: Covariance matrix learning is not an option, as tasks have the same loss function.Task-wise early stopping: This stops overfitting but different from weight regularization

The problem of regularization.

The problem of limited training data

Sequential training independent of data set.

The problem of long training times.

Squeeze Net, 2016

Proposed Architectures, A.

Proposed Architectures, B.

Proposed data sets, FGNET and MORPH Data sets.

Morph:labels: D.O.BGender labels: Yes

Race: yes

FG Net:labels: D.O.BGender labels: YesIn the wild: No

Proposed: Evaluation metrics.

Proposed: Hardware and Software.

Future proposed project Goals• Implement architecture in Matlab and Matconvnet.

• Test against DLA and DEX architectures using FGNET and MORPH data sets.

• Interface with single page web app, to make interactive website.

Further work.• Future work will see these same architectures used to characterize

other features such as ethnicity and emotion.

• The networks main image hierarchy can be switched to a newer leaner and/or deeper convolution network.

References:1. Zhanpeng Zhang, Ping Luo, Chen Change Loy, and Xiaoou, Facial Landmark detection by deep multi-task learning, Tang Dept. of

Information Engineering, The Chinese University of Hong Kong, 2014.

2. Gil Levi and Tal Hassner, Age and Gender Classification using Convolutional Neural Networks, Department of Mathematics and Computer Science, The Open University of Israel, 2015

3. Rajeev Ranjan, Vishal M. Patel and Rama Chellappa, HyperFace: A deep multi-task learning framework for face detection, landmark localization, pose estimation an gender recognition, 2016

4. Yuxin Jiang, Songbin Li, Peng Liu and Qiongxing Dai, Multi-feature deep learning for face gender recognition, 2014, IEEE 7th

Joint International Information Technology and Artificial Intelligence Conference, 2014

5. R. Rothe, Timofter R. and L. Gool, DEX: Deep expectation of apparent age from a single image, Proceedings of the IEEE International Conference on Computer Vision Workshops, pages 10-15, 2015

6. Zafeiriou Stefanos, Cha Zhang and Zhengyou Zhang, A survey on face detection in the wild: past, present and future. Computer Vision and image understanding. Computer Vision and Image understanding, 4(138):1-24, 2015

7. Forrest N Iandola, Matthew W Moskewicz, Khalid Ashraf, Song Han, William J Dally and Kurt Keutxer, Squeezenet: AlexNet-level accuracy with 50x fewer parameters and 0.5mb model size, Arvix preprint: arvix:1602.00360, 2016

Questions?

Face demographics for age estimation using multi-task deep...

Documents

Transcript of Face demographics for age estimation using multi-task deep...

COMP9315 DBMS Implementationcs9315/16s1/lectures/week01/all... · COMP9315 DBMS Implementation ... Ass Description Due Marks ... Constraints are an important aspect of data deﬁnition:

COMP8790 - Software Engineering Projectcourses.cecs.anu.edu.au/courses/CSPROJECTS/14S2/Reports/...COMP8790 - Software Engineering Project Exploring the Use of SCRUM for Administrative

Automated Reasoning for Artificial Intelligencecourses.cecs.anu.edu.au/courses/CSPROJECTS/18S1/reports/u5453384.pdf · Chapter1 Introduction Givenanarbitrarystatement,therearepotentiallymanyreasonsastowhywemight

An Augmented Reality Software Architecture for Information ...courses.cecs.anu.edu.au/courses/CSPROJECTS/19S1/reports/...While Virtual Reality (VR) aims to create an immersive virtual

Web Services, DevOps, and Deploymentcs9243/16s1/lectures/DevOps-slides.pdf · Web Services, DevOps, and Deployment ... –Without regard to how those applications were built, what

Leap motion UI for EcoVR - Australian National Universitycourses.cecs.anu.edu.au/courses/CSPROJECTS/16S1...Developed by Leap Motion Inc. The Leap Motion controller is a USB peripheral

Comparison of CPU and GPGPU performance as …courses.cecs.anu.edu.au/courses/CSPROJECTS/15S2/Reports/...Comparison of CPU and GPGPU performance as applied to a procedurally generated

Multi-word Expression Recognition with Few Shot Learningcourses.cecs.anu.edu.au/courses/CSPROJECTS/18S1/reports/u6133… · Multi-word Expression Recognition with Few Shot Learning

Quantifying User Inﬂuence In Social Network Using Retweet …courses.cecs.anu.edu.au/courses/CSPROJECTS/17S1/Reports/Yifei_… · strength of social network as a marketing and advertising

Lecture 2: System Architecture & Communication S D Bcs9243/16s1/lectures/comm-slides.pdf · Control-oriented communication Associates a transfer of control with communication Active

Plain of Jars Archaeology Websitecourses.cecs.anu.edu.au/courses/CSPROJECTS/19S1/reports/u6432443_report.pdfpossible ways of displaying similarly organised archaeological data. In

Extracting emerging knowledge from social mediacourses.cecs.anu.edu.au/courses/CSPROJECTS/18S1/final...Background: OpenIE Extraction of relation tuples from text by leveraging the

The Growing Importance of Concurrent and Distributed Systems …courses.cecs.anu.edu.au/courses/COMP2310/lectures2013/... · 2013-07-22 · The Growing Importance of Concurrent and

ENGN3213 Digital Systems & Microprocessors Reverse Polish Calculator …courses.cecs.anu.edu.au/courses/ENGN3213/Projects/RPC... · 2010-06-03 · Figure 5: Reverse polish calculator

Visualization of the patent claims structure to improve ...courses.cecs.anu.edu.au/courses/CSPROJECTS/14S2/Final_presenta… · Technology, Monash University and other university

COMP9321 Web Application Engineeringcs9321/16s1/lectures/lec10/Lec-10.pdf · 2016-05-12 · Architectural Considerations - Network COMP9321, 16s1, Week 10 7

Implementation of Multi-Representation Mondrian Drawer Toolcourses.cecs.anu.edu.au/courses/CSPROJECTS/16S2/Reports/Zikai_Zhao... · Abstract Mondrian-Style images are a typical and

Project Final Presentationcourses.cecs.anu.edu.au/courses/CSPROJECTS/17S1/Initial_present… · Google France Matchs Afril matchs afrique matchs africain en direct matchs afrique

Phenotype to Genotype Matching and Epigenetics in Evolutionary Algorithmscourses.cecs.anu.edu.au/courses/CSPROJECTS/17S1/Final... · 2017-05-21 · Phenotype to Genotype Matching

Unsupervised Machine Learning in Materials Design › courses › CSPROJECTS › 20S1 › reports › u631… · Unsupervised Machine Learning in Materials Design Yuyuan Liang u6319242