Learning deep representation from coarse to fine for face alignment

{shaozhiwen, feiben, yiru.zhao, qinchuan.zhang}@sjtu.edu.cn, ma-lz@cs.sjtu.edu.cn

Learning Deep Representation from Coarse to Fine for Face Alignment Zhiwen Shao, Shouhong Ding, Yiru Zhao, Qinchuan Zhang, and Lizhuang Ma

Department of Computer Science and Engineering

Shanghai Jiao Tong University, China

Problem

Face alignment is to locate facial landmarks

Motivation & Challenges

Face alignment has many applications

• Face animation

• Face beautification

• Face preprocessing

There are some challenges

• Large pose, illumination and expression

variations

• Partial occlusion

• Low quality

We need an effective method to represent

highly complex faces

Ours vs. Others

Conventional methods

• Their results are highly relevant to the

initial shape

• Our network takes raw faces as input

without any initialization

Deep learning methods

• They use cascaded networks or multitask

learning

• Our method uses one network and

doesn’t require extra facial attributes

Coarse-to-fine Training Algorithm Comparison with other methods

The detection of dense landmarks is difficult

owing to too many labels of each face

There are a few key landmarks coarsely

determining the face shape

Given landmarks can be split into principal

subset and elaborate subset

Principal subset

Elaborate subset

Loss function

controls the relative weight

of principal subset

The prediction for location of the principal

subset can extract intrinsic facial structure

We further fine-tune the learned model by

adjusting the relative weight of principal

subset

Deep convolutional network

Convolutional layer 3×3/1/1

Principal unit

50×50×3

25×25×64 13×13×128 7×7×192

25×25×128

4×4×256

50×50×64 13×13×192 7×7×256

Max-pooling layer 2×2/2/0

Fully-connected unit

Elaborate unit

Algorithm discussions

The input is 50×50×3 for color face patches. n is

equal to double total number of landmarks

Three face alignment benchmarks

• Helen, 300-W, COFW

Direct training algorithm (DT)

Coarse-to-fine training algorithm (CFT)

Results of RCPR and CFT on several images from COFW

Results of CFT on several images from Helen and IBUG

Conclusion

Comparison of mean errors (%) with other methods

We propose a novel coarse-to-fine algorithm to train deep

convolutional network for facial landmark detection

Our network directly predicts the coordinates of landmarks

using a single network without any other additional

operation, whilst significantly improving the accuracy of

face alignment in the condition of severe occlusion

We believe that the proposed algorithm can be applied to

other problems using deep convolutional network

Learning deep representation from coarse to fine for face alignment

Science

Transcript of Learning deep representation from coarse to fine for face alignment

The Anterior Hippocampus Supports a Coarse, Global Environmental Representation and the Posterior Hipocampus Supports Fine Grained Local Environments

Face Alignment by Coarse-to-Fine Shape Searching¬ne methods: The coarse-to-ﬁne approach has been widely used to address various image processing and computer vision problems such

Mtch Coarse

Extreme coarse of

A unified data representation theory for network ... · A unified data representation theory for network visualization, ordering and coarse-graining István A. Kovács1,2,3, Réka

Local Representation Alignment: A Biologically Motivated ...clgiles.ist.psu.edu/IST597/materials/slides/lect11/ororbia-lra-bio-inspired-talk.pdfmotivated/inspired by (Rao & Ballard,

Alignment-Based Recognition of Shape Outlinescs.brown.edu/people/pklein/publications/2001alignShapeR...Alignment-Based Recognition of Shape Outlines 607 using a coarse histogram of

Chapmans Coarse C19

Reconsidering Representation Alignment for Multi-View ...

A REAL-TIME COARSE-TO-FINE MULTIVIEW CAPTURE … · A REAL-TIME COARSE-TO-FINE MULTIVIEW CAPTURE SYSTEM FOR ALL-IN-FOCUS RENDERING ON A LIGHT-FIELD DISPLAY ... light-ﬁeld representation

LAR video: Hierarchical Representation for Low Bit-Rate ... · gion level without region partition encoding. ... motion segmentation in a video coding context. Unadapted ... (coarse

FINE AND COARSE MODULI SPACES IN THE REPRESENTATION THEORY ...web.math.ucsb.edu/~birge/modulispaces.pdf · lems arising in the representation theory of such algebras. We then outline

Optimal Representation of Multi-View Video - Dandancasas.github.io/docs/volino_BMVC14.pdf · VOLINO ET AL.: OPTIMAL REPRESENTATION OF MULTI-VIEW VIDEO 3. with similar online alignment

“Recycled Coarse Aggregates” RECYCLED COARSE ... - …interscience.in/IJATCE_Vol2Iss1/paper6.pdf · “Recycled Coarse Aggregates” ... aims to evaluate physical properties of

Increase Coarse Flotation

Testing the Coarse Alignment Algorithm Using Rotation …acta.uni-obuda.hu/Sotak_26.pdfDue to the fact that the IMU ADIS16405 consists of three-axis accelerometers, three-axis gyroscopes

Coarse Textured Soils

Nip Pressure Alignment Tool - ELSO Philips Service · PDF fileThe Nip Pressure Alignment Tool (NPAT) is a system used to capture ... a clear visual representation of real-time relative

Ontology Alignment. Ontology alignment Ontology alignment Ontology alignment strategies Evaluation of ontology alignment strategies Ontology alignment.

Face Alignment by Coarse-to-Fine Shape Searchingccloy/files/cvpr_2015_alignment.pdf · PR, leading to a ﬁner shape sub-region for the next search-ing stage, with closer estimate