Deep Residual Learning for Image Recognition.pptx [Read-Only]

2017-11-15

Deep Residual Learning for Image Recognition

Kaiming He et al. (Microsoft Research)By Zana Rashidi (MSc student, York University)

Introduction

2017-11-15

ILSVRC & COCO 2015 Competitions

1st place in all five main tracks:• ImageNet Classification• ImageNet Detection• ImageNet Localization• COCO Detection• COCO Segmentation

Datasets

ImageNet

• 14,197,122 images• 27 high-level categories• 21,841 synsets (subcategories)• 1,034,908 images with

bounding box annotations

• 330K images• 80 object categories• 1.5M object instances• 5 captions per image

2017-11-15

Image from cs231n (Stanford University) Winter 2016

Revolution of Depth

Image from author’s slides, ICML 2016

2017-11-15

Revolution of Depth

2017-11-15

Example

Background

2017-11-15

Deep Convolutional Neural Networks

• Breakthrough in image classification• Integrate low/mid/high-level features in a multi-layer fashion• Levels of features can be enriched by the number of stacked

layers• Network depth is very important

Features (filters)

2017-11-15

Deep CNNs

• Is learning better networks as easy as stacking more layers?• Degradation problem

− With depth increase, accuracy gets saturated, then degrades rapidly, not caused by overfitting, higher training error

Degradation of Deep CNNs

2017-11-15

Deep Residual Networks

Address Degradation

• Consider a shallower architecture and its deeper counterpart• Solution by construction:

− Add identity layers to the shallow learned model to build the deeper model

• The existence of this solution indicates that deeper models should have no higher training error, but experiments show:− Deeper networks are unable to find a solution that is comparable or

better than the constructed one

2017-11-15

Address Degradation (continued)

• So deeper networks are difficult to optimize• Deep residual learning framework

− Instead of fitting a few stacked layers to an underlying mapping− Let the layers fit a residual mapping− Instead of finding the underlying mapping H(x), let the stacked

nonlinear layers fit F(x)=H(x)-x, so original mapping recasts into F(x)+x

• Easier to optimize the residual mapping instead of the original

Residual Learning

• If identity mapping was optimal− Easier to push residual to zero− Than to fit identity mapping

• Identity shortcut connections− Add to output of stacked layers− No extra parameters− No computational complexity

2017-11-15

Details

• Adopt residual learning to every few stacked layers

• A building block− y=F(x, Wi )+x− x and y input and output− F(x, Wi )+x is the residual

mapping to be learned− ReLU nonlinearity

Details

• Dimensions of x and F(x) must be the same− Perform linear projection− y=F(x,Wi )+Wsx− 2 or 3 layers− Element-wise addition

2017-11-15

Experiments

Plain Networks

• 18 and 34 layers• Degradation problem• 34 layer has higher training

(thin curves) and validation(bold curves) error than 18 layer network

2017-11-15

Residual Networks

• 18 and 34 layer• Differ from the plain networks

only by shortcut connectionsevery two layers

• Zero-padding for increasing dimensions

• 34 layer ResNet is better than 18 layer ResNet

Comparison

● Reduced ImageNet top-1 error by 3.5%

● Converges faster

2017-11-15

Identity vs. Projection Shortcuts

- Recall y=F(x,Wi )+Wsx

A. Zero-padding for increasing dimension (parameter free)

B. Projections for increasing dimension, rest are identity

C. All shortcuts are projections

Deeper Bottleneck Architecture

• Training time concerns• Replace residual blocks with 3

layers instead of 2• 1✕1 convolution for reducing

and restoring dimensions• 3✕3 convolution, a bottleneck

with smaller input/output dimensions

2017-11-15

50 layer ResNet

• Replace each 2 layer residual block with this 3 layer bottleneck block resulting in 50 layers

• Use option B for increasing dimensions

• 3.8 billion FLOPs

101 layer and 152 layer ResNet

• Add more bottleneck blocks• 152 layer ResNet has 11.3

billion FLOPs• The deeper, the better• No degradation• Compared with state-of-the-art

2017-11-15

Results

Object Detection on COCO

2017-11-15

Object Detection on COCO

Object Detection in the Wild

https://youtu.be/WZmSMkK9VuA

2017-11-15

Conclusion

• Deep residual learning− Ultra deep networks could be easy to train− Ultra deep networks can gain accuracy from

2017-11-15

Applications of ResNet

• Visual Recognition• Image Generation• Natural Language Processing• Speech Recognition• Advertising• User Prediction

Resources

• Code written in Caffe available in github• Third party implementations in other frameworks

− Torch− Tensorflow− Lasagne− ...

2017-11-15

Thank you!

Deep Residual Learning for Image Recognition.pptx [Read-Only] · 2 ILSVRC & COCO 2015 Competitions...

Documents

Transcript of Deep Residual Learning for Image Recognition.pptx [Read-Only] · 2 ILSVRC & COCO 2015 Competitions...

PRINCIPAL S REPORT · 2020. 11. 1. · Geelong Baptist College - $100 Gift Card Street Wear Mask Category: 1stplace –Chaise Mason Year 8 - Oberon High School – 3D Printer sponsored

ART & WRITING CONTEST WINNERS · ART & WRITING CONTEST WINNERS 1stPlace –Early Elementary –Harper McDonald (2ndPlace –Finn McDonald –artwork wouldn’t dopy) 1stPlace –Elementary

SPAIN CMP PIS2008 1STPLACE COMPUTER HOY DECEMBER · 2020-05-24 · Día a dia aparecen cientos, Ponderación de los resultados A diferencia de otrcs casos, en las pruebas de sequridad

arXiv:1512.02325v4 [cs.CV] 30 Nov 20161512.02325v4 [cs.CV] 30 Nov 2016. ... size evaluated on PASCAL VOC, COCO, and ILSVRC and are compared to a range of recent state-of-the-art approaches.

Object Detection - University Of Illinoisslazebni.cs.illinois.edu/spring17/lec07_detection.pdfPascal VOC COCO ImageNet ILSVRC http ... //arxiv.org/pdf/1512.02325v5.pdf. ... Kaiming

Purchasing Update January 2019€¦ · home 1stplace in Tae Kwon Do and 3rd place in Haidong Gumdo. The flight from San Diego to South Korea took us approximately 13 hours. What are

Online Handwritten Script Recognition.pptx

Face Recognition.pptx - State of the Art in Face Recognition

f g@cse.cuhk.edu.hk qhfpku@pku.edu.cn shijianping ...These improvements are simple to implement, with sub-tle extra computational overhead. Our PANet reaches the 1stplace in the COCO

Large Scale Visual Recognition Challenge (ILSVRC) 2013: Classification spotlights.

Squeeze-and-Excitation Networks · 2018-06-27 · Squeeze-and-Excitation Networks (SENets) formed the foundation of our winner entry on ILSVRC 2017 Classification [Statistics provided

Knowledge Graph based Analysis and Exploration of ... · Recognition Challenge (ILSVRC) sophisticated DNN architectures for visual recognition tasks have been developed and trained

Deep Residual Learning - cs.kangwon.ac.krcs.kangwon.ac.kr/~leeck/AI2/deep_residual_learning.pdf · Deep Residual Learning MSRA @ ILSVRC & COCO 2015 competitions Kaiming He with Xiangyu

Large Scale Visual Recognition Challenge (ILSVRC) 2016image-net.org/challenges/talks/2016/ECCV2016_ilsvrc_coco...Fang, Changbao Wang, Zhe Wang, Hui Zhou, Liping Zhang, Xingcheng Zhang,

Large Scale Visual Recognition Challenge (ILSVRC) 2013: Classification spotlights

OXFORD VGG ILSVRC 20121. independent training ... dog breeds) Our Approach •Combine classification and detection in a cascade –class-specific bbox proposals –advanced features

Earth Day - Michigan Jowett St. Gerard School Lansing . 3rd Grade 2nd Place Hailey Prokop St. Gerard School Lansing . 4th Grade 1stPlace Alana Asmar Harwood Elementary

Learning and Transferring Mid-Level Image Representations ...openaccess.thecvf.com/content_cvpr_2014/papers/Oquab...Visual Recognition Challenge (ILSVRC-2012), and further improve

Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights.