STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in...
Transcript of STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in...
![Page 1: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/1.jpg)
/
CSE 559A: Computer Vision
Fall 2020: T-R: 11:30-12:50pm @ Zoom
Instructor: Ayan Chakrabarti ([email protected]).Course Staff: Adith Boloor, Patrick Williams
Dec 10, 2020
http://www.cse.wustl.edu/~ayan/courses/cse559a/
1 /
ANNOUNCEMENTSANNOUNCEMENTSLast recitation, “or PSET 5, this Friday!Make sure you are all workin” on “inal projects. Leave yoursel“ enou”h time to write the report.
2
/
OBJECT DETECTIONOBJECT DETECTION
3 /
OBJECT DETECTIONOBJECT DETECTION
4
![Page 2: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/2.jpg)
/
OBJECT DETECTIONOBJECT DETECTION
5 /
OBJECT DETECTIONOBJECT DETECTION
6
/
OBJECT DETECTIONOBJECT DETECTIONNewer methods also use a neural network to ”enerate re”ion proposalsEfficient Implementations: bulk o“ the computation happens once on the entire ima”e, and you crop a “eaturemap “or each re”ion.Even Faster Methods: Discretize ima”e locations into ”rid, and directly output upto a “ixed number o“boundin” boxes “or each ”rid block.
7 /
TRANSFER LEARNINGTRANSFER LEARNINGSay you want to train a network to solve a problem.
The task is complex, so you need a lar”e network.But you don’t have enou”h trainin” data to train such a network.
Pick a related task “or which you do have a lot o“ trainin” dataIma”eNet is a ”reat database “or this “or a variety o“ semantic tasks
Train a network (like VGG-16) to solve that task.Then, choose the output o“ some intermediate layer o“ that networkUse it as a “eature vector, and learn a smaller network “or your problem which ”oes “rom those “eatures to thedesired output.
8
![Page 3: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/3.jpg)
/
TRANSFER LEARNINGTRANSFER LEARNINGVGG-16 does well on Ima”enet classi“ication
and ”ives you a “eature representation that is surprisin”ly use“ul “or a broad ran”e o“ tasks.
Remember computin” encodin” “rom . VGG-16’s pool5, “c1, “c2, “eatures can be the “or many tasks.
One can also initialize a network with the VGG-16 architecture to one trained with ima”enet, and then“inetune by replacin” the “inal layer as classi“ication “or another task.
In ”eneral, empirical question to determine when trainin” on Task A will provide ”ood “eatures “or Task B.
x
x x
9 /
OTHER TASKSOTHER TASKS
10
/
OTHER TASKSOTHER TASKS
11 /
OTHER TASKSOTHER TASKS
12
![Page 4: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/4.jpg)
/
OTHER TASKSOTHER TASKS
13 /
OTHER TASKSOTHER TASKS
14
/
OTHER TASKSOTHER TASKS
15 /
OTHER TASKSOTHER TASKS
16
![Page 5: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/5.jpg)
/
OTHER TASKSOTHER TASKS
17 /
OTHER TASKSOTHER TASKS
18
/
OTHER TASKSOTHER TASKS
19 /
OTHER TASKSOTHER TASKS
20
![Page 6: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/6.jpg)
/
OTHER TASKSOTHER TASKS
21 /
OTHER TASKSOTHER TASKS
22
/
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
23 /
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
24
![Page 7: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/7.jpg)
/
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
25 /
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
26
/
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
27 /
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
28
![Page 8: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/8.jpg)
/
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
29 /
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
30
/
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
31 /
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
32
![Page 9: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/9.jpg)
/
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
33 /
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
34
/
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
35 /
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
36
![Page 10: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/10.jpg)
/
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
37 /
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
38
/
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS
39 /
FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKSBut what about downsamplin” ?
Option 0: Just don’t use downsamplin”
Bad, because down-samplin” is a way to quickly increase the receptive “ield o“ your network.
Option 1: Just produce a label map at lower-resolution.
Option 2: I“ you downsample by (typically ) Feed every shi ed version o“ your input throu”h this FCN.
Bad because i“ you down-sample multiple times, you’re stillre-computin” activations prior to the last-downsamplin”.
Option 3: Dilated Convolutions
N N = 2
K
(N − 1) × (N − 1)
40
![Page 11: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/11.jpg)
/
DILATED CONVOLUTIONDILATED CONVOLUTION
41 /
DILATED CONVOLUTIONDILATED CONVOLUTION
42
/
DILATED CONVOLUTIONDILATED CONVOLUTION
43 /
DILATED CONVOLUTIONDILATED CONVOLUTION
44
![Page 12: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/12.jpg)
/
DILATED CONVOLUTIONDILATED CONVOLUTION
45 /
DILATED CONVOLUTIONDILATED CONVOLUTION
46
/
DILATED CONVOLUTIONDILATED CONVOLUTION
47 /
DILATED CONVOLUTIONDILATED CONVOLUTION
48
![Page 13: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/13.jpg)
/
SEMANTIC SEGMENTATIONSEMANTIC SEGMENTATION
49 /
SEMANTIC SEGMENTATIONSEMANTIC SEGMENTATION
50
/
DEEP ARCHITECTURESDEEP ARCHITECTURES
51 /
DEEP ARCHITECTURESDEEP ARCHITECTURES
52
![Page 14: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/14.jpg)
/
DEEP ARCHITECTURESDEEP ARCHITECTURES
53 /
DEEP ARCHITECTURESDEEP ARCHITECTURES
54
/
DEEP ARCHITECTURESDEEP ARCHITECTURES
55 /
DEEP ARCHITECTURESDEEP ARCHITECTURES
56
![Page 15: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/15.jpg)
/
DEEP ARCHITECTURESDEEP ARCHITECTURES
57 /
DEEP ARCHITECTURESDEEP ARCHITECTURES
58
/
DEEP ARCHITECTURESDEEP ARCHITECTURES
59 /
DEEP ARCHITECTURESDEEP ARCHITECTURES
60
![Page 16: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/16.jpg)
/
DEEP ARCHITECTURESDEEP ARCHITECTURESWith BatchNorm
He et al., Identity Mappin”s in Deep Residual Networks . 2016.
61 /
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
62
/
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
63 /
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
64
![Page 17: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/17.jpg)
/
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
65 /
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
66
/
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
67 /
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
68
![Page 18: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/18.jpg)
/
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
69 /
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
70
/
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
71 /
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
72
![Page 19: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/19.jpg)
/
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
73 /
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
74
/
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
75 /
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
76
![Page 20: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent](https://reader034.fdocuments.in/reader034/viewer/2022052103/603d652eb93a0437700d2cf7/html5/thumbnails/20.jpg)
/
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
77 /
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
78
/
MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES
79 /
TRANSPOSE CONVOLUTIONTRANSPOSE CONVOLUTIONAs it su””ests, it is the transpose o“ the operation o“ Convolution with Stride.In “act, this represents the operation “or back-propa”atin” ”radients throu”h a convolution-with-stride layer.
Lets ”o back to our matrix vector notation, represent convolution with and downsamplin” with .
What is the transpose o“ this operation ? O“ ?
What does represent ?
Upsamplin” by “illin” in zeros. is still convolution (with a “lipped kernel, but doesn’t matter).
So a convolution-transpose layer effectively does up-samplin” with zeros, and then a re”ular convolution.But up-samplin” with zeros o en leads to arti“acts. Newer architectures don’t use convolution transpose.Instead, they do bilinear or nearest-nei”hbor interpolation on the “eature maps to increase resolution, andthen do a re”ular convolution.
A
k
A
s
y = xA
s
A
k
A
s
A
k
( =A
s
A
k
)
T
A
T
k
A
T
s
A
T
s
A
T
k
80