STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in...

20
/ CSE 559A: Computer Vision Fall 2020: T-R: 11:30-12:50pm @ Zoom Instructor: Ayan Chakrabarti ([email protected]). Course Staff: Adith Boloor, Patrick Williams Dec 10, 2020 http://www.cse.wustl.edu/~ayan/courses/cse559a/ 1 / ANNOUNCEMENTS ANNOUNCEMENTS Last recitation, “or PSET 5, this Friday! Make sure you are all workin” on “inal projects. Leave yoursel“ enou”h time to write the report. 2 / OBJECT DETECTION OBJECT DETECTION 3 / OBJECT DETECTION OBJECT DETECTION 4

Transcript of STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in...

Page 1: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

CSE 559A: Computer Vision

Fall 2020: T-R: 11:30-12:50pm @ Zoom

Instructor: Ayan Chakrabarti ([email protected]).Course Staff: Adith Boloor, Patrick Williams

Dec 10, 2020

http://www.cse.wustl.edu/~ayan/courses/cse559a/

1 /

ANNOUNCEMENTSANNOUNCEMENTSLast recitation, “or PSET 5, this Friday!Make sure you are all workin” on “inal projects. Leave yoursel“ enou”h time to write the report.

2

/

OBJECT DETECTIONOBJECT DETECTION

3 /

OBJECT DETECTIONOBJECT DETECTION

4

Page 2: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

OBJECT DETECTIONOBJECT DETECTION

5 /

OBJECT DETECTIONOBJECT DETECTION

6

/

OBJECT DETECTIONOBJECT DETECTIONNewer methods also use a neural network to ”enerate re”ion proposalsEfficient Implementations: bulk o“ the computation happens once on the entire ima”e, and you crop a “eaturemap “or each re”ion.Even Faster Methods: Discretize ima”e locations into ”rid, and directly output upto a “ixed number o“boundin” boxes “or each ”rid block.

7 /

TRANSFER LEARNINGTRANSFER LEARNINGSay you want to train a network to solve a problem.

The task is complex, so you need a lar”e network.But you don’t have enou”h trainin” data to train such a network.

Pick a related task “or which you do have a lot o“ trainin” dataIma”eNet is a ”reat database “or this “or a variety o“ semantic tasks

Train a network (like VGG-16) to solve that task.Then, choose the output o“ some intermediate layer o“ that networkUse it as a “eature vector, and learn a smaller network “or your problem which ”oes “rom those “eatures to thedesired output.

8

Page 3: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

TRANSFER LEARNINGTRANSFER LEARNINGVGG-16 does well on Ima”enet classi“ication

and ”ives you a “eature representation that is surprisin”ly use“ul “or a broad ran”e o“ tasks.

Remember computin” encodin” “rom . VGG-16’s pool5, “c1, “c2, “eatures can be the “or many tasks.

One can also initialize a network with the VGG-16 architecture to one trained with ima”enet, and then“inetune by replacin” the “inal layer as classi“ication “or another task.

In ”eneral, empirical question to determine when trainin” on Task A will provide ”ood “eatures “or Task B.

x

 

x x

 

9 /

OTHER TASKSOTHER TASKS

10

/

OTHER TASKSOTHER TASKS

11 /

OTHER TASKSOTHER TASKS

12

Page 4: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

OTHER TASKSOTHER TASKS

13 /

OTHER TASKSOTHER TASKS

14

/

OTHER TASKSOTHER TASKS

15 /

OTHER TASKSOTHER TASKS

16

Page 5: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

OTHER TASKSOTHER TASKS

17 /

OTHER TASKSOTHER TASKS

18

/

OTHER TASKSOTHER TASKS

19 /

OTHER TASKSOTHER TASKS

20

Page 6: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

OTHER TASKSOTHER TASKS

21 /

OTHER TASKSOTHER TASKS

22

/

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

23 /

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

24

Page 7: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

25 /

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

26

/

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

27 /

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

28

Page 8: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

29 /

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

30

/

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

31 /

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

32

Page 9: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

33 /

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

34

/

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

35 /

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

36

Page 10: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

37 /

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

38

/

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKS

39 /

FULLY-CONVOLUTIONAL NETWORKSFULLY-CONVOLUTIONAL NETWORKSBut what about downsamplin” ?

Option 0: Just don’t use downsamplin”

Bad, because down-samplin” is a way to quickly increase the receptive “ield o“ your network.

Option 1: Just produce a label map at lower-resolution.

Option 2: I“ you downsample by (typically ) Feed every shi ed version o“ your input throu”h this FCN.

Bad because i“ you down-sample multiple times, you’re stillre-computin” activations prior to the last-downsamplin”.

Option 3: Dilated Convolutions

N N = 2

K

(N − 1) × (N − 1)

40

Page 11: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

DILATED CONVOLUTIONDILATED CONVOLUTION

41 /

DILATED CONVOLUTIONDILATED CONVOLUTION

42

/

DILATED CONVOLUTIONDILATED CONVOLUTION

43 /

DILATED CONVOLUTIONDILATED CONVOLUTION

44

Page 12: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

DILATED CONVOLUTIONDILATED CONVOLUTION

45 /

DILATED CONVOLUTIONDILATED CONVOLUTION

46

/

DILATED CONVOLUTIONDILATED CONVOLUTION

47 /

DILATED CONVOLUTIONDILATED CONVOLUTION

48

Page 13: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

SEMANTIC SEGMENTATIONSEMANTIC SEGMENTATION

49 /

SEMANTIC SEGMENTATIONSEMANTIC SEGMENTATION

50

/

DEEP ARCHITECTURESDEEP ARCHITECTURES

51 /

DEEP ARCHITECTURESDEEP ARCHITECTURES

52

Page 14: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

DEEP ARCHITECTURESDEEP ARCHITECTURES

53 /

DEEP ARCHITECTURESDEEP ARCHITECTURES

54

/

DEEP ARCHITECTURESDEEP ARCHITECTURES

55 /

DEEP ARCHITECTURESDEEP ARCHITECTURES

56

Page 15: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

DEEP ARCHITECTURESDEEP ARCHITECTURES

57 /

DEEP ARCHITECTURESDEEP ARCHITECTURES

58

/

DEEP ARCHITECTURESDEEP ARCHITECTURES

59 /

DEEP ARCHITECTURESDEEP ARCHITECTURES

60

Page 16: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

DEEP ARCHITECTURESDEEP ARCHITECTURESWith BatchNorm

He et al., Identity Mappin”s in Deep Residual Networks . 2016.

61 /

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

62

/

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

63 /

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

64

Page 17: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

65 /

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

66

/

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

67 /

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

68

Page 18: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

69 /

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

70

/

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

71 /

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

72

Page 19: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

73 /

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

74

/

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

75 /

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

76

Page 20: STEREO - Washington University in St. Louisayan/courses/cse559a/PDFs/lec...For a reference patch in le image, get patches from same line in the right image, and from a differ ent

/

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

77 /

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

78

/

MORE ABOUT ARCHITECTURESMORE ABOUT ARCHITECTURES

79 /

TRANSPOSE CONVOLUTIONTRANSPOSE CONVOLUTIONAs it su””ests, it is the transpose o“ the operation o“ Convolution with Stride.In “act, this represents the operation “or back-propa”atin” ”radients throu”h a convolution-with-stride layer.

Lets ”o back to our matrix vector notation, represent convolution with and downsamplin” with .

What is the transpose o“ this operation ? O“ ?

What does represent ?

Upsamplin” by “illin” in zeros. is still convolution (with a “lipped kernel, but doesn’t matter).

So a convolution-transpose layer effectively does up-samplin” with zeros, and then a re”ular convolution.But up-samplin” with zeros o en leads to arti“acts. Newer architectures don’t use convolution transpose.Instead, they do bilinear or nearest-nei”hbor interpolation on the “eature maps to increase resolution, andthen do a re”ular convolution.

A

k

A

s

y = xA

s

A

k

A

s

A

k

( =A

s

A

k

)

T

A

T

k

A

T

s

A

T

s

A

T

k

80