Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit...

24
Cross-Modal Scene Representations Lluís Castrejón University of Toronto

Transcript of Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit...

Page 1: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Cross-Modal Scene Representations

Lluís CastrejónUniversity of Toronto

Page 2: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Motivation

Page 3: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Motivation

Page 4: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Motivation

Page 5: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Motivation

Page 6: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Motivation

??

Page 7: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Motivation

?

Page 8: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Cross-Modal Scene Understanding

Page 9: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

CMPlaces

Dataset of 205 scene categories

Page 10: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

CMPlaces

Dataset of 205 scene categories

Page 11: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

CMPlaces

Dataset of 205 scene categories

Page 12: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Strong vs weak alignment

Page 13: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Strong vs weak alignment

Not scalable!

Page 14: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Cross-modal Networks

Page 15: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Cross-modal Networks

Problem: Parts of the network specialize to certain domains

Page 16: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Cross-modal Networks

Solution: Use regularization to enforce alignments

Page 17: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Cross-modal Networks

Chair

BedBedroom

Page 18: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Cross-modal Networks

Page 19: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

T-SNE

Page 20: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Visualizing Activations

Page 21: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Cross-Modal Retrieval

Page 22: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Cross-Modal Retrieval

Page 23: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Inverting the representation

Page 24: Department of Computer Science, University of …castrejon/content/xmodal...Unit 31 (Fountain) Unit 50 (Arcade) Unit 81 (Ring) Unit 86 (Car) Unit 104 (Castle) Unit 115 (Bed) Real Clip

Thanks!

http://cmplaces.csail.mit.edu/