Visual Semantic Role Labelling for Image Understanding ...

Situation Recognition:Visual Semantic Role Labelling for Image Understanding

Mark Yatskar, Luke Zettlemoyer, Ali Farhadi

Experiment presentation by Nayan Singhal

Motivation

● Human understanding of image● Verbs in English language.

Approach

● CRF with CNN

● Log linear Loss

CNN CRF1024

CARRYING

AGENT WOMAN

ITEM JAR

AGENTPART HEAD

PLACE OUTDOOR

How object plays role in image understanding?

neighboring images

Remove Cliff

neighboring imagesRemoving Cliff

Remove person

neighboring imagesRemoving Man

Remove Sky

neighboring imagesRemoving Sky

Image (2)

neighboring images

Remove Person

neighboring imagesRemoving man

Remove Background

neighboring imagesRemoving Sky and Man

Conclusion

Each object plays a significant role in image understanding.

Experiment

1) Analyzing Failure Cases2) Different moods of faces

Expt 1: Analyzing Failure Cases

Object Recognition (1)

Imsitu Result

Why is it happening?

Are these images difficult to categorize?

Let’s analyze these with ImageNet

Imsitu Result

ImageNet classification

Imsitu Result

Verb Role Noun Potential Labels

A (Verb Role Noun Potential) + B (Labels)Post Processing:

Imsitu Result

Verb Potential

Verb Role Noun Potential

Imagenet

Labels

Preprocessing

Future Work● Add labels in preprocessing.

Exp 2: Different moods● Laughing● Smiling● Frowning● Grimacing● Winking● Squinting● Shouting● Puckering

LaughingSmiling Frowning

PuckeringSquintingWinking

Success Case

Smiling

Agent place

0.35967

Laughing

Agent place

0.35777

Shouting

Agent place

0.37531

Success Case

Frowning

Agent place

0.24378

Grimacing

Agent place

0.21052

Failure Case

Winking

Agent place

0.20954

Puckering

Agent place

woman -

0.21052

Test Images (25)

● Conclusion: Detect different moods of faces with slight variation.

Some Interest Categorization

Camouflaging

Agent frog

Hiding Item pebble

Place -

Camouflaging

Agent owl

Hiding Item tree

Place outdoors

neighboring images

Thank You

No Agent(2)

Watering

Agent Person

Tool Bucket

Place garden

Shredding

Agent Person

Tool Shreder

Item paper

Place -

Visual Semantic Role Labelling for Image Understanding ...

Documents

Transcript of Visual Semantic Role Labelling for Image Understanding ...

Deep Visual-Semantic Hashing for Cross-Modal · PDF fileDeep Visual-Semantic Hashing for Cross-Modal Retrieval Yue ... semantic fusion network for ... Each fully-connected layer ‘learns

Effective Semantic Pixel labelling with Convolutional ... Semantic Pixel labelling with Convolutional Networks and Conditional Random Fields Sakrapee Paisitkriangkrai∗1, Jamie Sherrah†2,

Towards Semantic Embedding in Visual Vocabulary

Supplementary Material: Language-Agnostic Visual-Semantic ...openaccess.thecvf.com/content_ICCV_2019/...Language-Agnostic Visual-Semantic Embeddings 1. Hyper-Parameters and Training

SEMANTIC PHOTOGRAMMETRY: BOOSTING IMAGE-BASED 3D ...3dom.fbk.eu/sites/3dom.fbk.eu/files/stathopoulou... · contribution of semantic labelling towards 3D reconstruction in photogrammetry.

Semantic Role Labelling (SRL) for Information Extraction of Bio-medical data.

Video Thumbnail Selection Deep Visual Semantic …arunv/master_thesis_defence.pdfDeep Visual Semantic Embedding for Video Thumbnail Selection Arun Balajee Vasudevan Supervisors: Michael

NBSearch: Semantic Search and Visual Exploration of ...

Region-Based Active Learning for Efficient Labelling in ... spotlight presentation.pdf · Region-Based Active Learning for Efficient Labelling in Semantic Segmentation Tejaswi Kasarla1,

SEMANTiCS 2018 – 14th International Conference on Semantic ...polleres/publications/neum-poll-2018SEMANTiCS.pdf · Geo-Semantic Labelling of Open Data Sebastian Neumaiera,1, Axel

DeViSE: A Deep Visual-Semantic Embedding Model

Grounded Semantic Composition for Visual Scenes · 2006. 1. 11. · Grounded Semantic Composition for Visual Scenes 1.1 Grounded Semantic Composition We use the term grounded semantic

Lecture 3: Semantic Role Labelling

Visual Relationship Detection Using Joint Visual-Semantic ...

Visual Semantic Complex Network for Web Images

JOURNAL OF LA Geometry-Entangled Visual Semantic ...

Visual Representations for Semantic Target Driven Navigation · Visual Representations for Semantic Target Driven Navigation Arsalan Mousavian2; , Alexander Toshev1;?, Marek Fiˇser

Visual-Semantic Embeddings: some thoughts on Language

ICSC2015 KeyNote: Semantic links in visual web

Outline From Tuesday Shallow Semantics: Semantic Role ...verbs.colorado.edu/~mpalmer/Ling7800/LSA-SemanticRoleLabeling.pdf · 1 Shallow Semantics: Semantic Role Labelling, and VerbNet