Object Detection Meets Knowledge Graphs

20
Object Detection Meets Knowledge Graphs Yuan Fang, Kingsley Kuan, Jie Lin, Cheston Tan and Vijay Chandrasekhar Agency for Science, Technology and Research (A*STAR), Singapore

Transcript of Object Detection Meets Knowledge Graphs

Page 1: Object Detection Meets Knowledge Graphs

Object Detection Meets Knowledge Graphs

Yuan Fang, Kingsley Kuan, Jie Lin,

Cheston Tan and Vijay Chandrasekhar

Agency for Science, Technology and Research (A*STAR), Singapore

Page 2: Object Detection Meets Knowledge Graphs

Outline

• Problem & Motivation

• Approach: Semantic consistency

• Approach: Re-optimization

• Results

• Case studies

• Conclusion

Page 3: Object Detection Meets Knowledge Graphs

Problem

Page 4: Object Detection Meets Knowledge Graphs

Motivation

• Most existing methods

– only utilize image features

– Ignoring external knowledge: common sense or domain specific expertise

• Example knowledge

– Cat sits on table

– Bear sits on table

Page 5: Object Detection Meets Knowledge Graphs

Outline

• Problem & Motivation

• Approach: Semantic consistency

• Approach: Overall framework

• Results

• Case studies

• Conclusion

Page 6: Object Detection Meets Knowledge Graphs

Knowledge incorporation through semantic consistency

• Semantic consistency matrix S

– 𝑆𝑙,𝑙′: how related concepts 𝑙, 𝑙′ are

– 𝑆cat,table ≫ 𝑆bear,table

• Object detection probability

𝑃𝑏,𝑙 ≡ 𝑝(𝑙|𝑏): probability of concept 𝑙given bounding box 𝑏

Semantic consistency

Probability in the same image

Example (b, b’ are bounding boxes in the same image)

Large Comparable 𝑝 cat 𝑏 − 𝑝 table 𝑏′ ≈ 0

Small Different 𝑝 bear 𝑏 − 𝑝 table 𝑏′ ≫ 0

Page 7: Object Detection Meets Knowledge Graphs

Constructing semantic consistency: Frequency-based

• Co-occurrence frequency based on training set

• Weakness:– Cannot generalize to new co-occurrences

– The need of a training set

Pointwise mutual information

Page 8: Object Detection Meets Knowledge Graphs

Constructing semantic consistency: knowledge graph (KG) based

• Generalization: indirect relationships (person-plate)

• Robustness: multiple relationships (cat-table, cat-plate-table)

Page 9: Object Detection Meets Knowledge Graphs

Constructing semantic consistency: knowledge graph (KG) based

A random walk 𝑣0, 𝑣1, … , 𝑣𝑡 with restartlim𝑡→∞

𝑃(𝑣𝑡 = 𝑙′|𝑣0 = 𝑙)

Page 10: Object Detection Meets Knowledge Graphs

Outline

• Problem & Motivation

• Approach: Semantic consistency

• Approach: Re-optimization

• Results

• Case studies

• Conclusion

Page 11: Object Detection Meets Knowledge Graphs

Overall Framework

Re-optimization

Page 12: Object Detection Meets Knowledge Graphs

Approach: Re-optimization

Bounding box 𝑏 ∈ {1,2,… , 𝐵}Object labels 𝑙 ∈ {1,2, … , 𝐿}𝑆𝑙,𝑙′: semantic consistency between 𝑙, 𝑙′

𝑃𝑏,𝑙: original probability of label 𝑙 given bounding box 𝑏 𝑃𝑏,𝑙: re-optimized probability of label 𝑙 given bounding box 𝑏

Page 13: Object Detection Meets Knowledge Graphs

Weakness of the proposed approach

• The re-optimization step based on knowledge is a post processing step

• Independent of the object detection model

• Cannot feedback into the detection model (eg. through backpropagation)

• Thesis of this paper: only intends to demonstrate the benefits of utilizing knowledge in deep learning models

Page 14: Object Detection Meets Knowledge Graphs

Outline

• Problem & Motivation

• Approach: Semantic consistency

• Approach: Re-optimization

• Results

• Case studies

• Conclusion

Page 15: Object Detection Meets Knowledge Graphs

Results – MSCOCO dataset

FRCNN: Faster RCNN (knowledge-free)KF-500: Frequency based knowledge (500 images)KF-All: Frequency based knowledge (all)KG-CNet: knowledge graph based on ConceptNet

Up to 4.6% in recall

Page 16: Object Detection Meets Knowledge Graphs

Results – PASCAL VOC dataset

Up to 3.1% in recall

Page 17: Object Detection Meets Knowledge Graphs

Outline

• Problem & Motivation

• Approach: Semantic consistency

• Approach: Re-optimization

• Results

• Case studies

• Conclusion

Page 18: Object Detection Meets Knowledge Graphs

Case study – office scene

𝑆 keyboard, laptop ≈ 135x median valuegroundtruth

detected

Page 19: Object Detection Meets Knowledge Graphs

Case study – outdoor scene

𝑆 surfboard, person ≈ 5x median valuegroundtruth

detected

Page 20: Object Detection Meets Knowledge Graphs

Conclusion & future work

• External knowledge is helpful

• Complement existing methods to achieve better prediction results

• Next step: end-to-end learning with knowledge