Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon...

94
Image Retrieval Credit goes to: John Tait University of Sunderland, UK

Transcript of Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon...

Page 1: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

Image Retrieval

Credit goes to: John Tait

University of Sunderland, UK

Page 2: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

2

Outline of Afternoon

– Introduction • Why image retrieval is hard

• How images are represented

• Current approaches

– Indexing and Retrieving Images • Navigational approaches

• Relevance Feedback

• Automatic Keywording

– Advanced Topics, Futures and Conclusion • Video and music retrieval

• Towards practical systems

• Conclusions and Feedback

Page 3: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

3

Scope

General Digital Still Photographic Image Retrieval

– Generally colour

Some different issues arise

– Narrower domains

• E.g.Medical images especially where part of body and/or specific disorder is suspected

– Video

– Image Understanding - object recognition

Page 4: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

4

Thanks to

Chih-Fong Tsai

Sharon McDonald

Ken McGarry

Simon Farrand

And members of the University of

Sunderland Information Retrieval Group

Page 5: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

Introduction

Page 6: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

6

Why is Image Retrieval Hard ?

? What is the topic of

this image

? What are right

keywords to index

this image

? What words would

you use to retrieve

this image ?

The Semantic Gap

Page 7: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

7

Problems with Image Retrieval

A picture is worth a thousand words

The meaning of an image is highly

individual and subjective

Page 8: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

8

How similar are these two

images

Page 9: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

How Images are represented

Page 10: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

10

Page 11: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

11

Page 12: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

12

Compression

• In practice images are stored as compressed

raster

– Jpeg

– Mpeg

• Cf Vector …

• Not Relevant to retrieval

Page 13: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

13

Image Processing for Retrieval

• Representing the Images

– Segmentation

– Low Level Features

• Colour

• Texture

• Shape

Page 14: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

14

Image Features

• Information about colour or texture or shape

which are extracted from an image are

known as image features

– Also low-level features

• Red, sandy

– As opposed to high level features or concepts

• Beaches, mountains, happy, serene, George Bush

Page 15: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

15

Image Segmentation

• Do we consider the whole image or just part

?

– Whole image - global features

– Parts of image - local features

Page 16: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

16

Global features

• Averages across whole image

Tends to loose distinction between foreground and background

Poorly reflects human understanding of images

Computationally simple

A number of successful systems have been built using global image features including Sunderland’s CHROMA

Page 17: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

17

Local Features

• Segment images into parts

• Two sorts:

– Tile Based

– Region based

Page 18: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

18

Regioning and Tiling Schemes

(a) 5 tiles (b) 9 tiles

(c) 5 regions (d) 9 regions

Tiles

Regions

Page 19: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

19

Tiling

Break image down into simple geometric

shapes

Similar Problems to Global

Plus dangers of breaking up significant objects

Computational Simple

Some Schemes seem to work well in practice

Page 20: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

20

Regioning

• Break Image down into visually coherent

areas

Can identify meaningful areas and

objects

Computationally intensive

Unreliable

Page 21: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

21

Colour

• Produce a colour signature for

region/whole image

• Typically done using colour correllograms

or colour histograms

Page 22: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

22

Colour Histograms

Identify a number of buckets in which to sort the

available colours (e.g. red green and blue, or up to

ten or so colours)

Allocate each pixel in an image to a bucket and count

the number of pixels in each bucket.

Use the figure produced (bucket id plus count,

normalised for image size and resolution) as the index key (signature) for each image.

Page 23: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

23

Global Colour Histogram

0

10

20

30

40

50

60

70

80

90

Red Orange

Page 24: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

24

Other Colour Issues

• Many Colour Models

– RGB (red green blue)

– HSV (Hue Saturation Value)

– Lab, etc. etc.

• Problem is getting something like human

vision

– Individual differences

Page 25: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

25

Texture

• Produce a mathematical characterisation of

a repeating pattern in the image

– Smooth

– Sandy

– Grainy

– Stripey

Page 26: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

26

Page 27: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

27

Page 28: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

28

Texture

• Reduces an area/region to a (small - 15 ?)

set of numbers which can be used a

signature for that region.

• Proven to work well in practice

• Hard for people to understand

Page 29: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

29

Shape

• Straying into the realms of object

recognition

• Difficult and Less Commonly used

Page 30: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

30

Ducks again

• All objects have

closed boundaries

• Shape interacts in a

rather vicious way

with segmentation

• Find the duck shapes

Page 31: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

31

Page 32: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

32

Summary of Image

Representation • Pixels and Raster

• Image Segmentation

– Tiles

– Regions

• Low-level Image Features

– Colour

– Texture

– Shape

Page 33: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

Indexing and Retrieving

Images

Page 34: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

34

Overview of Section 2

Quick Reprise on IR

Navigational Approaches

Relevance Feedback

Automatic Keyword Annotation

Page 35: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

35

Reprise on Key Interactive IR

ideas Index Time vs Query Time Processing

Query Time

Must be fast enough to be interactive

Index (Crawl) Time

Can be slow(ish)

There to support retrieval

Page 36: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

36

An Index

A data structure which stores data in a suitably

abstracted and compressed form in order to

faciliate rapid processing by an application

Page 37: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

37

Indexing Process

Page 38: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

Navigational Approaches

to Image Retrieval

Page 39: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

39

Essential Idea

Layout images in a virtual space in an

arrangement which will make some sense to

the user

Project this onto the screen in a

comprehensible form

Allow them to navigate around this projected

space (scrolling, zooming in and out)

Page 40: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

41

Observation

It appears people can take in and will inspect

many more images than texts when searching

Page 41: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

42

CHROMA

Development in Sunderland:

mainly by Ting Sheng Lai now of National Palace

Museum, Taipei, Taiwan

Structure Navigation System

Thumbnail Viewer

Similarity Searching

Sketch Tool

Page 42: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

43

The CHROMA System

General Photographic Images

Global Colour is the Primary Indexing Key

Images organised in a hierarchical

classification using 10 colour descriptors and

colour histograms

Page 43: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

44

Access System

Page 44: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

45

The Navigation Tool

Page 45: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

46

Technical Issues

Fairly Easy to arrange image signatures so

they support rapid browsing in this space

Page 46: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

Relevance Feedback

More Like this

Page 47: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

48

Relevance Feedback

Well established technique in text retrieval

Experimental results have always shown it to work

well in practice

Unfortunately experience with search engines

has show it is difficult to get real searchers to

adopt it - too much interaction

Page 48: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

49

Essential Idea

User performs an initial query

Selects some relevant results

System then extracts terms from these to

augment the initial query

Requeries

Page 49: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

50

Many Variants

Pseudo

Just assume high ranked documents are relevant

Ask users about terms to use

Include negative evidence

Etc. etc.

Page 50: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

51

Query-by-Image-Example

Page 51: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

52

Why useful in Image Retrieval?

1. Provides a bridge between the users

understanding of images and the low level

features (colour, texture etc.) with which the

systems is actually operating

2. Is relatively easy to interface to

Page 52: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

53

Image Retrieval Process

Ducks

Green

Water Texture

Leaf Texture

Page 53: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

54

Observations

Most image searchers prefer to use key words

to formulate initial queries

Eakins et al, Enser et al

First generation systems all operated using low

level features only

Colour, texture, shape etc.

Smeulders et al

Page 54: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

55

Ideal Image Retrieval Process

Thumbnail

Browsing

Need Keyword

Query

More Like

this

Page 55: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

56

Image Retrieval as Text Retrieval

What we really want to do is make the image

retrieval problem text retrieval

Page 56: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

57

Three Ways to go

Manually Assign Keywords to each image

Use text associated with the images (captions,

web pages)

Analyse the image content to automatically

assign keywords

Page 57: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

58

1- Manual Keywording

Expensive

Can only really be justified for high value

collections – advertising

Unreliable

Do the indexers and searchers see the images in

the same way

Feasible

Page 58: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

59

2- Associated Text

Cheap

Powerful

Famous names/incidents

Tends to be “one dimensional”

Does not reflect the content rich nature of images

Currently Operational - Google

Page 59: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

60

Possible Sources

of Associated text

Filenames

Anchor Text

Web Page Text around the anchor/where the

image is embedded

Page 60: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

61

3- Automatic Keyword Assignment

A form of Content Based Image Retrieval

Cheap (ish)

Predictable (if not always “right”)

No operational System Demonstrated

Although considerable progress has been made

recently

Page 61: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

62

Basic Approach

Learn a mapping from the low level image

features to the words or concepts

Page 62: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

63

Two Routes

1. Translate the image into piece of text

n Forsyth and other s

n Manmatha and others

2. Find that category of images to which a

keyword applies

n Tsai and Tait

n (SIGIR 2005)

Page 63: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

64

Summary

Separating Index Time and Retrieval Time Operations

“First generation CBIR”

Navigation (by colour etc.)

Relevance Feedback

Keyword based Retrieval

Manual Indexing

Associated Text

Automatic Keywording

Page 64: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

Advanced Topics, Futures and

Conclusions

Page 65: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

66

Outline

Video and Music Retrieval

Towards Practical Systems

Conclusions and Feedback

Page 66: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

Video and Music

Retrieval

Page 67: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

68

Video Retrieval

• All current Systems are based on one or

more of:

– Narrow domain - news, sport

– Use automatic speech recognition to do

speech to text on the soundtrack

– Do key frame extraction and then treat the

problem as still image retrieval

Page 68: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

69

Missing Opportunities in Video

Retrieval

• Using delta’s - frame to frame

differences - to segment the image into

foreground/background, players, pitch,

crowd etc.

• Trying to relate image data to

language/text data

Page 69: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

70

Music Retrieval

• Distinctive and Hard Problem

– What makes one piece of music similar to

another

• Features

– Melody

– Artist

– Genre ?

Page 70: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

Towards Practical Systems

Page 71: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

72

Ideal Image Retrieval Process

Thumbnail

Browsing

Need Keyword

Query

More Like

this

Page 72: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

73

Requirements

> 5000 Key word vocabulary

> 5% accuracy of keyword assignment for all

keywords

> 5% precision in response to single key word queries

The Semantic Gap Bridged!

Page 73: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

74

CLAIRE

Example State of the Art Semantic CBIR System

Colour and Texture Features

Simple Tiling Scheme

Two Stage Learning Machine

SVM/SVM and SVM/k-NN

Colour to 10 basic colours

Texture to one texture term per category

Page 74: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

75

Tiling Scheme

Page 75: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

76

Texture

Classifier

Architecture of Claire

Data

Extractor

Known Key Word/class

Page 76: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

77

Training/Test Collection

Randomly Selected from Corel

Training Set

30 images per category

Test Collection

20 images per category

Page 77: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

78

SVM/SVM Keywording with 100+50

Categories

0%

10%

20%

30%

40%

50%

60%

70%

10 30 50 70 100

concrete classes

abstract classes

baseline

Page 78: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

79

Examples Keywords

Concrete

Beaches

Dogs

Mountain

Orchids

Owls

Rodeo

Tulips

Women

Abstract

Architecture

City

Christmas

Industry

Sacred

Sunsets

Tropical

Yuletide

Page 79: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

80

SVM vs kNN

0%

10%

20%

30%

40%

50%

60%

70%

10 30 50 70 100 150

SVM concrete

SVM abstract

baseline

kNN abstract

kNN concrete

Page 80: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

81

Reduction in Unreachable Classes

0

10

20

30

40

50

60

10 30 50 70 100 150

Missing Category Numbers

SVM concrete

SVM abstract

kNN concrete

kNN abstract

Page 81: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

82

Labelling Areas of Feature Space

Mountain

Sea

Tree

Page 82: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

83

Overlap in Feature Space

Page 83: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

84

Keywording 200+200 Categories

SVM/1-NN

0%

10%

20%

30%

40%

50%

60%

10 30 50 70 100 150 200

concrete keywords

abstract keywords

baseline

Expon. (abstractkeywords)

Page 84: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

85

Discussion

Results still promising 5.6% of images have at least one relevant keyword assigned

Still useful - but only for a vocabulary of 400 words !

See demo at http://osiris.sunderland.ac.uk/~da2wli/system/silk1/

High proportion of categories which are never assigned

Page 85: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

86

Segmentation

Are the results dependent on the specific

tiling/regioning scheme used ?

Page 86: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

87

Regioning

(a) 5 tiles (b) 9 tiles

(c) 5 regions (d) 9 regions

Page 87: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

88

Effectiveness Comparison

8% (81)

13.7% (25)

16.7% (15)

21.43% (5 )

27.9% (2)

36.67% (0 )

52.5% (0)

14.3% (30)

9 .13% (69)

27.79% (1 )

33% (0 )

40.67% (1 )

61.5% (0)

18.4% (9)

0.00%

10.00%

20.00%

30.00%

40.00%

50.00%

60.00%

70.00%

10 30 50 70 100 150 200

No. of concrete classes

Ac

cu

rac

y

tiles

regions

8% (81)8 .86% (44)

11.25% (21)

16.29% (7 )

22.55% (2 )26.33% (2 )

52.5% (0)

9% (51) 9 .13% (69)

9 .25% (27)

14.14% (7 )

31% (0 )

21.06% (0 )

48% (0 )

0.00%

10.00%

20.00%

30.00%

40.00%

50.00%

60.00%

10 30 50 70 100 150 200

No. of abstract classes

Ac

cu

rac

y

tiles

regionsFive Tiles vs Five Regions

1-NN Data Extractor

Page 88: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

89

Next Steps

More categories

Integration into complete systems

Systematic Comparison with Generative approach

pioneered by Forsyth and others

Page 89: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

90

Other Promising Examples

Jeon, Manmatha and others -

High number of categories - results difficult to interpret

Carneiro and Vasconcelos

Also problems with missing concepts

Srikanth et al

Possibly leading results in terms of precision and vocabulary scale

Page 90: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

91

Conclusions

Image Indexing and Retrieval is Hard

Effective Image Retrieval needs a cheap and

predictable way of relating words and images

Adaptive and Machine Learning approaches offer

one way forward with much promise

Page 91: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

Feedback

Comments and Questions

Page 92: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

Selected Bibliography

Page 93: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

94

Early Systems

The following leads into all the major trends in systems based on colour, texture and shape

A. Smeaulder, M. Worring, S. Santini, A. Gupta and R. Jain “Content-based Image Retrieval: the end of the early years” IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12):1349-1380, 2000.

CHROMA Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

of the 26th ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2003), Toronto, July, 2003. pp 80-87. ISBN 1-58113-646-3

Sharon McDonald, Ting-Sheng Lai and John Tait, “Evaluating a Content Based Image Retrieval System” Proceedings of the 24th ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2001), New Orleans, September 2001. W.B. Croft, D.J. Harper, D.H.

Kraft, and J. Zobel (Eds). ISBN 1-58113-331-6 pp 232-240.

Translation Based Approaches P. Duygulu, K. Barnard, N. de Freitas and D. Forsyth “Learning a Lexicon for a Fixed Image

Vocabulary” European Conference on Computer Vision, 2002.

K. Barnard, P. Duygulu, N. de Freitas and D. Forsyth “Matching Words and Pictures” Journal of machine Learning Research 3: 1107-1135, 2003.

Very recent new paper on this is: P. Virga, P. Duygulu “Systematic Evaluation of Machine Translation Methods for Image and Video

Annotation” Images and Video Retrieval, Proceedings of CIVR 2005, Singapore, Springer, 2005.

Page 94: Image Retrieval - Dalhousie Universityweb.cs.dal.ca/~anwar/ir/lecturenotes/l14.pdf · Sharon McDonald and John Tait “Search Strategies in Content-Based Image Retrieval” Proceedings

95

Cross-media Relevance Models etc J. Jeon, V. Lavrenko, R. Manmatha “Automatic Image Annotation and Retrieval using Cross-Media

Relevance Models” Proceedings of the 26th ACM SIGIR Conference on Research and Development

in Information Retrieval (SIGIR 2003), Toronto, July, 2003. Pp 119-126

See also recent unpublished papers on

http://ciir.cs.umass.edu/~manmatha/mmpapers.html

More recent stuff

G Carneiro and N. Vasconcelos “A Database Centric View of Sentic Image Annotation and Retrieval”

Proceedings of the 28th ACM SIGIR Conference on Research and Development in Information

Retrieval (SIGIR 2005), Salvador, Brazil, August, 2005

M. Srikanth, J. Varner, M. Bowden, D. Moldovan “Exploiting Ontologies for Automatic Image

Annotation” Proceedings of the 28th ACM SIGIR Conference on Research and Development in

Information Retrieval (SIGIR 2005), Salvador, Brazil, August, 2005

See also the SIGIR workshop proceedings

http://mmir.doc.ic.ac.uk/mmir2005