Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department...

55
©2014 Per-Erik Forssén ©2014 Per-Erik Forssén Visual Object Recognition Lecture 2: Image Formation Per-Erik Forssén, docent Computer Vision Laboratory Department of Electrical Engineering Linköping University

Transcript of Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department...

Page 1: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n© 2 0 1 4 P e r - E r i k F o r s s é n

Visual Object RecognitionLecture 2: Image Formation

Per-Erik Forssén, docentComputer Vision Laboratory

Department of Electrical EngineeringLinköping University

Page 2: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Lecture 2: Image Formation• Pin-hole, and thin lens cameras

Projective geometry, lens distortion, vignetting, intensity, colour

• Geometric and Photometric Invariance Colour constancy, colour spaces, affine illumination model, homographies, epipolar geometry, canonical frames

Page 3: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

The Pin-Hole Camera

• A brightly illuminated scene will be projected onto a wall opposite of the pin-hole.

• The image is rotated 180º.

Page 4: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

The Pin-Hole Camera

• From similar triangles we get:

zx y

Optical Centre

fx

y

X=XY( )Z

X

Y

Z

x = f

X

Z

y = fY

Z

2

4x

y

1

3

5 =

2

4f 0 00 f 00 0 1

3

5

2

4X

Y

Z

3

5

Page 5: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• More generally, we write:f-focal length, s-skew, a-aspect ratio, c-projection of optical centre

The Pin-Hole Camera�

2

4x

y

1

3

5 =

2

4f 0 00 f 00 0 1

3

5

2

4X

Y

Z

3

5

2

4x

y

1

3

5 =

2

4f s c

x

0 fa c

y

0 0 1

3

5

2

4X

Y

Z

3

5

Page 6: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Motivation:f-focal length, s-skew, a-aspect ratio, c-projection of optical centre

The Pin-Hole Camera

Optical Centre

w

h

f

Image Plane Image Grid

cxcy( )

oFOV

OFOV

x

zy

2

4x

y

1

3

5 =

2

4f s c

x

0 fa c

y

0 0 1

3

5

2

4X

Y

Z

3

5

Page 7: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Motivation:f-focal length, s-skew, a-aspect ratio, c-projection of optical centre

The Pin-Hole Camera

Optical Centre

w

h

f

Image Plane Image Grid

cxcy( )

oFOV

OFOV

x

zy

2

4x

y

1

3

5

|{z}x

=

2

4f s c

x

0 fa c

y

0 0 1

3

5

| {z }K

2

4X

Y

Z

3

5

| {z }˜

X

x ⇠ K

˜

X

,

Page 8: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Also normalized image coordinates:

The Pin-Hole Camera�

2

4x

y

1

3

5

|{z}x

=

2

4f s c

x

0 fa c

y

0 0 1

3

5

| {z }K

2

4X

Y

Z

3

5

| {z }˜

X

x ⇠ K

˜

X

,

u =

2

4X/ZY/Z1

3

5x = Ku ⇠ KX̃

u = K

�1x

Page 9: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

The Pin-Hole Camera• For a general position of the world coordinate

system (WCS) we have:

u ⇠

2

4r11 r12 r13 t1r21 r22 r23 t2r31 r32 r33 t3

3

5

| {z }[R|t]

2

664

XYZ1

3

775

| {z }X

Page 10: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

The Pin-Hole Camera• For a general position of the world coordinate

system (WCS) we have:

,u ⇠

2

4r11 r12 r13 t1r21 r22 r23 t2r31 r32 r33 t3

3

5

| {z }[R|t]

2

664

XYZ1

3

775

| {z }X

u ⇠ [R|t]X

Page 11: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

The Pin-Hole Camera• For a general position of the world coordinate

system (WCS) we have: and thus

,

x ⇠ K[R|t]X

u ⇠

2

4r11 r12 r13 t1r21 r22 r23 t2r31 r32 r33 t3

3

5

| {z }[R|t]

2

664

XYZ1

3

775

| {z }X

u ⇠ [R|t]X

Page 12: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Epipolar geometry• The epipolar geometry of two cameras:

• e1, e2 are called epipoles. o1,o2 are the optical centres.

baseline

X=XY( )Z

e1e2

o1 o2

camera 1 camera 2

x2x1

epipolar plane

epipolar lines

Page 13: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Homographies• For a planar object, we can imagine a world

coordinate system fixed to the object

X

Y

Z

zx y

Optical Centre

fx

y

X=XY( )0

2

4x

y

1

3

5 = K

2

4r11 r12 r13 t1

r21 r22 r23 t2

r31 r32 r33 t3

3

5

2

664

X

Y

Z

1

3

775

Page 14: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Homographies• For a planar object, we can imagine a world

coordinate system fixed to the object

X

Y

Z

zx y

Optical Centre

fx

y

X=XY( )0

2

4x

y

1

3

5 = K

2

4r11 r12 t1

r21 r22 t2

r31 r32 t3

3

5

2

4X

Y

1

3

5 =

2

4h11 h12 h13

h21 h22 h23

h31 h32 h33

3

5

| {z }H

2

4X

Y

1

3

5

Page 15: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Homographies• Projections into two cameras:

• As the homography is invertible, we can now map from camera 2 to the object and on to camera 1:

Page 16: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Epipolar Geometry• So in general, two view geometry only tells

us that a corresponding point lies somewhere along a line.

• In practice, we often know more, as objects often have planar, or near planar surfaces.i.e., we are close to the homography case.

• Also: If the views have a short relative baseline, we can use even more simple models.

Page 17: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Thin Lens Camera• An actual pinhole lets in too little light, and a

bigger hole blurs the picture.

• Real cameras instead use lenses to obtain a sharp image using more light.

Page 18: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Thin Lens Camera• A thin lens is a (positive) lens with

• Parallel rays converge at the focal points

• Rays through the optical centre are not refracted

Optical Centre

fFocal point

d

d << f

Page 19: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Thin Lens Camera

• Thin lens relation (from similar triangles):

Optical Centre

f

Focal point

d

Image plane

l

Z

Focal point

1

f=

1

Z+

1

l

Page 20: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Thin Lens Camera

• Focus at one depth only.

• Objects at other depths are blurred.

Optical Centre

f

Focal point

d

Image plane

l

Z

Focal point

Page 21: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Adding an aperture increases the depth-of-field, the range which is sharp in the image.

• A compromise between pinhole and thin lens.

Optical Centre

f

Focal point

d

Image plane

l

Z

Focal point

Aperture

Thin Lens Camera

Page 22: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Lens distortion

• For zoom lenses:- Barrel at wide FoV - pin-cushion at narrow FoV

Barrel distortionCorrect Pin-cushion distortion

Page 23: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Lens distortion

• Modelling

• Used in optimisation such as BA

DistortedCorrect image

)

x ⇠ Kf(u,⇥0)

Page 24: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Lens distortion

• Rectification

• Used in dense stereo

)

Distorted image Correct

x

0 ⇠ f�1(Ku,⇥)

Page 25: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Lens Effects

• Vignetting and cos4-law- more severe on wide angle cameras

Darkened peripheryCorrect

Page 26: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Lens Effects• Vignetting

• cos4-lawdampening withcos4(w)

http://software.canon-europe.com/files/documents/EF_Lens_Work_Book_10_EN.pdf

Page 27: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Sensor activation is linear

• s-sensor absorption spectrum, r-reflectance spectrum of object, e-emission spectrum of light source (attenuated by the atmosphere)

a(x) =

Zs(�)r(�,x)e(�)d�

Image intensity

Page 28: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Sensor activation is linear

• s-sensor absorption spectrum, r-reflectance spectrum of object, e-emission spectrum of light source (attenuated by the atmosphere)

• However, most images are gamma corrected

a(x) =

Zs(�)r(�,x)e(�)d�

i(x) = a(x)�

Image intensity

Page 29: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Sensor activation is linear

• HVS handles a 1010 dynamic range on

• Exposure time and aperture size also scale the activation.

• Unless we know all aspects of image formation, we cannot trust the absolute intensity value.

a(x) =

Zs(�)r(�,x)e(�)d�

Image intensity

Page 30: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Colour perception is done using three different activation functions

Colour

http://publiclab.org/wiki/ndvi-plots-ir-kit

Page 31: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Colour perception is done using three different activation functions

• Sensor activation is not colour. Colour is an object property, i.e. a representation of .

• In order to estimate colour, we need to somehow compensate for the illumination .

Colour

Page 32: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Invariance Transformations

varying camera pose

varying illumination

• Two categories of nuisance factors for recognition/matching

Page 33: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• For matching we need either to know the changes, or an invariance transformation

• Ideally, an invariance transformation should keep information intrinsic to the object, but remove all influence from the imaging process

varying camera pose

varying illumination

Invariance Transformations

Page 34: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Photometric invariance gives robustness toillumination changes

• Geometric invariance gives robustness to view changes

varying camera pose

varying illumination

Invariance Transformations

Page 35: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Colour Constancy

http://web.mit.edu/persci/people/adelson/checkershadow_illusion.html

Page 36: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Colour Constancy

http://web.mit.edu/persci/people/adelson/checkershadow_illusion.html

Page 37: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• The colours we perceive are not the activations of cones in the retina.

• Colour constancy is an attempt by the HVS to transform the retinal activation into a normalized (white) reference illumination.

• Complex process that takes place at many levels (retina, V2,…) and uses high level information (e.g. known object colour).

• White balancing in cameras is a low-level technical equivalent.

Colour Constancy

Page 38: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Object basedcolour transfer

• If you know what you are looking at, you also know something about the illumination

Colour Constancy

Forssén & Moe, View Matching with Blob Features, JIVC’09

Page 39: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• If illumination changes, direct matching fails

Input

I(x) = k1I0(x)

J(x) = k2I0(x)

Photometric invariance

X

x

||I(x)� J(x)|| = large

Page 40: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• If illumination changes, direct matching fails

• We seek a function that is invariant to scalings

Input

I(x) = k1I0(x)

J(x) = k2I0(x)

Photometric invariance

X

x

||I(x)� J(x)|| = large

X

x

||f(I(x))� f(J(x))|| = small

Page 41: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• For cameras with non non-linear radiometricresponse (and e.g. gamma correction), or if two different cameras are used we may use the affine model:

• How should we choose f ? we want:

Photometric invariance

I(x) = k1I0(x) + k0

X

x

||f(I(x))� f(J(x))|| = small

Page 42: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Mean subtraction, derivatives, and other DC free linear filters remove a constant offset in intensity

• Normalising a patch by e.g. the standard deviation, removes scalings of the intensity.

• Affine invariance by combining both:

Photometric invariance

f(I(x)) = (I(x)� µI)/�I

µI = mean of patch

�I = std of patch

Page 43: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• IllustrationInput Gradient Normalised

gradient Normalised input

J(x)

I(x)

Photometric invariance

Page 44: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Transform each pixel separately

• Move intensity change into one dimension. E.g. HSV space.

• In matching, V is then downweighted (or discarded)

Other colour spaces

Page 45: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• Other popular choices are the CIE Lab space (left)

• and the Yuv space.

• Colour space changes do not handle changes in the illumination colour.

Other colour spaces

Page 46: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• The geometric invariances we use make a locally planar assumption. (see also today’s paper)

• They can thus be described using homographies.

Geometric invariance

Page 47: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• A homography is a transformation between points x on one plane, and points y on another.

• At most 8 degrees-of-freedom (dof) as define the same transformation

• See e.g. R. Hartley and A. Zisserman, Multiple View Geometry for Computer Vision

Geometric invariance

2

4y1

y2

1

3

5 = H

2

4x1

x2

1

3

5 =

2

4h11 h12 h13

h21 h22 h23

h31 h32 h33

3

5

2

4x1

x2

1

3

5

H and kH, k 2 R \ 0

Page 48: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• A hierarchy of transformations: scale+translation (3dof)

• similarity (4dof)(scale+translation+rotation)

• affine (6dof)(similarity+skew)

• plane projective (8dof)(affine+forshortening)

Geometric invariance2

4s 0 t10 s t20 0 1

3

5

2

4c s t1�s c t20 0 1

3

5

2

4a11 a12 t1a21 a22 t20 0 1

3

5

2

4h11 h12 h13

h21 h22 h23

h31 h32 h33

3

5

Page 49: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Canonical Frames• Aka. covariant frames, and invariant frames.

Resample patches to canonical frame. Points from e.g. Harris detector, or DoG maxima.

Page 50: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

• After resampling matching is much easier!

Canonical Frames

Page 51: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Canonical Frames• Combinatoral issues

From Harris or DoG we get images full of keypoints.

Page 52: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Canonical Frames• Combinatoral issues

• From Harris or DoG we get images full of keypoints.

• Using the points, we want to generate frames in both reference and query view and match them.

• We don’t want to miss a combination in one of the images, but we don’t want to generate too many combinations either.

Page 53: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Canonical Frames• Solutions:

• Use each point as a reference point.

• Restrict frame construction to k-nearest neighbours in scale space (or image plane).

• Remove duplicate groupings, and reflections.

Page 54: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Summary• Photometric invariance is needed to handle

changes in illumination.

• Common approaches: colour constancy, other colour spaces, and normalisation

• Geometric invariance handles changes in object pose

• A locally planar assumption is very useful for geometric invariance

Page 55: Per-Erik Forssén, docent Computer Vision Laboratory ... · Computer Vision Laboratory Department of Electrical Engineering ... Lens distortion • For zoom lenses: ... I = mean of

© 2 0 1 4 P e r - E r i k F o r s s é n

Discussion• Questions/comments on paper: M. Brown, D. Lowe, "Invariant Features from Interest Point Groups", BMVC 2002

• PhD students only, round robin scheduling