Scale Invariant Feature Transform (SIFT)

Scale Invariant Feature Scale Invariant Feature Transform (SIFT)Transform (SIFT)

OutlineOutline

What is SIFTWhat is SIFT

Algorithm overviewAlgorithm overview

Object DetectionObject Detection

SummarySummary

OverviewOverview

19991999

Generates image features, “keypoints”Generates image features, “keypoints”– invariant to image scaling and rotation– partially invariant to change in illumination and

3D camera viewpoint– many can be extracted from typical images– highly distinctive

Algorithm overviewAlgorithm overview

Scale-space extrema detectionScale-space extrema detection– Uses difference-of-Gaussian functionUses difference-of-Gaussian function

Keypoint localizationKeypoint localization– Sub-pixel location and scale fit to a modelSub-pixel location and scale fit to a model

Orientation assignmentOrientation assignment– 1 or more for each keypoint1 or more for each keypoint

Keypoint descriptorKeypoint descriptor– Created from local image gradientsCreated from local image gradients

Scale spaceScale space

Definition: Definition:

wherewhere

),(),,(),,( yxIyxGyxL 222 2/)(

yxeyxG

Keypoints are detected using scale-space Keypoints are detected using scale-space extrema in difference-of-Gaussian function extrema in difference-of-Gaussian function DD

DD definition: definition:

Efficient to computeEfficient to compute

),()),,(),,((),,( yxIyxGkyxGyxD

),,(),,( yxLkyxL

Relationship of Relationship of DD to to

Close approximation to scale-Close approximation to scale-normalized Laplacian of Gaussian,normalized Laplacian of Gaussian,

Diffusion equation:

Approximate ∂G/∂σ:

– giving,

When D has scales differing by a constant factor it already incorporates the σ2 scale normalization required for scale-invariance

yxGkyxGG ),,(),,(

GkyxGkyxG 22)1(),,(),,(

yxGkyxG 2),,(),,(

Scale space constructionScale space construction

Scale space imagesScale space images

first octave

second octave

third octave

fourth octave

Difference-of-Gaussian imagesDifference-of-Gaussian images

first octave

second octave

third octave

fourth octave

Frequency of samplingFrequency of sampling

There is no minimumThere is no minimum

Best frequency determined experimentallyBest frequency determined experimentally

Prior smoothing for each octavePrior smoothing for each octave

Increasing Increasing σσ increases robustness, but costs increases robustness, but costs

σσ = 1.6 a good tradeoff = 1.6 a good tradeoff

Doubling the image initially increases Doubling the image initially increases number of keypointsnumber of keypoints

Finding extremaFinding extrema

Sample point is selected only if it is a Sample point is selected only if it is a minimum or a maximum of these pointsminimum or a maximum of these points

DoG scale spaceExtrema in this image

LocalizationLocalization

3D quadratic function is fit to the local sample 3D quadratic function is fit to the local sample pointspoints

Start with Taylor expansion with sample point Start with Taylor expansion with sample point as the originas the origin– wherewhere

Take the derivative with respect to Take the derivative with respect to XX, and set , and set it to 0, givingit to 0, giving

is the location of the keypointis the location of the keypoint

This is a 3x3 linear systemThis is a 3x3 linear system

DDDD T

Tyx ),,(

D ˆ02

Derivatives approximated by finite Derivatives approximated by finite differences,differences,– example:example:

If If XX is > 0.5 in any dimension, process is > 0.5 in any dimension, process repeatedrepeated

FilteringFiltering

Contrast (use prev. equation):Contrast (use prev. equation):– If If | D(X) || D(X) | < 0.03, throw it out < 0.03, throw it out

Edge-iness:Edge-iness:– Use ratio of principal curvatures to throw out poorly Use ratio of principal curvatures to throw out poorly

defined peaksdefined peaks– Curvatures come from Hessian:Curvatures come from Hessian:– Ratio of Ratio of Trace(H)Trace(H)22 and and Determinant(H)Determinant(H)

– If ratio > If ratio > (r+1)(r+1)22/(r)/(r), throw it out (SIFT uses r=10), throw it out (SIFT uses r=10)

xyyyxx

DDDHDet

Orientation assignmentOrientation assignment

Descriptor computed relative to keypoint’s Descriptor computed relative to keypoint’s orientation achieves rotation invarianceorientation achieves rotation invariance

Precomputed along with mag. for all levels Precomputed along with mag. for all levels (useful in descriptor computation)(useful in descriptor computation)

Multiple orientations assigned to keypoints Multiple orientations assigned to keypoints from an orientation histogramfrom an orientation histogram– Significantly improve stability of matchingSignificantly improve stability of matching

))),1(),1(/())1,()1,(((2tan),(

))1,()1,(()),1(),1((),( 22

yxLyxLyxLyxLayx

yxLyxLyxLyxLyxm

Keypoint imagesKeypoint images

DescriptorDescriptor

Descriptor has 3 dimensions Descriptor has 3 dimensions (x,y,(x,y,θθ))

Orientation histogram of gradient magnitudesOrientation histogram of gradient magnitudes

Position and orientation of each gradient Position and orientation of each gradient sample rotated relative to keypoint orientationsample rotated relative to keypoint orientation

DescriptorDescriptor

Weight magnitude of each sample point by Weight magnitude of each sample point by Gaussian weighting functionGaussian weighting function

Distribute each sample to adjacent bins by Distribute each sample to adjacent bins by trilinear interpolation (avoids boundary effects)trilinear interpolation (avoids boundary effects)

DescriptorDescriptorBest results achieved with 4x4x8 = 128 Best results achieved with 4x4x8 = 128 descriptor sizedescriptor size

Normalize to unit lengthNormalize to unit length– Reduces effect of illumination changeReduces effect of illumination change

Cap each element to 0.2, normalize againCap each element to 0.2, normalize again– Reduces non-linear illumination changesReduces non-linear illumination changes– 0.2 determined experimentally0.2 determined experimentally

Object DetectionObject Detection

Create a database Create a database of keypoints from of keypoints from training imagestraining images

Match keypoints to Match keypoints to a databasea database– Nearest neighbor Nearest neighbor

searchsearch

PCA-SIFTPCA-SIFT

Different descriptor (same keypoints)Different descriptor (same keypoints)

Apply PCA to the gradient patchApply PCA to the gradient patch

Descriptor size is 20 (instead of 128)Descriptor size is 20 (instead of 128)

More robust, fasterMore robust, faster

SummarySummary

Difference-of-GaussianDifference-of-Gaussian

FilteringFiltering

Orientation assignmentOrientation assignment

Descriptor, 128 elementsDescriptor, 128 elements

Scale Invariant Feature Transform (SIFT)

Documents

Transcript of Scale Invariant Feature Transform (SIFT)

ANALISI ENERGETICA E OTTIMIZZAZIONE DELL’ALGORITMO BRISK · L’algoritmo Scale Invariant Feature Transform (SIFT) fu inizialmente proposto da David Lowe nel 1999 [13] ed è oggi

Scale Invariant Feature Transform (SIFT)turkel/notes/illusion_files/sift-Marganit.pdf · Scale Invariant Feature Transform (SIFT) The SIFT descriptor is a coarse description of the

Scale Invariant Feature Transform: A Graphical Parameter ...tmorris/pubs/BMVC Sift Parameter Sweep.pdf · Scale Invariant Feature Transform: A Graphical Parameter Analysis Michael

SIFT (scale invariant feature transform)

Can Facial Metrology Predict Gender?rossarun/pubs/CaoFacial...al. [27] proposed a novel gender recognition method using Scale Invariant Feature Transform (SIFT) descriptors and shape

Local features: detection and descriptionvision.cs.utexas.edu/378h-fall2015/slides/lecture12.pdf · Scale Invariant Feature Transform (SIFT) descriptor ... • Sometimes even day

SIFT - The Scale Invariant Feature Transform · SIFT - The Scale Invariant Feature Transform Distinctive image features from scale-invariant keypoints. David G. Lowe, International

COMBINING MUTUAL INFORMATION AND SCALE INVARIANT … · 2013. 12. 12. · The Scale Invariant Feature Transform (SIFT) operator's success for computer vision applications makes it

Computer Vision Processing Scale Invariant Feature …2016/08/16 · Scale Invariant Feature Transform Computer Vision Processing Content - Introduction to SIFT - Detection of Scale-Space

Scale Invariant Feature Transform (SIFT)turkel/notes/sift.pdf · Scale Invariant Feature Transform (SIFT) The SIFT descriptor is a coarse description of the edge found in the frame.

Fast and Robust 3D Feature Extraction from Sparse Point Cloudsjacoposerafin.com/wp-content/uploads/serafin16iros.pdf · The Scale Invariant Feature Transform (SIFT) algorithm by Lowe

Scale-Invariant Feature Transform

Offline Hand Writer Identification Based on Scale ... · Scale invariant feature transform (SIFT),for distinctive scale-invariant features extraction from images, has been widely

SIFT: Scale-Invariant Feature Transformmatthewtoews.com/teaching/lecture_ecse626_sift.pdf · SIFT: Scale-Invariant Feature Transform Idea: identify the same image features in the

SIFT: Scale Invariant Feature Transform by David Loweweb.eecs.umich.edu/~silvio/teaching/EECS598/lectures/lecture10_1.pdf · Sub Pixel Locate Potential Feature Points Build Keypoint

SIFT: SCALE INVARIANT FEATURE TRANSFORM BY DAVID LOWE

SIFT: SCALE INVARIANT FEATURE TRANSFORM BY …b1morris/ecg782/sp14/docs/Ali_SIFT.pdf · Orientation Assignment ... SIFT: Scale Invariant Feature Transform. Sift.ppt Lee, David. Object

OBJECT DETECTION820541/FULLTEXT01.pdf · Scale-invariant feature transform (SIFT)[7] and Speeded-up robust features (SURF)[8], state-of-the-art feature detection algorithms, which

The SIFT (Scale Invariant Feature Transform) Detector and ...

IMAGE MATCHING USING SCALE INVARIANT FEATURE TRANSFORM ...itzik/IP5211/Other/Projects/P6_SIFT_Image... · IMAGE MATCHING USING SCALE INVARIANT FEATURE TRANSFORM (SIFT) Naotoshi Seo