CSE 473/573 Computer Vision and Image Processing (CVIP)

Ifeoma Nwogu

Lectures 21 & 22 – Segmentation and clustering

Schedule

• Last class – We started on segmentation

• Today– Segmentation continued

• Readings for today: – Forsyth and Ponce chapter 9; – Szelinski chapter 5

Digital image manipulations

• Image processing image in → image out

• Image analysis image in → measurements out

• Image understanding image in → high-level description out

Perceptual grouping

Motion and perceptual organization

Humans interpret information “collectively” or in groups

6Slides accompanying Forsyth and Ponce “Computer Vision - A Modern Approach” 2e by D.A. Forsyth

Image segmentationThe main goal is to identify groups of

pixels/regions that “go together perceptually”

Image segmentationSeparate image into “coherent objects”

Examples of segmented images

Why do segmentation?

• To obtain primitives for other tasks• For perceptual organization, recognition• For graphics, image manipulation

Task 1: Primitives for other tasks

• Group together similar-looking pixels for efficiency of further processing– “Bottom-up” process– Unsupervised

X. Ren and J. Malik. Learning a classification model for segmentation. ICCV 2003.

“superpixels”

• Image parsing or semantic segmentation:

Example of segments as primitives for recognition

J. Tighe and S. Lazebnik, ECCV 2010, IJCV 2013

Task 2: Recognition• Separate image into coherent “objects”

– “Bottom-up” or “top-down” process?– Supervised or unsupervised?

Berkeley segmentation database:http://www.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/segbench/

image human segmentation

Task 3: Image manipulation• Interactive segmentation for graphics

Challenges with segmentation

High-level approaches to segmentation

• Bottom-up: group tokens with similar features• Top-down: group tokens that likely belong to

the same object

[Levin and Weiss 2006]

Approaches to segmentation

• Segmentation as clustering• Segmentation as graph partitioning• Segmentation as labeling (?)

Segmentation as clustering

• Clustering: grouping together similar points and represent them with a single token

• Key Challenges: 1. What makes two points/images/patches similar? 2. How do we compute an overall grouping from

pairwise similarities?

Segmentation as clustering

Source: K. Grauman

K-means algorithm

Illustration: http://en.wikipedia.org/wiki/K-means_clustering

1. Randomly select K centers

2. Assign each point to nearest center

3. Compute new center (mean) for each cluster

K-means algorithm

Illustration: http://en.wikipedia.org/wiki/K-means_clustering

1. Randomly select K centers

2. Assign each point to nearest center

3. Compute new center (mean) for each cluster

Back to 2

K-means1. Initialize cluster centers: c0 ; t=0

2. Assign each point to the closest center

3. Update cluster centers as the mean of the points

4. Repeat 2-3 until no points are re-assigned (t=t+1)

211argmin xcδδ

21argmin xccc

K-means: design choices• Initialization

– Randomly select K points as initial cluster center– Or greedily choose K points to minimize residual

• Distance measures– Traditionally Euclidean, could be others

• Optimization– Will converge to a local minimum– May want to perform multiple restarts

How to choose the number of clusters?

• Minimum Description Length (MDL) principle for model comparison

• Minimize Schwarz Criterion – also called Bayes Information Criteria (BIC)

sum squared error

How to choose the number of clusters?

• Validation set– Try different numbers of clusters and look at

performance• When building dictionaries (discussed in a previous

class), more clusters typically work better

How to evaluate clusters?

• Generative– How well are points reconstructed from the

clusters?

• Discriminative– How well do the clusters correspond to labels?

• Purity

Note: unsupervised clustering does not aim to be discriminative

Common similarity/distance measures• P-norms

– City Block (L1)

– Euclidean (L2)

– L-infinity

• Mahalanobis– Scaled Euclidean

• Cosine distance

Here xi is the distance between two points

Conclusions: K-means

Good• Finds cluster centers that minimize conditional variance (good

representation of data)• Simple to implement, widespread application

Bad• Prone to local minima• Need to choose K• All clusters have the same parameters (e.g., distance measure is

non-adaptive)• Can be slow: each iteration is O(KNd) for N d-dimensional points

K-medoids

• Just like K-means except– Represent the cluster with one of its members,

rather than the mean of its members– Choose the member (data point) that minimizes

cluster dissimilarity

• Applicable when a mean is not meaningful– E.g., clustering values of hue or using L-infinity

similarity

K-Means pros and cons• Pros

– Simple and fast– Easy to implement

• Cons– Need to choose K– Sensitive to outliers

• Usage– Rarely used for pixel

segmentation

• Versatile technique for clustering-based segmentation

D. Comaniciu and P. Meer, Mean Shift: A Robust Approach toward Feature Space Analysis, PAMI 2002.

Mean shift segmentation

Mean shift algorithm• Try to find modes of this non-parametric

density

Kernel density estimation (KDE)

• A non-parametric way to estimate the probability density function of a random variable.

• Inferences about a population are made based only on a finite data sample.

• Also termed the Parzen–Rosenblatt window method,

• Named fter Emanuel Parzen and Murray Rosenblatt, who are usually credited with independently creating it in its current form

Kernel density estimation

Kernel density estimation function

Gaussian kernel

Kernel density estimationKernel

Data (1-D)

Estimated density

Region ofinterest

Center ofmass

Mean Shiftvector

Slide by Y. Ukrainitz & B. Sarel

Mean shift

Region ofinterest

Center ofmass

Mean Shiftvector

Mean shift

Region ofinterest

Center ofmass

Mean Shiftvector

Mean shift

Region ofinterest

Center ofmass

Mean Shiftvector

Mean shift

Region ofinterest

Center ofmass

Mean Shiftvector

Mean shift

Region ofinterest

Center ofmass

Mean Shiftvector

Mean shift

Region ofinterest

Center ofmass

Mean shift

Simple Mean Shift procedure:• Compute mean shift vector

• Translate the Kernel window by m(x)

x - xx

m x xx - x

g( ) ( )kx x

Computing the Mean Shift

Real Modality Analysis

• Attraction basin: the region for which all trajectories lead to the same mode

• Cluster: all data points in the attraction basin of a mode

Attraction basin

Mean shift filtering and segmentation for grayscale data; (a) input data (b) mean shift paths for the pixels on the plateaus (c) filtering result (d) segmentation result

http://www.caip.rutgers.edu/~comanici/Papers/MsRobustApproach.pdf

Mean shift clustering

• The mean shift algorithm seeks modes of the given set of points1. Choose kernel and bandwidth2. For each point:

a) Center a window on that pointb) Compute the mean of the data in the search windowc) Center the search window at the new mean locationd) Repeat (b,c) until convergence

3. Assign points that lead to nearby modes to the same cluster

• Compute features for each pixel (color, gradients, texture, etc); also store each pixel’s position

• Set kernel size for features Kf and position Ks

• Initialize windows at individual pixel locations• Perform mean shift for each window until convergence• Merge modes that are within width of Kf and Ks

Segmentation by Mean Shift

http://www.caip.rutgers.edu/~comanici/MSPAMI/msPamiResults.html

Mean shift segmentation results

http://www.caip.rutgers.edu/~comanici/MSPAMI/msPamiResults.html

Mean shift pros and cons• Pros

– Good general-purpose segmentation

– Flexible in number and shape of regions

– Robust to outliers

• Cons– Have to choose kernel size in advance

– Not suitable for high-dimensional features

• When to use it– Oversegmentation

– Multiple segmentations

– Tracking, clustering, filtering applications• D. Comaniciu, V. Ramesh, P. Meer:

Real-Time Tracking of Non-Rigid Objects using Mean Shift, Best Paper Award, IEEE Conf. Computer Vision and Pattern Recognition (CVPR'00), Hilton Head Island, South Carolina, Vol. 2, 142-149, 2000

CSE 473/573 Computer Vision and Image Processing (CVIP)

Documents

Transcript of CSE 473/573 Computer Vision and Image Processing (CVIP)

CSE 473/573 Computer Vision and Image Processing (CVIP)

· PDF filePhone # 573-334-082 Attn: Ed Bagby Fax# Phone# 573-751-9040 Attn: Fax # Phone # 573-290-5710 Attn: Brian Deuster Fax # 573-651-2532 Phone # 573-651-2000

EOSC 473/573 Oceanographic Methods · • Research projects in BC. The deep fjords and many islands provide a dynamic ... ‐ final project report in the form of a scientific ...

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.

CSE 473/573 Computer Vision and Image Processing (CVIP)inwogu/teaching/Coursepage573... · 2016-04-18 · CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu inwogu@buffalo.edu.

cap.ksu.edu.sa€¦ · cuMucaT1vE SAMPLE OF N. our OF .00000 .90000 .70000 .60000 aea 743 2X2 473 2425 383 230. 89 687.33 9ts.78 344.67 573, t' 8C'.S6 . a—LA . 1030.0

ACADEMIC AFFAIRS COUNCIL - sdbor.edu · 2019-02-15 · GEOG 474-574 primarily focuses on advanced GIS modeling methods and how to use them in real-world GIS applications. GEOG 473-573

FIRAS (473)

MAE 473-573: Lecture #5 - 2D Graphics

Bleach 473

CSE 473/573 RANSAC & Least Squares Devansh Arpit.

CSE 473/573 Computer Vision and Image Processing (CVIP)inwogu/teaching/Coursepage573_fa14/lect… · But derivatives amplify noise • Derivative estimates appear below images –

CSCI 473/573 Human-Centered Robotics - Today at Minesinside.mines.edu/.../Lectures/03-ROS_Overview.pdf · 2020-01-22 · ROS: Robot Operating System 40 Ecosystem: ROS is supported

CSE 473/573 Computer Vision and Image Processing (CVIP)inwogu/teaching/Course... · Joseph Nicéphore Niépce 1826 Reproduction, 1952 . Image formation • Let’s design a camera:

CVIP Spring 2017 Newsletter...CVIP Spring 2017 Newsletter COMMUNITY VOLUNTEERS FOR INTERNATIONAL PROGRAMS 427 N. Shaw Lane, Room 300 F, International Center, MSU, East Lansing, MI

SBE Data Processing - University of British Columbiaswaterma/473-573/Handouts/... · SBE Data Processing CTD Data Processing and Plotting Software for Windows XP, Windows Vista, or

MAE 473-573: Lecture #5 - 2D Graphics€¦ · MAE 473-573: Lecture #5 - 2D Graphics • Overview of computer graphics: End result: A 2D picture on a computer screen Often a projection

Dongqing Chen, Ph.D. Computer Vision & Image Processing (CVIP) Laboratory

JCU 473/573 Narrative Expository PowerPoint

Biology 2 - University of British Columbiaswaterma/473-573/Biology2.pdf · Biology 2 . Plankton = organisms unable to swim against currents. Drifters. (Hensen 1887) Zooplankton =