SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.

Automatic detection of object based Region-of-Interest for image

compression

Sunhyoung Han

Transmission in erroneous channel

Basic Motivation

Spatially differentSuper resolution

ConstraintsLimited Resources &Channel Errors

Basic motivation

By having information about importance of regionsOne can wisely use the limited resources

User-adaptive Coder

• visual concepts of interest can

be anything

• main idea:

• let users define a universe

of objects of interest

• train saliency detector for

each object

• e.g. regions of “people”,

“the Capitol”, “trees”, etc.

User Adaptive Coder

query providedby user

traindetector

current trainingsets

User-adaptive coder

• user-adaptive coder:– detector should be generic enough to handle large

numbers of object categories

– training needs to be reasonably fast (including example preparation time)

“face” “lamp” “car”

User-adaptive coder• proposed detector

– top-down object detector (object category specified by user)

– focus on weak supervision instead of highly accurate localization

– composed of saliency detection and saliency validation

– discriminant saliency:

saliencyfilters

training

FIND best features

Discriminant Saliency

• start from a universe of classes (e.g. “faces”, “trees”,

“cars”, etc.)

• design a dictionary of features: e.g. linear

combinations of DCT coefficients at multiple scales

• salient features: those that best distinguish the object

class of interest from random background scenes.

• salient regions are the regions of the image where

these detectors have strong response

• see [Gao & Vasconcelos, NIPS, 2004].

Top-down Discriminant Saliency Model

Scale Selection

Faces Discriminant Feature

Selection

Salient Features

Background

Saliency Map

Original Feature Set

Malik-Perona pre-attentive perception model

ZXIk kk

;maxarg*

• saliency detector

• salient point sali:– magnitude i

– location li– scale si

• saliency map approximated by a Gaussian mixture

Saliency representation

image saliency map salient points

Probability map

Saliency validation• saliency detection:

– due to limited feature dictionary and/or limited training set

– coarse detection of object class of interest

• need to eliminate false positives

• saliency validation:– geometric consistency– reject salient points whose

spatial configuration is inconsistent with training examples

original Image

saliency map for ‘street sign’

example of saliency map

Saliency validation• learning a geometric model of salient point con-

figuration• two components:

- image alignment

• model:- classify pointsinto

• true positives

- configuration model

• false positives- model eachas Gaussian

Saliency validation

• model: two classes of points Y={0,1}– Y=1 true positive– Y=0 false positive

• saliency map: mixture of true and false positive saliency distributions

• each distribution approximated by aGaussian

• this is a two class clustering problem– can be solved by expectation-maximization

• graphical model

• non-standard issues– we start from distributions, not points– alignment does not depend on false

negatives

Saliency validation

E-stepM-step

L~uniform

Y~Bernoulli (1)

C|Y=i~multinomial (i)

X|Y=i,L=l,S=s,~G(x, l-, )

Saliency ValidationFor K training examples (# of saliency point is Nk for kth example) Missing data Y= j,

j {1,0}∈ Parameters j (probability for class j)

∑j (Covariance for class )

k (displacement for kth example)

For robust update

DERIVATION DETAILS

Saliency Validation• visualization of EM algorithm

Saliency detection result

Init saliency points overlapped over 40 samples

Visualized variance ∑1Overlapped points classified as ‘’object’’

Overlapped points classified as ‘’noise’’

Visualized variance ∑0

Saliency Validation

• examples of classified Points

• in summary, during training we learn– discriminant features– The “right” configuration of salient points

Examples of classified saliency points White if hij1>hij

0 Black otherwise

Region of interest detection

• find image window that best matches the learned configuration

• mathematically: - find location p where the posterior probability of the object class is the largest

Region of interest detection• by Bayes rule

– Posterior Likelihood x Prior

– likelihood is given by matching saliencies within the window

& the model

- prior measuresthe saliency massinside window ?

likelihoodPrior

Region of Interest Detection

• given the model– the likelihood, under it, of

a set of points drawn from the observed saliency distribution is

– and the optimal location is given by

Prior for location PWith saliency detector

DERIVATION DETAILS

Measure configuration matching

2. Determine scale(shape) of ROI mask Observation(∑*) from data and

prior(∑1) from training data are used

3. Thresholds PY|X,P(1|x,p*) to get binary ROI mask

** Once the center point is known the assignment of each point is given by

The observed configuration for Y=1 isx

∑1∑*

Saliency detection (for statue of liberty)Probability map (saliency only)

Probability map (with configuration info.) ROI mask

• Example of ROI Detection

Evaluation

• Using CalTech “Face” database &UIUC “Car side” database

• Evaluate robustness of learning– Dedicated Training set vs. Web Training set

• Evaluation Metric– ROC area curve

– PSNR gain for ROI coding vs. normal coding

Number of positive example: 550

Number of positive example: 100

Evaluation

• ROC area curve

False Positive

False PositiveT

“Car” “Face”

Evaluation

• PSNR performance comparison

“Car” “Face”Bit Per Pixel

Bit Per PixelP

14.3% bits can be saved even with web train uniform casefor the same image quality

Result Examples

ResultComparison of needed bits to get the same PSNR (30 dB) for ROI

Maximally, ¼ bits are enough to get the same quality for ROI area

Result Examples

Normal coding ROI coding

EM derivation• Want to fit lower level observation

• For a virtual sample X = {Xik|i=1, …, Nk and k=1, …, K} with the size of Mik=ik*N, likelihood becomes

• For complete set the log likelihood becomes

EM derivation

Maximization in the m-step is carried out by maximizing the Lagrangian

ROI Detection

For one sample point x1

For samples having distribution of

ROI Detection

Therefore,

SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.

Documents

Transcript of SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.

gioi han day so va gioi han ham so

Contents Han E / Han ES / Han ESS / Han EE Pagemarioborre.com.ar/assets/_industrial_connectors_han--harting.pdf · Hoods/housings seal NBR Limiting temperatures - 40 OC / +125 C Degree

Jewellery Manufacturers Han · PDF fileJewellery Manufacturers Han Goldwyn Limited ... Diamond & Coloured Dress Rings 4 ... Han Jewellery. Han Jewellery 01 H 1

Han Q Contents Page - RS Components · Han Q 13. 01 Han® Q Contents Han® Q 2/0 Crimp . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.02

Compression JPG compression, Source: Original 10:1 Compression 45:1 Compression.

Source Coding Data Compression May 7, 2012 A.J. Han Vinck.

Data Compression - University of British Columbia · compression. In general, data compression can be lossless compression or lossy compression. In the latter, some portion of the

Review of linear algebra - SVCL

Discrete Fourier Transform - SVCL - Statistical Visual Computing Lab

Compression: Video Compression (MPEG and others)

deep compression Stanford · Deep Compression and EIE: ——Deep Neural Network Model Compression and Efficient Inference Engine Song Han CVA group, Stanford University

Edges, interpolation, templates - SVCL

Lossless compression in lossy compression systems

The soft-margin support vector machine - SVCL · The soft-margin support vector machine Nuno Vasconcelos ECE Department, UCSD

The Gaussian classifier - SVCL

An Injectable Nucleus Pulposus Implant to … Injectable Nucleus Pulposus Implant to Restore Spinal Range of Motion in Compression +Malhotra, NR2; Han, W 1; Beckstein, J; Cloyd, J;

2018 Daily Prayer Calendar - Joshua Project · Han Chinese, Gan Han Chin., Hainanese Han Chinese, Hakka Han Chinese, Hui'an Han Chinese, Huizhou Han Chinese, Jin Ethnic Rel. LR-UPG

SVCL 1 Content based Image Retrieval (at SVCL) Nikhil Rasiwasia, Nuno Vasconcelos Statistical Visual Computing Laboratory University of California, San.

Linear Algebra Concepts - SVCL

kernels.ppt - SVCL