MKL for Category Recognition

MKL forCategory Recognition

Kumar SrijanSyed Ahsan Ishtiaque

Dataset

• 19 categories considered• Currently– Minimum of 58 images in each– Average of 101 images

• The images have been taken from Google Images.

• Has been supplemented by images from Flickr.

http://images.google.com and http://flickr.com

http://images.google.com

Code Walkthrough – Relevant Files• preprocCal101.sh - Rescales images and renames them

according to the code• cal_preprocDatabases.m - Builds Image and ROI database• cal_preprocVocabularies.m – Prepares the visual word

vocabularies• cal_preprocFeatures.m – computes features for all the

images, project them onto visual words and build map files for each

• cal_preprocHistograms.m – prepares Histograms for the visual words.

• cal_preprocKernels.m – computes training and testing kernel matrices.

• cal_classAll.m – final classification

Code Walkthrough

Construct Visual Wordscal_preprocVocabulariesConstruct Visual Words

cal_preprocVocabularies

Calculating Local Descriptors

Vector quantizationbk_calcVocabularyVector quantizationbk_calcVocabulary

Calculate Image Descriptors

bk_calcFeatures

Calculate Image Descriptors

bk_calcFeatures

Preparing Database( Separate Training and Testing Images, Define

Region of Interest and add Jitters to it )

cal_preprocDatabeses

Preparing Database( Separate Training and Testing Images, Define

Region of Interest and add Jitters to it )

cal_preprocDatabeses computes the features for all the images,

projects them on visual words, and produce map files for each.

cal_preprocFeatures

computes the features for all the images,

projects them on visual words, and produce map files for each.

cal_preprocFeatures

Compute and quantize descriptors for training

and test imagesbk_calcFeatures

Compute and quantize descriptors for training

and test imagesbk_calcFeatures

Prepare the visual words Histogram

cal_preprocHistograms

Prepare the visual words Histogram

cal_preprocHistograms

Compute Training and Testing Kernel Matrices

cal_preprocKernels

Compute Training and Testing Kernel Matrices

cal_preprocKernelsRun on all categories

cal_classAllRun on all categories

cal_classAll

Train SVM with MKLOne vs. Rest Classifiers

bk_trainAppModel

Train SVM with MKLOne vs. Rest Classifiers

bk_trainAppModel

Evaluate SVM on test data

bk_testAppModel

Evaluate SVM on test data

bk_testAppModel

Documentation for modifications and adjustment of parameters for

code execution

Changing the number of Training and testing images

• Default value is 15.• Change drivers/cal_filenames.txt accordingly

– this file contains the name of images for each category to be processed as training or testing images

Changing the number of Training images

• To change the number of final training images, which includes jittered images– In drivers/cal_conf.m file, change conf.numPos to desired value

• To change the number of initial training images (without jitters), which are input to the code– In drivers/cal_preprocDatabases.m –

Change this if ni <= 15 % Hard CodedTo if ni <= conf.numPos % Changed, change it to your desired value imdb.images(ii).set = imdb.sets.TRAIN ; else imdb.images(ii).set = imdb.sets.TEST ;

Changing the number of test images

• In drivers/cal_setupTrainTest.m –for cl = fieldnames(roidb.classes)‘

selCla = findRois(testRoidb, 'class', char(cl)) ;Change this keep(selCla(1 : min(15, length(selCla)))) = true ; % Hard CodedTo keep(selCla(1 : min(conf.numPos, length(selCla)))) = true ; % Changed

% you can change it to desired valueend

Adding a new Feature

• In drivers/cal_conf.m– Add that feature name to conf.featNames– Now specify the properties and parameters for that

feature like conf.feat.<your_feature_name>.<parameter>

• Now add your extractFn, quantizeFn and clusterFn in features directory ( check for input and output format for each )

Parameters

• Parameter should include– format – dense or sparse

• In the dense format, one stores features on a grid, specifying the x and y pixel coordinate of each column/row of the grid. Then we store an "image" whose pixels correspond to grid elements andspecify corresponding visual words.

• In the sparse format, one store a list of visual words and their x, y location in the image.

– extractFn - pointer to the function called to extract the feature

Parameters

– clusterFn - pointer to the clustering (k-means) function– quantizeFn - pointer to the function used to project onto

k-means cluster– vocabSize - k-means vocab. size (number of visual words)– numImagesPerClass - number of image per class used to

sample features to train the vocabulary with k-means– numFeatsPerImage - number of features per image

sampled to train the vocabulary with k-means– compress – “false” generally– pyrLevels - pyramidal levels used when building histogram

based on this features

Changing Jitters

• Jitters are basic modifications (zooming, flipping and rotating) on an image, in the code they are used to create more training data out of the basic training data, which helps to increase the accuracy.

• Current jitters supported are– rp5, rm5, fliplr, fliplr_rp5, fliplr_rm5, zm1, zm2 – these all

are modifications of zoom, rotate and flip only.

• For changing the jitters to be used, in drivers/cal_conf.m file – change conf.jitterNames accordingly.

Changing Features

• Current features supported are– gb – Sparse Geometric-Blur words– gist– bow – Sparse SIFT words, Bag of Words– phog180, phog360 – Dense edge-based shape– phowColor, phowGray – Dense SIFT words

• For changing features to be used, in drivers/cal_conf.m file – change conf.featNames accordingly

• For using bow feature, also use cal_preprocDiscrimScores after cal_preprocFeatures step.

Changing the weight learning method

• Current learning methods supported are– Manik– equalMean – It means that the weights are set to

the inverse of the average of the kernel matrices. It is a simple heuristic whose only purpose is to "balance" the kernels when you combine them additively.

• For changing the weight learning method, in drivers/cal_conf.m file – change conf.learnWeightMethod accordingly.

Obtaining Results

• Calculate SVM score for the image for all the classes.

• The image is assigned the class which has the highest score.

• Use this information to create the confusion matrix.

• Use confusion matrix to calculate the final accuracy.

Code Execution - I• In the current execution, we have taken 10 classes.

– Badge– Bulb– Camera– Cell– Frog– Horse– Keyboard– Kingfisher– Locket– Moon

• 15 train + 15 test images were used for the execution of the code

Kernel Matrices

echi2_phowGray_L0 echi2_phowGray_L1

echi2_phowGray_L2 el2_gb

Aggregate SVM Scores

Test Images

MKL for Category Recognition

Documents

Transcript of MKL for Category Recognition

Intel(R) MKL User's Guide

Object Recognition: The Problem Recognition …CSE152, Winter 2013 Intro Computer Vision Three Levels of Recognition • Category Recognition -- near top of tree (e.g., vehicles) –

1. 2 Define the purpose of MKL Upon completion of this module, you will be able to: Identify and discuss MKL contents Describe the MKL EnvironmentDiscuss.

MKL 2010 Winter/Spring Lookbook

Mkl Userguide Lnx

Win Fortran Mkl Userguide

WARP3D Implementation of MKL Cluster PARDISO Solver · •MKL PARDISO is a Parallel, Direct Sparse Matrix Solver •Cluster MKL - hybrid MPI/OpenMP implementation •MKL PARDISO can

Cvpr2007 object category recognition p2 - part based models

Category-Speciﬁc Object Recognition and Segmentation Using ... · Category-Speciﬁc Object Recognition and Segmentation Using a Skeletal Shape Model Nhon H. Trinh ... being optimal

Category-Blind Human Action Recognition: A Practical ...openaccess.thecvf.com/...Category-Blind_Human_Action_ICCV_2015… · Category-blind Human Action Recognition: ... as input,

Data-Driven 3D Voxel Patterns for Object Category Recognition · 3. Object Category Recognition with 3DVPs We propose a novel object recognition framework based on 3D Voxel Patterns

Method for Customer Review Category Based MKL-SVM

Internet Video Category Recognition

Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Unsupervised Category Modeling, Recognition and Segmentation

Fine Grained Visual Category Recognition and Perceptual Embedding

Intel Mkl Sparse Blas Overview

Research Paper A Lung Sound Category Recognition Method ... · Key words: lung sound, category recognition, wavelet de-noising, linear discriminant analysis, BP neural network Introduction

Mkl infocv v2.5

Cvpr2007 object category recognition p0 - introduction