Active Learning for Structured ... - cv-foundation.org · Active Learning for Structured...

Active Learning for Structured Probabilistic Models with Histogram Approximation

Qing Sun1, Ankit Laddha2, Dhruv Batra1

1Virginia Tech. 2Carnegie Mellon University.

Figure 1: Overview of our approach. We begin with a structured probabilistic model (CRF) trainedon a small set of labeled images; then search the large unlabeled pool for a set of informativeimages to annotate where our current model is most uncertain, i.e. has highest entropy. Sincecomputing the exact entropy is NP-hard for loopy models, we approximate the Gibbs distributionwith a coarsened histogram over M bins. The bins we use are ‘circular rings’ of varying hamming-ball radii around the highest scoring solution. This leads to a novel variational approximation ofentropy in structured models, and an efficient active learning algorithm.

A number of problems in Computer Vision – image segmentation, geometriclabeling, human body pose estimation – can be written as a mapping from aninput image x ∈ X to an exponentially large space Y of structured outputs.For instance, in semantic segmentation, Y is the space of all possible (super-)pixel labelings, |Y|= Ln, where n is the number of (super-)pixels and L isthe number of object labels that each (super-)pixel can take.

As a number of empirical studies have found [4, 8, 13], the amountof training data is one of the most significant factors influencing the per-formance of a vision system. Unfortunately, unlike unstructured predictionproblems – binary or multi-class classification – data annotation is a particu-larly expensive activity for structured prediction. For instance, in image seg-mentation annotations, we must label every (super-)pixel in every trainingimage, which may easily run into millions. In pose estimation annotations,we must label 2D/3D locations of all body parts and keypoints of interestin thousands of images. As a result, modern dataset collection efforts suchas PASCAL VOC [3], ImageNet [2], and MS COCO [6] typically involvespending thousands of human-hours and dollars on crowdsourcing websitessuch as Amazon Mechanical Turk.

Active learning [10] is a natural candidate for reducing annotation ef-forts by seeking labels only on the most informative images, rather than theannotator passively labeling all images, many of which may be uninforma-tive. Unfortunately, active learning for structured-output models is challeng-ing. Even the simplest definition of “informative” involves computing theentropy of the learnt model over the output-space:

H(P) =−EP(y|x)[log(P(y|x))] (1a)

=− ∑y∈Y

P(y|x) logP(y|x), (1b)

which is intractable due to the summation over an exponentially-large outputspace Y .

Overview and Contributions. In this paper, we study active learningfor probabilistic models such as Conditional Random Fields (CRFs) that en-code probability distributions over an exponentially-large structured outputspace.

Our main technical contribution is a variational approach [12] for ap-proximate entropy computation in such models. Specifically, we present a

This is an extended abstract. The full paper is available at the Computer Vision Foundationwebpage.

(a) Binary Segmentation (b) Geometric LabelingFigure 2: Accuracy vs the number of images annotated (shaded regions indicate confi-dence intervals, achieved from 20 and 30 runs respectively). We can see that our approachActive-PDivMAP outperforms all baselines and is very quickly able to reach the same per-formance as annotating the entire dataset.

crude yet surprisingly effective histogram approximation to the Gibbs dis-tribution, which replaces the exponentially-large support with a coarseneddistribution that may be viewed a histogram over M bins. As illustrated inFig. 1, each bin in the histogram corresponds to a subset of solutions – forinstance, all segmentations where size of foreground (number of ON pixels)is in a specific range [L U ]. Computing the entropy of this coarse distri-bution is simple since M is a small constant (∼10). Importantly, we provethat the optimal histogram, i.e. one that minimizes the KL-divergence to theGibbs distribution, is composed of the mass of the Gibbs distribution in eachbin, i.e. ∑y∈bin P(y|x). Unfortunately, the problem of estimating sums of theGibbs distribution under general hamming-ball constraints continues to be#P-complete [11]. Thus, we upper bound the mass of the distribution in abin with the maximum entry in a bin multiplied by the size of the bin. Fortu-nately, finding the most probable configuration in a hamming ball has beenrecently studied in the graphical models literature [1, 7, 9], and efficientalgorithms have been developed, which we use in this work.

We perform experiments on figure-ground image segmentation and coarse3D geometric labeling [5]. As shown in Fig. 2, our proposed algorithm sig-nificantly outperforms a large number of baselines and can help save hoursof human annotation effort.

[1] Dhruv Batra, Payman Yadollahpour, Abner Guzman-Rivera, and Greg Shakhnarovich. Di-verse M-Best Solutions in Markov Random Fields. In ECCV, 2012.

[2] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-ScaleHierarchical Image Database. In CVPR, 2009.

[3] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The pascalvisual object classes (voc) challenge. IJCV, 88(2):303–338, June 2010.

[4] James Hays and Alexei A. Efros. im2gps: estimating geographic information from a singleimage. In CVPR, 2008.

[5] Derek Hoiem, Alexei A. Efros, and Martial Hebert. Recovering surface layout from animage. IJCV, 75(1), 2007.

[6] Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan,Piotr Dollár, and C. Lawrence Zitnick. Microsoft COCO: Common objects in context, 2014.

[7] Franziska Meier, Amir Globerson, and Fei Sha. The More the Merrier: Parameter Learningfor Graphical Models with Multiple MAPs. In ICML Workshop on Inferning: Interactionsbetween Inference and Learning, 2013.

[8] D. Parikh and C.L. Zitnick. The role of features, algorithms and data in visual recognition.In CVPR, 2010.

[9] Adarsh Prasad, Stefanie Jegelka, and Dhruv Batra. Submodular meets structured: Findingdiverse subsets in exponentially-large structured item sets. 2014.

[10] B. Settles. Active Learning. Synthesis Lectures on Artificial Intelligence and MachineLearning. Morgan & Claypool, 2012.

[11] Leslie G. Valiant. The complexity of computing the permanent. Theoretical ComputerScience, 8(2), 1979.

[12] Martin J. Wainwright and Michael I. Jordan. Graphical models, exponential families, andvariational inference. Foundations and Trends in Machine Learning, 1(1-2):1–305, 2008.

[13] Xiangxin Zhu, Carl Vondrick, Deva Ramanan, and Charless Fowlkes. Do we need moretraining data or better models for object detection? In BMVC, 2012.

Active Learning for Structured ... - cv-foundation.org · Active Learning for Structured...

Documents

Transcript of Active Learning for Structured ... - cv-foundation.org · Active Learning for Structured...

Structured Media STRUCTURED MEDIA SYSTEMS GROUP Components

Data Driven Energy Efﬁciency in Buildings · Data Driven Energy Efﬁciency in Buildings Nipun Batra1, Amarjeet Singh1, Pushpendra Singh1, Haimonti Dutta2 Venkatesh Sarangan3, Mani

Fifth Floor - Structured Cabling Layout Plan - Structured ......Fifth Floor - Structured Cabling Layout Plan - Structured Cabling Installation ...

Status Epilepticus - CSHL Pperspectivesinmedicine.cshlp.org/content/6/3/a022830... · 2016-02-22 · Status Epilepticus Syndi Seinfeld1, Howard P. Goodkin2, and Shlomo Shinnar3 1Virginia

Structured analysis and structured design

Anomaly-Free Sets of Fermions - Mathematicsmath.mit.edu/~dspivak/chiral-sets.pdfarXiv:hep-ph/0510181 v1 13 Oct 2005 Anomaly-Free Sets of Fermions Puneet Batra1, Bogdan A. Dobrescu2,

Constrained moving least-squares immersed boundary method ... · fluid‐structure interaction analysis Yegao Qu 1,2 | Romesh C. Batra1 1Department of Biomedical Engineering and Mechanics,

loss Jinwoo Choi Gaurav Sharma2 Manmohan Chandraker2 Jia …jbhuang/supp/WACV_2020_Drone... · 2020. 3. 28. · Gaurav Sharma2 Manmohan Chandraker2 Challenges 1Virginia Tech 2NEC

Systems of Carecsa.virginia.gov/content/pdf/Executive_Level_Overview... · 2012. 6. 11. · beyond disorders that are related to their criminal behavior.1 1Virginia Department of

Introduction of Structured Learningspeech.ee.ntu.edu.tw/~tlkagk/courses/ML_2017/Lecture/Structured... · Introduction of Structured Learning Hung-yi Lee. Structured Learning •We

VOLUME VOLUME 4444 (STRUCTURED CABLING INSTALLATION WORKS)(STRUCTURED CABLING ... · 2020. 7. 14. · volume volume 4444 (structured cabling installation works)(structured cabling

Masterclass: Structured Alternatives to Structured …/media/Files/Presentations/2016/02/160223...Structured Alternatives to Structured Notes ... a requirement to appoint an unaffiliated

Request for Proposals For Architectural Services for ......COUNTY OF MADISON,VIRGINIA RFP # 190802 2 COUNTY OF MADISON, 1VIRGINIA RFP # 90802 Architectural Services (COMPLETE THIS

NILMTK: An Open Source Toolkit for Non-intrusive Load … · NILMTK: An Open Source Toolkit for Non-intrusive Load Monitoring Nipun Batra1, Jack Kelly2, Oliver Parson3, Haimonti Dutta4,

Structured Products in Asia - pdf.hubbis.compdf.hubbis.com/publication/structured-products-in-asia-2015.pdf · transparency in structured investment products and to be the ... i STRUCTURED

arXiv:1604.02125v4 [cs.CV] 26 Sep 2016 · Gordon Christie 1;, Ankit Laddha2, Aishwarya Agrawal , Stanislaw Antol Yash Goyal 1, Kevin Kochersberger , Dhruv Batra3;1 1Virginia Tech

microMend Mohs Surgery Study 200502 · 1 CLOSURE OF SKIN WOUNDS WITH MICROMEND® WOUND CLOSURE DEVICE AFTER MOHS SURGERY Authors: Hiren Kolli,1,2 Ronald Berenson,3 Ronald Moy2 1Virginia

Are structured products rising from the ashes? - sft.fr · structured product Definition of structured products ... – Athena (BNP) ... – Date on which the structured product is

Separating Brand from Category Personality Personality.pdf · 2005. 4. 4. · Separating Brand from Category Personality Rajeev Batra1 Peter Lenk Michel Wedel 1 Rajeev Batra is S.S.

arXiv:1705.00601v2 [cs.CV] 17 Aug 2017 - Viraj Prabhu · Aroma Mahendru;1Viraj Prabhu Akrit Mohapatra Dhruv Batra2 Stefan Lee1 1Virginia Tech 2Georgia Institute of Technology {maroma,