ICML 2016: The Information Sieve

The Information SieveGreg Ver Steeg and Aram Galstyan

Soup = data

“Main ingredient” extracted at each layer

Factorial code

• Carry recipe instead of soup• Missing ingredients?• Make more soup

• Compression• Prediction• Generative model

Recipe-Ingredient 1-Ingredient 2-… Invertible transform that makes

components independent

Finding such a transform is a generally intractable problem.We use a sequence that incrementally removes dependence

Two Steps1.Find the most informative function of the input data2. Transform the data to remove the

information in Yk, and then repeat

Main ingredient

The main ingredient: multivariate information• Multivariate mutual information, or Total Correlation (Watanabe, 1960)

• TC(X|Y) = 0 if and only if Y “explains” all the dependence in X• So we search for Y that minimizes TC(X|Y) • Equivalently, we define the total correlation explained by Y as:

The main ingredient:Total Correlation Explanation (CorEx)

• Optimize over all probabilistic functions• Solution has special form that makes it tractable• Computational complexity is linear in the number of variables

Sift out the main ingredient: remainder info

The remainder is a transformation of the inputs with 2 properties:

Remainder contains no info about Y

Transformation is invertible

Iterative sifting as:

Multivariate mutual information in data (Total Correlation)

Contribution from each layer of the sieve (optimized)

Remainder (at layer r)

Decomposition of information

Iterative sifting as:

Dependence at each layer of the sieve decreases until we get to zero, i.e. complete independence

Dependence (at layer r)

Extracting dependence

Recover spatial clusters from fMRI data

Ground truth ICA Sieve

Example of recovering spatial clusters in brain data from temporal activation patterns

Lossy compression and in-painting• Sieve representation with 12 layers/bits/binary latent factors on

MNIST digits

We can use the sieve for standard prediction and generative model tasks

Lossless compression (on MNIST)• Same size codebooks for Random and Sieve-based codes• (gzip is sequence-based, shown for reference)Proof of principle for lossless compression; though specialized

compression techniques are better on MNIST.

Method Naive gzip Random codebook

Sieve codebook

Bits per digit 784 328 267 243

Conclusion

• Incrementally decomposing multivariate information is useful, practical, and delicious• Could improve with joint optimization and better

transformations for remainder info

Link to all papers and codehttp://bit.ly/corex_info

Contact: gregv@isi.edu, galstyan@isi.edu

• The extension to continuous random variables is nontrivial but more practical and demonstrates connections to “common information”: “Sifting Common Information from Many Variables”, arXiv:1606.02307.

ICML 2016: The Information Sieve

Science

Transcript of ICML 2016: The Information Sieve

A tutorial on deep learning at icml 2013

U.S. SIEVE OPENING IN INCHES U.S. SIEVE NUMBERS HYDROMETER …

ICML Hanyang University Polymeric Materials for Information & Communication Organic Thin Film Transistor Edited by Song Ho, Kim.

ICMl conference, Belgrade 2015

ERATOSTHENES SIEVE

Sieve Format

Multiplexity as a Conduit and Sieve of Information …...Multiplexity as a Conduit and Sieve of Information in Embedded Markets Preliminary and Incomplete. Please do not cite or circulate.

Sieve Update

SIEVE 1.1.0 User Guide Version Btools.thermofisher.com/.../Man-XCALI-97186-SIEVE... · 1 Overview to SIEVE Understanding SIEVE 2 SIEVE User Guide Thermo Scientific Understanding SIEVE

ICML-Basu02-Semi Supervised Clustering by Seeding

CHAPTER 2 SIEVE ANALYSIS AND FINENESS MODULUS Sampling · SIEVE ANALYSIS AND FINENESS MODULUS Sampling ... (Note: In a fine aggregate sieve analysis, the test sample Sieve. 2013 1.0

ICML 2014 CLUB - Online Clustering of Bandits Poster, 31st ICML, JMLR

In[Ba3Cl3F6]: A Novel Infrared-Transparent Molecular Sieve ...S1 Supporting Information In[Ba3Cl3F6]: A Novel Infrared-Transparent Molecular Sieve Constructed by Halides Xiaoqing Jiang,a

ICML · Introduction to ICML 2015 5 ICML 2015 at a glance 7 Local information 9 Organizing committee 13 Keynote speakers 15 Awards 17 Tutorials 18 Main conference 21 Workshops 37

ICML ’11 Tutorial: Recommender Problems for Web Applicationspages.cs.wisc.edu/~beechung/icml11-tutorial/ICML-Recommender... · Recommender Problems for Web Applications ... –

Molecular Sieve Applications - Kolmetz.com · Date of b’thru test Mol sieve life of 4 months Mol sieve life of 7 months ... Molecular Sieve Life ... • Have used molecular sieves

SIeve+ Introduction

Learning to Groove ICML

ICML Conference Belgrade, 2015

Company Profile - Sieve cleaners | Filip Sieve Cleaners · sieve cleaners for the milling industry Independent: sieve cleaners for all standard plansifters e.g. Alapala, Bühler,