Dimensionality Reduction and Principle Components Analysiswcohen/10-601/601-pca.pdf · •...

Dimensionality Reduction�and Principle Components

Analysis

Outline •  What is dimensionality reduction? •  Principle Components Analysis (PCA)

– Example (Bishop, ch 12) – PCA vs linear regression – PCA as a mixture model variant – Implementing PCA

•  Other matrix factorization methods – Applications (collaborative Jiltering) – MF with SGD – MF vs clustering vs LDA vs ….

A DIMENSIONALITY REDUCTION METHOD YOU ALREADY KNOW….

3 http://ufldl.stanford.edu/wiki/

Outline •  What’s new in ANNs in the last 5-10 years?

– Deeper networks, more data, and faster training •  Scalability and use of GPUs ✔ •  Symbolic differentiation ✔ •  Some subtle changes to cost function, architectures,

optimization methods ✔ •  What types of ANNs are most successful and why?

– Convolutional networks (CNNs) ✔ – Long term/short term memory networks (LSTM) ✔ – Word2vec and embeddings✔ – Autoencoders

•  What are the hot research topics for deep learning?

Neural network auto-encoding •  Assume we would

like to learn the following (trivial?) output function:

Input Output

00000001 00000001

00000010 00000010

… …

10000000 10000000

•  One approach is to use this network

http://ufldl.stanford.edu/wiki/

Neural network auto-encoding Input Output

00000001 00000001

00000010 00000010

… …

10000000 10000000

Maybe learning something like this:

http://ufldl.stanford.edu/wiki/

Neural network autoencoding •  The hidden layer is a compressed version of the data •  If training error is zero then the training data is

perfectly reconstructed –  It probably won’t be unless you overJit

•  Other similar examples will be imperfectly reconstructed

•  High reconstruction error usually means the example is different from the training data – Reconstruction error on a vector x is related to P(x)

on the probability that the auto-encoder was trained with.

•  Autoencoders are related to generative models of the data

(Deep) Stacked Autoencoders

Train Apply

new dataset H of hidden unit activations for X

(Deep) Stacked Autoencoders

Modeling documents using top 2000 words.

Applications of dimensionality reduction •  Visualization

– examining data in 2 or 3 dimensions •  Semi-supervised learning

– supervised learning in the reduced space •  Anomaly detection and data completion

– coming soon •  Convenience in data processing

– Go from words (106 dimensions) to word vectors (hundreds of dimensions)

– Use dense GPU-friendly matrix operations instead of sparse ones

PCA is basically this picture…

•  with linear units (not sigmoid units)

•  trained to minimize squared error

•  with a constraint that the “hidden units” are orthogonal

Motivating Example of PCA

A Motivating Example

PCA as matrices

10,000 pixels

1000 * 10,000,00

… …

a1 a2 .. … am

b1 b2 … … bm v11 … … …

… …

V[i,j] = pixel j in image i

2 prototypes

PCA as matrices: the optimization problem

PCA as vectors: the optimization problem

A cartoon of PCA

A 3D cartoon of PCA

More cartoons

PCA as matrices

10,000 pixels

1000 * 10,000,00

… …

a1 a2 .. … am

b1 b2 … … bm v11 … … …

… …

V[i,j] = pixel j in image i

2 prototypes

PCA vs linear regression

PCA vs mixtures of Gaussians

PCA: IMPLEMENTATION

Finding principle components of a matrix

•  There are two algorithms I like – EM (Roweis, NIPS 2007) – Eigenvectors of the correlation matrix

(next)

PCA as vectors: the optimization problem

PCA as matrices: the optimization problem

Implementing PCA

Implementing PCA: why does this work?

Some more examples of PCA:�Eigenfaces

Image denoising

for image denoising

PCA FOR MODELING TEXT�(SVD = SINGULAR VALUE DECOMPOSITION)

A Scalability Problem with PCA •  Covariance matrix is large in high dimensions

– With d features covariance matrix is d*d •  SVD is a closely-related method that can be

implemented more efJiciently in high dimensions – Don’t explicitly compute covariance matrix – Instead decompose X to X=USVT

– S is k*k where k<<d – S is diagonal and S[i,i]=sqrt(λi) for V[:,i] – Columns of V ~= principle components – Rows of US ~= embedding for examples

SVD example

•  The Neatest Little Guide to Stock Market Investing •  Investing For Dummies, 4th Edition •  The Little Book of Common Sense Investing: The Only

Way to Guarantee Your Fair Share of Stock Market Returns

•  The Little Book of Value Investing •  Value Investing: From Graham to Buffett and Beyond •  Rich Dad’s Guide to Investing: What the Rich Invest in,

That the Poor and the Middle Class Do Not! •  Investing in Real Estate, 5th Edition •  Stock Investing For Dummies •  Rich Dad’s Advisors: The ABC’s of Real Estate

Investing: The Secrets of Finding Hidden Profits Most Investors Miss

https://technowiki.wordpress.com/2011/08/27/latent-semantic-analysis-lsa-tutorial/

TFIDF counts would be better

Recovering latent factors in a matrix

m terms

doc term matrix

… …

a1 a2 .. … am

b1 b2 … … bm v11 …

… …

V[i,j] = TFIDF score of term j in doc i

Investing for real estate

Rich Dad’s Advisor’s: The ABCs of Real

Estate Investment …

The little book of common

sense investing: …

Neatest Little Guide to Stock

Market Investing

Dimensionality Reduction and Principle Components Analysiswcohen/10-601/601-pca.pdf · •...

Documents

Transcript of Dimensionality Reduction and Principle Components Analysiswcohen/10-601/601-pca.pdf · •...

Provincial Clinical Knowledge Topic · neonatal resuscitation equipment. The principle advantages of PCA for patients are: • Clinically effective though not complete analgesia,

BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.

PCA-6187 Manual ed.3 - Advantechadvdownload.advantech.com/.../PCA-6187_Manual_ed.3.pdf · 2017-08-23 · PCA-6187 User™s Manual iv Table 1.1: PCA-6187 comparison table Model PCA-6187VE-00A1

PCA: Lecture 3 Extensions of PCA and Related Tools

CHAPTER 6: STRUCTURES€¦ · chapter 6: structures 601 concrete structures 601-1 description 601-2 materials 601-3 construction requirements 601-3.01 foundations 601-3.02 falsework

Feature extraction 1.Introduction 2.T-test 3.Signal Noise Ratio (SNR) 4.Linear Correlation Coefficient (LCC) 5.Principle component analysis (PCA) 6.Linear.

TECHNICAL MANUAL MASTER PCA - Frank's Hospital Workshopinterest · 2009-09-10 · Command inputs and visualisation ... in accordance with the EN 60 601-1 Standard. 1.2. Block diagram

PCA & tb-PCA (linear unconstrained ordination)

(based on WCO PCA Guidelines, Vol.1). What PCA is/isn’t Objectives & benefits of PCA Process and considerations for introduction of PCA process.

Response Surface Method Principle Component Analysis 1Daniel Baur / Numerical Methods for Chemical Engineers / RSM & PCA Daniel Baur ETH Zurich, Institut.

PCA and Mixtures of PCA: Improving the robustness to outliers · Probabilistic PCA Robust probabilistic PCA Mixtures of (robust) probabilistic PCA Experiments M. Verleysen UCL 4 Maximum

Principle Component Analysis(PCA)

PCA and Robust PCA for Modern Datasets

The Principle of Separation of Powers and Authoritarian ...allanbrewercarias.com/.../611.-601.-Duquesne.-Separation-of-Powers… · The principle of separation of powers in modern

Full product line catalogue 2018 3409 · 79 80 pca-m35ka puz-zm35vka pca-m50ka puz-zm50vka pca-m60ka puz-zm60vha pca-m71ka puz-zm71vha pca-m100ka puz-zm100vka pca-m100ka puz-zm100yka

Contents PCA GHA APEX Kernel PCA CS 476: Networks of Neural Computation, CSD, UOC, 2009 Conclusions WK9 – Principle Component Analysis CS 476: Networks.

1V0-601 Test Questions 1V0-601 Exam PDF

PRINCIPAL COMPONENT ANALYSIS(PCA) EOFs and Principle Components; Selection Rules

Chen, Huanting PCA 2014-07-31 FINAL VERSION-3_Huanting_PCA_2014-07-31_FINAL_VER… · Portfolio(Construction(Using(Principle(Component Analysis((By# Huanting#Chen#!! AMaster!Project!

PCA-6180 - Advantechdownloadt.advantech.com/ProductFile/Downloadfile4/1-YZ4N/6180... · PCA-6180 Series comparison table Model PCA-6180E PCA-6180E2 PCA-6180SE PCA-6180F CPU: Intel