LearningFromData Lecture 27 LearningAidesmagdon/courses/LFD-Slides/SlidesLect27.pdfLearningFromData...

Learning From DataLecture 27

Learning Aides

Input Preprocessing

Dimensionality Reduction and Feature SelectionPrincipal Components Analysis (PCA)

Hints, Data Cleaning, Validation, . . .

M. Magdon-IsmailCSCI 4100/6100

Learning Aides

Additional tools that can be applied to all techniques

Preprocess data to account for arbitrary choices during data collection (input normalization)

Remove irrelevant dimensions that can mislead learning (PCA)

Incorporate known properties of the target function (hints and invariances)

Remove detrimental data (deterministic and stochastic noise)

Better ways to validate (estimate Eout) for model selection

c© AML Creator: Malik Magdon-Ismail Learning Aides: 2 /16 Nearest neighbor −→

Nearest Neighbor

Mr. Good and Mr. Bad were both given credit cards by the Bank of Learning (BoL).

Mr. Good Mr. Bad

(Age in years, Income in $× 1, 000) (47,35) (22,40)

Mr. Unknown who has “coordinates” (21yrs,$36K) applies for credit. Should the BoLgive him credit, according to the nearest neighbor algorithm?

What if, income is measured in dollars instead of “K” (thousands of dollars)?

Age (yrs)

Income(K

c© AML Creator: Malik Magdon-Ismail Learning Aides: 3 /16 Nearest neighbor uses Euclidean distance−→

Nearest Neighbor Uses Euclidean Distance

Mr. Good and Mr. Bad were both given credit cards by the Bank of Learning (BoL).

Mr. Good Mr. Bad

(Age in years, Income in $) (47,35000) (22,40000)

Mr. Unknown who has “coordinates” (21yrs,$36000) applies for credit. Should theBoL give him credit, according to the nearest neighbor algorithm?

What if, income is measured in dollars instead of “K” (thousands of dollars)?

Age (yrs)

Income($)

-3500 3500

c© AML Creator: Malik Magdon-Ismail Learning Aides: 4 /16 Algorithms treat dimensions uniformly −→

Uniform Treatment of Dimensions

Most learning algorithms treat each dimension equally

Nearear neighbor: d(x,x′) = ||x− x′ ||

Weight Decay: Ω(w) = λwtw

SVM: margin defined using Euclidean distance

RBF: bump function decays with Euclidean distance

Input Preprocessing

Unless you want to emphasize certain dimensions, the data should bepreprocessed to present each dimension on an equal footing

c© AML Creator: Malik Magdon-Ismail Learning Aides: 5 /16 Input preprocessing is a data transform −→

Input Preprocessing is a Data Transform

2...xt

xn 7→ zn

g(x) = g(Φ(x)).

Raw xn have (for example) arbitrary scalings in each dimension, and zn will not.

c© AML Creator: Malik Magdon-Ismail Learning Aides: 6 /16 Centering −→

Centering

raw data

centered

zn = xn − x zn = Dxn zn = Σ−12xn

Dii =1

z = 0 σi = 1 Σ = 1NZtZ = I

c© AML Creator: Malik Magdon-Ismail Learning Aides: 7 /16 Normalizing −→

Normalizing

raw data

centered

normalized

Dii =1

z = 0 σi = 1 Σ = 1NZtZ = I

c© AML Creator: Malik Magdon-Ismail Learning Aides: 8 /16 Whitening −→

Whitening

raw data

centered

normalized

whitened

Dii =1

z = 0 σi = 1 Σ = 1NZtZ = I

c© AML Creator: Malik Magdon-Ismail Learning Aides: 9 /16 Only use training data −→

Only Use Training Data For Preprocessing

WARNING!Transforming data into a more convenient format hasa hidden trap which leads to data snooping.

When using a test set, determine the input transformation from training data only.

Rule: lock away the test data until you have your final hypothesis.

D −→

Dtrain

input preprocessing

z = Φ(x)−→ g(x) = g(Φ(x))

−−−−→

−−−−−−−→

−−−−−−−−−−−−−−−−−−−−−−−→

Cumulative

Profit

no snooping

snooping

0 100 200 300 400 500

c© AML Creator: Malik Magdon-Ismail Learning Aides: 10 /16 PCA −→

Principal Components Analysis

Original Data

−−−−−−→

Rotated Data

Rotate the data so that it is easy toIdentify the dominant directions (information)

Throw away the smaller dimensions (noise)

Projecting the Data to Maximize Variance

(Always center the data first)

z = xt

vOriginal Data

Find v to maximize the variance of z

Maximizing the Variance

var[z] =1

vtxnxt

= vtΣv.

vOriginal Data

Choose v as v1, the top eigenvector of Σ — the top principal component (PCA)

The Principal Components

z1 = xtv1

z2 = xtv2

z3 = xtv3...

v1,v2, · · · ,vd are the eigenvectors of Σ with eigenvalues λ1 ≥ λ2 ≥ · · · ≥ λd

Theorem [Eckart-Young]. These directions give best re-construction of data; also capture maximum variance.

vOriginal Data

PCA Features for Digits Data

%ReconstructionError

0 50 100 150 2000

Principal components are automatedCaptures dominant directions of the data.

May not capture dominant dimensions for f .

Other Learning Aides

1.Nonlinear dimension reduction:

2.Hints (invariances and prior information):rotational invariance, monotonicity, symmetry, . . . .

3.Removing noisy data:

intensity

symmetry

intensity

symmetry

intensity

symmetry

4.Advanced validation techniques: Rademacher and Permutation penaltiesMore efficient than CV, more convenient and accurate than VC.

LearningFromData Lecture 27 LearningAidesmagdon/courses/LFD-Slides/SlidesLect27.pdfLearningFromData...

Documents

Transcript of LearningFromData Lecture 27 LearningAidesmagdon/courses/LFD-Slides/SlidesLect27.pdfLearningFromData...

LFD Test Results upload process flow

Specifications - Godrej AV Solutions · Samsung Plasma LFD (Large Format Display) monitor Offering you the widest range of plasma LFD solutions at Samsung, our large format displays

AXS Talisman LFD B EN - Renault

Splice LRFD-LFD Design

Copy of Splice LRFD-LFD Design

Section1: Introduction,ProbabilityConcepts andDecisions · CourseOverview Section 1: Introduction,ProbabilityConceptsandDecisions Section 2: LearningfromData: Estimation,ConﬁdenceIntervals

Text Classification Sentiment Analysis and Opinion Mining€¦ · I Featureselection(FS)hasthegoalofidentifyingthemost ... RCV1-v2 ≈800,000 ≈20,000 ≈780,000 99 Yes EN MLMC 20Newsgroups

AllergenControl™ LFD Detection Kitsd163axztg8am2h.cloudfront.net/.../84/b5/7819b437cd84ffd5f7f127cded… · AllergenControl™ LFD Detection Kits (Microbiologique) Any food business

Инструкция Hotpoint-Ariston LFD 11M121 CX EU, Hotpoint ...

Large Format Solutions - Sahara AV LFD Guide 2015 28 … · Large Format Solutions digital signage Technology Partner In association with Holdan Ltd LFD | Video Wall | Collaboration

1959 Samsung LED LFD Toolkit

NFDNTL NTN NFDNTL TRL NN LFD - United States Department of ...

LFD PRESTRESSED CONCRETE GIRDER DESIGN …...LFD PRESTRESSED CONCRETE GIRDER DESIGN AND RATING ix SUMMARY OF AUGUST 1993 REVISIONS - VERSION 3.0 The following revisions are made to

PRODUCT CATALOG SPHERICAL PLAIN BEARINGS · 2020. 3. 13. · 14 LFD SPHERICAL PLAIN BEARINGSProduct Catalog 15 SIMPLY WELL-ENGINEERED Success Through Precision. LFD SPHERICAL PLAIN

LFD SERIES OVEN C186 Rev 3-09.pdf · lfd series oven with protocol plus ™ instruction manual c-186 pn 143378 revision h 3/2009

Data Analysis 3 - Feature Selection, Representative-based ...pla06/files/mad3/mad3_02.pdf · DataAnalysis3 FeatureSelection,Representative-basedClustering,HierarchicalClustering JanPlatoš

Understanding and Specifying Outdoor LFD with Samsung

LearningFromData Lecture6 Bounding …magdon/courses/LFD-Slides/SlidesLect06.pdfc AML Creator: MalikMagdon-Ismail Bounding the Growth Function: 11/31 Combinatorialpuzzleagain−→

MANUTENTION - LFD · 2018. 9. 25. · Nous proposons des roulements à rotule sur billes et des roulements à rotule sur rouleaux de la gamme LFD, optimisés pour l’application.

Mathcad - 01005 Gusset Lo LFD final 040609