Applied Machine Learning in Biomedicineenrigri/Public/Machine_Learning/Slides/...•Beware of loss...

Applied Machine Learning in Biomedicine

Enrico Grisan

enrico.grisan@dei.unipd.it

Algorithm’s objective cost

Formal objective for algorithms: - minimize a cost function - maximize an objective function Proving convergence: - does objective monotonically improve? Considering alternatives: - does another algorithm score better?

Loss function

Choosing a loss function

• Motivated by the application – 0-1 error, achieving a tolerance, business cost

• Computational convenience: – Differentiability, convexity

• Beware of loss dominated by artifacts: – Outliers

– Unbalanced classes

A step into linear regression

Vector form for RSS

Least squares estimation

Geometry of least squares

Least square estimation

% ww = Dx1 weights

% X = NxD test cases

% Y = Nx1

ww = X\Y;

Least square estimation (2)

2. Update

The importance of the step

Least squares classifier

Why not using linear least squares to fit regressors on binary targets?

% fit yy = ww*xx

% ww = Dx1 weights

% xx = NxD test cases

% yy = Nx1

ww = xx\yy;

Least squares classifier

Why not using linear least squares to fit regressors on binary targets?

% fit yy = ww*xx

% ww = Dx1 weights

% xx = NxD test cases

% yy = Nx1

ww = xx\yy;

Least squares in practice (1)

Prostate cancer study (Stamey, 1989)

Mapping clinical measure with PSA marker

1) Normalize all data

2) Split the data into train and test set

3) Fit the regression on the training set

4) Estimate results on the test set

Least squares in practice (6b)

Multiclass linear classifier

K class, K>2 Building K-1 binary classifers

One versus all

K class, K>2 Building K(K-1)/2 binary classifers

One versus one

Least squares approach

Linear regression (with features)

X = [ones(N,1), xx, xx.^2, xx.^3 , xx.^4 , xx.^5 , xx.^6];

W = X\yy;

% FunctionInterpolation

Xnew = [ones(N,1), xnew, xnew.^2, xnew.^3 , xnew.^4 , xnew.^5 , xnew.^6];

Ynew=Xnew*W;

Neighbour-based regression

Take height from the nearest input

Kernel smoothing

Weight points in proportion to a kernel

Kernel smoothing

Over fitting

We can make the empirical loss zero:

Generalization

Want to do well on future, unknown data:

Expected error

Training error Generalization error Expected test error

Validation set

validation p=2 p=5

Learning curves

p, polynomial order

Using validation set

Validation set used to estimate test error for fitted model

Can overfit the validation set

Tracking a validation set is also used during fitting of a single model

- ad hoc

- depends on optimizer

- sometimes fast

- sometimes can work annoyingly well

Model selection and assessment Model selection:

Estimating the performance of different models in order to choose the best one

Model assessment:

having chosen a final model, estimating its prediction error on new data

Train Validation Test

Cross validation

We want a procedure for estimating at least the average generalization error even when the data is scarce and we can not set aside a validation set

Fold 5 - Test

Fold 4 - Train

Fold 3 - Train

Fold 2 - Train

Fold 1 - Train

Fold 4 - Train

Fold 2 - Train

Fold 1 - Train

Fold 3 - Test

K-Fold Cross Validation

Choosing K K=N leave one out Unbiased estimator for the prediction error High variance (training set are very similar)

Small-sized training set may overestimate the prediction error Rule of thumb: K=5 or K=10

Cross validation scenario

P1. Cleveland Heart Disease Data from V.A. Medical Center, Long Beach and Cleveland Clinic Foundation, 1988.

297 patients, 14 attributes per patient

Predict heart disease (possibly in a scale 1-4)

P2. HIV cleavage site

Rögnvaldsson, You and Garwicz (2015) "State of the art prediction of HIV-1 protease cleavage sites", Bioinformatics, vol 31 (8), pp. 1204-1210. Kontijevskis, Wikberg and Komorowski (2007) "Computational Proteomics Analysis of HIV-1 Protease Interactome". Proteins: Structure, Function, and Bioinformatics, 68, 305–312. You, Garwicz and Rögnvaldsson (2005) "Comprehensive Bioinformatic Analysis of the Specificity of Human Immunodeficiency Virus Type 1 Protease". Journal of Virology, 79, 12477–12486.

Knowledge of the mechanism of HIV protease cleavage specificity is critical to the design of specific and effective HIV inhibitors. Searching for an accurate, robust, and rapid method to correctly predict the cleavage sites in proteins is crucial when searching for possible HIV inhibitors. Scope is to predict if a sequence of aminoacids will constitute a cleavage site

Applied Machine Learning in Biomedicineenrigri/Public/Machine_Learning/Slides/...•Beware of loss...

Documents

Transcript of Applied Machine Learning in Biomedicineenrigri/Public/Machine_Learning/Slides/...•Beware of loss...

Unbalanced Panel Data Models - homepage.univie.ac.at · Introduction Unbalanced Panel Data Models Unbalanced Panels with Stata Balanced vs. Unbalanced Panel In a balanced panel, the

DM outliers

Sigir i.segalovich machine_learning _in_search_quality_at_yandex

Outliers - Chief Executive Boards International · 1 We Share Ideas Outliers The Story of Success by Malcolm Gladwell We Share Ideas chief We Share Ideas Outliers • Outliers are

Outliers Book Summary

Outliers, Leverage, and Influence - Statpower Notes/Outliers.pdf · Outliers, Leverage, and Influence James H. Steiger ...

AMAZING JOURNEY TO ROBUST STATISTICS, DISCOVERING OUTLIERS … · 2020-03-14 · DISCOVERING OUTLIERS FOR EFFICIENT PREDICTION AMAZING JOURNEY TO ROBUST STATISTICS, DISCOVERING OUTLIERS

Inconsistent Outliers

Statistics and Outliers

Outliers and influential data points. No outliers?

Test for Outliers

media.americanmusical.com...1/4" TRS connectors, unbalanced approx. 150 Ohm, unbalanced approx. 150 Ohm, unbalanced +21 dBu +21 dBu approx. 150 Ohm, unbalanced +21 dBu 1/4" TS connectors,

Unbalanced Faults.pdf

no outliers, symmetrical data outliers, skewed data ...

Outliers - DropPDF1.droppdf.com/files/hOWvr/outliers-malcolm-gladwell.pdf · OUTLIERS The Story of Success MALCOLM GLADWELL BACK BAY BOOKS LITTLE, BROWN AND

Outliers innovation

Unbalanced Load

Detecting Outliers

Outliers Malcolm Gladwell

Multinomial Logistic Regression: Detecting Outliers and Validating Analysis Outliers Split-sample Validation.