Machine Learning Logistic Regression

Jeff Howbert Introduction to Machine Learning Winter 2012 1

Machine Learning

Logistic Regression

Name is somewhat misleading. Really a technique for classification, not regression.

– “Regression” comes from fact that we fit a linear model to the feature space.

Involves a more probabilistic view of classification.

Logistic regression

Consider a two-outcome probability space, where:

– p( O1 ) = p

– p( O2 ) = 1 – p = q

Can express probability of O1 as:

Different ways of expressing probability

notation range equivalents

standard probability p 0 0.5 1

odds p / q 0 1 + log odds (logit) log( p / q ) - 0 +

Numeric treatment of outcomes O1 and O2 is equivalent

– If neither outcome is favored over the other, then log odds = 0.

– If one outcome is favored with log odds = x, then other outcome is disfavored with log odds = -x.

Especially useful in domains where relative probabilities can be miniscule

– Example: multiple sequence alignment in computational biology

Log odds

From probability to log odds (and back again)

function logistic 1

functionlogit 1

Standard logistic function

Scenario:

– A multidimensional feature space (features can be categorical or continuous).

– Outcome is discrete, not continuous. We’ll focus on case of two classes.

– It seems plausible that a linear decision boundary (hyperplane) will give good predictive accuracy.

Logistic regression

Model consists of a vector in d-dimensional feature space

For a point x in feature space, project it onto to convert it into a real number z in the range - to +

Map z to the range 0 to 1 using the logistic function

Overall, logistic regression maps a point x in d-dimensional feature space to a value in the range 0 to 1

Using a logistic regression model

dd xxz 11xβ

)1/(1 zep

Can interpret prediction from a logistic regression model as:

– A probability of class membership

– A class assignment, by applying threshold to probability threshold represents decision boundary in feature space

Using a logistic regression model

Need to optimize so the model gives the best possible reproduction of training set labels

– Usually done by numerical approximation of maximum likelihood

– On really large datasets, may use stochastic gradient descent

Training a logistic regression model

Logistic regression in one dimension

Parameters control shape and location of sigmoid curve– controls location of midpoint

– controls slope of rise

Subset of Fisher iris dataset– Two classes

– First two columns (SL, SW)

Logistic regression in two dimensions

decision boundary

Interpreting the model vector of coefficients

From MATLAB: B = [ 13.0460 -1.9024 -0.4047 ] = B( 1 ), = [ 1 2 ] = B( 2 : 3 )

, define location and orientationof decision boundary

– - is distance of decisionboundary from origin

– decision boundary isperpendicular to

magnitude of defines gradientof probabilities between 0 and 1

Logistic regression in two dimensions

13 attributes (see heart.docx for details)

– 2 demographic (age, gender)

– 11 clinical measures of cardiovascular status and performance

2 classes: absence ( 1 ) or presence ( 2 ) of heart disease 270 samples Dataset taken from UC Irvine Machine Learning Repository:

http://archive.ics.uci.edu/ml/datasets/Statlog+(Heart) Preformatted for MATLAB as heart.mat.

Heart disease dataset

matlab_demo_05.m

MATLAB interlude

Advantages:– Makes no assumptions about distributions of classes in feature

– Easily extended to multiple classes (multinomial regression)

– Natural probabilistic view of class predictions

– Quick to train

– Very fast at classifying unknown records

– Good accuracy for many simple data sets

– Resistant to overfitting

– Can interpret model coefficients as indicators of feature importance

Disadvantages:– Linear decision boundary

Logistic regression

Machine Learning Logistic Regression

Documents

Transcript of Machine Learning Logistic Regression

Logistic Regression, AdaBoost and Bregman Distancesrob.schapire.net/papers/breg-dist.pdf · Machine Learning, 48(1/2/3), 2002. Logistic Regression, AdaBoost and Bregman Distances

Machine Learning: Logistic Regression Lecture 04oucsace.cs.ohiou.edu/~razvan/courses/ml4900/lecture04a.pdf · Machine Learning: Logistic Regression Lecture 04. ... gradient descent

Applied Machine Learning Lecture 5-2: Logistic regression ...richajo/dit866/lectures/l5/l5_2.pdf · Applied Machine Learning Lecture 5-2: Logistic regression and SVM Selpi (selpi@chalmers.se)

Neural Networks - Prolog 2 - Logistic Regression · Neural Networks - Prolog 2 - Logistic Regression September 13, 2016 Marcin Junczys-Dowmunt Machine Translation Marathon 2016 1

Machine Learning 2. Logistic Regression and LDA · Machine Learning Machine Learning 2. Logistic Regression and LDA Lars Schmidt-Thieme Information Systems and Machine Learning Lab

Machine Learning Lecture 7 Bayes Decision Theory ng · 4 ng 9 Softmax Regression • Multi-class generalization of logistic regression In logistic regression, we assumed binary labels

Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.

And Logistic Regression Linear and Logistic Regression.

CS 4900/5900: Machine Learning Logistic Regression

8 logistic regressioncourses.cs.vt.edu/cs4824/Spring19/slide_pdfs/8 logistic...Logistic Regression Machine Learning CS4824/ECE4424 Bert Huang Virginia Tech Outline • Review conditional

Intro Logistic Regression Gradient Descent + SGDIntro Logistic Regression Gradient Descent + SGD Machine Learning for Big Data CSE547/STAT548, University of Washington Sham Kakade

Lecture 20 - Logistic Regression - Statistical Science · Logistic Regression Logistic Regression Logistic regression is a GLM used to model a binary categorical variable using numerical

Intro Logistic Regression Gradient Descent + SGD AdaGradcourses.cs.washington.edu/courses/cse547/14wi/slides/...1 1 Intro Logistic Regression Gradient Descent + SGD AdaGrad Machine

Kernels'(SVMs,'Logistic' Regression)aarti/Class/10315_Fall19/lecs/Lecture11.pdf · Kernels'(SVMs,'Logistic' Regression) Aarti&Singh Machine&Learning&101315 Oct&2,&2019

Sas academic Program · Predictive Modeling Using Logistic Regression 16 hours Statistics 1: Introduction to ANOVA, Regression, and Logistic Regression 21 hours Machine Learning Using

Logistic Regression, AdaBoost and Bregman Distancessteele/Courses/9xx/Resources/Machine... · Logistic Regression, AdaBoost and Bregman Distances Michael Collins AT&T Labs Research

Logistic Regression - svivek · Logistic Regression is the discriminative version. This lecture •Logistic regression •Connection to Naïve Bayes •Training a logistic regression

Kernel Logistic Regression and the Import Vector …dept.stat.lsa.umich.edu/~jizhu/pubs/Zhu-JCGS05.pdf · Kernel Logistic Regression and the Import Vector Machine ... Recently Keerthi,

Regression analysis Linear regression Logistic regression.

Introduction to Logistic Regression and Support Vector Machine