Topic 12: Multiple Linear Regression

Outline

• Multiple Regression– Data and notation–Model – Inference

• Recall notes from Topic 3 for simple linear regression

Data for Multiple Regression

• Yi is the response variable• Xi1, Xi2, … , Xi,p-1 are p-1

explanatory (or predictor) variables • Cases denoted by i = 1 to n

Multiple Regression Model

• Yi is the value of the response variable for the ith case

• β0 is the intercept• β1, β2, … , βp-1 are the regression

coefficients for the explanatory variables

0 1 1 2 2 1 , 1...i i i p i p iY X X X

Multiple Regression Model

• Xi,k is the value of the kth explanatory variable for the ith case

• ei are independent Normally distributed random errors with mean 0 and variance σ2

0 1 1 2 2 1 , 1...i i i p i p iY X X X

Multiple Regression Parameters

• β0 is the intercept• β1, β2, … , βp-1 are the regression

coefficients for the explanatory variables

• σ2 the variance of the error term

Interesting special cases

• Yi = β0 + β1Xi + β2Xi2 +…+ βp-1Xi

p-1+ ei

(polynomial of order p-1)• X’s can be indicator or dummy variables

taking the values 0 and 1 (or any other two distinct numbers)

• Interactions between explanatory variables (represented as the product of explanatory variables)

Interesting special cases• Consider the model Yi= β0 + β1Xi1+ β2Xi2+β3X i1Xi2+ ei

• If X2 a dummy variable–Yi = β0 + β1Xi + ei (when X2=0)

–Yi = β0 + β1Xi1+β2+β3Xi1+ ei (when X2=1)

= (β0+β2) + (β1+β3)Xi1+ ei

–Modeling two different regression lines at same time

Model in Matrix Form

n 1 n p n 1p 1

~N(0, )

~ ( , )

Y N X I

Least Squares

YXXbX equations normal Obtain

)XbY()XbY(SSE minimize to b Find

Least Squares Solution

Fitted (predicted) values

YX)XX(b 1

HY YX)XX(XXbY 1

Residuals

Y)HI( HYY YYe

is symetric and idempotent ( )( ) ( )

I H I H I H

Covariance Matrix of residuals

• Cov(e)=σ2(I-H)(I-H)΄= σ2(I-H)

• Var(ei)= σ2(1-hii)

• hii= X΄i(X΄X)-1Xi

• X΄i =(1,Xi1,…,Xi,p-1)

• Residuals are usually correlated

• Cov(ei,ej)= -σ2hij

Estimation of σ2

n( ) ( )

pSSE MSEdf

s s Root MSE

Y Xb Y Xb

Distribution of b• b = (X΄X)-1X΄Y• Since Y~N(Xβ, σ2I)• E(b)=((X΄X)-1X΄)Xβ=β• Cov(b)=σ2 ((X΄X)-1X΄)((X΄X)-1X΄)΄

=σ2(X΄X)-1

• σ2 (X΄X)-1 is estimated by s2 (X΄X)-1

ANOVA Table• Sources of variation are –Model (SAS) or Regression (KNNL)–Error (SAS, KNNL) or Residual–Total

• SS and df add as before–SSM + SSE =SSTO–dfM + dfE = dfTotal

Sums of Squares

i ii 1

ˆSSM Y Y

ˆSSE (Y Y )

SSTO (Y Y)

Degrees of Freedom

Mean Squares

SSTO/df MSTSSE/df MSESSM/dfMSM

Mean Squares

i ii 1

ˆMSM Y Y / ( 1)

ˆMSE (Y Y ) / (n )

MST (Y Y) / (n 1)

ANOVA Table

Source SS df MS F

Model SSM dfM MSM MSM/MSE

Error SSE dfE MSE Total SSTO dfTotal MST

ANOVA F test

• H0: β1 = β2 = … = βp-1 = 0

• Ha: βk ≠ 0, for at least one k=1,., p-1

• Under H0, F ~ F(p-1,n-p)

• Reject H0 if F is large, use P-value

P-value of F test• The P-value for the F significance test

tells us one of the following:– there is no evidence to conclude that

any of our explanatory variables can help us to model the response variable using this kind of model (P ≥ .05)–one or more of the explanatory

variables in our model is potentially useful for predicting the response variable in a linear model (P ≤ .05)

• The squared multiple regression correlation (R2) gives the proportion of variation in the response variable explained by all the explanatory variables

• It is usually expressed as a percent• It is sometimes called the coefficient of

multiple determination (KNNL p 226)

• R2 = SSM/SST– the proportion of variation explained

• R2 = 1 – (SSE/SST)– 1 – the proportion not explained

• Can express F test is terms of R2

F = [ (R2)/(p-1) ] / [ (1- R2)/(n-p) ]

Background Reading

• We went over KNNL 6.1 - 6.5

Topic 12: Multiple Linear Regression

Documents

Transcript of Topic 12: Multiple Linear Regression

1 Topic 2 – Simple Linear Regression KKNR Chapters 4 – 7.

Lecture 11: Regression Methods I (Linear Regression)math.arizona.edu/~hzhang/math574m/2017Lect11_lm.pdf · Lecture 11: Regression Methods I (Linear Regression) 7/40. Linear Model

Topic 12: Multiple Linear Regression. Outline Multiple Regression –Data and notation –Model –Inference Recall notes from Topic 3 for simple linear regression.

Linear regression: Part 1 - NTNU · Linear regression: Part 1. Lecture Outline What are linear models?-EX1: What is the ‘best’ line? What is linear regression? ... Linear regression

Linear and Non-Linear Regression: Powerful and Very ...business.expertjournals.com/ark:/16759/EJBM_322... · Keywords: Linear Regression, Non-Linear Regression, Best-Fitting Model,

Linear Regression - Wharton Finance - Finance …finance.wharton.upenn.edu/.../Linear-Regression-Slides.pdf · 2009-10-07 · regression model, 2-variable linear regression model)

New Linear Regression (Part II) · 2020. 9. 23. · Linear Regression (Part II) nanopoulos@ismll.de Outline • Generalize the linear regression • Simple multiple regression •

Multiple Linear Regression - Analysis Made Easy · Multiple Linear Regression The MULTIPLE LINEAR REGRESSION command performs simple multiple regression using least squares. Linear

Polynomial Functions Topic 2: Linear Equations and Regression.

Two-Variable Analysis: Simple Linear Regression/ Correlationmeonline.engin.umich.edu/courses/blackbelt/6s... · II. Simple Linear Regression • Simple Linear Regression examines

1 Curve-Fitting Polynomial Interpolation. 2 Curve Fitting Regression Linear Regression Polynomial Regression Multiple Linear Regression Non-linear Regression.

Linear Regression - SAS · MULTIPLE LINEAR REGRESSION SAS/STAT SYNTAX & EXAMPLE DATA ... SIMPLE LINEAR REGRESSION ... title 'Best Models Using All-Regression Option'; run;

Regression analysis Linear regression Logistic regression.

Chapter 11 Multiple Linear Regression Chapter 11 Multiple Linear Regression.

TOPIC 12 Simple Linear Regression and Correlation.

Basic Statistics Linear Regression. X Y Simple Linear Regression.

Mathematics Instructional Plan Linear Regression...Linear Regression Strand: Algebra and Functions Topic: Determining lines of best fit Primary SOL: AFDA.3 The student will collect

Chapter 3 Multiple Linear Regression Model - IITKhome.iitk.ac.in/~shalab/regression/Chapter3-Regression-Multiple...Chapter 3 Multiple Linear Regression Model ... So a simple linear

Course Introduction / Review of Linear Regression and ...faculty.washington.edu/cadolph/panUW/topic1.pw.pdfOutline for Topic 1 1.Review of linear regression (notation, estimation)

LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.