MARE 250 Dr. Jason Turner Correlation & Linear Regression.

15
MARE 250 Dr. Jason Turner Correlation & Linear Regression
  • date post

    22-Dec-2015
  • Category

    Documents

  • view

    218
  • download

    1

Transcript of MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Page 1: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

MARE 250Dr. Jason Turner

Correlation & Linear Regression

Page 2: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Means Tests Vs. AssociationsMeans tests – t-test, ANOVA – test for differences between/among means (Responses among/between factors)

Associations – tests for relationships between/among variables (responses)

Page 3: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Linear RegressionLinear regression investigates and models the linear relationship between a response (Y) and predictor(s) (X)Both the response and predictors are continuous variables (“Responses”)

Linear regression analysis is used to: - determine how the response variable changes as

a particular predictor variable changes

- predict the value of the response variable for any value of the predictor variable

Page 4: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Regression vs. CorrelationLinear regression investigates and models the linear relationship between a response (Y) and predictor(s) (X)Both the response and predictors are continuous variables (“Responses”)

Correlation coefficient (Pearson) – measures the extent of a linear relationship between two continuous variables (“Responses”)

Page 5: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

When Regression vs. Correlation?Linear regression - used to predict relationships, extrapolate data, quantify change in one versus other is weighted direction

Correlation coefficient (Pearson) – used to determine whether there is a relationship or not

IF Regression – then it matters which variable is the Response (Y) and which is the predictor (X)

Y – (Dependent variable) X – (Independent)X causes change in Y (Y outcome dependent upon X)Y Does Not cause change in X (X –Independent)

Page 6: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Linear RegressionRegression provides a line that "best" fits the data (from response & predictor)

The least-squares criterion (method used to draw this "best line“) requires that the best-fitting regression line is the one with the smallest sum of the squared error terms (the distance of the points from the line).

Page 7: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Linear Regression

The R2 and adjusted R2 values represent the proportion of variation in the response data explained by the predictors

Adjusted R2 is a modified R2 that has been adjusted for the number of terms in the model. If you include unnecessary terms, R2 can be artificially high

Page 8: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

y

Is This Them? Are These They?

y = b0 + b1xy = dependent variable

b0 + b1 = are constants

b0 = y intercept

b1 = slope

x = independent variable

Urchin density = b0 + b1(salinity)

Page 9: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Effects of OutliersOutliers may be influential observations

A data point whose removal causes the regression equation (line) to change considerably

Consider removal much like an outlier

If no explanation – up to researcher

Page 10: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Warning on Regression

Regression is based upon assumption that data points are scattered about a straight line

What can we do to determine if a Regression is warranted?

Page 11: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Correlation Coefficient (r)(Pearson) – measures the extent of a linear relationship between two continuous variables (responses)

Pearson correlation of cexa Ant and cexa post = 0.811P-Value = 0.000

IF p < 0.05 THEN the linear correlation between the two variables is significantly different than 0

IF p > 0.05 THEN you cannot assume a linear relationship between the two variables

Correlation Coefficient

Page 12: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Correlation Coefficient

Page 13: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

“R2 D2 it is you, it is you”

Coefficient of Determination (R2) - Expression of the proportion of the total variability in the response (s) attributable to the dependence of all of the factors

R2 – used for assessing the “goodness of fit” of a regression model

Should use Adjusted R2 as it is a more conservative measure

R2 values range from 0 to 100%. An R2 of 100% means that all of the variability in the data can be explained by the model

Page 14: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Coefficient Relationships

The coefficient of determination (r2) is the square of the linear correlation coefficient (r)

Page 15: MARE 250 Dr. Jason Turner Correlation & Linear Regression.

Next Week

Regression Analysis: _ Urchins versus % Rock

The regression equation is_ Urchins = - 0.557 + 0.0361 % Rock

Predictor Coef SE Coef T PConstant -0.5569 0.3820 -1.46 0.146% Rock 0.036116 0.0062 5.80 0.000

S = 3.27363 R-Sq = 11.0% R-Sq(adj) = 10.6%