Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time...

20
Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present? What do lag scatter plots tell us? What does ACF tell us? Conduct the Durbin-Watson test

Transcript of Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time...

Page 1: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Nonstationary Time Series Examples

• Consider the Chemical Process Viscosity data set

• Is this time series stationary? Why or why not?• Time effect present?• What do lag scatter plots tell us?• What does ACF tell us?• Conduct the Durbin-Watson test

Page 2: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

The Durbin-Watson Test

Page 3: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Nonstationary Time Series Examples

• We want to predict the viscosity at time periods 101, 102, and 103.• Since data is nonstationary, time period is

important.• Could use a simple linear model to make the

predictions:

Page 4: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Nonstationary Time Series Examples

• We want to predict the viscosity at time periods 101, 102, and 103.

Note that in Minitab under Time Series the predictions won’t come with intervals. But, under regression they will.

Page 5: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Nonstationary Time Series Examples

• How do we know how well our model works compared to other models?• Mean Absolute Percentage Error (MAPE)• Mean Absolute Deviation (MAD)• Mean Squared Deviation (MSD)

For all three measures, smaller values indicate a better fitting model.

(hand outs)

Page 6: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Nonstationary Time Series Examples

The measures MAPE, MAD, and MSD are not very informative by themselves, but can be used to compare the fits obtained by using different models.

Mean absolute percentage error (MAPE) – Expresses accuracy as a percentage of the error. Because this number is a percentage, it may be easier to understand than the other statistics. For example, if the MAPE is 5, on average the forecast is off by 5%.

Mean absolute deviation (MAD) – Expresses accuracy in the same units as the data, which helps conceptualize the amount of error. Outliers have less of an affect on MAD than on MSD.

Mean squared deviation (MSD) – A commonly-used measure of accuracy of fitted time series values. Outliers have more influence on MSD than MAD.

Page 7: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Nonstationary Time Series Examples

• Minitab will compute MAPE, MAD, and MSD for this model:

Page 8: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Nonstationary Time Series Examples

• Now, fit the quadratic model and compare the MAPE, MAD, and MSD values to the linear model.

How do the predictions and prediction intervals for this model compare to the linear model?

Page 9: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Smoothing Techniques

Consider the Chemical Process Viscosity data set and in Minitab go to Time Series > Moving Average.

Choose a moving average of length 4

Look at the graph and comment

Page 10: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Smoothing Techniques

Consider the Chemical Process Viscosity data set and in Minitab go to Time Series > Moving Average.

Use a moving average of length 4 and predict the 101, 102, and 103 time period viscosities.

What do you notice?

Page 11: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Comparing Models

Compare predicting viscosity at time period 101 using a linear, quadratic, and moving average model with moving average length of your choice.

Which model do you prefer? Why?Did you check your residuals plots?

Page 12: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Comparing Models

Consider the Cheese data set again and compare predicting production in 1998 using a linear, quadratic, and moving average model with moving average length of your choice.

Which model do you prefer? Why?Did you check your residuals plots?

Page 13: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Comparing Models

Consider the Champagne data set again and compare predicting sales in January, 1970 using:• Linear• Quadratic• Linear or quadratic with month dummy variables• Moving average model with moving average length of

your choice (notice this data set is very sensitive to moving average length).

Which model do you prefer? Why?Did you check your residuals plots?

Page 14: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Comparing Models

Consider the Violent Crime Rate data set and model the data.

Who has the best model?

Remove the last observation from your data set and predict it. Who’s prediction is best?

Page 15: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Comparing Models

Consider the drowning children data set and model the data.

Who has the best model?

Remove the last observation from your data set and predict it. Who’s prediction is best?

Page 16: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Stepwise Regression

Open the data set Wage2

There are a lot of potential predictor variables to include in a model. One way to deal with this is stepwise regression.

Page 17: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Stepwise Regression

Response is: WagePotential predictors: Hours, IQ, KWW, educ, exper, tenure, age, sibs, brthord, meduc, feduc

Page 18: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Stepwise Regression

Forward selection, which involves starting with no variables in the model, trying out the variables one by one and including them if they are 'statistically significant'.

Backward elimination, which involves starting with all candidate variables and testing them one by one for statistical significance, deleting any that are not significant.

Methods that are a combination of the above, testing at each stage for variables to be included or excluded.

Page 19: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Forward or Backward?

If you have a very large set of potential independent variables from which you wish to extract a few, you should generally go forward.

If you have a modest-sized set of potential variables from which you wish to eliminate a few, you should generally go backward.

Page 20: Nonstationary Time Series Examples Consider the Chemical Process Viscosity data set Is this time series stationary? Why or why not? Time effect present?

Best Subsets Regression

The ultimate in fishing trips. Minitab examines all possible subsets of the predictors, beginning with all models containing one predictor, and then all models containing two predictors, and so on. By default, Minitab displays the two best models for each number of predictors.