Forecasting Crude Oil Price (Revisited )

Forecasting Crude Oil Price (Revisited)

Imad Haidar* and Rodney C. Wolff * PhD student

The University of Queensland, Brisbane, QLD 4072, Australia

E-mail: [email protected], [email protected]

mailto:[email protected]

mailto:[email protected]

This paper attempts to answer the following questions:

• What type of dynamics is governing crude oil prices and returns? – Specifically, we investigate if there is any non-linear deterministic

dynamics (chaos) which could be miss specified as a random walk.• From a statistical point of view have the dynamics of crude oil

returns changed significantly during the past twenty years?• Do we have strong empirical evidence that crude oil spot

returns are predictable in the short-term? • Can we forecast the direction of crude oil return for multi-

steps ahead?

Data• Crude oil daily spot prices/ returns for West Texas

Intermediate (WTI),• official closing price are from 2 January 1986 to 2 March 2010

(6194 daily observation).• The data were retrieved on 11 March 2010 from the Energy

Information Administration (EIA).

Diagnostic Tests

• The autocorrelation (ACF) is much more evident in the squared log-returns, especially for return II;

• AFC was significantly over the upper confidence level. This could present evidence of heteroskedasticity.

• The Augmented Dicky-Fuller and Phillips–Perron test for crude oil price and return at 1% significant level are as follows:– Crude oil price for the whole series from January 1986 to March 2010

is integrated of the first order, or I(1). – Crude oil price for the first subsection from January 1986 to January

1998 is I(0).– Crude oil price for the second subsection from end of January 1998-

March 2010 is I(1),– The returns for all subsections are I(0).

Testing for non-linearityThree tests:

• The Brock, Dechert and Scheinkman (BDS) test (Brock et al. 1996);

• The Fuzzy Classification System (FCS), by Kaboudan (1999);• The Time Domain Test for Non-linearity, by Barnett and Wolff

(2005);

The BDS test

• The Correlation Integral measures how often a temporal pattern appears in the data.

• The null hypothesis is that the data are pure whiteness (iid).

m

nmBnmW

),,(),,(

]),,1(),,([),,( mnCnmCnnmB

where

The BDS testReturn I

ε =0.5 ε =1 ε =1.5 ε =2m W SIG W SIG W SIG W SIG2 63.28235 0 32.41485 0 28.294 0 24.21126 05 277.9648 0 63.43841 0 41.301 0 34.10912 08 1569.915 0 113.0694 0 51.747 0 38.16579 0

Return IIε =0.5 ε =1 ε =1.5 ε =2

m W SIG W SIG W SIG W SIG2 69.901 0 30.065 0 21.751 0 19.484 05 370.17 0 80.356 0 37.184 0 30.265 08 2574.9 0 194.74 0 57.465 0 36.404 0

The BDS testReturn III

m W SIG2 16.08305 05 30.51802 08 47.82609 0

The FCS test

M

m ss

yy

emCemCemCemC

M2 21

211

),(ln),(ln),(ln),(ln

)1(

2RARIMACreate fuzzy Membership rules

75.085.075.095.085.0

195.01

0)1.0/)85.0(1(

1)05.0/)95.0(

0

Class Membership class Degree of Membership Rule no.

3938373635

Source: Kaboudan (1999)

90.0

90.070.0

70.0

2

2

2

R

R

R

1)2.0/)90.0(1(

02R

321

SL

HNNL

The FCS results

Data set Fitted ARIMA R2 θ DecisionOil price (all) Simple 0.99 0.96 SL-NL-HNOil price I ARIMA(3,1,3) 0.90 0.94 SL-NL-HNOil price II Simple 0.90 0.87 SL-NL-HNOil return (all) ARIMA(0,0,5) 0.005 0.96 NL-NL-HNOil return I ARIMA(3,0,3) 0.01 0.93 NL-NL-HNOil return II ARIMA(2,0,0) 0.02 0.99 NL-WN3-MA return II* ARIMA (2,0,6) 0.77 0.00 SL-NLWavelet return II**

ARIMA (3, 0, 3) 0.018 0.29 WL-NL

SL: strongly linear; NL: non-linear; HN: high noise; WN: white noise. * is smoothed return II with a simple three days moving average. ** is filtered return II using a wavelet filter.

The FCS over time

Testing for Chaos

• Chaos is characterize by sensitivity of a time series to the changes in the initial condition.

• Lyapunov exponents (LE) is a quantitative measure of the existence of chaos.

– Let be a dynamical system where are the data, e is a random error and f is a non-linear function. Also, let be the Jacobian matrix of f, and

– Lyapunov exponents λ are estimated as – – where represents the largest eigenvalue of the Jacobian matrix

Lyapunov exponents Return Lyapunov Exponent 99% Confidence level

1000 times bootstrapm λ Highest Lowest

d 1 0.0924 1.45E-18 0d 2 0.1951 0.063865 -2.20E-20d 3 3.48E-19 0.078794 -1.10E-19d 4 0.0204 0.176158 0.021725d 5 0.0318 1.875162 -1.30E-18d 6 9.74E-20 0.648368 -0.0502

3MA return Lyapunov Exponent 99% Confidence level 1000 time bootstrap

m λ1 Highest Lowestd 1 0.07 1.32E-18 0d 2 0.3405 0.062429 -3.10E-20d 3 0.1101 0.046871 -8.40E-20d 4 0.0766 0.127102 -1.40E-19d 5 0.1353 2.08148 -9.30E-19d 6 2.82E-19 0.594717 -0.03211

Lyapunov exponents

• We cautiously conclude that, the dynamics of crude oil returns series are non-linear deterministic, possibly chaotic.

• This conclusion contradicts the findings of Moshiri and Foroutan (2006) in which they found no evidence of chaos in crude oil futures price.

• It is important to note that Moshiri and Foroutan (2006) were testing LE using raw price of crude oil futures contracts and not the spot return.

FORECASTING

We use three types of Models:• ARIMA• EGARCH• ANN

ANN

• ANN were designed in an attempt to imitate the human brain functionality;

• the fundamental idea of ANN is to learn the desirable behaviour from the data with no a priori assumptions.

• From an econometrics view, ANN falls in the non-linear, non-parametric, and multivariate group of models (Grothmann 2002).

• This makes it a suitable approach to model non-linear relationship in high dimensional space.

Input layer Hidden layer Output layer

ANN (cont.) ji

j

iji wxbu

)( ii ufy

Σ

ws1

wsn

ws3

ws2

xn

x3

x2

X1

b

Input

Summing junction

Base

OutputActivation function

weights

y

b

• Training neural network in the backpropagation paradigm involves continuous change to the values of the network parameters in the direction that reduces the error between the input and the target

• • • • where w is the network weights, t is

the current step, T the number of iterations, λ is the learning rate (step size), is the gradients of the error surface, E the global error

ANN resultsOut-of-sample Benchmark Squared Wav1 Wav2 3 MA

Hit rate (%) 48.82 55.12605 61.99 74.97 79.67

RMSE 0.0356 0.032827 0.0337 0.0196 0.0131

R2 0.0182 0.004447 0.0293 0.0837 0.6219

IC 0.7529 0.965628 0.7554 0.9693 0.806

MSE 0.0013 0.001164 0.0011 0.0004 0.0002

MAE 0.0253 0.02222 0.0233 0.0114 0.0093

SSE 0.3756 0.27694 0.3208 0.1127 0.0829

DA -0.4304 1.51874 4.011 8.4024 13.45

P-value 0.6665 0.091106 0.0008 0 0

ANN results (cont.)

Metric 3 days MA 5 days MA

ANN RW ANN RW

Hit rate 79.67 72.40265 83.52 80.97166

RMSE 0.0131 0.012578 0.0066 0.011199

R2 0.6219 0.408819 0.7302 0.611197

Multi-steps Forecast



)(

)(

)1(ˆ

qt

t

t

y

y

y


)(ˆ ty

)1(

)1(

)2(

ˆ

ˆ

qt

t

t

y

y

y

)(ty

where q is the number of lagsand is the forecast horizon

Multi-steps Forecast

Forecast horizon Hit rate renege Confidence limit Mean hit rate for 1000 tests

19 days 52-60% 95% 56%

20 days 52-60% 95% 56%

21 days 52-59% 95% 56%

22 days 52-58% 95% 55%

23 days 51-57% 95% 54%

24 days 52-58% 95% 54%

25 days 52-58% 95% 55%

Conclusion

• The BDS statistic indicates the existence of non-linear behaviour in all crude oil prices and returns subseries .

• The FCS test also suggests that the dynamics of crude oil series are non-linear stochastic.

• Finally, the Lyapunov exponents for crude oil returns (and smoothed returns) highlights the possibility of low dimensional deterministic dynamics, i.e., chaos.

• The Lyapunov exponent results could explain the random-walk like behaviour of the crude oil return.

Conclusion (cont.)

• Several data transformations and smoothing with the hope that we could reduce the noise and highlight certain dynamics, such as mean reversion.

• Our empirical results showed that some of these measures are effective in improving the forecast accuracy.

• We show that for smoothed data multi-step forecasting is possible (for 19-25 steps ahead) with reasonable accuracy.

Thank you

Forecasting Crude Oil Price (Revisited )

Documents

Transcript of Forecasting Crude Oil Price (Revisited )