A H YBRID A RTIFICIAL N EURAL N ETWORK M ODEL WITH L INEAR & N ONLINEAR C OMPONENTS Yolcu Ufuk,...

A HYBRID ARTIFICIAL NEURAL NETWORK MODEL WITH LINEAR & NONLINEAR COMPONENTS

Yolcu Ufuk, ([email protected])

Department of Statistics, Giresun University, Giresun 28000, Turkey,

Egrioglu Erol ([email protected])

Department of Statistics, Ondokuz Mayis University, Samsun 55139, Turkey

Aladag Cagdas H. ([email protected])

Department of Statistics, Hacettepe University, Ankara 06800, Turkey

mailto:[email protected]



CONTENT

1. Introduction

2. The Proposed Method (L&NL-ANN

Model)

3. Application of L&NL-ANN Model

3.1. Data Set 1

3.2. Data Set 2

4. Conclusions

INTRODUCTION

In the literature, while linear models such as autoregressive integrated moving average (ARIMA; [2]) have being used for linear time series, nonlinear models such as artificial neural networks (ANN), bilinear, and threshold autoregressive (TAR; [15]) have being preferred for nonlinear time series.

It is a well-known fact that real life time series can generally contain both linear and non-linear components.

It is almost impossible that a time series is pure linear or pure nonlinear.

For some time series, linear models can

produce satisfactory results when linear part of

the time series is superior to nonlinear part.

In a similar way, when nonlinear part of the

time series is superior to linear part, nonlinear

models can give satisfactory results.

However, in both case, one of these parts is

not taken into consideration.

Thus, it can lead to deceptive results.

To deal with this problem, various hybrid

approaches have been suggested in the

literature.

Tseng et al. (2002) proposed a hybrid forecasting

model, which combines the seasonal ARIMA

(SARIMA) and ANN [16].

Zhang (2003) improved a hybrid model based on

ARIMA and ANN [20].

Pai and Lin (2005) combined ARIMA and support

vector machines (SVM) [12].

Chen and Wang (2007) combined SARIMA and

SVM [4].

Aladag et al. (2009) combined ARIMA and Elman

neural networks [1].

Lee and Tong (2011) combined ARIMA and

genetic algorithms [9].

Besides, Ince and Traffalis (2006) proposed a hybrid

model which incorporates parametric techniques

such as ARIMA, vector autoregressive (VAR) and

co-integration techniques, and nonparametric

techniques such as support vector regression

(SVR) and ANN [6].

Wang et al. (2006) introduced some hybrid

approaches which are called threshold ANN,

cluster based ANN, and periodic ANN [18].

BuHamra et al. (2003) and Jain and Kumar (2007)

suggested hybrid approaches in which the inputs

of ANN are determined by Box-Jenkins procedure

([3]; [7]).

In addition to these studies, hybrid approaches

combine SARIMA and ANN also proposed to

analyze fuzzy time series by Egrioglu et al. (2009)

and Uslu et al. (2010) ([5]; [17]).

These hybrid approaches generally consist of two

phases.

After linear component of time series is modeled

with a linear model in the first phase, nonlinear

component is modeled by utilizing a nonlinear

model in the next phase.

In two-phase methods, it is assumed that time

series has only linear structure in the first phase

and it is assumed that time series has only

nonlinear structure in the second phase.

Therefore, this causes model specification error.

In this study, a new ANN model composed of both linear and nonlinear structures is proposed to deal with this problem and to increase forecasting accuracy.

Therefore, this method has the ability to model both linear and nonlinear parts in time series at the same time.

In the proposed model, Multiplicative and Mc Culloch-Pitts neuron structures are used for nonlinear and linear parts, respectively.

In addition, the modified particle swarm optimization (MPSO) method is used to train the proposed neural network model.

To show the applicability of the proposed method,

it is applied to two real life time series in the

implementation.

For the aim of comparison, the obtained results

are compared to those calculated from other

approaches available in the literature.

As a result of the implementation, it is seen that

the proposed method has the best forecasting

accuracy.

2. THE PROPOSED METHOD

Hybrid models include advantages of both linear

and nonlinear models have also been proposed in

the literature. In hybrid approaches, it is assumed

that the time series is composed of sum of linear

and nonlinear parts and can be defined by

where yt, Lt, and Nt represent the time series, the linear

part and the nonlinear part of the time series,

respectively.

In the literature, Zhang (2003), Pai and Lin (2005), Chen and Wang (2007), Aladag et al. (2009) and Lee and Tong (2011) employed two-phase hybrid approaches.

After the linear part of time series is modeled in the first phase, by assuming that residuals obtained in the first phase contain the nonlinear part, these residuals are analyzed with nonlinear models in the second phase.

Employing a linear model in the first phase means that nonlinear relations are not taken into consideration.

This situation causes model specification error.

To overcome this problem, one-phase method

which can simultaneously analyze both linear and

nonlinear structures is needed when time series

given in (1) are analyzed.

Therefore, a novel linear & nonlinear artificial

neural network (L&NL-ANN) model is proposed in

this study.

The broad structure of the proposed model is

illustrated in Fig. 2.1.

Fig. 2.1 The architecture of L&NL-ANN model

In Fig. 2.1, ∑ and ∏ represent neuron models Mc

Culloch and Pitts, and multiplicative neuron

models, respectively. The functions f1 and f2 are

given in (2) and (3), respectively.

As seen in Fig. 2.1, L&NL-ANN model includes two

components linear and nonlinear.

W1 is a vector that includes the weights between

the inputs of the linear component and neurons

in the hidden layer corresponding to the linear

part of the model.

Similarly, the vector W2 contains the weights

between the inputs of the nonlinear component

and neurons in the hidden layer corresponding to

the nonlinear part of the model.

Each component has m inputs so both W1 and W2

are mx1. The vector W3, which is 2x1, consists of

two weights which are used to combine outputs

calculated from linear and nonlinear components.

Thus, calculation of the output of L&NL-ANN

model is given in three stages.

Stage 1. The output value of the neuron in the hidden

layer corresponding to the linear component (o1) is

calculated. Firstly, the activation value net1 for the

neuron is obtained by using the formula given as

follows:

where w1j (j=1,2,…,m) are elements of W1, and b1 is bias

weight for the linear part. The activation function used

in this neuron is f1 given in (2) so the output value o1 is

calculated by

Stage 2. The output value of the neuron in the hidden

layer corresponding to the nonlinear component (o2) is

calculated. Before calculating o2, the activation value

net2 for the neuron is computed by using the formula

given in (6).

where w2j, and b2j (j=1,2,…,m) are elements of W2, and

bias weight values for the nonlinear part. The

activation function used in this neuron is f2 given in (3)

so the output value o2 is calculated by

Stage 3. The output value of the model is

calculated. First of all, the activation value net3

for the neuron in the output layer is obtained by

using the formula given in (8).

In (8), b3 is bias weight. Then, the output value is

computed as follows:

As seen from (9), the output value is obtained from the

weighted sum of linear and nonlinear autoregressive

models. Unlike the model given in (1), L&NL-ANN

model can be expressed as in (10).

It should be noted in here that in the model given

in (1), weights of linear and nonlinear

components are equal. However, in L&NL-ANN

model, these weights are determined during the

optimization process of ANN due to the structure

of the data.

In the proposed approach, L&NL-ANN model, which

is also defined in this study, is trained using

MPSO method. In the MPSO, positions of a

particle are weights of L&NL-ANN model. Hence,

a particle has 3m + 4 positions. Structure of a

particle is illustrated in Fig. 2.2.

Fig. 2.2 Structure of a particle

Mean square error (MSE), which is a well-known

forecasting performance criterion, is used as

evaluation function. MSE can be calculated using

the formula given in (11).

where n represents the number of learning sample. The

algorithm for calculation of the output value of the

proposed L&NL-ANN model is presented below.

Algorithm: The algorithm for calculation of the

output value of the proposed L&NL-ANN model.

Step 1. The parameters of MPSO are determined. In

the first step, the parameters which direct the

MPSO algorithm are determined. These

parameters are pn, vm, c1i, c1f, c2i, c2f, w1, and w2

that were given in the [10] and [13].

Step 2. Initial values of positions and velocities

are determined. The initial positions and

velocities of each particle in a swarm are

randomly generated from uniform distribution

(0,1) and (-vm,vm), respectively.

Step 3. Evaluation function values are computed.

Evaluation function values for each particle are

calculated. MSE given in (11) is used as

evaluation function.

Step 4. Pbestk (k = 1,2, …, pn) and Gbest are

determined due to evaluation function values

calculated in the previous step. Pbestk is a vector

stores the positions corresponding to the kth

particle’s best individual performance, and Gbest

is the best particle, which has the best evaluation

function value, found so far.

Step 5. The parameters are updated. The

updated values of cognitive coefficient c1, social

coefficient c2, and inertia parameter w are

calculated like in [10] and [13].

Step 6. New values of positions and velocities

are calculated. New values of positions and

velocities for each particle are computed like in

[10] and [13]. If maximum iteration number is

reached, the algorithm goes to Step 3; otherwise,

it goes to Step 7.

Step 7. The optimal solution is determined. The

elements of Gbest are taken as the optimal

weight values of the L&NL-ANN.

3. APPLICATION OF L&NL-ANN MODEL

In order to evaluate the performance of the

proposed approach based on the L&NL-ANN

model, which also defined in this study, and the

MPSO algorithm, the proposed approach is

applied to two real time series in the

implementation. In all computations, MATLAB

version 7.12.0 (R2011a) computer package was

used.

3.1 DATA SET 1

The first time series is the amount of carbon

dioxide measured monthly in Ankara capitol of

Turkey (ANSO) between March 1995 and April

2006.

It has both trend and seasonal components and its

period is 12.

The first 124 observations are used for training and

the last 10 observations are used for test set.

In addition to the proposed approach,

Seasonal Autoregressive Integrated Moving

Average (SARIMA),

Winters Multiplicative Exponential Smoothing

(WMES),

Feed Forward Neural Networks (FFANN)

methods and the fuzzy time series forecasting

methods proposed by Song [14], Egrioglu [5] and

Uslu [17] are used to analyze ANSO data.

For the test set, the forecasts and root mean

square error (RMSE) values produced by all

methods are summarized in Table 3.1.

Table 3.1 The obtained results for ANSO data

The results of the methods SARIMA, WMES, Song

[14], Egrioglu et al. [5] and Uslu et al. [17] were

taken from Uslu et al.s’ study.

Test Data

SARIMA WMESSong

[14]Egrioglu et al. [5]

Uslu et al. [17]

FFANN L&NL-ANN

21 22.9300 15.4000 41.6667 20 22.7536 24.0916 25.417327 22.3500 16.1100 27.5000 30 22.7536 24.1705 25.735825 23.6100 17.7700 41.6667 20 22.7536 24.6201 27.640628 28.8100 25.1200 41.6667 30 22.7536 25.9042 29.477538 46.9700 41.1100 41.6667 30 42.0558 47.0788 37.604445 54.6200 46.1200 46.7857 50 42.0558 44.2092 40.202338 58.1300 49.8000 45.0000 40 42.0558 38.4641 40.684636 46.9900 44.2400 46.7857 30 42.0558 34.7330 34.937824 37.8500 31.9600 46.7857 30 22.7536 28.5170 28.602722 24.7600 18.3900 27.5000 20 22.7536 25.5381 26.7366

RMSE 9.6249 7.1062 12.7410 4.5607 3.6611 3.7402 3.2465

When FFANN is used, the numbers of neurons in

both the hidden and input layers are changed

from 1 to 12 and one output neuron is employed.

The best architecture among them is found as the

architecture contains 12 neurons in the input

layer and one neuron in the hidden layer.

Thus, the inputs of the best FFANN model are the

lagged variables Xt-1, Xt-2, …, Xt-12. Inputs of L&NL-

ANN model are taken as Xt-1, Xt-2, …, Xt-12 like in

the FFANN model.

And, in the training process of L&NL-ANN model,

the parameters of the modified particle swarm

optimization are determined as follows:(c1i, c1f) =

(2, 3), (c2i, c2f) = (2, 3), (w1, w2) = (1, 2), pn = 30, and

maxt = 1000.

According to Table 3.1, the proposed approach has

the best forecasting accuracy for ANSO data in

terms of RMSE.

To examine the results visually, the graph of the

real observations and the forecasts produced by

the proposed approach for test set is given in Fig.

3.1

Fig. 3.1 The graph of observations and forecasts for test data

As clearly seen from this graph, the proposed

approach produces very accurate forecasts for

ANSO data.

3.1 DATA SET 2

L&NL-ANN model is also applied to Canadian lynx

data consisting of the set of annual numbers of

lynx trappings in the Mackenzie River District of

North-West Canada for the period from 1821 to

1934.

This data has also been extensively analyzed in the

time series literature.

We use the logarithm (to the base 10) of the data in

the analysis.

In addition to the proposed approach, logarithm of

Canada lynx data is examined by using the methods

proposed by Zhang [20], Katijani et al. [8], Pai and

Lin[12], Aladag et al. [1].

When the proposed method is used, the order of the

L&NL-ANN model is m=3.

MSE values obtained from the methods are presented

in Table 3.2.

Table 3.2 The obtained MSE values for Test Data of Logarithmic Canadian Lynx

Data.ARIMA FANN

Zhang [20]

Kajitani et al.[8]

Pai and Lin [12]

Aladag et al.[1]

L&NL-ANN

0.015 0.02 0.017 0.014 0.035 0.009 0.006

As seen from Table 3.2, the best forecasts are

obtained when L&NL-ANN model is used.

The graph of the real observations and the

forecasts obtained from the proposed approach

for test set is given in Fig. 3.2.

Fig. 3.2 The graph of observations and forecasts for test data of data

set 2

It is clearly seen from the graph that the forecasts

produced by the proposed approach are very

accurate.

5. CONCLUSIONS

It is a well-known fact that real life time series can

contain both linear and nonlinear structures.

In the literature, various hybrid approaches, which

are generally two-phase methods, have been

proposed to deal with such time series.

After linear component of time series is modeled

with a linear model in the first phase, nonlinear

component is modeled by utilizing a nonlinear

model in the next phase.

In two-phase methods, it is assumed that time

series has only linear structure in the first phase

and it is assumed that time series has only

nonlinear structure in the second phase.

Therefore, this causes model specification error.

To overcome this problem and to reach high

forecasting accuracy level, a new ANN model

which can simultaneously analyze both linear and

nonlinear structures is introduced in this study.

This model can be considered as one-phase hybrid

approach.

In the other hybrid approaches available in the

literature, weights of linear and nonlinear

components are equal.

Unlike the other hybrid approaches suggested in

the literature, in the proposed neural network

model, weights of linear and nonlinear

components are determined during the

optimization process of ANN due to the structure

of the data.

In the proposed model, Multiplicative and Mc

Culloch-Pitts neuron structures are used for

nonlinear and linear parts, respectively.

To show forecasting performance of the proposed

method, it is applied to two real life time series in

the implementation.

As a result of the implementation, it is clearly

observed that the proposed method produced the

best forecasts for these two real time series.

REFERENCES1. Aladag, C.H., Egrioglu, E., Kadilar, C.: Forecasting nonlinear time series with a

hybrid methodology, Applied Mathematic Letters, 22, 1467-1470 (2009).

2. Box, G.E.P., Jenkins, G.M.: Time Series Analysis: Forecasting and Control Holdan-Day, San Francisco, CA, (1976).

3. BuHamra, S., Smaoui, N., Gabr, M.: The Box-Jenkins analysis and neural networks: prediction and time series modeling, Applied Mathematical Modeling, 27, 805-815 (2003).

4. Chen, K.Y., Wang, C.H.: A Hybrid SARIMA and support vector machines in forecasting the production values of the machinery industry in Taiwan, Expert Systems with Applications, 32(1), 254-264 (2007).

5. Egrioglu, E., Aladag, C.H., Yolcu, U., Basaran, M.A., Uslu, V.R.: A new hybrid approach based on SARIMA and partial high order bivariate fuzzy time series forecasting model, Expert Systems with Applications, 36, 7424-7434 (2009).

6. Ince, H., Traffalis, T.B.: A hybrid model for exchange rate prediction, Decisions Support Systems, 42, 1054-1062 (2006).

7. Jain, A. Kumar, A.M.; Hybrid neural network models for hydrological time series forecasting, Applied Soft Computing, 7, 585-592 (2007).

8. Katijani, Y., Hipel, W.K. Mcleod, A.I.: Forecasting nonlinear time series with feed forward neural networks: A case study of Canadian Lynx Data, Journal of Forecasting, 24 105-117 (2005).

9. Lee, Y-S., Tong, L-I.: Forecasting time series using a methodology based on autoregressive integrated moving average and genetic programming, Knowledge-Based Systems, 24, 66-72 (2011).

10. Ma, Y., Jiang, C., Hou, Z., Wang, C.: The formulation of the optimal strategies for the electricity producers based on the particle swarm optimization algorithm, IEEE Trans Power System, 21(4):1663–71 (2006).

11. Mc Culloch, W.S., Pitts, W.: A logical calculus of the ideas immanent in nervous activity. Bulletin of Mathematical Biophysics, 5, 115–133 (1943).

12. Pai, P.F., Lin, C.S.: A Hybrid ARIMA and support vector machines model in stock price forecasting, Omega, 33(6), 497-505 (2005).

13. Shi, Y, Eberhart, R.C.: Empirical study of particle swarm optimization. Proc IEEE Int Congr Evol Comput, 3:101–6 (1999).

14. Song, Q.: Seasonal forecasting in fuzzy time series. Fuzzy Sets and Systems, 107, 235–236 (1999).

15. Tong, H.: Non-linear time series: A Dynamical system approach, New York, Oxford University Press (1990).

16. Tseng, F.M., Yu, H.C., Tzeng, G.H.: Combining neural network model with seasonal time series ARIMA model, Technological Forecasting&Social Change, 69, 71-87 (2002).

17. Uslu, V.R., Aladag, C.H., Yolcu, U., Egrioglu, E.: A new hybrid approach for forecasting a seasonal fuzzy time series, International Symposium Computing Science and Engineering Proceeding Book, 1152-1158 (2010).

18. Wang, W., VanGelder, P.H.A.J.M., Vrijling, J.K., Ma, J.: Forecasting daily stream flow using hybrid ANN models, Journal of Hydrology, 324, 383-399 (2006).

19. Yadav, R.N., Kalra, P.K., John, J.: Time series prediction with single multiplicative neuron model, Applied Soft Computing, 7, 1157-1163 (2007).

20. Zhang, G.: Time series forecasting using a hybrid ARIMA and neural network model, Neurocomputing, 50, 159-175 (2003).

A H YBRID A RTIFICIAL N EURAL N ETWORK M ODEL WITH L INEAR & N ONLINEAR C OMPONENTS Yolcu Ufuk,...

Documents

Transcript of A H YBRID A RTIFICIAL N EURAL N ETWORK M ODEL WITH L INEAR & N ONLINEAR C OMPONENTS Yolcu Ufuk,...