Download - Pricing and Hedging Short Sterling Options Using Neural Networks

ICMA Centre Discussion Papers in Finance, DP2011-18

Pricing and Hedging Short Sterling Options Using

Neural Networks

Fei Chen and Charles Sutcliffe

ICMA Centre - University of Reading

September 2011

ICMA Centre Discussion Papers in Finance DP2011-18

ICMA Centre · University of Reading

Whiteknights · PO Box 242 · Reading RG6 6BA · UK

Tel: +44 (0)1183 787402 · Fax: +44 (0)1189 314741

Web: www.icmacentre.ac.uk

Director: Professor John Board, Chair in Finance

The ICMA Centre is supported by the International Capital Market Association

Pricing and Hedging Short Sterling Options UsingNeural Networks

Fei Chen and Charles Sutcliffe* **

ABSTRACT

This paper compares the performance of artificial neural networks (ANNs) with thatof the modified Black model in both pricing and hedging Short Sterling options.Using high frequency data, standard and hybrid ANNs are trained to generate optionprices. The hybrid ANN is significantly superior to both the modified Black modeland the standard ANN in pricing call and put options. Hedge ratios for hedging ShortSterling options positions using Short Sterling futures are produced using thestandard and hybrid ANN pricing models, the modified Black model, and alsostandard and hybrid ANNs trained directly on the hedge ratios. The performance ofhedge ratios from ANNs directly trained on actual hedge ratios is significantlysuperior to those based on a pricing model, and to the modified Black model.

Key Words: Short Sterling, Neural Networks, Option Pricing, Option Hedging

14 September 2011

* The ICMA Centre, University of Reading, PO Box 242, Reading RG6 6BA, UK,[email protected]

** The ICMA Centre, University of Reading, PO Box 242, Reading RG6 6BA, UK,[email protected]

We wish to thank Owain ap Gwilym (Bangor) and Chris Brooks (Reading) for their comments onan earlier draft.

Pricing and Hedging Short Sterling Options Using Neural Networks

1. Introduction

Artificial neural networks (ANN) have proved useful in a large variety of financial applications

including business failure prediction, bond rating assessment, term structure forecasting, volatility

forecasting and financial instrument pricing and hedging. This study focuses on option pricing and

hedging. It compares the performance of a parametric option pricing model with that of an ANN in

pricing and hedging Short Sterling options traded on NYSE Liffe. The underlying asset for Short

Sterling options is futures on 3-month sterling interest rates.

Conventional parametric approaches to option pricing and hedging are based on theory. For some

types of option closed form pricing models have been derived, while the pricing of other options

relies on numerical procedures such as Monte Carlo simulation and the binomial model. These

pricing models are based on theoretical arguments using assumptions concerning the behaviour of

the underlying asset price and the riskless interest rate. It is well known that the conventional

parametric pricing models have systematic biases, due to the use of simplifying and unrealistic

assumptions (e.g. Black, 1975; Rubinstein, 1985; Backus et al., 2004).

ANNs are information processing tools commonly used for function approximation and

classification, and they offer an alternative way of developing option pricing and hedging models.

Their particular strength lies in their ability to approximate highly non-linear and multivariate

relationships without the restrictive assumptions implicit in parametric approaches. This property

of ANNs makes them attractive for problems such as pricing and hedging options. In addition, neural

networks are adaptive and respond to structural changes in the markets. The drawback of this

approach is that it is highly data driven, requiring large quantities of historical prices. Table 1

summarises the main differences and links between parametric models and ANNs.

1

Table 1 Comparison of Parametric and ANN Models

Parametric Model ANN

Restrictive assumptions leading tosystematic bias

Recognizes patterns and relationshipswithout restrictive assumptions

Multivariate and non-linearEffectively approximates non-linearfunctions

Fails to adjust to changing marketbehaviour

Adaptive

Easy to apply and no historical datais required

Requires a large amount of data and issometimes unable to converge rapidly

Although many previous researchers have compared ANNs and parametric models in pricing options

and a few of them have also investigated hedging, this study is novel in a number of ways. First, no

earlier study has looked at the pricing of Short Sterling options. Second, most previous papers use

daily or weekly prices for the options and the underlying. However, when daily closing prices are

used, the closing times for trading in options and futures are non-synchronous. This study uses high

frequency data, making it possible to find virtually synchronous options and underlying futures

prices. Third, it is the first to look at hedging interest rate options. Fourth, it is only the second study

after Carverhill and Cheuk (2003) to train an ANN directly on the hedge ratios. Fifth, it is only the

second study after White (2000) to compare the performance of an ANN with the parametric model

of Black (1976), modified for futures-style margining. Finally, it is the first to examine hybrid

hedging models.

Section 2 summarises the previous literature, while section 3 sets out the rival pricing and hedging

models used in this study. In section 4 we discuss the dataset and the implied parameter estimates

we derive, and explain how the models are fitted to the data. Section 5 describes the estimation of

the ANN model, while section 6 has the results and section 7 concludes.

2 Literature Review

2.1 Pricing

Previous studies have used ANNs to price options on a variety of financial instruments. These 98

studies, listed in table 2, have generally found that ANNs produce prices closer to market prices than

2

do parametric pricing models, such as Black-Scholes. As table 2 shows, 47 previous studies of the1

pricing of financial options by ANN have focused on equity indices (S&P 500, S&P 100, FTSE 100,

DAX, OMX, KOSPI 200, CAC 40, S. African All Share and S&P CNX Nifty), and 15 have

considered individual equity options. Seventeen studies have analysed the pricing of options on stock

index futures (S&P 500, SPI, TAIFEX, Ibex 35 and Nikkei 225), while two have looked at currency

options, and two have considered options on currency futures. With 12 studies using simulated data,

this leaves just three previous studies of the pricing of interest rate options. White (2000) studied

options on 3-month eurodollar futures which are traded on NYSE Liffe, and so use futures style

margining. Using high frequency data for January to July 1994 and the same inputs as Black-Scholes,

ANN option prices were found to be superior to those of the Modified Black (MB) model . Raberto2

et al (2000) investigated the pricing of options on German Treasury bond futures traded on NYSE

Liffe, but do not appear to have benchmarked their ANN option prices against a parametric model.

Zhou, Yang and Han (2007b) analysed data on three convertible bonds traded on the Shanghai Stock

Exchange. Using Black-Scholes and the binomial model as benchmarks, their ANN prices were

superior.

2.2 Hedging

Hedge ratios can be derived analytically from the chosen parametric pricing model. Since ANNs are

differentiable functions of the input variables, option sensitivities (the Greeks) such as the hedge

ratio can also be derived analytical from the ANN pricing approximation. Alternatively, ANNs can

be trained directly on the desired hedge ratios. There have been 20 previous studies of hedging using

ANNs and they are listed in table 3, but none have analysed interest rate options. All 20 of these

studies derived the hedge ratios analytically from an ANN fitted to prices. In addition, Carverhill and

Cheuk (2003) also used a new ANN to model the hedge ratio directly. They found that the best delta

hedging performance was produced by the binomial model, followed by the new ANN, while hedge

ratios derived from the pricing ANN model were worst.

Most of these studies have been conducted by artificial intelligence researchers.1

The MB model is described below in section 3.1.1.2

3

Table 2 Studies Using ANN to Price Financial Options

S&P 500 Andreou et al (2002, 2004, 2006,

2008, 2010)Dugas et al (2001)Garcia & Gençay (1998, 2000)Gençay & Gibson (2007)Gençay & Qi (2001)Gençay & Salih (2003)Ghaziri, Elfakhani & Assi (2000)Ghosn & Bengio (2002)Gradojevic et al (2009)Liu (1996)Qi & Maddala (1996)Saito & Jun (2000)Tzastoudis et al (2006)S&P 500 FuturesCarverhill & Cheuk (2003)Geigle & Aronson (1999)Hamid & Habib (2005)Hutchinson, Lo & Poggio (1994)S&P 100Blynski & Faseruk (2006)Kitamura & Ebisuda (1998)Malliaris et al (1993a, 1993b)Samur & Temur (2009)FTSE 100 Bennell & Sutcliffe (2004)Gregoriou et al (2007)Healy et al (2002, 2003, 2004,

2007)Mostafa & Dillon (2008)Niranjan (1996)Schittenkopf & Dorffner (2001)Ibex 35 FuturesMartel et al (2009)

SPI Futures Boek et al (1995)Lajbcygier (2002, 2003, 2004))Lajbcygier, Boek, Flitman &

Palaniswami (1996)Lajbcygier, Boek, Palaniswami &

Flitman (1995)Lajbcygier & Connor (1997a,

1997b)Lajbcygier & Flitman (1996)Lajbcygier, Flitman, Swan &

Hyndman (1997)DAXAnders, Korn & Schmitt (1998)Hanke (1999a, 1999b)Herrmann et al (1997a & 1997b)Krause (1996)Ormoneit (1999)OMXAmilon (2003)KOSPI 200Choi, Lee, Han & Lee (2004)CAC 40De Winne et al (2001)Nikkei 225 FuturesYao, Li & Tan (2000)TAIFEX FuturesLin & Yeh (2005)Tseng et al (2008)S. African All SharePires (2005)Pires & Marwala (2005)S&P CNX NiftyMitra (2006)Saxena (2008)

Individual EquitiesAmornwattana et al (2007)Dindar & Marwala (2004)Kakati (2005, 2008)Kelly (1994)Lachtermacher et al (1996)Liang, Zhang & Yang (2006) Meissner & Kawano (2001)Pande & Sahu (2006)Pires & Marwala (2004, 2007) Zapart (2002, 2003a, 2003b)Zhou, Yang & Han (2007a)Deustche Mark-US$Carelli, Silani & Stella (2000)Deustche Mark-US$ VolatilityKaraali et al (1997)Sterling-US$ FuturesTeddy, Lai & Quek (2006)Tung & Quek (2005)Eurodollar FuturesWhite (2000)German Treasury Bond FuturesRaberto et al (2000)Convertible BondsZhou, Yang & Han (2007b)Simulated DataBarucci et al (1996, 1997)Charalambous et al (2005)Galindo-Flores (2000)Hanke (1997)Le Roux & Du Toit (2001)Lu & Ohta (2003a, 2003b)Montagna et al (2003)Morelli et al (2004)Tsaih (1999)White (1998)

Table 3 Studies Using ANN to Hedge Financial Options

S&P500Andreou et al (2008, 2010) Garcia & Gençay (2000) Gençay & Qi (2001) S&P500 FuturesCarverhill & Cheuk (2003)Hutchinson, Lo & Poggio (1994) FTSE 100Mostafa & Dillon (2008)Schittenkopf & Dorffner (2001)

SPI FuturesLajbcygier & Connor (1997a) TAIFEX FuturesKo (2009)Ko et al (2005)DAXHerrmann & Narr (1997a, 1997b) Ormoneit (1999) OMXAmilon (2003)

Individual EquitiesKakati (2005)Kelly (1994) Simulated DataHanke (1997) Morelli et al (2004) Ibex 35 FuturesMartel et al (2009)

4

3 The Rival Pricing and Hedging Models

The Black (1976) model is a well-established closed-form model for pricing and hedging short term

interest rate options, and options on short term interest rate futures. The performance of various

ANNs will be compared with that of the Modified Black (MB) model.

3.1 Pricing Model

3.1.1 Modified Black Model. NYSE Liffe, which uses futures-style margining for all its option

contracts, trades American-style options on Short Sterling futures. Lieu (1990) modified the Black

(1976) model to price and hedge futures options with futures-style margining. As Lieu (1990) states,

under the futures-style margining system, both the long and the short positions post risk-based initial

margin on entering their option positions. During the life of the option it is marked to market daily,

and gains and losses are paid and collected on a daily basis, just like futures positions. As the entire

option premium is not paid at the time of purchase, option premiums are potentially higher because

the shorts require a higher price to compensate for the loss of interest on the full premium.

Chen and Scott (1993) have shown that the results derived by Lieu can be applied to interest rate

futures options, and that it is never optimal to exercise early an American-style option on an interest

rate future with futures-style margining. Therefore these American-style options can be priced as

though they are European-style using the MB model. The only difference between the Black (1976)

model and the MB model is that the interest rate drops out of the futures-style option formula. This

is because it is no longer necessary for longs to borrow to pay the option premium, or for shorts to

invest the premium. Because the discount factor is less than one, both call and put prices are higher

under futures-style margining. The MB model for pricing short-term interest rate options under

futures-style margining is as follows:

2 1C = [(100!X)×N(!d )]![(100!F)×N(!d )]

1 2P = [(100!F)×N(d )]![(100!X)×N(d )] (1)

1 f X 2 1where d = [ln(R /R )+0.5(S T)]/(S%T)] and d = d !S%T2

fwhere C is the call premium, P is the put premium, F is the futures price, X is the strike price, R is

Xthe interest rate implied by the futures price (i.e., 100!F), R is the interest rate implied by the strike

price (i.e., 100!X), S is the volatility of 3-month interest rates measured by the annual standard

deviation, T is the time to expiration in years, and N(d) is the cumulative probability distribution

5

function for a standardized normal variable. Equation (1) requires the value of four parameters:

underlying futures price (F), strike price (X), maturity (T) and volatility (S). Volatility is the only

variable that cannot be directly observed in the market.

The partial derivatives (i.e. hedge ratios) of this MB model with respect to F are as follows:

C 1Delta = äC/äF = N(!d )(2)

P 1Delta = äP/äF = !N(d )

The goal of delta neutral hedging is to make the value of a portfolio of options, futures and bonds

immune to changes in the price of the underlying asset. However, a delta hedged portfolio involving

options is only immune to small changes in the underlying asset. To maintain delta neutrality the

portfolio position must be adjusted frequently.

3.1.2 Standard ANN Pricing Model. The standard ANN (SANN) uses actual option market prices

as the output. We chose to use the Multilayer Perceptron (MLP). MLP is also known as a back

propagation network which is by far the most popular type of neural network. Hornik et al. (1990)

showed that a one-hidden-layer MLP can approximate a large class of linear and non-linear functions

with arbitrary precision.

A back propagation network is trained using a two-step procedure. The activity from the input

pattern flows forward through the network, and the error signal flows backward to adjust the weights.

The objective function is to minimize the squared error between the observed option prices and the

ANN output, as shown in equation (3).

iMin 0.53 (d(i)!o(i)) (3)2

where d(i) is the desired output, and o(i) is the model output.

In order to construct an MLP, various decisions must be made including: the number of hidden

layers, the number of processing elements, and the transfer function. Choices must also be made for

the input and output variables and the length of the training and testing periods. Our standard ANN

(SANN) has three inputs: moneyness, volatility and time to maturity. The network is trained on

actual option prices from NYSE Liffe.

6

3.1.3 Hybrid ANN Model. A hybrid ANN (HANN) is also used, where the target output is the

difference between the actual option market price and that estimated by the parametric model. The

idea of constructing a hybrid model is to use the MB model as a base, and allow the ANN to augment

its performance. In effect the HANN is modelling the bias in the parametric pricing model. This bias

is then added to the MB price to produce the final HANN model output. Previous studies have shown

that the hybrid model usually outperforms both the parametric and SANN models (see Lajbcygier

et al., 1997; Lajbcygier, 2003; Andreou et al., 2006). This superior performance is achievable

because the parametric model is highly accurate, thus providing a good building block for

statistically-based models, such as ANNs, to model their biases. In addition, accurate option pricing

can be achieved for illiquid options, such as deep-in-the-money or very long maturity options,

because, unlike ANNs, the accuracy of parametric models is unaffected by liquidity.

3.2 Hedging Model

Delta hedging performance is also examined in this study. The delta of the MB model is defined in

equation (2), and we use two ANN hedging models, based on either option prices (analytical hedging

models) or price changes (directly trained hedging models).

3.2.1 Analytical Hedging Model (SANN1 and HANN1). Delta measures the sensitivity of the option

price to underlying price movements. Once the network is trained on prices, delta can be analytically

derived from both the SANN and HANN models. For SANN, the partial derivatives of the output

(Price/Strike) with respect to one of the inputs (Moneyness) give the delta, called SANN1. The

HANN model output is the bias of the MB Model (MB Price/Strike)!(Market Price/Strike) called

HANN1. The HANN model delta is the MB delta minus the sensitivity of the ANN output to

moneyness.

3.2.2 Directly Trained Hedging Model (SANN2 and HANN2). In this case the hedge ratio is modelled

directly. SANN2 trains the network on the desired delta, and HANN2 trains the network on the

difference between the MB model delta and the desired delta. Changes in option prices and futures

prices for the same contract are used to compute the desired delta.

4 Data

7

Short Sterling futures traded on NYSE Liffe are listed on a quarterly expiry cycle of March, June,

September and December. In addition two serial months are traded on two of the next three months,

so that at any time 23 delivery months are available, with the nearest three delivery months being

consecutive. Options on Short Sterling futures are listed on the nearest ten futures, with identical

expiration dates and times. Every day, ten contracts with different maturities are traded - eight on a

quarterly cycle, and two serial options. The strike price intervals are 0.125 for all serial delivery

months and the nearest four quarterly delivery months, and 0.25 for the remaining quarterly delivery

months. This study uses tick data on Short Sterling call and put options and Short Sterling futures

traded on NYSE Liffe for 4th January 2005 to 29th December 2006. Table 4 summarises the options

and futures data.

Table 4 Observations in the Options and Futures Database3

Call Options

Ask Bid TradeSpreadTrade

BlockTrade

Volatility All

2005 18055 16374 3615 6005 2161 - 46210

2006 504652 450177 3735 7070 2970 - 968604

Put Options

2005 16762 16766 2452 3340 828 - 40148

2006 517241 446903 2239 3306 1488 - 971177

Futures

2005 12697759 12579266 772710 566368 346 560 26617009

2006 22701523 22738706 1012472 907558 728 587 47361577

There are five types of options data: Ask, Bid, Trade, Spread Trade and Block Trade, where Ask and

Bid are quote prices, and all other data types are trades. For pricing and hedging purposes only

options trade data is considered, giving 39,209 observations, of which 25,556 are call prices and

13,563 are put prices. When matching each options trade with an underlying futures price all the

available futures data, including quote prices, is considered . If there is no futures quote or trade price4

The massive increase in the number of recorded quotes in 2006 is because it includes quotes of the same price3

at the same second, whereas in 2005 two or more quotes at the same price occurring simultaneously were

recorded as only one quote.

Although subject to bid-ask bounce, trades represent economic transactions, while quotes can be4

unrepresentative of market prices. The unreliability of quotes is particularly important for option quotes a long

way from the money. This moneyness-related problem is not relevant for futures quotes, making them more

reliable than options quotes. There is a need to use synchronous futures and options prices, but the modest

number of options trades makes it impossible to generate a large sample of synchronous trade prices. Since there

is a very large number of futures quotes with tight bid-ask spreads, options trade prices were synchronised with

8

available at the same second as an options trade, the time-weighted average of the futures price

immediately before and after the option trade is used. For 98% of the options trades, the time

difference between the options and underlying futures prices is less than one minute.

For Short Sterling options with futures-style margining, the European-style put-call parity gives the

following lower boundary conditions on options prices:

C > F ! X (4)

P > X ! F (5)

Of the 39,209 options trades, 448 trades fail to satisfy the boundary condition in either equation (4)

or (5), and are deleted. The futures data from 1 February 2005 to 4 February 2005 is missing fromst th

the raw database supplied by NYSE Liffe and so this period is dropped from the analysis. In addition,

options with a time to maturity of less than seven calendar days are deleted. The cleansed database

has 38,531 call and put trades.

The MB model requires four inputs (volatility, time to maturity, the futures price and the strike

price). These four inputs are also used for the pricing ANNs. In common with most previous studies,

the variables are transformed using the homogeneity hint (Garcia and Gençay, 1998). Provided

returns on the underlying asset are distributed independently of its price, the MB model is

homogeneous of degree one in the underlying and the strike, and so both sides of equation (1) can

be divided through by the strike price. Therefore, the inputs are volatility, time to maturity and

f X Xmoneyness (R /R ), while the output is C/R . Time to maturity (T) is measured as the number of

calendar days until expiration, divided by the number of calendar days in that year.

To price an option, the most important input is the expected volatility (S). Some studies of S&P 500

options use historical volatility estimates, or the VIX. The present study uses the volatility implied

by the price of recent options trades. Computing implied volatilities at time t involves solving the

MB model in equation (1) iteratively for volatility, given the observed values of the other inputs at

time t. For a specified expiration date and strike price, the implied volatilities for calls and puts are

computed using option trades over the past 15 days (i.e. t!15 days) up to five minutes before the

futures quotes and trades. This enables very closely synchronized options and futures prices to be obtained.

9

trade being priced (i.e. t!5 mins). This five minute lag is to permit the option pricing model to be

implemented in real time. The volatility used in the MB and ANN models is the equally weighted

average of these implied volatilities. Some options, such as serial trades, are highly illiquid. If less

than ten implied volatility estimates are available, the option trade is excluded. This leads to the

exclusion of 1,042 trades, and the database for pricing and hedging options now contains 37,489

option observations.

For options on Short Sterling futures, the current minimum price movement is 0.005% (0.5 basis

points). Since the contract size of the future is £500,000, this is equivalent to (£500,000×0.005%÷4),

i.e., £6.25 . The raw data was recorded to four decimal places until May 2006, and to three decimal5

places thereafter but without a decimal point. Therefore in the raw data base, the minimum price

movement is five units before May 2006 and 50 units afterwards. To convert the price data into

percentages it was divided by 1,000 before May 2006, and then by 10,000, so that the inputs

(underlying and strike) are measured as (100!Yield).

The data is further filtered for the pricing and hedging models. We set the maximum time to maturity

at 360 days, maximum option prices at 0.5 (99.97% quantile), the maximum moneyness at 1.2 for

calls and the minimum moneyness at 0.85 for puts. Therefore, the sample space is restricted by

excluding deep-in-the-money options (normally with extreme prices) and very long date options.

These account for 5.8% of the call volume and 5.6% of the put volume.

5 Estimating the ANN Models

The data is separated into six categories for pricing: all calls, all puts, out-of-the-money (OTM) calls

and puts, and in-the-money (ITM) calls and puts. Table 5 summaries the training and testing sample

sizes.

Division by four is because the tick size of 0.005% is a 3 month rate, which needs to be increased to an annual5

rate for the computation of the monetary value corresponding to one tick.

10

Table 5 Sample Size Summary

No. of Obs. Size of Training Sample No. of Testing Periods

All Call 23226 6000 43

All Put 12104 6000 15

OTM Call 19314 6000 33

OTM Put 8681 6000 6

ITM Call 3747 2000 4

ITM Put 3307 2000 3

For most categories, the size of the training sample is 6,000 observations. As we only have 3,747

ITM calls and 3,307 ITM puts, the training sample is 2,000 for the ITM options. The out-of-sample

testing samples are the 400 subsequent observations after the end of each training sample. Using a

rolling windows approach, the training and testing samples are then rolled forward by 400

observations, and the ANN re-estimated. For each training sample, weights and bias information is

used to derive the hedge ratios. The ANN design uses a three layer MLP with 10 neurons in the

hidden layer. A tan-sigmoid transfer function is applied for the hidden layer and a linear function is

applied for the output layer of the pricing models (SANN1 and HANN1). A tan-sigmoid transfer

function is applied for both the hidden and output layers of the hedging models (SANN2 and

HANN2).Given a sufficient number of neurons in the hidden layer, this network can approximate

any function with a finite number of discontinuities arbitrarily well. There are a number of back

propagation algorithms in the Matlab neural network toolbox, and we use the Levenberg-Marquardt

algorithm as it is much faster than the other methods. Cross validation has been widely used in

training neural networks to prevent over-fitting, and since it is part of the Matlab “train” function,

it is applied automatically in our analysis.

6 Results

6.1 Pricing Model Performance

Using the homogeneity hint improves the MLP performance, relative to using the underlying and

strike as separate inputs; i.e. C/K = f(S, T, M) and P/K = f(S, T, M) with the inputs of moneyness (M),

volatility (S) and time to maturity (T). Compared with other input combinations, this specification

gives the best results for all categories of option. Volume was tried as an additional input, but pricing

accuracy becomes slightly worse.

11

Although ANNs are trained to minimise the sum of the squared errors, there is no agreement on the

appropriate measure of pricing accuracy. In this study, four alternative summary measures of

performance are used: (a) mean squared error (MSE); (b) mean absolute error (MAE); © the

correlation between actual and computed prices; and (d) mean error (ME)

MSE = (3(y!x) /n (6)2

MAE = (3|y!x|)/n (7)

ME = (3(y!x))/n (8)

where x is the actual value of the dependent variable, y is the estimated value of the dependent

variable, and n is the number of observations.

Table 6 shows the performance of MB, SANN and HANN in terms of MSE, MAE, correlation and

ME for the overall out-of-sample period. The figures in bold in table 6 indicate the estimation

technique with the best score. Each ANN is trained to minimise the total squared error, and so the

MSE is used as the prime performance measure. Using the MSE criterion HANN is best in every

case. For the MAE criterion HANN is best in five cases. For correlation, MB is best in all cases.

Using ME, SANN is the best in three cases, while HANN is the best for the other three. Thus,

overall, HANN appears to be the preferable technique.

To test whether these results are statistically significant, a paired t-test is the obvious choice.

However, this test requires that the differences between the errors from the two models are normally

distributed. As the data does not satisfy this assumption, the bootstrap method is applied as a non-

parametric alternative to the paired t-test. The bootstrap is a procedure that involves choosing

random samples with replacement from a data set. In our case the MSE numbers in table 6 indicate

that HANN is the best model, but how certain is this conclusion? We define the difference in the

MSEs as the test statistic. The sign column in tables 7 and 8 are the signs of the MSE of model A

minus the MSE of model B, and P indicates a positive difference, while N indicates a negative

difference. To determine whether the sign is significant, squared errors for the two models are re-

sampled 1,000 times and a distribution for the test statistic produced. Tables 7 and 8 have the

bootstrap test results for calls and puts. For each category, we compare the three models pairwise.

For example, we first compare the MB model with the SANN model in the All Call Option category.

The sign is positive, which means that the mean error of MB (Model A) is higher than SANN (Model

12

B). To test whether the difference is significant, we compute the 95% confidence interval. If both

the lower bound and the higher bound are positive, we conclude that the test statistic is significantly

positive at the 95% confidence level and Model B is preferred. If the lower bound is less than zero

and the higher bound is greater than zero, we conclude that the difference in MSE is not significantly

different from zero. If both bounds are negative, we conclude that the difference is negative and

Model A is preferred.

Table 6 Pricing Performance of ANNs and MB

MSE MAE Correlation ME

AllCall

Options

MB 0.0093% 0.6452% 98.6864% 0.2363%

SANN 0.0087% 0.6095% 98.4930% 0.1144%

HANN 0.0082% 0.5796% 98.5620% 0.1029%

AllPut

Options

MB 0.0148% 0.9368% 99.2792% 0.6621%

SANN 0.0124% 0.8116% 99.1469% !0.0439%

HANN 0.0112% 0.7796% 99.2310% 0.0150%

OTMCall

Options

MB 0.0083% 0.5844% 97.3122% 0.1589%

SANN 0.0084% 0.5608% 96.8118% 0.0657%

HANN 0.0078% 0.5236% 97.0618% 0.0911%

OTMPut

Options

MB 0.0100% 0.7070% 97.7356% 0.4021%

SANN 0.0103% 0.6793% 97.3966% !0.3110%

HANN 0.0091% 0.6209% 97.7037% !0.2883%

ITMCall

Options

MB 0.0187% 1.0941% 98.9414% 0.9262%

SANN 0.0167% 1.0493% 98.3328% !0.0552%

HANN 0.0164% 1.0570% 98.3271% !0.1077%

ITMPut

Options

MB 0.0128% 0.8969% 99.5818% 0.5243%

SANN 0.0111% 0.8022% 99.4794% !0.0453%

HANN 0.0105% 0.7845% 99.5075% !0.0646%

Exactly the same orderings are found for both call and put options. For all call options and all put

options categories, the HANN model is clearly superior to SANN, which is superior to MB. For

OTM call and put options, HANN is superior to both MB and SANN, while for ITM call and put

options MB is inferior to both HANN and SANN. So for OTM calls and puts HANN is clearly

superior, while for ITM calls and puts it is not inferior to any other pricing model. Therefore, for

pricing Short Sterling call and put options, the hybrid ANN is superior to the standard ANN, and

both are superior to the MB model.

13

Table 7 Significance Tests for Calls

Model A Model B Sign2.5% LowerBound

2.5%UpperBound

Significance Rank

All Call Options

MB SANN P 4.42E!06 8.71E!06 Plus SANN > MB

MB HANN P 9.05E!06 1.29E!05 Plus HANN > MB

SANN HANN P 2.85E!06 6.13E!06 Plus HANN > SANN

OTM Call Options

MB SANN N !3.22E!06 2.61E!06 Insignificant Insignificant



ITM Call Options



SANN HANN P !4.40E!06 9.20E!06 Insignificant Insignificant

Note: The test statistic is the MSE of Model A minus the MSE of Model B. The Sign column is the sign of the MSE of

model A minus the MSE of model B, and P indicates a positive difference, while N indicates a negative difference. The

significance column is the significance of the test statistics. Plus indicates that the test statistic is significantly positive,

while Insignificant indicates that the test statistic is not significantly different from zero.

Table 8 Significance Tests for Puts


2.5%UpperBound

Significance Rank

All Put Options




OTM Put Options

MB SANN N !1.13E!05 4.55E!06 Insignificant Insignificant



ITM Put Options



SANN HANN P !5.60E!07 1.22E!05 Insignificant Insignificant

See notes for table 7.

14

6.2 Hedging Model Performance

Option pricing theory is based on replicating an option with a portfolio of other assets. In a stock-

style margining system, options can be replicated using positions in the underlying security and a

money market account (or a bond position). In a futures-style margining system, a bond position is

not required, and options can be hedged using just futures contracts. Rival hedge ratios (deltas) are

obtained from the MB model and an ANN. Portfolios of options and futures are formed which hedge

against changes in the underlying futures price, with the hedged portfolio being rebalanced in

discrete time.

ANN hedge ratios can be obtained in two different ways. The partial derivatives with respect to the

underlying futures price can be computed analytically from an ANN pricing model - i.e. analytical

hedging models. In this case the ANN is treated as a function, and the chain rule of differentiation

applied. This can be done because the transfer functions for both the hidden and output layers are

differentiable. SANN1 and HANN1 deltas are derived from the pricing models SANN and HANN

respectively. In the case of HANN1, the output from the pricing model is the bias of the MB model.

Therefore, the HANN1 delta is the partial derivative of the bias with respect to the underlying futures

price, plus the MB delta. Alternatively, a new ANN can be trained directly to target the desired hedge

ratios, i.e. directly trained hedging models. We fit a standard (SANN2) and a hybrid (HANN2) ANN.

HANN2 aims to adjust the MB hedge ratios using a non-parametric enhancement. The idea of

HANN2 is the same as the other hybrid models where the MB deltas are an additional input to the

ANN, and the MB hedge ratio biases are the output.

6.2.1 Analytical Hedging Model (SANN1 and HANN1). Deltas can be derived analytically from the

two ANNs used for pricing. Figures 1 to 4 show the scatter plot for SANN1 deltas and HANN1

deltas compared with MB deltas. Figures 1 and 2 are deltas estimated for calls, and figures 3 and 4

are deltas estimated for puts. The x-axis is the MB delta and the y-axis is the ANN delta. The

diagonal line indicates y=x. Deltas derived from the pricing models are more or less in line with the

MB deltas for both call and put options, especially when options are OTM. When options are ITM,

the ANN deltas tend to be bigger, and can be far away from the corresponding MB delta. As the non-

parametric model has no constraint on the range of delta, it can occasionally be greater than one, or

less than minus one.

15

The distributions of the estimated deltas are also shown in figures 1 to 4. We can see that the ANN

delta distributions have much longer tails, but are still similar to the MB delta distributions,

especially for puts. For both calls and puts, the MB deltas have a slightly lower correlation with the

SANN1 deltas (calls 0.988; puts 0.989) than with the HANN1 deltas (calls 0.990; puts 0.993).

Hedging performance will be discussed in section 6.2.2, where the analytical hedging models will

be compared with the directly trained hedging models.

Figure 1 SANN1 Deltas Versus MB Deltas (Calls)Figure 2 HANN1 Deltas Versus MB Deltas (Calls)Figure 3 SANN1 Deltas Versus MB Deltas (Puts)Figure 4 HANN1 Deltas Versus MB Deltas (Puts)

6.2.2 Directly Trained Hedging Model (SANN2 and HANN2). As the aim of delta hedging is to

replicate changes in option prices (ÄP) using delta multiplied by the change in the underlying futures

price (ÄF), a delta of (ÄP/ÄF) serves as a perfect hedge ratio. Two new ANNs are trained to directly

estimate deltas - a standard ANN model (SANN2) and a hybrid ANN model (HANN2). The input

variables for SANN2 are the same as for SANN1: moneyness, time to maturity and volatility, while

the output is the desired delta, defined as the actual (ÄP/ÄF). For HANN2 the MB delta is included

as an input (as it is for HANN1), while the output is the MB delta’s bias, relative to the actual delta.

The aim of training each ANN is to learn how the hedge ratio varies with moneyness, time to

maturity and volatility. These sensitivities are defined for very small changes in prices, and so high

frequency data is used to train the ANNs. However, most institutions do not implement intraday

hedging because of the considerable transaction costs, and so hedging performance is tested using

daily data. The ANNs are trained using the data previously used for the two pricing ANNs

(henceforward, the high frequency database), while testing uses a daily database constructed from

this high frequency database. This data is used to train and test the four ANNs (SANN1, HANN1,

SANN2 and HANN2).

In the pricing section we considered 23,226 call and 12,104 put observations. The training data

comprises the odd observations (11,613 calls and 6,052 puts) from the high frequency database, and

the resulting models are tested on daily observations constructed from the even observations (11,613

calls and 6,052 puts) of the high frequency database. This odd-even method of splitting the data for

training and testing is often used in ANN applications (see Wikel et al., 1996, Mirsepassi, 2004, and

16

Bowers and Shedrow, 2000), and is superior to simply using the first half for training, and the second

half for testing. While the size of the training data set is still half the data, the daily out-of-sample

dataset is much bigger than half the available contract days. This is because, after removing the odd

data for training, as long as contracts generally have more than one observation per day, this permits

the computation of a daily value of (ÄP/ÄF) for most contracts.

To get the desired delta (ÄP/ÄF), the high frequency data is sorted by strike and maturity, so that

observations of the same contract are “stacked” to produce a single sequence of observations. The

odd sequence of observations is then differenced to produce actual values for (ÄP/ÄF). As these

ratios are much noisier than option prices, some restrictions are applied to this high frequency data

on which the ANNs are trained to assist in finding a function that links the inputs with the output.

Since delta is a slope measure for a continuous option pricing function, the time difference between

observations is restricted to a maximum of two days. As there can be two observations at the same

time, a minimum time change of 0.01 days is also applied. Sometimes the option price makes a big

movement while the futures price hardly moves, and vice versa, and so the actual delta can be as high

as 10 (or !10), or as low as 0. To find a relatively accurate sensitivity formula, only actual values

of (ÄP/ÄF) in the same range as the theoretical delta (0 to 1 for calls, and !1 to 0 for puts) are

considered as the output when training each ANN. Only price changes with the same strike and

maturity are considered. The application of these restrictions reduces the number of values of

(ÄP/ÄF) to 2,153 calls and 1,211 puts. The four ANNs were trained on this high frequency data set.

Since the performance of daily hedges is being examined, testing the ANNs is conducted using a

daily data set. The even high frequency data (11,613 calls and 6,052 puts) have not been used in

constructing the high frequency data set used for training the models, and so are out-of-sample. For

contracts with more than one observation in a day, the even observation at, or closest to, 15:00 is

chosen. Some options are not traded every day and the time difference between two observations for

a contract can be as long as a month, so the hedging rebalance period between the daily prices is

restricted to be no more than two days. Applying this restriction produces 2,816 daily values of

(ÄP/ÄF) for calls and 1,675 for puts. The four ANNs are then tested on these daily data sets.

17

Table 9 Hedging Data Set Summary

Out-of-SampleWhole Sample

Effective Training Effective Test

Calls 2,153 2,816 23,226

Puts 1,211 1,675 12,104

The six correlations between the deltas of each of the analytical hedging models (MB, SANN1 and

HANN1) and the two directly trained hedging models (SANN2 and HANN2) are between 0.75 and

0.90, while the correlation between SANN2 and HANN2 is 0.81 for calls and 0.85 for puts. This

indicates that, while the deltas for the three analytical hedging models were very similar with a

correlation of about 0.99, those for the directly trained hedging models are more dissimilar. There

is also a difference in the mean deltas and their variances, as shown in table 10. The directly trained

hedging models have larger absolute deltas of just under a half, with a much lower variance.

Table 10: Means and Variances of the Deltas

Call Deltas Put Deltas

Mean Variance Mean Variance

MB 0.292 0.042 !0.403 0.046

SANN1 0.271 0.045 !0.396 0.056

HANN1 0.277 0.031 !0.344 0.046

SANN2 0.434 0.008 !0.459 0.007

HANN2 0.436 0.007 !0.445 0.015

Hedging performance is measured using MSE, MAE, correlation and ME as in the pricing section.

Error is defined as ÄP!(delta×ÄF). Table 11 has the hedging performance for MB, SANN1,

HANN1, SANN2, and HANN2 out-of-sample. Apart from the MAE measure, the directly trained

models (SANN2 and HANN2) are clearly superior for both calls and puts. Table 12 contains the

pair-wise rankings of the five models that are statistically significant at the 5% level. These

statistically significant rankings enable the creation of the two quasi-orderings (one per data set)

which appear in figure 5.

In every case the two best models are the direct trained hedging models (SANN2 and HANN2),

which are significantly superior to both the analytical hedging models (SANN1 and HANN1) and

the MB model. For call options SANN2 and HANN2 are equally attractive, while for puts SANN2

18

dominates HANN2. Since HANN2 has more information than SANN2, its lack of dominance for

hedging is surprising.

Table 11 Out-of-Sample Hedging Performance

Hedging MSE MAE Correlation ME

Calls

MB 0.087% 0.987% 63.360% 0.442%

SANN1 0.090% 0.992% 61.972% 0.450%

HANN1 0.087% 1.001% 62.801% 0.431%

SANN2 0.071% 1.168% 72.916% 0.198%

HANN2 0.069% 1.124% 72.285% 0.269%

Puts

MB 0.228% 1.552% 70.297% 1.005%

SANN1 0.237% 1.571% 70.209% 1.037%

HANN1 0.203% 1.472% 68.503% 0.946%

SANN2 0.143% 1.519% 72.619% 0.759%

HANN2 0.160% 1.505% 67.623% 0.725%

Table 12: Significance Tests for Out-of-Sample Testing

Out-of-sample Call (Hedging Model 2)

Model A Model B Sign2.5%

Lower Bound

2.5% Upper Bound

Significance Rank

MB SANN1 !1 -6.13E-05 4.62E-06 Insignificant Insignificant

MB HANN1 1 -1.34E-05 1.77E-05 Insignificant Insignificant

SANN1 HANN1 1 -9.33E-06 7.27E-05 Insignificant Insignificant

MB SANN2 1 3.94E-06 3.37E-04 Plus SANN2 > MB

MB HANN2 1 5.66E-05 3.10E-04 Plus HANN2 > MB

SANN2 HANN2 1 -4.84E-05 7.59E-05 Insignificant Insignificant

SANN1 SANN2 1 2.44E-05 3.71E-04 Plus SANN2 > SANN1

HANN1 HANN2 1 5.39E-05 3.03E-04 Plus HANN2 > HANN1

HANN2 SANN1 !1 -3.43E-04 -7.44E-05 Minus HANN2 > SANN1

HANN1 SANN2 1 5.06E-05 3.50E-04 Plus SANN2 > HANN1

19

Out-of-sample Put (Hedging Model 2)


2.5% UpperBound

Significance Rank

MB SANN1 !1 -2.35E-04 8.14E-05 Insignificant Insignificant


SANN1 HANN1 1 1.82E-04 5.17E-04 Plus HANN1 > SANN1

MB SANN2 1 4.51E-04 1.28E-03 Plus SANN2 > MB


SANN2 HANN2 !1 -3.37E-04 -1.57E-05 Minus SANN2 > HANN2

SANN1 SANN2 1 5.36E-04 1.42E-03 Plus SANN2 > SANN1

HANN1 HANN2 1 1.43E-04 7.62E-04 Plus HANN2 > HANN1

HANN2 SANN1 !1 -1.25E-03 -3.70E-04 Minus HANN2 > SANN1

HANN1 SANN2 1 2.73E-04 9.67E-04 Plus SANN2 > HANN1

Note: The Sign column are the signs of the MSE of model A minus the MSE of model B, while 1 indicates a positive

difference and -1 indicates a negative difference. The Significance columns gives the significance of the test statistics.

Plus indicates that the test statistic is significantly positive. Insignificant indicates that the test statistic is not significantly

different from zero. Minus indicates that the test statistic is significantly negative.

Figure 5 Quasi-Order Diagrams for Hedging Models

7 Conclusions

The aim of this research is to investigate the use of artificial neural networks (ANN) as a tool for

pricing and delta hedging interest rate options, and Short Sterling options were selected for testing.

The MB model is widely used to price such options, and is used to benchmark the performance of

the ANNs. In our analysis we have used the average historical implied volatility as an estimate for

expected volatility. Neural networks are fitted using the Levenberg-Marquardt training algorithm.

It is shown that ANNs are a suitable tool for both pricing and hedging Short Sterling options,

capturing the nonlinear behaviour of option prices and hedge ratio deltas with a high degree of

accuracy. The results for pricing call and put options show that the hybrid ANN model is

significantly superior to the MB model. The standard ANN is indistinguishable from the MB model

for OTM options, but outperforms the MB model when both ITM and OTM options are tested.

We considered the performance of both analytical hedging ANN models and directly trained hedging

ANN models in hedging Short Sterling options. The analytical hedging models are based on the

ANN pricing model, and their hedging performance is practically indistinguishable from that of the

20

MB model. In contrast to Carverhill and Cheuk (2003) and S&P500 futures options, for Short

Sterling options we find that the directly trained hedging ANN models provide significantly better

hedging results than the MB and analytical models. Surprisingly, although the hybrid ANN is

superior for pricing, HANN2 is not dominant for hedging.

Neural networks are therefore a promising alternative to closed-form models in both pricing and

hedging options, and may play an important role for exotic options for which no closed-form model

exists. However, as fitting an ANN is data intensive, there may be problems in pricing and hedging

newly traded or illiquid options.

References

Amilon, H., 2003, A Neural Network Versus Black-Scholes: A Comparison of Pricing and HedgingPerformances, Journal of Forecasting, vol 22, no. 4. July, pp. 317-335.

Amornwattana, S., Enke, D. and Dagli, C.H., 2007, A Hybrid Option Pricing Model Using a Neural Networkfor Estimating Volatility, International Journal of General Systems, vol. 26, no. 5, October, pp. 558-573.

Anders, U., Korn, O, and Schmitt, C., 1998, Improving the Pricing of Options: A Neural Network Approach,Journal of Forecasting, vol.17, no. 5-6, September-November, pp.369-388.

Andreou, P.C., Charalambous, C. and Martzoukos, S.H., 2002, Critical Assessment of Option PricingMethods Using Artificial Neural Networks. In Proceedings of the 2002 International Conferenceon Artificial Neural Networks, pp. 1131-1136.

Andreou, P.C., Charalambous, C. and Martzoukos, S.H., 2006, Robust Artificial Neural Networks for Pricingof European Options, Computational Economics, vol. 27, no. 2-3, May, pp. 329-351.

Andreou, P.C., Charalambous, C. and Martzoukos, S.H., 2008, Pricing and Trading European Options byCombining Artificial Neural Networks and Parametric Models with Implied Parameters, . EuropeanJournal of Operational Research, vol. 185, no. 3, March, pp. 1415-1433.

Andreou, P.C., Charalambous, C. and Martzoukos, S.H., 2010, Generalized Parameter Functions for OptionPricing, Journal of Banking and Finance, vol. 34, no. 3, March, pp. 633-646.

Andreou, P.C., Martzoukos, S.H. and Charalambous, C., 2004, Option Pricing and Trading with ArtificialNeural Networks and Advanced Parametric Models with Implied Parameters. In Proceedings of the2004 International Joint Conference on Neural Networks, vol. 4, IEEE, pp. 2741-2746.

Backus, D., Foresi, S. Li, K. and Wu. L., 2004, Accounting for Biases in Black-Scholes, Working Paper,Stern School of Business.

Barucci, E., Cherubini, U. and Landi, L., 1996, No-Arbitrage Asset Pricing With Neural Networks UnderStochastic Volatility. In Neural Networks in Financial Engineering: Proceedings of the ThirdInternational Conference on Neural Networks in the Capital Markets, edited by A.P.N. Refenes, Y.Abu-Mostafa, J. Moody and A. Weigend, World Scientific, New York, pp. 3-16.

Barucci, E., Cherubini, U. and Landi, L., 1997, Neural Networks for Contingent Claim Pricing Via theGalerkin Method. In Computational Approaches to Economic Problems, edited by H. Amman, B.Rustem and A. Whinston, Kluwer Academic Publishers, Dordrecht, pp. 127-141.

Bennell, J. and Sutcliffe, C.M.S., 2004, Black-Scholes Versus Artificial Neural Networks in Pricing FTSE100 Options, Intelligent Systems in Accounting , Finance and Management vol 12, no. 4, October-December, pp. 243-260.

Black, F., 1975, Fact and Fantasy in the Use of Options., Financial Analysts Journal, vol. 31, no. 4, July-August, pp. 36-41, 61-72.

Black, F, 1976, The Pricing of Commodity Contracts, Journal of Financial Economics, vol.3, no. 1-2,January-March, pp.167-179.

21

Blynski, L. and Faseruk, A., 2006, Comparison on the Effectiveness of Option Price Forecasting: Black-Scholes Versus Simple and Hybrid Neural Networks, Journal of Financial Management andAnalysis, vol. 19, no. 2, July-December, pp. 46-58.

Boek, C., Lajbcygier, P., Palaniswami, M. and Flitman, A., 1995, A Hybrid Neural Network Approach to thePricing of Options, Proceedings of the International Conference on Neural Networks 95, IEEE, vol2, pp. 813-817.

Bowers, J. A. and Shedrow, C. B., 2000, Predicting Stream Water Quality Using Artificial Neural Networks(ANN). In Development and Application of Computer Techniques to Environmental Studies VIII,edited by G. Ibarra-Berastegi, G.A. Brebbia and P. Zannetti, WIT Press, pp. 89-97.

Carelli, A., Silani, S. and Stella, F., 2000, Profiling Neural Networks for Option Pricing, InternationalJournal of Theoretical and Applied Finance, vol. 3, no. 2, April, pp. 183-204.

Carverhill, A. and Cheuk, T.H.F., 2003, Alternative Neural Network Approach for Option Pricing andHedging, Working Paper, School of Business, University of Hong Kong.

Charalambous, C. and Martzoukos, S.H., 2005, Hybrid Artificial Neural Networks for Efficient Valuationof Real Options and Financial Derivatives, Computational Management Science, vol. 2, no. 2,March, pp. 155-161.

Chen, R. R. and Scott, Louis, 1993, Pricing Interest Rate Futures Options with Futures-Style Margining,Journal of Futures Markets, vol. 13, no. 1, February, pp.15-22.

Choi, H.H., Lee, H.S., Han, G.S. and Lee, J., 2004, Efficient Option Pricing Via a Globally RegularizedNeural Network, In Proceedings of an International Symposium on Neural Networks 2004, Part 2,pp. 988-993.

De Winne, R., Francois-Heude, A. and Meurisse, B., 2001, Market Microstructure and Option Pricing: ANeural Network Approach, Working Paper, Facultés Universitaires Catholiques de Mons, August.

Dindar, Z.A. and Marwala, T., 2004, Option Pricing Using a Committee of Neural Networks and OptimizedNetworks. In Proceedings of the 2004 IEEE International Conference on Systems, Man andCybernetics, vol. 1, pp. 434-438.

Dugas, C., Bengio, Y., Bélisle, F., Nadeau, C. and Garcia, R., 2001, Incorporating Second-Order FunctionalKnowledge for Better Option Pricing. In Advances in Neural Information Processing Systems 13,edited by T.K. Leen, T.G. Dietterich and V. Tresp, The MIT Press.

Galindo-Flores, J. A., 2000, Framework for Comparative Analysis of Statistical and Machine LearningMethods: An Application to the Black-Scholes Option Pricing Model. In Computational Finance1999, edited by Y.S. Abu-Mostafa, B. LeBaron, A.W. Lo and A.S. Weigend, The MIT Press,Cambridge, Massachusetts, pp. 635-660.

Garcia, R. and Gençay, R., 1998, Option Pricing with Neural Networks and a Homogeneity Hint, In DecisionTechnologies for Computational Finance: Proceedings of the Fifth International ConferenceComputational Finance, edited by A.P.N. Refenes, A.N. Burgess and J.E. Moody, Advances inComputational Management Science, vol. 2, pp. 195-205.

Garcia, R. and Gençay, R., 2000, Pricing and Hedging Derivatives Securities with Neural Networks and aHomogeneity Hint, Journal of Econometrics, vol 94, no. 1-2, pp. 93-115.

Geigle, D.S. and Aronson, J.E., 1999, An Artificial Neural Network Approach to the Valuation of Optionsand Forecasting of Volatility, Journal of Computational Intelligence in Finance, vol. 7, no. 6,November-December, pp. 19-25.

Gençay, R and Gibson, R., 2007, Model Risk for European Style Stock Index Options, IEEE Transactionson Neural Networks 18, vol. 18, no. 1, January, pp. 193-202.

Gençay, R and Qi, M., 2001, Pricing and Hedging Derivative Securities with Neural Networks: BayesianRegularization, Early Stopping and Bagging, IEEE Transactions on Neural Networks, vol. 12, no.4, July, pp. 726-734.

Gençay, R and Salih, A., 2003, Degree of Mispricing with the Black-Scholes Model and Non ParametricCures, Annals of Economics and Finance, vol. 4, no. 1, May, pp. 73-101.

Ghaziri, H., Elfakhani, S. and Assi, J., 2000, Neural Networks Approach to Pricing Options, Neural NetworkWorld, vol. 10, no. 1-2, pp. 271-277.

Ghosn, J. and Bengio, Y., 2002, Multi-Task Learning for Option Pricing, Working Paper 2002s-53,CIRANO, Montréal.

Gradojevic, N., Gençay, R. and Kukolj, D., 2009, Option Pricing with Modular Neural Networks, IEEETransactions on Neural Networks, vol. 20, no. 4, April, pp. 626-637.

22

Gregoriou, A., Healy, J. and Ioannidis, C., 2007, Hedging Under the Influence of Transaction Costs: AnEmpirical Investigation on FTSE 100 Index Options, Journal of Futures Markets, vol. 27, no. 5,May, pp. 471-494.

Hamid, S.A. and Habib, A., 2005, Can Neural Networks Learn the Black-Scholes Model?: A SimplifiedApproach, Working Paper, Southern University of New Hampshire.

Hanke, M., 1997, Neural Network Approximation of Option Pricing Formulas for Analytically IntractableOption Pricing Models, Journal of Computational Intelligence in Finance, vol. 5, no. 5, September-October, pp. 20-27.

Hanke, M., 1999a, Neural Networks Versus Black-Scholes: An Empirical Comparison of the PricingAccuracy of Two Fundamentally Different Option Pricing Methods, Journal of ComputationalIntelligence in Finance, vol. 7, no. 1, January-February, pp. 26-34.

Hanke, M., 1999b, Adaptive Hybrid Neural Network Option Pricing, Journal of Computational Intelligencein Finance, vol. 7, no. 5, September-October, pp. 33-39.

Healy, J., Dixon, M., Read, B. and Cai, F.F., 2002, A Data-Centric Approach to Understanding the Pricingof Financial Options, European Physical Journal B, vol. 27, no. 2, May, pp. 219-227.

Healy, J.V., Dixon, M., Read, B.J. and Cai, F.F., 2003, Confidence in Data Mining Model Predictions: AFinancial Engineering Application, Proceedings of the 29th Annual Conference of the IEEEIndustrial Electronics Society Special Session on Intelligent Systems, vol. 2, pp. 1926-1931.

Healy, J.V., Dixon, M., Read, B.J. and Cai, F.F., 2004, Confidence Limits for Data Mining Models ofOptions Prices, Physica A: Statistical Mechanics and its Applications, vol. 344, no. 1-2, December,pp. 162-167.

Healy, J., Dixon, M., Read, B. and Cai, F.F., 2007, Non-Parametric Extraction of Implied Asset PriceDistributions, Physica A: Statistical Mechanics and its Applications, vol. 382, no. 1, August, pp.121-128.

Herrmann, R. and Narr, A., 1997a, Neural Networks and the Valuation of Derivatives - Some Insights intothe Implied Pricing Mechanism of German Stock Index Options, Working Paper no. 202, Departmentof Finance and Banking, University of Karlsruhe, 45 pages.

Herrmann, R. and Narr, A., 1997b, Risk Neutrality, Risk, vol. 10, August, Technology Supplement, pp. 23,26-29.

Hornik, K, Stinchcombe, M., and White, H., 1990, Universal Approximation of an Unknown Mapping andits Derivatives Using Multilayer Feedforward Networks, Neural Networks, vol. 3. pp. 551-560.

Hutchinson, J.M., Lo, A.W. and Poggio, T., 1994, A Non Parametric Approach to Pricing and HedgingDerivative Securities Via Learning Networks, Journal of Finance, vol. 49, no. 3, July, pp. 851-889.

Kakati, M., 2005, Pricing and Hedging Performances of Artificial Neural Net in Indian Stock Option Market,The ICFAI Journal of Applied Finance, vol. 11, no. 1, January, 62-73.

Kakati, M., 2008, Option Pricing Using Adaptive Neuro-Fuzzy System (ANFIS), ICFAI Journal ofDerivatives Markets, vol. 5, no. 2, April, pp. 53-62.

Karaali, O., Edelberg, W. and Higgins, J., 1997, Modelling Volatility Derivatives Using Neural Networks,Proceedings of the IEEE-IAFE 1997 Computational Conference for Financial Engineering, IEEE,pp. 280-286.

Kelly, D.L., 1994, Valuing and Hedging American Put Options Using Neural Networks, Working Paper,University of California, December 22 pages.

Kitamura, T. and Ebisuda, S., 1998, Pricing Options Using Neural Networks, Working Paper, GraduateSchool of Industrial Administration, Carnegie Mellon University, 11 pages.

Ko, P.C. (2009) Option Valuation Based on the Neural Regression Model, Expert Systems with Applications,vol. 26, no. 1, January, pp. 464-471.

Ko, P.C., Lin, P.C., Chien, W.L. and Cheng, Y.S. (2005) Hedging Derivative Securities Based on the NeuralNetwork Coefficient Model. Proceedings of the Eighth Joint Conference on Information Sciences,Salt Lake City, Utah, pp. 1163-1166.

Krause, J., 1996, Option Pricing with Neural Networks, Proceedings of the Fourth European Congress onIntelligent Techniques and Soft Computing, Verlag Mainz, Aachen, vol. 3, pp. 2206-2210.

Lachtermacher, G. and Rodrigues Gaspar, L.A. , 1996, Neural Networks in Derivative Securities PricingForecasting in Brazilian Capital Markets. In Neural Networks in Financial Engineering: Proceedingsof the Third International Conference on Neural Networks in the Capital Markets, edited by A.P.N.Refenes, Y. Abu-Mostafa, J. Moody and A. Weigend, World Scientific, New York, pp. 92-97.

23

Lajbcygier, P., 2002, Comparing Conventional and Artificial Neural Network Models for the Pricing ofOptions. In Neural Networks in Business: Techniques and Applications, edited by K.A. Smith andJ.N.D. Gupta, Idea Group Publishing, pp. 220-235.

Lajbcygier, P., 2003, Option Pricing with the Product Constrained Hybrid Neural Network. In Proceedingsof a Joint International Conference on Artificial Neural Networks and Neural InformationProcessing 2003, pp. 615-621.

Lajbcygier, P., 2004, Improving Option Pricing with the Product Constrained Hybrid Neural Network, IEEETransactions on Neural Networks, vol. 15, no. 2, March, pp. 465-476.

Lajbcygier, P., Boek, C., Flitman, A. and Palaniswami, M., 1996, Comparing Conventional and ArtificialNeural Network Models for the Pricing of Options on Futures, NeuroVe$t Journal, vol. 4, no. 5,September-October, pp. 16-24.

Lajbcygier, P., Boek, C., Palaniswami, M. and Flitman, A., 1995, Neural Network Pricing of All OrdinariesSPI Options on Futures. In Neural Networks in Financial Engineering: Proceedings of the ThirdInternational Conference on Neural Networks in the Capital Markets, edited by A.P.N. Refenes, Y.Abu-Mostafa, J. Moody and A. Weigend, World Scientific, New York, pp. 64-77.

Lajbcygier, P. and Connor, J.T., 1997a, Improved Option Pricing Using Artificial Neural Networks andBootstrap Methods, International Journal of Neural Systems, vol. 8, no. 4, August, pp. 457-471.

Lajbcygier, P. and Connor, J.T., 1997b, Improved Option Pricing Using Bootstrap Methods, Proceedings ofthe 1997 IEEE International Conference on Neural Networks, IEEE, New York, vol. 4, pp. 2193-2197.

Lajbcygier, P. and Flitman, A., 1996, A Comparison of Non-parametric Regression Techniques for thePricing of Options Using an Implied Volatility. In Decision Technologies for Financial Engineering:Proceedings of the Fourth International Conference on Neural Networks in Capital Markets, 1996,edited by A.P. Refenes, Y. Abu-Mostafa, J. Moody and J. Weigend, World Scientific, New York,pp. 201-213.

Lajbcygier, P., Flitman, A., Swan, A. and Hyndman, R., 1997, The Pricing and Trading of Options Using aHybrid Neural Network Model With Historical Volatility, NeuroVe$t Journal, vol. 5, no. 1, January-February, pp. 27-41.

Le Roux, L.J. and Du Toit, G.S., 2001, Emulating the Black-Scholes Model With a Neural Network,Southern African Business Review, vol. 5, no. 1, July.

Liang, X., Zhang, H. and Yang, J., 2006, Pricing Options in Hong Kong Market Based on Neural Networks.In Proceedings of an International Conference on Neural Information Processing 2006, part 3, pp.410-419.

Lieu, D., 1990, Option Pricing with Futures-Style Margining, Journal of Futures Markets, vol. 10, no. 4,August, pp.327-338.

Lin, C.T. and Yeh, H.Y., 2005, The Valuation of Taiwan Stock Index Option Price - Comparison ofPerformances Between Black-Scholes and Neural Network Model, Journal of Statistics andManagement Systems, vol. 8, no. 2, July, pp. 355-367.

Liu, M., 1996, Option Pricing with Neural Networks, Progress in Neural Information Processing edited byS.I. Amari, L. Xu, L.W. Chan, I. King and K.S. Leung, vol. 2. Springer-Verlag, pp. 760-765.

Lu, J. and Ohta, H., 2003a, A Data and Digital Contracts Driven Method for Pricing Complex Derivatives,Quantitative Finance, vol. 3, no. 3, June, pp. 212-219.

Lu, J. and Ohta, H. , 2003b, Digital Contracts Driven Method for Pricing Complex Derivatives, Journal ofthe Operational Research Society, vol. 54, no. 9, September, pp. 1002-1010.

Malliaris, M. and Salchenberger, L., 1993a, A Neural Network Model for Estimating Option Prices, Journalof Applied Intelligence, vol. 3, no. 3, pp. 193-206.

Malliaris, M. and Salchenberger, L., 1993b, Beating the Best: A Neural Network Challenges the Black-Scholes Formula, Proceedings of the Ninth Conference on Artificial Intelligence for Applications,IEEE, pp. 445-449.

Martel, C.G., Artiles, M.D.G. and Rodriguez, F.F. (2009) A Financial Option Pricing Model Based onLearning Algorithms, Proceedings of the World Multiconference on Applied Economics, Businessand Development, AEBD’09, pp. 153-157.

Meissner, G. and Kawano, N., 2001, Capturing the Volatility Smile of Options on High-Tech Stocks - ACombined GARCH-Neural Network Approach, Journal of Economics and Finance, vol. 25, no. 3,Fall, pp. 276-292.

24

Mirsepassi, A., 2004, Application of Intelligent System for Water Treatment Plant Operation, IranianJournal of Environmental Health, Science and Engineering, vol. 1, no. 2, pp. 51-57.

Mitra, S.K., 2006, Improving Accuracy of Option Price Estimation Using Artificial Neural Networks,Working Paper, Institute of Rural Management, India, January.

Montagna, G., Morelli, M., Nicrosini, O., Amato, P. and Farina, M., 2003, Pricing Derivatives by PathIntegral and Neural Networks, Physica A: Statistical Mechanics and its Applications, vol. 324, no.1-2, June, pp. 189-195.

Morelli, M.J., Montagna, G., Nicrosini, O., Treccani, M., Farina, M., and Amato, P., 2004, Pricing FinancialDerivatives with Neural Networks, Physica A: Statistical Mechanics and its Applications, vol. 338,no. 1-2, July, pp. 160-165.

Mostafa, F. and Dillon, T., 2008, A Neural Network Approach to Option Pricing. In Computational Financeand its Applications III, edited by M. Constantino, M. Larran and C.A. Brebbia, WIT Press, pp. 71-86.

Niranjan, M., 1996, Sequential Tracking in Pricing Financial Options Using Model Based and NeuralNetwork Approaches. In Advances in Neural Information Processing Systems, vol. 9, edited by M.C.Mozer and T. Petsche, MIT Press, Cambridge, pp. 960-966.

Ormoneit, D., 1999, A Regularization Approach to Continuous Learning with an Application to FinancialDerivatives Pricing, Neural Networks, vol. 12, no. 10, December, pp. 1405-1412.

Pande, A. and Sahu, R., 2006, A New Approach to Volatility Estimation and Option Price Prediction forDividend Paying Stocks, Working Paper.

Pires, M.M., 2005, American Option Pricing Using Computational Intelligence Methods, University ofWitwatersrand, MSc Report.

Pires, M.M. and Marwala, T., 2004, American Option Pricing Using Multi-Layer Perceptron and SupportVector Machine. In Proceedings of the 2004 IEEE International Conference on Systems, Man andCybernetics, vol. 2, pp. 1279-1285.

Pires, M.M. and Marwala, T., 2005, American Option Pricing Using Bayesian Multi-Layer Perceptrons andBayesian Support Vector Machines. In Proceedings of the 3rd IEEE International Conference onComputational Cybernetics, pp. 219-224.

Pires, M.M. and Marwala, T., 2007, Option Pricing Using Bayesian Neural Networks, Working Paper,University of Witwatersrand, 6 pages.

Qi, M. and Maddala, G.S., 1996, Option Pricing Using Artificial Neural Networks: The Case of S&P 500Index Call Options. In Neural Networks in Financial Engineering: Proceedings of the ThirdInternational Conference on Neural Networks in the Capital Markets, edited by A.P.N. Refenes, Y.Abu-Mostafa, J. Moody and A. Weigend, World Scientific, New York, pp. 78-91.

Raberto, M., Cuniberti, G., Riani, M., Scales, E., Mainardi, F. and Servizi, G., 2000, Learning Short-OptionValuation in the Presence of Rare Events, International Journal of Theoretical and Applied Finance,vol. 3, no. 3, July, pp. 563-564.

Rubinstein, M., 1985, Nonparametric Tests of Alternative Option Pricing Models Using All Reported Tradesand Quotes on the 30 Most Active CBOE Option Classes from August 23, 1976 Through August 31,1978, Journal of Finance, vol. 40, no. 2, June, pp. 455-480.

Saito, S. and Jun, L., 2000, Neural Network Option Pricing in Connection with the Black and Scholes Model.In Proceedings of the Fifth Conference of the Asian Pacific Operations Research Society, Singapore2000, edited by P. Kang, H. Phua, C.G. Wong, D.H. Ming and W. Koh.

Samur, Z.I. and Temur, G.T., 2009, The Use of Artificial Neural Network in Option Pricing: The Case ofS&P 100 Index Options, World Academy of Science, Engineering and Technology, vol. 54, June, pp.326-331.

Saxena, A., 2008, Valuation of S&P CNX Options: Comparison of Black-Scholes and Hybrid ANN Model,Paper presented at the SAS Global Forum 2008, Paper 162-2008.

Schittenkopf, C. and Dorffner, G., 2001, Risk Neutral Density Extraction from Option Prices: ImprovedPricing with Mixture Density Networks, IEEE Transactions on Neural Networks, vol. 12, no. 4, July,pp. 716-725.

Teddy, S.D., Lai, E.M.K. and Quek, C., 2006, A Brain-Inspired Cerebellar Associative Memory Approachto Option Pricing and Arbitrage Pricing. In Proceedings of an International Conference on NeuralInformation Processing 2006, Part 3, pp. 370-379.

25

Tsaih, R., 1999, Sensitivity Analysis, Neural Networks and the Finance, Proceedings of the InternationalJoint Conference on Neural Networks, IEEE vol. 6, pp. 3830-3835.

Tseng, C.H., Cheng, S.T., Wang, Y.H. and Peng, J.T. (2008) Artificial Neural Network Model of the HybridEGARCH Volatility of the Taiwan Stock Index Option Prices, Physica A Statistical Mechanics andits Applications, vol. 387, no. 13, May, pp. 3192-3200.

Tung, W.L. and Quek, C., 2005, GenSo-OPATS: A Brain Inspired Dynamically Evolving Option PricingModel and Arbitrage Trading System. In Proceedings of the 2005 IEEE Congress on EvolutionaryComputation, vol. 3, no. 2429-2436.

Tzastoudis, V.S., Thomaidis, N.S. and Dounias, G.D., 2006, Improving Neural Network Based Option PriceForecasting. In Proceedings of the 4th Helenic Conference on Advances in Artificial Intelligence2006, pp. 378-388.

White, A.J., 1998, A Genetic Adaptive Neural Network Approach to Pricing Options: A SimulationAnalysis, Journal of Computational Intelligence in Finance, vol. 6, no. 2, March-April, pp. 13-23.

White, A.J., 2000, Pricing Options with Futures-Style Margining - A Genetic Adaptive Neural NetworkApproach, Garland Publishing.

Wikel, J. H., Dow, E. R., and Heathman, M., 1996, Interpretative Neural Networks for QSAR, NetworkScience Computational Chemistry, March.

Yao, J., Li, Y. and Tan, L., 2000, Option Price Forecasting Using Neural Networks, Omega, vol. 28, no. 4,August, pp. 455-466.

Zapart, C.A., 2002, Stochastic Volatility Options Pricing with Wavelets and Artificial Neural Networks,Quantitative Finance, vol. 2, no. 6, June, pp. 487-495.

Zapart, C.A., 2003a, Beyond Black-Scholes: A Neural Networks-Based Approach to Options Pricing,International Journal of Theoretical and Applied Finance, vol. 6, no. 5, August, pp. 469-489.

Zapart, C.A., 2003b, Statistical Arbitrage Trading with Wavelets and Artificial Neural Networks. InProceedings of the 2003 IEEE International Conference on Computational Intelligence for FinancialEngineering, pp. 429-435.

Zhou, W., Yang, M. and Han, L., 2007a, Black-Scholes Versus Artificial Neural Networks in Pricing CallWarrants: The Case of China Market, Third International Conference on National Computation(ICNC 2007), vol. 1, pp. 528-532.

Zhou, W., Yang, M. and Han, L., 2007b, A Non-Parametric Approach to Pricing Convertible Bond ViaNeural Network, Eighth ACIS International Conference on Software Engineering, ArtificialIntelligence, Networking and Parallel Distributed Computing 2007, (SNPD 2007), vol. 2, pp. 564-569.

11,011 words

Figure 1 SANN1 Deltas Versus MB Deltas (Calls)

26

Figure 2 HANN1 Deltas Versus MB Deltas (Calls)

Figure 3 SANN1 Deltas Versus MB Deltas (Puts)

27

Figure 4 HANN1 Deltas Versus MB Deltas (Puts)

Figure 5 Quasi-Order Diagrams for Hedging Models

Puts Quasi Ordering Out of SampleCalls Quasi Ordering Out of Sample

28