Arbitrage and Its Physical Limits -...

53
Arbitrage and Its Physical Limits Louis H. Ederington Price College of Business, University of Oklahoma Chitru S. Fernando Price College of Business, University of Oklahoma Kateryna V. Holland Krannert School of Management, Purdue University Scott C. Linn Price College of Business, University of Oklahoma January 15, 2017 Abstract We examine how physical constraints limit arbitrage by studying the effect of crude oil storage constraints on arbitrage activity in the U.S. crude oil market. We document both temporary and long-term violations of the no-arbitrage conditions that are robustly attributable to storage capacity constraints. When crude oil storage levels are well below available storage capacity, temporary violations of the upper no-arbitrage bound occur but tend to be eliminated within a few days. However, as the amount of oil in storage approaches the capacity limit, the price adjustment process slows and violations of the upper no-arbitrage limit persist. We find evidence of temporary, but not long-term, violations of the lower no-arbitrage futures pricing bound, with the latter being consistent with our observation that there were no periods of stock- out conditions during our sample period. We also find that arbitrage was limited by financial constraints over our 2004-2015 sample period. However, the evidence in support of physical constraints impeding arbitrage is independently strong and remains robust when we control for the effect of financial constraints. Our results are also robust to the use of different measures of physical constraints. Our evidence further indicates that arbitrage normally impacts the spot price more than the futures price. Our findings highlight the importance of accounting for physical arbitrage limits in the pricing of commodity futures. We also contribute to the Theory of Storage literature by highlighting the consequences for prices when inventories approach storage capacity limits. JEL Classifications: law of one price, limits to arbitrage, commodity markets, oil futures markets, oil storage, physical constraints, cash-and-carry arbitrage. Keywords: G13, G18, Q41. Author contact information: Ederington: [email protected], (405)325-5697; Fernando: [email protected], (405)325-2906; Holland: [email protected]; (765)496-2194; Linn: [email protected] (405)325-3444. We thank Bruce Bawks, John Conti, Thomas Lee, Yongjia Li, Anthony May, Glen Sweetnam, John Zyren, and seminar participants at the U.S. Energy Information Administration, and the Second IEA IEF OPEC Workshop on the interactions between physical and financial energy markets for valuable discussions and comments. We gratefully acknowledge financial support from the U.S. Department of Energy -- Energy Information Administration and the University of Oklahoma Office of the Vice President for Research. We also thank Moody’s Analytics for making the company’s EDF (Expected Default Frequency) data available to us and to Sue Zhang and Robert Tran for their gracious help in assembling the data. The views expressed in this paper reflect the opinions of the authors only, and do not necessarily reflect the views of the Energy Information Administration or the U.S. Department of Energy. The authors are solely responsible for all errors and omissions.

Transcript of Arbitrage and Its Physical Limits -...

Arbitrage and Its Physical Limits

Louis H. Ederington

Price College of Business, University of Oklahoma

Chitru S. Fernando

Price College of Business, University of Oklahoma

Kateryna V. Holland

Krannert School of Management, Purdue University

Scott C. Linn

Price College of Business, University of Oklahoma

January 15, 2017

Abstract

We examine how physical constraints limit arbitrage by studying the effect of crude oil storage

constraints on arbitrage activity in the U.S. crude oil market. We document both temporary

and long-term violations of the no-arbitrage conditions that are robustly attributable to storage

capacity constraints. When crude oil storage levels are well below available storage capacity,

temporary violations of the upper no-arbitrage bound occur but tend to be eliminated within a

few days. However, as the amount of oil in storage approaches the capacity limit, the price

adjustment process slows and violations of the upper no-arbitrage limit persist. We find

evidence of temporary, but not long-term, violations of the lower no-arbitrage futures pricing

bound, with the latter being consistent with our observation that there were no periods of stock-

out conditions during our sample period. We also find that arbitrage was limited by financial

constraints over our 2004-2015 sample period. However, the evidence in support of physical

constraints impeding arbitrage is independently strong and remains robust when we control for

the effect of financial constraints. Our results are also robust to the use of different measures

of physical constraints. Our evidence further indicates that arbitrage normally impacts the spot

price more than the futures price. Our findings highlight the importance of accounting for

physical arbitrage limits in the pricing of commodity futures. We also contribute to the Theory

of Storage literature by highlighting the consequences for prices when inventories approach

storage capacity limits.

JEL Classifications: law of one price, limits to arbitrage, commodity markets, oil futures

markets, oil storage, physical constraints, cash-and-carry arbitrage.

Keywords: G13, G18, Q41.

Author contact information: Ederington: [email protected], (405)325-5697; Fernando: [email protected],

(405)325-2906; Holland: [email protected]; (765)496-2194; Linn: [email protected] (405)325-3444.

We thank Bruce Bawks, John Conti, Thomas Lee, Yongjia Li, Anthony May, Glen Sweetnam, John Zyren, and

seminar participants at the U.S. Energy Information Administration, and the Second IEA IEF OPEC Workshop

on the interactions between physical and financial energy markets for valuable discussions and comments. We

gratefully acknowledge financial support from the U.S. Department of Energy -- Energy Information

Administration and the University of Oklahoma Office of the Vice President for Research. We also thank

Moody’s Analytics for making the company’s EDF (Expected Default Frequency) data available to us and to Sue

Zhang and Robert Tran for their gracious help in assembling the data. The views expressed in this paper reflect

the opinions of the authors only, and do not necessarily reflect the views of the Energy Information Administration

or the U.S. Department of Energy. The authors are solely responsible for all errors and omissions.

2

Arbitrage and Its Physical Limits

Abstract

We examine how physical constraints limit arbitrage by studying the effect of crude oil

storage constraints on arbitrage activity in the U.S. crude oil market. We document both

temporary and long-term violations of the no-arbitrage conditions that are robustly

attributable to storage capacity constraints. When crude oil storage levels are well below

available storage capacity, temporary violations of the upper no-arbitrage bound occur but

tend to be eliminated within a few days. However, as the amount of oil in storage approaches

the capacity limit, the price adjustment process slows and violations of the upper no-arbitrage

limit persist. We find evidence of temporary, but not long-term, violations of the lower no-

arbitrage futures pricing bound, with the latter being consistent with our observation that

there were no periods of stock-out conditions during our sample period. We also find that

arbitrage was limited by financial constraints over our 2004-2015 sample period. However,

the evidence in support of physical constraints impeding arbitrage is independently strong

and remains robust when we control for the effect of financial constraints. Our results are

also robust to the use of different measures of physical constraints. Our evidence further

indicates that arbitrage normally impacts the spot price more than the futures price. Our

findings highlight the importance of accounting for physical arbitrage limits in the pricing of

commodity futures. We also contribute to the Theory of Storage literature by highlighting the

consequences for prices when inventories approach storage capacity limits.

JEL Classifications: law of one price, limits to arbitrage, commodity markets, oil futures

markets, oil storage, physical constraints, cash-and-carry arbitrage.

Keywords: G13, G18, Q41.

3

Arbitrage and Its Physical Limits

“…contango would have to widen much more to signal real storage distress.”

Spencer Jakab, Wall Street Journal, December 2, 2015

1. Introduction

While the Law of One Price (LOP) is one of the most powerful concepts in the

financial economics tool chest, a number of recent papers explore the financial limits to the

arbitrage that enforces LOP, and the implications of these limits for asset pricing.1 Physical

limits, specifically limits on the availability of either inventory or storage capacity, are

potentially equally important in the arbitrage pricing relationship for financial assets whose

value is derived from the value of commodities.2 While several recent studies have examined

the role of financial limits to arbitrage in the pricing of commodity derivatives,3 the potential

effect of physical limits has not been empirically studied. In this paper, we address this gap

in the literature by studying the effect of physical storage limits on arbitrage activity in the

U.S. crude oil market. We specifically focus on the futures physical delivery hub at

Cushing, Oklahoma, which is also the major storage center in the U.S. for crude oil. We draw

inferences through an examination of the behavior of the spread between futures and spot

prices for West Texas Intermediate (WTI) crude oil.

In commodity markets, if the price of a commodity for future delivery exceeds the

price for near-term delivery by more than the carrying (including storage) and transaction

costs, arbitrageurs should be able to make a riskless profit by simultaneously executing

contracts to buy in the spot market and sell in the forward market, while storing the

commodity over the interim period. Exploitation of such an arbitrage opportunity, commonly

referred to as “cash-and-carry arbitrage,” results in the now familiar relation between futures

and spot prices derived from the Theory of Storage.4 More precisely, cash-and-carry

1 See, for example, Shleifer and Vishny (1997), Gromb and Vayanos (2002), Mitchell, Pulvino and Stafford

(2002), Hong and Stein (2003), Gabaix, Krishnamurthy, and Vigneron (2007), Brunnermeier and Pedersen

(2009), Etula (2013), and Acharya, Lochstoer, and Ramadorai (2013). 2 See Routledge, Seppi, and Spatt (2000) and Gorton, Hayashi, and Rouwenhorst (2013). 3 See, for example, Mou (2010), Hong and Yogo (2012), Etula (2013), Acharya, Lochstoer, and Ramadorai

(2013), and Cheng, Kirilenko, and Xiong (2015). 4 Kaldor (1939), Working (1949), Brennan (1958), Deaton and Laroque (1992), and Routledge, Seppi, and Spatt

(2000).

4

arbitrage should place an upper limit on the spread between the price for near-term delivery,

the “spot price,”5 and the price for longer-term delivery, henceforth the “futures price.”

Likewise, the possibility of reverse cash-and-carry arbitrage, which involves a short sale in

the spot market covered by a purchase in the forward market, should set a lower limit on the

futures-spot spread. In many commodity markets, however, storage capacity is limited --

especially at the delivery location for the futures contract. Once all available storage

facilities are full (or, in the case of reverse cash-and-carry arbitrage, the available inventory is

depleted), the corresponding arbitrage described above is no longer possible and the no-

arbitrage limits on the futures-spot spread should no longer hold.

In a classical cash-and-carry arbitrage transaction involving a physical commodity at

a spot-futures market hub like the WTI Cushing hub, the arbitrageur will execute three

simultaneous transactions: (1) purchase the commodity in the spot market; (2) sell a

corresponding quantity of the commodity in the futures market; and (3) contract to store the

commodity at the hub until the futures transaction is closed out or settled. The ability to

execute this trade depends on the availability of storage space at the hub.6 If storage capacity

or inventory is not immediately available, violations of the no-arbitrage condition may persist

until storage becomes available. Storage contracts are typically over-the-counter agreements

and thus may require some time to arrange. Moreover, they have elements of counterparty

risk, such as force majeure and physical delivery default that are not present in purely

financial arbitrage trades. These elements may discourage or delay the actions necessary to

implement the arbitrage trades. If delayed, spreads above the no-arbitrage limit may persist

until storage can be arranged. Consequently, the physical limits to executing an arbitrage

may contribute to the persistence of futures-spot spread no-arbitrage violations above and

beyond the limits imposed by financial constraints.

5 As discussed below, contracts in the crude-oil market are normally for delivery over a month since (unlike for

commodities like gold) immediate spot delivery of large quantities of oil is very difficult and expensive. For

futures contracts held to physical delivery, the CME group allows one calendar month for the oil to be delivered

to the Cushing hub. Hence the quoted “spot” prices are normally forward or futures prices for delivery over the

nearest calendar month. Contracts for immediate delivery are rare and typically for small quantities that can be

moved by tanker truck. Despite the absence of immediate spot delivery in this market, we keep with convention

by using the term “spot price” to refer to the settlement price for near-term-delivery contracts. 6 In a reverse cash-and-carry arbitrage transaction, the arbitrageur will simultaneously (1) borrow and sell the

commodity in the spot market; and (2) purchase a corresponding quantity of the commodity in the futures

market. The ability to execute this contract depends on the availability of inventory at the hub that can be

borrowed.

5

We find evidence of both short- and long-term violations of the no-arbitrage

conditions in the U.S. crude oil futures market at the WTI Cushing hub. Consistent with the

argument that often storage cannot be arranged immediately, we find evidence of numerous

temporary violations of the no-arbitrage upper bound. When storage levels are well below

capacity, these temporary violations tend to be eliminated quickly, i.e., within a couple of

days. However, as the amount of oil stored approaches the available capacity, the adjustment

process takes longer and violations of the upper no-arbitrage limit persist for longer than a

few days. In contrast, we observe only temporary violations of the no-arbitrage lower bound

indicating the absence of persistent physical limits on reverse cash-and-carry arbitrage. Our

finding for reverse cash-and-carry arbitrage is consistent with the absence of any periods of

inventory stock-out conditions during our sample period, with crude oil inventory in storage

never dropping below 30% of available storage capacity. Our evidence further indicates that

the spot price adjusts more than the futures price in bringing the spread back within the no-

arbitrage bounds, which again points to physical arbitrage limits being a major factor

determining the mispricing of the futures-spot spread.

Testing for violations of the no-arbitrage conditions is complicated because we are

unable to exactly identify the no-arbitrage limits at each point in time since (as discussed

below) historical data on storage and transaction costs are not available. Hence, we test for

evidence of cash-and-carry and reverse cash-and-carry arbitrage by examining the behavior

of the futures-spot spread. We find evidence of both cash-and-carry arbitrage and reverse

cash-and-carry arbitrage normally operating to return the futures-spot spread to within no-

arbitrage bounds. When the futures-spot spread is positive on day t-2 and rises further on day

t-1 (and storage capacity is not exhausted), there is a strong tendency for the spread to fall on

day t which is what we would expect if the further rise in the spread on day t-1 sets off cash-

and-carry arbitrage. This reversal tendency is stronger when the spread on day t-2 is high

than when it is positive but low. This is again what we would expect since the further rise on

day t-1 is more likely to raise the spread above the upper no-arbitrage bound if the spread is

already very high. Likewise, if the spread on day t-2 is negative and the spread declines

further on day t-1, there is a strong tendency for the spread to rise or reverse on day t. Again,

this is what one would expect if the further decline in the spread on day t-1 triggers reverse

6

cash-and-carry arbitrage and this reversal tendency is stronger the more negative is the

spread on day t-2. Both of these spread reversal tendencies are stronger when spread

volatility is high. In contrast, if the level of the spread on day t-2 and the change on day t-1

are of opposite sign, making it less likely that the change in the spread on day t-1 carried it

outside the no-arbitrage bound, the spread changes on days t-1 and t are basically

uncorrelated.

While normally a further rise in the spread on day t-1 from an already high level on

day t-2 tends to be reversed on day t, this is not the case when oil inventories are close to

capacity, which indicates that the lack of available storage space hinders the cash-and-carry

arbitrage which would normally operate to pull the spread back down on day t. On the other

hand, as alluded to previously, we find no evidence that reverse cash-and-carry arbitrage is

impeded by low inventories.

We also examine possible financial limits to arbitrage in the WTI crude oil market

concurrently with measures of physical limits to arbitrage. The importance of financial limits

to arbitrage in the oil market is highlighted by Acharya, Lochstoer, and Ramadorai (2013).7

Those authors document the effect on futures and spot prices when producers’ hedging

demand in the futures market is not fully met due to broker-dealer capital constraints. We

also present evidence that arbitrage was restricted by financial constraints over our 2004-

2015 sample period. However, the evidence in support of physical constraints impeding

arbitrage is considerably stronger and remains robust when we control for the effect of the

financial constraint measures we examine. While Acharya, Lochstoer, and Ramadorai (2013)

emphasize the importance of an inventory stock-out in giving rise to commodity sector

default risk, there is no significant oil inventory stock-out that occurs in the 2004-2015

period of our study. In contrast, our study highlights the role played by the unavailability of

storage capacity due to high inventory levels as the cause of a decoupling between the futures

and spot market in oil, an event that occurs several times during our sample period.

Therefore, our study builds on the existing literature on the financial limits to arbitrage in

commodity markets by also establishing the importance of the physical limits to arbitrage in

these markets. We therefore provide a more complete picture of the role that limits to

7 Birge, Hortacsu, and Mercadal (2016) show that financial constraints impede arbitrage in electricity markets.

7

arbitrage, both physical limits as well as financial limits, play in the pricing of commodity

futures.

We contribute to the literature on the Theory of Storage by examining the effect of

inventory storage capacity limits. The existing literature has focused on the effect of stock-

outs when inventories are very low with little or no attention to the possible pricing effects

when inventories are very high and storage capacity becomes limited or is exhausted. For

instance, Routledge, Seppi, and Spatt (2000), who explore the consequences of the non-

negativity inventory constraint for forward and futures prices, write, “Inventory can always

be added to keep current spot prices from being too low relative to expected future spot

prices.” We contribute to this literature by exploring the price consequences when

inventories approach storage capacity limits so that additional inventory cannot be added.

Modeling inventories as buffers to supply and demand shocks, Deaton and Laroque

(1992) show that the increase in the risk of an inventory stock-out when inventories are low

carries through to an increase in expected future spot price volatility. Routledge, Seppi, and

Spatt (2000) extend the Deaton and Laroque (1992) model by including a forward market

and show that inventory stock-outs can break the arbitrage link between the spot and forward

markets. Similarly, we show that when storage capacity is limited or exhausted, the

commodity’s spot price will also be decoupled from the forward price. Therefore, in the case

of both inventory stockouts and full storage situations, the arbitrage pricing relation between

forward and spot prices will break down. Additionally, consistent with the Theory of Storage,

we find that when inventories are neither very high nor very low, arbitrage restores the

futures-spot spread to its no-arbitrage bounds following temporary violations.

The rest of the paper is structured as follows. In the next section, we discuss arbitrage

and storage in the crude oil market and develop our primary hypotheses. The data is

described and basic results are presented in section 3, where we test for how the market

responds to temporary violations of the no-arbitrage limits and present evidence that

violations of the upper limit persist as storage approaches full capacity. In section 4, we

explore how reverse cash-and-carry arbitrage enforces the lower no-arbitrage limit on the

spread and whether this spread enforcing arbitrage is hindered when available oil inventories

are low. In section 5, we expand the analysis to consider financial as well as physical limits

8

to arbitrage. In section 6, we examine how spot and futures prices change due to arbitrage

and ask whether arbitrage impacts primarily the spot price, primarily the futures price, or

both. Various robustness checks are presented in section 7, and section 8 concludes.

2. Oil market arbitrage and storage

2.1. No arbitrage spread conditions in the oil market

Consider the limits that arbitrage places on the futures-spot and futures-futures

spreads at the crude oil futures contract delivery/pricing hub assuming that storage is

available for lease at the hub location.8 Let Pt,t+v designate the price at time t for delivery at

time t+v and Pt,t+s represent the time t price for delivery at time t+s where s>v. If t+v is the

first available delivery time, Pt,t+v may be referred to as the spot price and Pt,t+s-Pt,t+v as the

futures-spot spread. We will follow that convention here. Since large quantities of crude oil

cannot be delivered instantaneously, virtually all physical delivery contracts in the crude oil

market, including spot contracts, are contracts for delivery over a future period of time –

generally one month (Kaminski, 2012). For futures contracts held to physical delivery, the

CME group allows one calendar month for the oil to be delivered to the Cushing hub. For

example, suppose the current month is June. The three-month futures contract will be the

contract that, if held to expiration, will result in physical delivery of crude oil commencing

September 1 and ending on or before September 30. Similarly, delivery on the two-month

futures contract will occur from August 1-31. In the case of the “spot” contract traded in the

month of June, physical delivery of crude oil will commence July 1 and end on or before July

31. Hence by convention the quoted “spot” prices in the crude oil market are typically prices

for delivery over the nearest forward calendar month.9 Contracts that stipulate physical

delivery over shorter periods, including immediate physical delivery, are rare and typically

for small quantities that can be moved by tanker truck.

8 More generally, storage may also be available at remote locations, in which case the availability of such

remote storage will be determined by both storage and transportation constraints. 9 The CME group stipulates several methods by which the buyer can opt to receive physical delivery. At the

buyer's option, delivery can be made by: (1) by inter-facility transfer ("pumpover") into a designated pipeline or

storage facility with access to seller's incoming pipeline or storage facility; (2) by in-line (or in-system) transfer,

or book-out of title to the buyer; or (3) if the seller agrees to such transfer and if the facility used by the seller

allows for such transfer, without physical movement of product, by in-tank transfer of title to the buyer.

Especially with the third option, physical delivery will effectively be instantaneous and subject only to the

provision that the buyer has acquired the right to store oil in the tank/s used previously by the seller.

9

Let SCt,t+v,t+s represent the present value as of time t of the cost of storing one unit of

the commodity from time t+v to t+s including transaction costs on the futures trades.10 Let

CVt,t+v,t+s designate the (assumed known) present value as of time t of the convenience yield

from holding physical units of the commodity from time t+v to t+s. If Pt,t+s > [Pt,t+v +

SCt,t+v.t+s – CVt,t+v,t+s](1+rt,t+v,t+s) where rt,t+v,t+s is the interest rate from t+v to t+s, arbitrageurs

can earn a “riskless” profit by simultaneously: (1) buying the near-term contract Pt,t+v, (2)

shorting the longer term contract Pt,t+s, and (3) assuming storage capacity is available,

arranging for storage from t+v to t+s.11 An implicit assumption is that funding of the

transaction is not constrained, which we will relax in the analysis presented in section 5. As

arbitrageurs transact to capture the riskless profit, Pt,t+v should rise and Pt,t+s fall until Pt,t+s ≤

[Pt,t+v + SCt,t+v.t+s – CVt,t+v,t+s](1+rt,t+v,t+s). Hence this arbitrage should ensure that:

[Pt,t+s -Pt,t+v] ≤ [Pt,t+vrt,t+v,t+s + (SCt,t+v.t+s – CVt,t+v,t+s)(1+rt,t+v,t+s)] (1)

Ederington, Fernando, Holland, Lee, and Linn (2016) provide strong evidence in support of

this arbitrage relationship for U.S. crude oil futures at the Cushing delivery point.

Assuming trades and storage can be contracted the instant violations of equation 1 are

observed, violations of equation 1 should be fleeting and only observable in high frequency

data. However, if storage takes time to arrange (as explored in section 2.2 below), violations

of equation 1 could arise but be temporary. If storage cannot be arranged immediately, a

trader pursuing riskless arbitrage would need to wait to arbitrage the mispricing between the

spot and futures contracts until storage becomes available.12 In the latter case, the spread may

continue to exceed the no-arbitrage upper bound in equation 1 until sufficient storage

capacity becomes available. Assuming storage can be arranged, any temporary violation of

10 For ease of exposition, we disregard the possibility of storage at a location away from the hub, in which case

any transportation costs between the delivery points for the t+v and t+s contracts need to be added to the

transaction costs. 11 This trade is not completely riskless if the convenience yield is uncertain. For pedagogical simplicity we

allow future storage costs to be uncertain but treat the convenience yield as uncertain but effect of uncertainty

regarding either is basically the same. Nonetheless, risk cannot be completely eliminated due to physical and

financial performance risk. 12 Speculators could execute a naked speculative transaction involving only the spot and futures trades, hoping

that storage can be arranged in the future on terms that would not eliminate arbitrage profit. However, such

transactions are not riskless. Ederington et al. (2016) show that most cash-and-carry arbitrage transactions in

this market tend to be riskless.

10

equation 1 should be followed by a fall in the spread as arbitrage trades take place.13

However if storage is at capacity, violations of equation 1 can persist.

Our treatment of SCt,t+v.t+s in equation 1 warrants clarification. We recognize that

both SCt,t+v.t+s and CVt,t+v,t+s are endogenous. In particular, as discussed below, SCt,t+v.t+s will

tend to rise and CVt,t+v,t+s will tend to fall as inventories increase. Thus, it could be argued

that if storage cannot be arranged immediately, the cost of storage is effectively infinite so

that equation 1 always holds but this is void of any predictive content. To obtain predictive

hypotheses, when we refer to equation 1 being violated, we are treating SCt,t+v.t+s as the cost

of storage when it can be arranged.

We next examine the no-arbitrage lower bound. Consider a trader who holds the

commodity in inventory. If Pt,t+s < [Pt,t+v + SCSt,t+v.t+s – CVt,t+v,t+s](1+rt,t+v,t+s) where SCSt,t+v,t+s

is the saving on storage costs by not storing oil from t+v to t+s minus transaction costs, the

trader can profit by simultaneously: (1) selling the oil for delivery at time t+v at Pt,t+v and (2)

purchasing for delivery at time t+s for Pt,t+s. This frees up storage from time t+v to t+s. If

alternative uses for the storage can be arranged immediately or SCSt,t+v,t+s is known, this

arbitrage is riskless and arbitrage should ensure that:

[Pt,t+s -Pt,t+v] ≥ [Pt,t+vrt,t+v,t+s + (SCSt,t+v.t+s – CVt,t+v,t+s)(1+rt,t+v,t+s)] (2)

Note that this lower bound on the spread may be either positive or negative.14 If alternative

storage uses cannot be arranged immediately and SCSt,t+v,t+s is uncertain, this trade is risky

unless the trades are delayed until alternative uses for the storage have been arranged.

Therefore, it is possible that inequality (2) is violated temporarily. If inventories are depleted

so that there is no oil to sell for delivery at time t+v, then the violation of equation 2 may

persist longer.

13 Additionally, temporary violations of arbitrage bounds could occur because of inattentive traders (Duffie,

2010) or lack of sufficient financial traders in the market, which is also inhabited by physical traders who have

traditionally dominated the market. 14 In commodity futures market analyses, it is sometimes assumed that (1) transaction costs are negligible, and

(2) any storage costs can be completely recaptured if the storage is not used so that SCSt,t+v,t+s = SCt,t+v,t+s and

hence [Pt,t+s -Pt,t+v] = [Pt,t+vrt,t+v,t+s + (SCt,t+v.t+s – CVt,t+v,t+s)(1+rt,t+v,t+s)]. However we argue in section 2.2 below

that in commodity markets, and crude oil in particular, SCSt,t+v,t+s is generally less than SCt,t+v,t+s either because

transaction costs are not negligible or because storage costs cannot be totally recouped if the storage is unused.

Hence there is normally a gap between the upper and lower spread limits in equations 1 and 2.

11

2.2. Storage and storage costs

If oil is purchased for delivery at time t+s and sold for delivery at time t+v in a cash-

and-carry arbitrage, storage must be arranged for the time period between t+s and t+v. Since

the availability and cost of storage are important to our analysis, it is helpful to summarize

relevant characteristics of crude oil storage. Once produced, crude oil may be stored in tank

farms, underground caverns, refineries, and pipelines, or off-shore in tankers. Particularly

important for our purpose are storage levels and costs at the pricing point and delivery hub

for the WTI futures contract, which is in Cushing, Oklahoma. Ederington et al. (2016) find

that most arbitrage in the WTI crude oil market entails Cushing oil inventories. The U.S.

Energy Information Administration (EIA) estimates the working capacity of tank farms in the

U.S. at 399.7 million barrels as of September 2015 of which 73 million barrels, or 18.0%, are

at Cushing, making it the largest oil storage facility in the world (EIA, 2015,

http://www.eia.gov/petroleum/storagecapacity/table1.pdf ). Cushing, labeled the “pipeline

capital of the world,” is connected to crude oil production facilities and oil refineries

throughout the United States through an extensive pipeline network. Oil is stored in Cushing

for operational, arbitrage, and speculative purposes. While anecdotal and media reports

appear from time to time about investment banks and other oil traders leasing storage at

Cushing for arbitrage and speculative purposes (see, for example, Leff, 2015), hard data is

unavailable. However according to the EIA, in spring 2015 approximately 80% of the storage

at Cushing was leased by the owner-operators to others while the percentage leased to others

at other tank farms in the U.S. was only about 29%. Storage away from Cushing entails

additional transportation costs or additional risk to arbitrage using crude oil futures since the

delivery point for the NYMEX oil futures contract is Cushing.15 This suggests that much of

the storage at Cushing is leased for arbitrage or speculative purposes. Storage capacity at

Cushing has grown considerably over the last decade. The EIA reports that working capacity

increased from 46.0 million barrels in September 2010 to 71.4 million in March 2015.

Capacity figures prior to 2010 are unavailable but the maximum held in storage prior to

January 2006 was only 22.8 million barrels. The business media commonly attribute at least

15 Nonetheless, the press has published articles describing crude oil being stored on floating tankers in

conjunction with arbitrage trades (See, for example, Kent and Kantchev, 2015). However, a precise time series

of tanker storage data is not available.

12

some of this storage construction to demand for storage by WTI futures traders (see, for

example, Blas, 2015, and Kaufman, 2015).

Storage contracts at Cushing and elsewhere are typically over-the-counter and thus

may require some time to arrange.16 Moreover, they have counterparty risk, force majeure

and physical delivery risk elements that are not present in purely financial arbitrage trades,

which may discourage or delay the actions necessary to implement the arbitrage trades. If

delayed, spreads above the no-arbitrage limit may persist until storage can be arranged.17

Unfortunately, historical figures for the storage cost measures SCt,t+v,t+s and SCSt,t+v,t+s

are unavailable so we cannot directly test for violations of equations 1 and 2. Instead, as

explained below, we test for indirect evidence of violations and consequent market

corrections by examining changes in the futures-spot spread. Average crude oil storage costs

at Cushing are commonly estimated around $0.40 to $0.50 per barrel per month but

reportedly vary considerably depending on capacity utilization.18 If a trader wants to execute

an arbitrage transaction but has not yet leased storage capacity, the cost to him is the storage

cost per barrel stated in the new lease. If a trader has already leased storage, what matters to

him in considering a particular arbitrage possibility is the storage unit’s opportunity cost,

which will vary with capacity utilization and quite possibly across individual traders.

Consider, for instance, a trader who has leased storage capacity for a year at $0.50 a

barrel/month. After the lease is signed, the $0.50 becomes a sunk cost and what then matters

is the marginal opportunity cost of using the storage capacity. Depending on whether it is

possible to re-lease the unused storage capacity, this marginal opportunity cost may vary

from zero to the re-lease rate. Moreover the unused storage has an option value. If the trader

institutes a cash and carry arbitrage as soon as the spread widens sufficiently to make the

arbitrage profitable and hence fills his storage units to capacity, he loses the option to

conduct the arbitrage on even more favorable terms in the future if spreads should widen

16 The CME has recently launched an oil storage futures contract at the Louisiana Offshore Port but not as yet

for storage at Cushing. 17 Faced with an apparently profitable futures-spot spread that exceeds expected storage costs, some traders may

trade the futures and spot immediately. In doing so, they accept the risk that storage cannot be arranged or will

be more expensive than anticipated and the trade will remain a speculation unless and until the exposure is

covered in the physical market. 18 Private communication with a company specializing in oil and petroleum product storage confirms that the

typical price has been about $.50/barrel per month but that the cost increases whenever the market is in

contango, suggesting those are periods in which demand for storage capacity is high.

13

further. Thus, in this case, SCt,t+v,t+s is hard to measure and may vary across traders but

undoubtedly varies positively with capacity utilization.

While storage costs likely vary positively with capacity utilization, the convenience

yield likely varies inversely as Einloth (2009), Gorton, Hayashi, and Rouwenhorst (2013)

and others point out, reinforcing the tendency for the upper no-arbitrage bound in equation 1

to vary positively with capacity utilization levels. Gorton, Hayashi, and Rouwenhorst (2013)

argue that the convenience yield in commodity futures should vary inversely with inventory

levels since low inventory levels increase price volatility. Consistent with this, using data for

33 commodity markets, they find that: 1) the cash-futures basis is an inverse function of

inventory levels, and 2) returns to a strategy of holding long futures positions are positive and

inversely correlated with inventory levels. Turning to reverse-cash-and-carry arbitrage, the

storage cost savings if the trader draws down his inventory, SCSt,t+v,t+s, depend on whether

the storage tank can be re-leased or used for other purposes since he will pay the storage cost

per barrel of leased capacity whether he has oil stored or not. If it cannot be re-leased,

SCSt,t+v,t+s is zero.

Since we hypothesize that arbitrage possibilities are limited by available storage

capacity and that storage costs vary directly with capacity utilization, we need measures of

both actual storage levels and storage capacity at Cushing. Since the EIA began reporting

actual weekly Cushing storage levels in April 2004, our data period begins April 5, 2004.19

The EIA began surveying and reporting storage capacity figures semi-annually in September

2010.20 Since the EIA’s capacity figures cover only the latter third of our data period and

only estimate shell and working capacity, not effective capacity, we use as our primary proxy

for effective capacity a measure based on historical peaks in actual storage. In Figure 1, we

chart Cushing estimated working storage capacity levels and actual storage levels from April

2004 to April 2015 on a logarithmic scale.

***Insert Figure 1 about here***

Searching for the lowest number of peaks or inflection points for a linear spline

19 Prior to that time the Cushing figures were lumped into those for the Midwest region. 20 The EIA reports both shell capacity and working capacity where the latter lower figure adjusts for the fact

that oil at the bottom of the tank is not obtainable and that the tanks cannot be filled to the very top. Both the

EIA and others stress that the unknown effective capacity is less than either figure since some space is required

for effective operation.

14

function, which bounds all observed storage levels with inflection points at the chosen peaks,

yields the linear spline with peaks at 4/18/2005, 2/2/2009, 1/4/2013, and 4/3/2015 shown as

the solid blue line in Figure 1 where we also graph actual storage levels. We use this log

linear function as our initial and primary proxy for computing effective storage capacity. In

section 7, we also use the EIA measures of working capacity for the October 2010 – April

2015 sub-period for which these figures are available.

2.3. Storage and spreads – Initial evidence

According to the analysis in section 2.l, large positive spreads between the prices of

contracts for longer-term and near-term delivery should only persist when storage levels

approach capacity so that arbitrageurs find it difficult or impossible to arrange storage.

Initial evidence on this is presented in Figure 2 where we chart 10-day moving averages of

both capacity utilization and the futures-spot spread.

***Insert Figure 2 about here***

Capacity utilization is measured as the ratio of the actual level of crude oil stocks as

reported by the EIA to the proxy for effective storage capacity described in section 2.2. The

futures-spot spread in Figure 2 is measured as the difference between the price of the second

and nearby futures contracts. As predicted, Figure 2 shows that large positive futures-spot

spreads are generally associated with high levels of capacity utilization.

2.4. Hypotheses

While Figure 2 indicates that high futures-spot spreads are generally associated with

high levels of capacity utilization, this does not necessarily indicate that physical limits to

arbitrage were a constraint since other factors could account for the correlation in Figure 2.

For instance, an unexpected short-term decline in the demand for crude oil which is not

expected to persist could lead to both an increase in crude oil inventories and a

disproportionate decline in the spot price, and therefore an increase in the spread.

For further evidence on the effect of arbitrage and its limits, we analyze the behavior

of the futures-spot spread across time. Ideally, if we could observe the storage cost and

convenience yield terms, SCt,t+v,t+s, SCSt,t+v,t+s, and CVt,t+v,t+s, in equations 1 and 2, we could

15

explore how they change as storage levels approach capacity and test for violations of

equations 1 and 2. However, as explained in section 2.2 those data are unobservable.

Consequently, we test for arbitrage and its limits based on the behavior of the futures-spot

spread.

Consider the implications of the analysis in section 2.1 for the behavior of the futures-

spot spread. As long as the futures-spot spread is between the upper bound defined by

equation 1 and the lower bound defined by equation 2, arbitrage should not occur. In this

case, if markets are weak form efficient and news arrives randomly, the change in the spread

one day should be independent of previous spread changes. However, if a change in the

spread in period t-1 carries the spread above the no-arbitrage bound, this should set off

arbitrage in which arbitrageurs buy for delivery in the near-term and sell for delivery in the

longer-term resulting in a decline in the spread in period t. Likewise, a fall in the spread

below the no-arbitrage upper bound in period t-1 should set off arbitrage that raises the

spread in period t. Thus, we expect successive changes in the spread to be uncorrelated

within the upper and lower no-arbitrage bounds and negatively correlated outside the no-

arbitrage bounds.

Testing for evidence of arbitrage based on spread autocorrelations is complicated by

the fact that we cannot observe storage costs, transaction costs, the convenience yield, or the

storage cost savings when storage levels are reduced, and so we cannot compute the upper

and lower no-arbitrage bounds. However, we expect storage costs to vary directly and the

convenience yield to vary inversely with storage capacity utilization and thus, the upper

bound should rise as capacity utilization rises. In addition, for given values of the no-

arbitrage bounds, the likelihood that a change in the spread leads to a violation of the upper

or lower bounds should depend on the sign and size of both the change in the spread and its

16

prior level.21 For instance, suppose the period t-1 change in the spread is positive. In this

case, it is more likely that it crosses the upper bound and thus leads to a fall in the spread in

period t if the spread level at the beginning of period t-1 is positive and high than if the

spread at the beginning of period t-1 is negative or low. Likewise, a given decline in the

spread in period t-1 is more likely to cross the lower bound, and thus lead to a rise in period t,

if the spread at the beginning of period t is already negative or low. This leads to our first

hypothesis:

H1: If there are no limits to arbitrage, an increase (decrease) in the futures-spot spread should

be more likely to lead to arbitrage and a subsequent fall (rise) in the spread if the spread is

already high (low).

It is not clear a priori whether we should observe this predicted autocorrelation

pattern in weekly, daily, hourly, or higher frequency data. If the arbitrage can be arranged

almost instantaneously, then we should observe these patterns only in high frequency data if

at all. However, we have argued above in sections 2.1 and 2.2 that, when the spread rises

above the upper bound, risk or the inability to arrange storage may lead arbitrageurs to delay

going long in the near-term contract and shorting the longer-term contract until they can

contract for storage. Thus it may take hours or days until the spread change reverses. We test

for arbitrage patterns in daily data.

We have argued above that there are likely physical limits to arbitrage. When storage

is near capacity, both storage costs and the lead time required to contract storage will likely

increase. Given these difficulties in storage contracting when storage is near capacity, a

21 To see this, suppose the upper bound is fixed and equal to U. Suppose also that at t-1 the actual spread equals

A. The distance to U therefore equals U-A. The narrower the gap the smaller is the required change in the

spread before the upper bound is breached. Assume arbitrage opportunities arrive randomly (that is the change

in the spread arrives randomly each period) and that the size of the change is a drawing from a stationary

distribution with mean 0 and constant variance. The probability that U will be breached given U and A equals

the probability that the change will exceed U-A. To illustrate, suppose the change in the spread is a drawing

from a 2(0, )N distribution. Therefore, the probability that the change will breach U equals PrU A

z

,

which is increasing in A since U is fixed. Conditional on the change equaling an arbitrary value P, the

probability evaluated at t-1 equals PrU A P

z

which is increasing in P given U and A.

17

spread change reversing arbitrage is less likely, implying lower first-order autocorrelation.

This leads to our second hypothesis:

H2: When storage levels are at or near capacity, cash-and-carry arbitrage is more costly or

difficult, thus if the futures-spot spread is positive and high, a further increase in the spread is

less likely to be followed by arbitrage and a reversal in the spread.

Similarly, if there is an inventory stockout or if inventories are at the minimum required for

operational purposes, reverse cash-and-carry arbitrage, in which arbitrageurs sell from

inventory in the spot market, cannot occur, leading to our third hypothesis:

H3: When tradeable inventory levels are at or near zero, reverse cash-and-carry arbitrage is

more costly or difficult, thus if the futures-spot spread is negative and low, a further decrease

in the spread is less likely to be followed by arbitrage and a reversal in the spread.

3. Results

3.1. Data description and initial evidence.

While futures and spot price data are available from 1983, our data period begins in

April 2004 when the EIA began reporting crude oil stock levels at Cushing. We examine

daily prices of NYMEX WTI crude oil futures contracts from April 6 2004 through May 6,

2015 obtained from the website of the Energy Information Administration. Descriptive

statistics for the futures-spot spread measured as the difference between the prices of the

second and nearby contracts are presented in Table 1 for both the level and daily changes in

the spread.

***Insert Table 1 about here***

Interestingly, while Schwartz (1997) and Routledge, Seppi, and Spatt (2000) argue

that in this market backwardation should be more common than contango, the market

actually tended to be in contango over much of this period with the spread averaging $0.52,

as shown in Panel A. The spread was positive on 76.5% of the observed days. With a

standard deviation of $0.39, daily changes in the spread were fairly large.

18

Partial autocorrelations out to a four-day lag are reported in Panel B for both the

spread and its components. Several patterns are worth noting. First, consistent with our

arbitrage argument, there is evidence of fairly strong mean reversion with a first order

autocorrelation of -0.246 between changes on successive days. Clearly, there is a tendency

for increases and decreases in the spread to be partially reversed on subsequent days. In the

absence of arbitrage, this would seem to violate weak-form efficiency. Second, the spread

displays considerably more mean reversion than either of its two components. The first order

autocorrelation for the futures and spot price changes are only -0.045 and -0.053 respectively.

This indicates that the mean reversion of the spread is not simply a reflection of mean

reversion in the spot and futures prices due to some other cause such as bid-ask bounce.

Third, while the first-order autocorrelation in spread changes is clearly the largest, there is

also evidence of negative partial correlation at lags of two and three days. As we discussed

above, it is unclear a priori how long it would take arbitrageurs to contract storage and thus

how quickly arbitrage should reverse violations of the no-arbitrage bounds. If indeed the

mean reversion observed in the spread is due to arbitrage, this indicates that most of the

reversal takes place in one day but that full reversal may take several days.

3.2. Testing for evidence of arbitrage activity and physical limits to arbitrage

According to the arbitrage hypothesis, mean reversion in the spread should be

observed only when the change in the spread crosses the no-arbitrage bounds – and then only

if arranging the arbitrage transactions occurs with a lag. As long as the spread is fluctuating

within the no-arbitrage bounds, weak form efficiency implies that there should be little, if

any, autocorrelation. As discussed above, testing is complicated by the difficulty in

measuring the no-arbitrage bounds. However, as we argued in section 2.4 and H1, an

increase in the spread in period t-1 is more likely to cross the no-arbitrage upper bound, and

thus lead to mean reversion in period t, if the spread is already positive and high than if it is

negative or low. Likewise, a negative change in the spread is more likely to cross the no-

arbitrage lower bound leading to mean reversion if the spread is already low.

To test this, we examine variations of a simple regression ΔSPt = β0 + β1ΔSPt-1+et, 22

22 The estimated β1 = -0.246 which is significant at the 1% level based on Newey-West standard errors.

19

where ΔSPt = SPt – SPt-1 and SPt = Ft – St with Ft being the futures price on day t and St

being the spot price. We consider several variations. First, we divide the sample into (1)

cases when the change in the spread and the beginning spread have the same sign and (2)

cases when they have different signs. We define DSamet-1 =1 if ΔSPt-1*SPt-2 >0 and =0

otherwise. Thus DSamet-1 =1 if a positive spread at time t-2 is followed by a further increase

in the spread at time t-1 or if a negative spread at time t-2 is followed by a further decrease at

time t-1, i.e. if the spread at time t-2 and the change in day t-1 have the same sign. We define

DDifft-1 =1-DSamet-1. Thus DDifft-1=1 if ΔSPt-1*SPt-2 <=0, i.e., if the spread change on day t-

1 is opposite in sign to the spread on day t-2. As reported in Table 2 DDifft-1=1 is slightly

more common than DSamet-1=1 since it includes the cases when ΔSPt-1 or SPt-2 are zero.

***Insert Table 2 about here***

Second, we separate those cases when the change in the spread (at time t-1) and the

beginning spread (at time t-2) have the same sign into two groups: contango, where both

signs are positive (so that the upper bound might be violated) and backwardations, where

both signs are negative (or zero) (so that the lower bound might be violated). DPost-1=1 if

SPt-2>0 and ΔSPt-1>0 and zero otherwise and DNegt-1=1 if SPt-2<0 and ΔSPt-1<0 and zero

otherwise. Table 2 shows that during our sample period DPos is more common than DNeg.

Third, given that the spread is positive and in contango for 76.5% of the observations

in our sample, we further divide the set when a positive spread is followed by a further

increase in the spread into: (1) those cases when the time t-2 spread is positive but less than

$0.50 and (2) those cases when the time t-2 spread is positive and greater than $0.50. We

choose $0.50 since this is a common estimate of storage costs and since the median spread is

$0.45. Specifically, we take cases where the spread is positive (SPt-2>0) and the change in

spread is positive (ΔSPt-1>0) and define DPosLowt-1=1 if the spread is between 0 and $0.50,

0<SPt-2<=0.50, and zero otherwise and DPosHight-1=1 if the spread is above $0.5 (SPt-2>0.5)

and zero otherwise.

Fourth, we consider instances of contango where the spread is more likely to provide

opportunities for arbitrage (positive, increasing and above $0.50) but storage capacity is

exhausted. To test this, we define the dummy variable DCap_Utilt=1 if the ratio of actual

Cushing storage levels announced by the EIA for that week divided by our estimate of

20

effective capacity based on historical production peaks as described in section 2.2 is in the

top 20% of observed levels of capacity utilization. This translates to capacity utilization rates

exceeding approximately 92%.23 Since storage levels should only affect the likelihood of

arbitrage when the spread increases from already high levels, we interact this variable with

DPosHigh t-1. As reported in Table 2, DCap_Utilt-1*DPosHight-1=1 for approximately 7.4%

of our observations.

Table 3 presents results of the four variations of the simple regression ΔSPt = β0 +

β1ΔSPt-1+et described above. Model 1 examines periods where change in the spread and the

beginning spread have the same sign and those with the opposite sign by modifying the

above mentioned simple regression in the following manner: ΔSPt = β0 + β1DDifft-1*ΔSPt-1

+β2 DSamet-1*ΔSPt-1+ et. According to our hypothesis H1, the likelihood of setting off

arbitrage by crossing either no-arbitrage bounds is greater if DSamet-1=1; thus H1 implies β2-

β1<0. Model 2 examines periods where change in the spread and the beginning spread have

the same sign and further separates those into periods of backwardation (negative spreads)

and contango (positive spreads). We expect that the coefficient of DNegt-1*ΔSPt-1 will be

more negative than that of DPost-1*ΔSPt-1.24

***Insert Table 3 about here***

Model 3 further divides contango cases into those where spreads are above and below

$0.50 We expect more instances of spread movements above the $0.50 level to set off cash-

and-carry arbitrage (given that the approximate cost of storage is $0.5), and thus more mean

reversion, when DPosHight-1=1. Model 4 adds a physical storage capacity limit. In

hypothesis H2 in section 2.4., we hypothesized that as crude oil storage tanks approach

capacity, storage costs rise, thereby elevating the no-arbitrage upper bound for cash-and-

carry arbitrage. This implies that, for a given level of SPt-2, a further increase in the spread is

less likely to cross the no-arbitrage bound setting off cash-and-carry arbitrage and mean

reversion when capacity utilization is high. We also hypothesized that arbitrage and mean

23 We consider alternative measures of storage capacity utilization in section 7. 24 We have argued that SCSt,t+v,t+s is generally less than SCt,t+v,t+s and therefore, expect the likelihood that

the lower bound is crossed when Dnegt-1=1 to be greater than the likelihood that the upper bound is crossed

when Dpost-1=1. This depends on the relative sizes of storage costs, SC, storage cost savings, SCS and the

convenience yield, CV, in equations 1 and 2. But, the fact that mean and median spreads are strongly positive

and that negative spreads are observed only 23.3% of the time suggests that storage costs and savings normally

exceed the convenience yield in this market so that the absolute value of the upper bound exceeds the absolute

value of the lower bound (which may even be positive).

21

reversion might be delayed because more time may be required to contract storage when

capacity is tight. This implies that even when other conditions for cash-and-carry arbitrage

are met, i.e. a further increase in the spread from already high spread levels, arbitrage and

mean reversion are less likely when storage levels are already high. Therefore, we expect less

mean reversion when DCap_Utilt-1*DPosHight-1=1.

Results for Model 1 in Table 3 are striking. 𝛽1̂ is insignificant and even positive in

sign, indicating that there is no evidence of mean reversion on day t when a positive spread at

time t-2 is followed by a negative change at time t-1 or a negative spread at time t-2 is

followed by a positive change on day t-1. This is consistent with our argument that in these

cases, the change on day t-1 is unlikely to cross the no-arbitrage bounds and set off arbitrage.

In summary, when the change in the spread in period t-1 has the opposite sign to the level of

the spread at the beginning of period t-1, making it unlikely that the change in period t-1

carried the spread outside the no-arbitrage bounds, there is no evidence of mean reversion or

arbitrage and weak form efficiency holds.

On the other hand, 𝛽2̂ = -.679 which is significant at the .0001 level based on Newey-

West standard errors. This implies that when a positive spread on day t-2 is followed by a

further increase in the spread on day t-1 or when a negative spread is followed by a further

decrease, approximately two-thirds of the day t-1 change is reversed on day t. This is

consistent with our argument that in these cases, the change on day t-1 is more likely to cross

one of the no-arbitrage bounds leading to arbitrage which partially reverses the day t change.

Needless to say 𝛽2̂ - 𝛽1̂ <0 and the difference is significant at the .0001 level confirming

hypothesis H1. This evidence also indicates that arranging arbitrage trades takes some time,

likely because storage takes time to arrange, so that temporary violations of the no-arbitrage

limits are observed.

Model 2 in Table 3 examines cases where the spread on day t-2 and change in spread

on day t-1 have the same sign, which are further separated into cases where the spread is

increasing or decreasing. While significantly negative in both cases, as expected, the

coefficient of DNegt-1*ΔSPt-1 is considerably larger in absolute terms than the coefficient of

DPost-1*ΔSPt-1. The difference between the two is significant at the .01 level. This pattern is

consistent with our argument that the no-arbitrage lower bound is smaller in absolute terms

22

than the upper bound, and may even be positive in some periods so that arbitrage is more

likely when the t-2 spread and the t-1 change are both negative than when both are positive.

Model 3 shows that, as hypothesized, when the time t-2 spread is positive, the

tendency for the spread to mean revert is much stronger when the beginning spread exceeds

$0.50 than when it is positive but less than $0.50. Thus, it appears that increases in the spread

are more likely to cross the no-arbitrage upper bound thus leading to cash-and-carry arbitrage

if the t-2 spread is more than approximately $0.50. This further confirms our hypothesis H1.

Model 4 confirms our hypothesis H2. The coefficient of DCap_Utilt-1* DPosHight-1*

ΔSPt-1 is positive and significant at the .05 level. Moreover, the coefficient of DPosHight-1*

ΔSPt-1 is a highly significant -0.6588. The latter result implies that when the spread increases

from a level exceeding $0.50, approximately 65.88% of the increase in the spread tends to be

reversed the next day if capacity utilization levels are below the top 20%. However, when

capacity utilization levels are in the top 20% of observed utilization levels, the estimated

reversal is only 0.6588-0.4705 = 18.83% which is insignificant at the 10% level. In other

words, when the spread already exceeds $0.50, a further increase in the spread tends to be

mostly reversed if capacity utilization is low but not reversed significantly if capacity

utilization is already high. This is consistent with our argument that available storage

capacity imposes a physical limit on cash-and-carry arbitrage.

Note that when capacity utilization is high and storage approaches its capacity limits,

the cost of storage increases causing the spread to increase. However, this effect works

against our finding evidence supporting H2. Specifically, when DPosHight- 1=1 and

DCap_Utilt-1=1, the mean of SPt-1 is $1.891 while when DPosHight- 1=1 and DCap_Utilt-1=0,

the mean of SPt-1 is $1.219. Thus, in the absence of storage limits, the incentive to undertake

cash and carry arbitrage that would reverse the spread increase would be even greater when

DCap_Utilt-1=1 implying a negative coefficient for DCap_Utilt- 1* DPosHight-1* ΔSPt-1.

Despite this, we find a positive coefficient, indicating that cash-and-carry arbitrage is less

likely to occur when actual oil in storage is close to capacity.

3.3 Volatility

In Table 4, we add measures of spread volatility to the regression. Movements of the

23

futures-spot spread are more likely to cross the no-arbitrage bounds and thus trigger C&C

arbitrage when spread volatility is high. In our regressions, we relate the spread change at

time t to the change in the spread at time t-1, but profitable arbitrage may also be more likely

if the changes in the spread at times t-2, t-3, t-4 etc. were large. Also, our measure of the

change based on settlement prices, misses intraday spread changes which might set off

arbitrage and these are likely larger when volatility is high. Finally, spread volatility tends to

be higher when Cushing storage tanks are almost full. Hence it is important to control for

volatility when testing for evidence of physical limits to arbitrage.

Our measure of spread volatility is the log of one plus the standard deviation of the

futures-spot spread over the 20 days prior to t-2. With the exception of ΔSPt-1, all our

independent variables in Table 3 are zero-one dummies. To facilitate interpretation of the

coefficient of the continuous volatility variable and comparison with the other coefficients,

we standardize this log volatility measure to a mean of zero and variance of one.25 We label

this variable: Volt-2. As explained and confirmed above, in the absence of physical limts,

violation of the upper no-arbitrage limit should be most likely when the spread increases at

time t-1 after already exceeding $0.50 at time t-2. To test whether violation of the no-

arbitrage limit and thus a reversal of the spread increase at time t-1 under these conditions is

more likely when spread volatility is high, we interact Volt-2 with DPosHight-1* ΔSPt-1 and

add this variable to Model 4 of Table 3. The hypothesis that C&C arbitrage is more likely

when volatility is high implies a negative coefficient on the new interaction variable.

*** Insert Table 4 about here ***

Results with this variable added are reported as Model 1 in Table 4. As expected the

coefficient of the interactive volatility variable is negative and significant at the 1% level

implying C&C arbitrage is more likely when volatility is high. The coefficients in Table 4

imply that when spread volatility is at its mean level, i.e. Volt-2=0, and storage is not

constrained, approximately 32.2% of the increase in the spread at time t-1 (from an already

high level a time t-2) tends to be reversed at time t. However, when volatility is one standard

deviation above its mean, approximately 55.2% (-0.3219-0.2296) of the increase in the

spread at time t-1 tends to be reversed.

25 Prior to standardization, the mean was .23213 and standard deviation .1955.

24

Importantly, controlling for volatility leads to a substantial increase in the coefficient

of the capacity utilization variable which rises from .4705 in Table 3 to .8180 and is now

significant at the .1% level. The implication is that C&C arbitrage is impeded when oil

storage approaches capacity.

Our evidence in Table 3 indicates that, consistent with reverse C&C arbitrage, a

further decline in the futures spot spread at time t-1 from an already negative level at time t-

2, tends to be partially reversed at time t. To test whether this tendency is even stronger

when volatility is high (as we would expect), in Model 2, we add a variable in which we

interact Volt-2 with DNegt-1* ΔSPt-1. The hypothesis that reverse C&C arbitrage is more likely

when volatility is high implies a negative coefficient.

We found no significant evidence in Table 3 that increase in the spread at time t-1

from a positive but low level at time t-2 (specifically < $0.50) sets off C&C arbitrage. To

test whether spread volatility impacts the probability of a spread reversal in this case, we

interact Volt-2 with DPosLowt-1*ΔSPt-1. We have no strong prior for this coefficient sign. It

is possible that while the conditions for C&C arbitrage are not normally met in this case, they

might be when volatility is high implying a negative coefficient. On the other hand, it may

be that volatility has little impact when the other conditions for arbitrage are not met

implying a coefficient insignificantly different from zero.

Finally, in Model 2 in Table 4 we include Volt-2 un-interacted. We do not expect a

significant coefficient for this variable since our theory implies that the impact of volatility

on the change in the spread at time t should be conditional on the prior level and change in

the spread. Nonetheless, it seems prudent to put this expectation to the test and to determine

if our interacted volatility variables are actually just proxying for an un-interacted effect.

The results in Model 2 confirm our expectations. Consistent with our reverse C&C

arbitrage hypothesis, the coefficient of Volt-2*DNegt-1* ΔSPt-1is negative and significant

implying that when the spread at time t-1 declines further from an already negative level, the

spread reversal tendency at time t is stronger when volatility is high. As expected the

coefficient of the un-interacted volatility measure is insignificant and the coefficient of

Volt-2*DPosLowt-1* ΔSPt-1 is also insignificant. In Model 3, we drop Volt-2 and

Volt-2*DPosLowt-1* ΔSPt-1 since these variables both lack theoretical justification and are

25

insignificant. Model 3 is used as our base model for a number of subsequent estimations

3.4 Summary

In summary, the behavior of the futures-spot spread in the crude oil market shows

evidence of considerable arbitrage activity that is sometimes limited by significant physical

constraints. We argued that an increase (decrease) in the spread is unlikely to set off

arbitrage if the spread is negative (positive) prior to the increase (decrease) and there is no

evidence of mean reversion in the spread when this is the case. Likewise, when the spread is

positive and storage costs are substantial, minimal arbitrage and mean reversion are expected

when the beginning spread is positive but small. Consistent with this expectation, we find

evidence of only a slight mean reversion when the beginning spread is positive but below

$0.50. In other words, when the past spread pattern makes it unlikely that the no-arbitrage

bounds are crossed, there is little evidence of mean reversion or arbitrage, and weak form

efficiency holds.

However, as hypothesized, we find evidence of strong mean reversion when an

already negative spread declines further. Likewise, in the absence of capacity constraints, we

find evidence of strong mean reversion when the spread increases after it already exceeds

$0.50. These mean reversion tendencies are especially strong when futures-spot spread

volatility is high. However, when capacity utilization rates are relatively high, the mean

reversion tendency is non-existent or much weaker, indicating that due to unavailability or

high cost of storage, arbitrage to reverse the spread is limited.

The spread’s mean reversion tendency could be due to other factors such as

inefficient markets. However, the fact that it is not observed (or is much weaker) when the

spread and the change in the spread are of different signs or when the beginning spread is

positive but small, and is substantial when the spread falls from a negative level or increases

from an already high level seem most consistent with arbitrage. In short, strong mean

reversion is observed when the arbitrage hypothesis implies it should be observed and not

observed when arbitrage is not expected. Moreover, the fact that this is observed in daily

data indicates that storage cannot normally be contracted immediately but takes some time to

arrange.

26

This evidence of physical limits to arbitrage has serious economic efficiency

implications. Cash-and-carry arbitrage tends to allocate assets across time in an efficient

manner. If traders foresee a future crude oil shortage and thus bid up the futures price

relative to the spot price, the resulting arbitrage leads to oil being taken off the market in the

current time of relative plenty and coming back on the market during the foreseen period of

relative scarcity. If storage capacity is limited, this reallocation cannot take place.

4. Minimum Inventories and Reverse Cash-and-Carry Arbitrage

In section 3, we examined the possible effects of storage limits on cash-and-carry

arbitrage, finding that when the market is in contango, the futures-spot spread is already high,

and there is excess storage capacity, further increases in the spread tend to be reversed

consistent with cash-and-carry arbitrage. However, when the futures-spot spread is already

high but there is little or no excess storage capacity, further increases in the spread are not

reversed (or the reverse is much smaller) indicating that cash-and-carry arbitrage is limited

by the lack of storage.

As discussed above, the existing theory of storage has focused more on the

consequences of inventory depletions than storage limits. If inventories are quite low, the

risk of a stockout is high, leading to a convenience yield for physical holdings of the

commodity. In addition, if there are no inventories, reverse cash-and-carry arbitrage in which

the arbitrageur simultaneously sells from inventory in the spot market and buys (or goes

long) in the futures market cannot take place. Accordingly, the arbitrage enforcing a lower

limit on the futures-spot spread cannot occur. This suggests that just as we find evidence of a

lack of available storage capacity inhibiting the ability of cash-and-carry arbitrage to return

the futures-spot spread to below the no-arbitrage upper bound when oil inventories approach

capacity, we might find evidence that an inventory shortage inhibits the ability of reverse

cash-and-carry arbitrage to enforce the no-arbitrage lower spread bound.

Determining the level at which inventories might be so low as to limit reverse cash-

and-carry arbitrage is difficult. Oil is an economically essential commodity and inventories

are never zero. Indeed, over our 2004-2015 period, actual oil storage levels at Cushing never

fall below 30.5% of our capacity proxy. Still, it is possible that inventories could fall below

27

the minimum level required for operational purposes, leaving no inventories for reverse C&C

arbitrage, but it is hard to determine what this minimum level might be.26 Testing for possible

inventory limits on reverse C&C arbitrage is also complicated by the relative scarcity of days

in our sample (10.3% of all days) when the spread declines from an already negative level so

that reverse C&C arbitrage might be profitable.

Despite these limitations, we nonetheless explore the possibility of lower inventory

limits on reverse C&C arbitrage. Analogous to our definition of DCap_Utilt, we first define

DLow_Inv_10t =1 when the ratio of actual inventories to our capacity proxy is in the lower

10% of observed capacity utilization ratios.27 This translates to capacity utilization rates

below 52%. Since this inventory cutoff level is high relative to the minimum observed of

30.5%, we also define DLow_Inv_5t =1, and DLow_Inv_3t =1 when the ratio of actual

inventories to our capacity proxy is in the lower 5% and 3% of observed capacity utilization

ratios, i.e., below 40% and 35%, respectively.

In the previous section, we made a distinction between cases when the spread at time

t-2 was positive but less than $0.50 and cases when it was more than $0.50, since cash-and-

carry arbitrage should be more profitable when the spread is larger (as our evidence

confirmed). Likewise, we would expect reverse cash-and-carry arbitrage to be more

profitable when the spread is negative and large in absolute terms than when negative but

small. However, we do not split these observations into spreads above and below -$0.50 due

to several reasons. First, the number of backwardation observations (DNegt-1) is quite low in

our sample. 28 Second, while +$0.50 is an often-quoted storage cost, there is no such guide as

to where to draw the lower bound and it might not even be negative. Also, while $0.50 is

quite important in the case of C&C arbitrage as storage will need to be secured, in the case of

reverse C&C there are may be many instances where the $0.50 storage cost is sunk.

26 While we refer to a minimum operational level for ease of exposition, it should be made clear that this is

likely a continuum since the risk of a stock-out increases as the level of operational inventories is lowered. Thus

it is more accurate to say that as inventories decline, obtaining oil for reverse cash and carry arbitrage becomes

more difficult. 27While we use the upper 20% of observed levels of capacity utilization to define the upper bound, it is

equivalent to storage being 92% full, which is a likely physical capacity constraint. However, the smallest 20%

of observations happen when storage is over 63% full, which is hardly a shortage. We therefore consider the

bottom 10% of observations when defining the lower bound. 28 Unfortunately, as noted above, cases when the spread is negative make up only 23.5% of our sample so the

number of cases when SPt-2< -$0.50 and ΔSPt-1<0 is only 5.3%.

28

To test the ability of reverse cash-and-carry arbitrage to reverse the spread (i.e., pull

up the spread following the further decline of a negative spread that is more than $0.50 in

absolute terms) we include in the regression the interaction variable DLow_Inv_10t-1* DNegt-

1* ΔSPt-1. The hypothesis that reverse cash-and-carry arbitrage is impeded when inventories

are low (specifically in the bottom 10% of capacity utilization) implies less mean reversion

when DLow_Inv_10t-1* DNegHight-1=1.

Results are shown in Model 1 in Table 5 where we repeat the Model 3 regressions

from Table 4 adding DLow_Inv_10t-1* DNegt-1* ΔSPt-1. The significant negative coefficient

for DNegt-1*ΔSPt-1 indicates again that when the spread is negative on day t-2 and declines

further on day t-1, reverse cash-and-carry arbitrage tends to pull the spread back up on day t.

***Insert Table 5 about here***

In Model 1, there is no evidence that reverse cash-and-carry arbitrage is impeded by

lower than normal inventories in our data period. Indeed, the coefficient of DLow_Inv_10t-

1* DNegt-1* ΔSPt-1 is negative and significant at the 1% level while the hypothesis that

reverse C&C arbitrage is impeded by inventories in the bottom 10% of observed capacity

utilization levels implies a positive coefficient. This negative coefficient is likely due to the

correlation between inventory levels and the spread. Also, DLow_Inv_10t-1 still includes a

number of relatively high inventory levels, i.e., up to 52% of capacity, levels at which

stockouts are less likely.

Since DLows_Inv_10t-1 includes many days with relatively high oil inventories (up to

52% of capacity) in Models 2 and 3 in Table 5, we replace DLow_Inv_10t-1 with

DLow_Inv_5t-1 and DLow_Inv_3t-1, respectively. Results in Models 2 and 3 are somewhat

different from those in Model 1. The coefficient of DLow_Inv_5t-1* DNegt-1* ΔSPt-1 is

positive although statistically insignificant, and similarly for DLow_Inv_3t-1* DNegt-1* ΔSPt-

1. This provides some indication that reverse cash-and-carry may be constrained by low

inventory levels in the bottom 5% of observed capacity utilization levels. However, a caveat

is that the insignificance of these variables could be due to the small number of observations

where both the level and change in the spread are negative and inventories are in the bottom

5% of our sample.

In summary, we find that when the futures-spot spread is negative on day t-2 and

29

declines further on day t-1, it tends to rise or reverse on day t, which is what we would expect

if the further decline on day t-1 makes reverse C&C arbitrage profitable. In contrast to our

findings for C&C arbitrage and storage limitations, we find no evidence that this reverse

C&C arbitrage is constrained by inventory levels in the bottom 10% of the sample. When we

examine lower inventory levels (bottom 5% and 3% of the sample) there is some suggestion

that the spread does not narrow, which would be consistent with reverse C&C arbitrage being

impeded by low inventories. However, the relation is not statistically significant.

5. Financial Constraints and Arbitrage

Next, we consider possible financial limits on cash-and-carry arbitrage. In cash-and-

carry arbitrage, the arbitrageur must finance both the spot market purchase and the storage as

well as post margin for the futures sale. Thus, any financial constraints faced by the

arbitrageur could impede the arbitrage, as Brunnermeier and Pedersen (2009) note.

In this section, we examine how, in addition to physical storage constraints, financial

(funding) constraints influence cash-and-carry arbitrage. The spread data that form the basis

for our examination to this point are measured at the daily frequency. As a measure of the

extent to which financial constraints are influential we, therefore, begin by utilizing a market-

based variable also measured at the daily frequency that is in the spirit of several measures

that have been used in the literature but which are measured at a quarterly frequency. For

completeness, however, we also examine the influence of the financial constraint measures

that have been commonly used in the literature as indicators of funding constraints for

arbitrage trades. We focus on cash-and-carry arbitrage since some reverse cash-and-carry

arbitrage trades may involve arbitrageurs selling oil from inventory and therefore impose less

need for new capital.

Several measures of financial constraints have been employed in the literature. Which

of these measures are most relevant for our purposes depends on the types of firms

conducting cash-and-carry arbitrage in the crude oil markets. While hard data on the firms

conducting arbitrage trades is unavailable, the trading desks of both oil firms and financial

firms are known to lease storage at Cushing for arbitrage purposes.29 Consequently, we

29 See, for example, http://www.reuters.com/article/us-oil-storage-houston-analysis-

idUSKBN0LU2DL20150226.

30

employ financial constraint measures that are applicable to both financial and oil-related

firms, as well as aggregate measures for the economy as a whole. We follow Acharya,

Lochstoer, and Ramadorai (2013) and examine measures of financial constraints for the U.S.

financial industry and for U.S. energy firms.30

Our primary daily measure of external financial constraints for a company is the

expected default frequency (EDF) computed by Moody’s. Quoting from Moody’s, “EDF9

begins with three drivers (asset value, asset volatility, and default point), computes the

Distance-to-Default (DD), and uses an empirical mapping to compute a probability of

default, known as the EDF credit measure. The EDF credit measure is a point-in-time

probability of default.” (Moody’s, 2015, pg. 9). When viewed from the perspective of the

firm seeking to borrow to fund an arbitrage transaction, the measure is an indicator of credit

worthiness. When applied as an indicator of the credit stress faced by a potential lender it

serves as an indicator of credit tightness. At the heart of the calculation is the distance to

default as defined by the Merton (1974) model for valuing risky debt.

We compute two aggregate EDF measures. The first is computed as an average over

all U.S. financial firms and serves as an indicator of the reluctance of lenders to fund

arbitrage transactions.31 The second measure is the EDF average across U.S. energy firms

and serves as an indicator of the credit worthiness of firms seeking to engage in arbitrage

strategies. Fifty percent (50%) of the companies included in the set of energy companies are

from SIC 1311 (Crude, Petroleum, and Natural Gas). The remainder are from the oil

exploration, drilling, distribution, and exploration sectors (SIC 1381, 1389, 2911, 4922-

4924). One of the advantages from using the EDF score is the daily frequency match to our

data and the ability to examine variations within quarters. Some of the other measures of

financial constraints that we use are measured at a quarterly frequency due to data

availability (BDLeverage and Zmijewski (1984) score) and therefore, cannot pick up within-

quarter variations. We interact the EDF measures with the spread change variables as in our

tests of the impact of physical storage constraints. We test the hypothesis that cash-and-carry

arbitrage is more restricted in the presence of financial constraints by interacting the EDF

30 We thank Sue Zhang and Robert Tran of Moody’s Analytics for providing us daily EDF values for our

sample period. 31 For brevity we do not list all the SIC codes included. A list is available from the authors upon request.

31

measures with our spread measures in a fashion similar to our tests for the influence of

storage capacity constraints.

We also consider the measure of broker-dealer capital constraints studied by Acharya,

Lochstoer, and Ramadorai (2013), Etula (2013), Adrian, Etula, and Muir (2014) and Adrian

and Shin (2010). As noted by Etula (2013), broker-dealers are marginal investors on the

speculative side of the commodities market and their leverage can be a measure of their ease

of access to capital. We use the Adrian, Etula and Muir (2014) formulation for broker-dealer

leverage:

𝐵𝐷𝐿𝑒𝑣𝑒𝑟𝑎𝑔𝑒𝑡 = 𝐵𝐷 𝑇𝑜𝑡𝑎𝑙 𝐹𝑖𝑛𝑎𝑛𝑐𝑖𝑎𝑙 𝐴𝑠𝑠𝑒𝑡𝑠𝑡

𝐵𝐷 𝑇𝑜𝑡𝑎𝑙 𝐹𝑖𝑛𝑎𝑛𝑐𝑖𝑎𝑙 𝐴𝑠𝑠𝑒𝑡𝑠 𝑡−𝐵𝐷 𝑇𝑜𝑡𝑎𝑙 𝐿𝑖𝑎𝑏𝑖𝑙𝑡𝑖𝑒𝑠𝑡 (3)

We collect BDLeverage data from Table L.129 of the Federal Reserve Flow of Funds. While

the EDF measure described above is measured on a daily basis, the BDLeverage measure is

computed on a quarterly basis due to data availability. Since this is an inverse measure of

leverage, decreases in BDLeverage indicate a more risky broker dealer balance sheet and

thus a possible environment in which potential ‘arbitrage’ borrowers have reduced access to

capital. Consequently, cash-and-carry arbitrage should be more constrained when

BDLeverage is low implying a negative coefficient on variables involving BDLeverage.

Following Acharya, Lochstoer, and Ramadorai (2013), we also develop an aggregate

measure for oil and gas producer default risk by calculating the Zmijewski (1984) default risk

score. Acharya, Lochstoer, and Ramadorai (2013) use the Zmijewski score as a proxy for

hedging demand, since it measures the default risk of oil and gas firms, and therefore, the

extent to which speculators in the market are financially constrained due to producer hedging

demand. It could also proxy for producer access to capital. The Zmijewski score is calculated

for each firm in the SIC 1311 sector using quarterly accounting information from Compustat

in the following manner:

𝑍𝑚𝑖𝑗𝑒𝑤𝑠𝑘𝑖𝑡 = −4.3 − 4.5 ∗𝑁𝑒𝑡𝐼𝑛𝑐𝑜𝑚𝑒

𝑇𝑜𝑡𝑎𝑙𝐴𝑠𝑠𝑒𝑡𝑠 + 5.7 ∗

𝑇𝑜𝑡𝑎𝑙𝐷𝑒𝑏𝑡

𝑇𝑜𝑡𝑎𝑙𝐴𝑠𝑠𝑒𝑡𝑠− 0.004 ∗

𝐶𝑢𝑟𝑟𝑒𝑛𝑡𝐴𝑠𝑠𝑒𝑡𝑠

𝐶𝑢𝑟𝑟𝑒𝑛𝑡𝐿𝑖𝑎𝑏𝑖𝑙𝑖𝑡𝑖𝑒𝑠 (4)

We then calculate the Aggregate_Zmijewski (equally-weighted) score for all the SIC

32

1311 firms.32 When the Zmijewski score is high, oil and gas firms conducting cash-and-

carry arbitrage could have difficulty obtaining the needed financing implying a positive

coefficient.

Like the BDLeverage measure, the Zmijewski score is computed on a quarterly basis.

As such we cannot observe these values on a daily basis but must make the assumption that

the beginning of quarter values prevail for each day of the following quarter. If the true

measures (if observable on a daily basis) actually vary over the quarter, this restriction makes

it more likely that we will fail to identify a relation between our daily spread change measures

and the BDL or Zmijewski measures. This situation however is not present when we use the

daily EDF measures.

We also examine two additional measures of potential financial constraints.

Brunnermeier and Pedersen (2009) employ the VIX (CBOE Volatility Index) as a measure of

arbitrageurs’ capital availability.33 The VIX is available on a daily basis. Finally we use an

indicator variable for the financial crisis period from September 2008 through December

2009 since financing was abnormally restricted during this period.

In the context of our data and model, financial constraints should manifest themselves

in the same manner as physical constraints. To wit, if arbitrage is not limited by financial

constraints, then a move in the futures-spot spread above the no-arbitrage limit should set off

cash-and-carry arbitrage, which should return the spread to within the no-arbitrage limit. If

however, financial constraints impede arbitrage, this spread change reversal cannot take

place. Thus, our approach to testing for financial constraints is similar to our approach for

testing the effect of physical constraints: we focus on cases when the spread is already high

on day t-2 and increases further on day t-1 and test if a spread reversal on day t is less likely

when financial constraints seem most likely to be effective. Hence, as we did to test for

32 The rationale for the use of the Zmijewski (1984) score by Acharya, Lochstoer, and Ramadorai (2013) stems

from the positive relationship between financial distress and hedging by oil and gas producers, and the

likelihood that greater hedging pressure by producers will be more likely to financially constrain speculators

who take the other side of hedging trades. To capture this relationship more precisely, Acharya, Lochstoer, and

Ramadorai (2013) also calculate and use the aggregate Zmijewski score for the subset of firms in their sample

(around 70%) that actually hedge but demonstrate in their robustness checks that their findings remain

unchanged. However, their findings for the hedging and non-hedging samples when examined separately show

that the results using the aggregate Zmijewski score for the full sample are driven by the subset of firms in the

full sample that actually hedge. 33 See http://www.cboe.com/micro/vix/part2.aspx for a complete description of how the VIX is computed.

33

physical constraints, we form the interactive variable: FCt-1*DPosHight-1* ΔSPt-1 where FCt-1

designates the financial constraint variable. The hypothesis that financial constraints impede

arbitrage implies a positive coefficient on the variables interacted with the EDF values as

well as all other financial constraint proxies except the BDLeverage variable, which should

have a negative coefficient because it measures the inverse of a leverage variable. In other

words, when financial constraints are binding, increases in the spread above the normal no-

arbitrage limit are less likely to be reversed.

Table 6 reports regression results including the financial constraint measures. Models

1-6 include EDF scores for the U.S. financial industry, EDF scores for energy firms,

BDLeverage, VIX, 2008 Crisis dummy, and the Zmijewski score, respectively. We

standardize all the financial constraint variables. We find evidence that cash-and-carry

arbitrage was restricted by financial constraints during our 2004-2015 data period. The

strongest evidence involves the results based upon the interaction variable involving the daily

EDF scores for lenders, which is significantly different from zero at the 1% level. These

results suggest that when the environment is such that lenders face funding difficulties, they

are more apprehensive about lending and restrict lending to fund arbitrage activities. All the

other physical constraint coefficient signs are consistent with arbitrage being limited by

physical constraints and the interacted EDF for energy firms, VIX, 2008 Crisis dummy, and

Zmijewski score coefficients are significant at the 10% level, while the coefficient of the

EDF interaction variable for US financial firms is significant at the 1% level. In interpreting

these results it should be kept in mind that BDLeverage and the Zmijewski score are

quarterly and thus cannot pick up daily or weekly variations in financial constraints within

the quarter, which may explain the lack of statistical significance for the former. Last, and

critical for our previous results, is that the interacted DCap_Util variable is significant and

positive in all models so our physical constraint results are clearly robust to controlling for

possible financial constraints.

***Insert Table 6 about here***

In summary, we find evidence that financial constraints may have limited cash-and-

carry arbitrage over our data period. One thing that is clear, however, is that the physical

constraint variable remains significant when these financial constraint variables are included

34

in the models estimated, reaffirming the importance of physical constraints in limiting

arbitrage.

6. Spread components

In previous sections and Tables 3-6, we present evidence supporting the arbitrage

relationship that a further decline in a negative spread or a further increase in a spread

exceeding $0.50 tends to be reversed the following day except when oil in storage

approaches capacity, i.e., cash-and-carry arbitrage is hampered by storage limits. In

conditions when arbitrage would not be expected, specifically following a decrease in a

positive spread or increase in a negative spread or when the spread is positive but small, there

is little or no reversal. In this section, we consider which component of the spread -- spot

price or futures price -- tends to adjust more in conditions conducive to arbitrage. A priori,

we would expect an arbitrage in response to a movement in the spread outside the no-

arbitrage bounds to impact both prices. For instance, if the spread rises to the point where

cash-and-carry arbitrage is profitable, the spot price should rise as arbitrageurs long the spot

contract and the futures price should fall as they short the futures contract. However, it is

possible that speculators step in to keep either price from adjusting so that the adjustment

occurs more in one price than the other.

In Table 7 we re-estimate the primary regression model examined in the previous

tables separating the change in the spread on the following day into its two components. In

other words, we estimate one regression with the day t change in the futures price as the

dependent variable in Model 1 and a second with the day t change in the spot price as the

dependent variable in Model 2. Note that since the change in the spread = futures-spot = (Ft-

St)-(Ft-1-St-1) = (Ft- Ft-1)-(St- St-1) the hypothesized signs in the spot price regression are

opposite to those in the futures and spread regressions.

***Insert Table 7 about here***

As shown in Model 1 of Table 7, there is little evidence that arbitrage leads to a

sizable adjustment in the futures price. The adjusted R-square is a miniscule .003 and most of

the variables are insignificant. The coefficient of Volt-2* DNegt-1* ΔSPt-1 is significantly

different from zero at the .01 level but the coefficient is opposite to the sign implied by

35

arbitrage.

In contrast, in Model 2, there is evidence that, in the absence of physical constraints,

the spot price adjusts as implied by arbitrage. Consistent with the expected impact of cash-

and-carry arbitrage, the coefficients of DNegt-1* ΔSPt-1, Volt-2* DNegt-1* ΔSPt-1, and Volt-

2*DPosHight-1* ΔSPt-1 are all significant at the .01 level with the positive signs implied by

arbitrage. The evidence in Table 7 further indicates that as the oil in storage approaches

capacity, this spot price adjustment is impeded since the coefficient of DCap_Utilt-1*

DPosHight-1* ΔSPt-1 is negative and significant. In other words, when storage is not

constrained, a rise in the spread above the upper no-arbitrage limit normally leads to cash-

and-carry arbitrage in which arbitrageurs buy and store oil causing the spot price to rise but

as storage capacity is exhausted this arbitrage is apparently impeded as the spot price does

not rise.34 The adjusted R-square is much smaller than that for the spread regressions in Table

4 but this is to be expected since many non-arbitrage factors tend to impact the spot and

futures prices similarly and thus have much less impact on the spread than on either the spot

or futures prices individually.

In summary, our evidence indicates cash-and-carry arbitrage and reverse cash-and-

carry arbitrage impact the spot price much more than the futures price. In other words, when

the spread rises above, or falls below, the no-arbitrage bounds, it is primarily the spot price

which adjusts to bring the spread back within the bounds. However, when storage levels are

at or near capacity, the spot price adjustment is impeded.

7. Robustness Checks

In the above estimations, we separate DPosLow and DPosHigh at a spread of $0.50.

Also, we use days when storage capacity utilization is in the top 20% of our sample as our

measure of periods when storage is likely constrained. In this section, we examine the

robustness of our results to variations in these measures. Also, since October 2010, the EIA

has provided measures of actual storage capacity at Cushing on a semi-annual basis and we

examine how these more discrete measures of storage capacity utilization affect our results.

34 One of the explanations could be that as storage gets close to its operational capacity, then more oil would be

forced onto the spot market, pushing the spot price lower.

Brennan, Morgan and Justin Solomon, “Cushing, Oklahoma: Small town is holding billions in black gold,”

CNBC, March 7, 2015.

36

First, we re-estimate model 3 from Table 4, dividing DPosHigh and DPosLow at

$0.60 and $0.40 instead of $0.50. In column 2 of Table 8, we present results when

DPosLowt-1=1 if 0<=SPt-2<0.60 and ΔSPt-1>0 and DPosHight-1=1 if SPt-2>=0.60 and ΔSPt-

1>0. The results are little changed from the model 3 regression estimation results reported in

Table 4 that divided PPosLow and DPosHigh at $0.50. In column 3, we re-estimate the

relations separating DPosHight-1 and DPosLowt-1 at $0.40 values of SPt-2. Again, the

estimates are basically the same except that the coefficient of DPosLowt-1* ΔSPt-1 is now

significant at the .05 level. In both regressions the coefficient of DCap_Utilt-1* DPosHight-1*

ΔSPt--1 is significant at the 1% level so our physical constraint results are robust to changes in

this parameter.

***Insert Table 8 about here***

In previous tables, we set DCap_Util =1 when our capacity utilization measure was in

the top quintile. It is likely that storage opportunity costs, and therefore the no arbitrage

upper bound, rise before this level is reached and that storage constraints are even more

severe as the capacity utilization rate rises further. Thus, in column 4 (5) in Table 8 we re-

estimate the regression with DCap_Util =1 when the capacity utilization measure is in the top

30% (top 10%) of observed levels. As shown in column 4, the results are basically the same

whether we use the top 20% or the top 30% to define DCap_UTIL. The coefficient of

DCap_Utilt-1* DPosHight-1* ΔSPt--1 declines from .8185 in Table 4 model 3 to .6179 but is

still significant at the 5% level. Again, the implication is that when the spread rises on day t-

1 from an already high level it tends to reverse on day t if capacity utilization is below 20%

or 30% but much less if (if at all) when capacity utilization exceeds these levels.

As shown in column 5, the estimates are also essentially the same when DCap_Util is

set equal to 1, i.e., when capacity utilization is in the top 10% of observed levels. The

coefficient of DCap_Utilt-1* DPosHight-1* ΔSPt-1 is roughly the same as in column 4 and

significant at the 1% level. In other words, our evidence indicates that cash-and-carry

arbitrage is constrained by storage constraints whether we use the top 30%, top 20%, or top

10% of capacity utilization levels to define periods of high capacity utilization

To this point we have defined periods of likely capacity constraints based on our

estimated capacity utilization measure calculated from historical highs in Cushing storage

37

levels as described in section 2.2. In October 2010, the EIA began collecting and reporting

semi-annual measures of storage capacity at Cushing, as well as throughout the U.S. The

EIA reports total “shell capacity” as well as “working capacity,” with the latter accounting

for the fact that the tanks can neither be filled to capacity nor fully depleted of oil.35 The EIA

notes that effective capacity is below their working capacity measure due to operational

constraints and necessary maintenance. While the EIA does not estimate this effective

capacity level, some industry analysts estimate it at 80% of capacity.36 According to the EIA

working capacity at Cushing rose from 46.1 million barrels in October 2010 to 71.4 million

barrels in April 2015.

We next estimate the model for the October 2010 – May 2015 period using the EIA

figures to define periods of likely storage constraints. To convert EIA’s semi-annual

capacity figures (April and October) to a daily basis, we interpolate daily levels between

actual observations assuming that working capacity grew at a constant rate over the six

months between EIA reports. We then define DCap_Utilt-1=1 if the ratio of actual storage at

Cushing divided by the interpolated daily EIA working capital estimate on day t-1 was in the

top 20% (or top 10%) of observed daily levels (and =0 otherwise)

Results where DCap_Util=1 for observations in the top 20% are shown in column 6

of Table 8 and results for DCap_Util=1 for observations in the top 10% are shown in column

7. While results for the entire period using our capacity proxy based on historical peaks in

storage levels were basically the same whether we defined high capacity utilization as in the

top 20% or top 10%, sub-period results based on EIA capacity figures are stronger when

DCap_Util denotes observations in the top 10% of capacity utilization rather than the top

20%. In both columns 6 and 7, the signs of the estimated coefficients imply that further

increases in the spread after it reaches about $0.50 tend to be reversed if capacity utilization

is moderate or low but not if capacity utilization is high. However, while the coefficient of

DCap_Utilt-1* DPosHight-1* ΔSPt-1 is roughly the same in both regressions, it is only

35 “Tank bottoms are volumes below the normal suction lines of a storage tank that may include water and

sediment and are difficult to access” (EIA, March 4, 2014, Today in Energy,

https://www.eia.gov/todayinenergy/detail.cfm?id=20212 ) 36 “Cushing Full by June, Then What?” Oil and Gas 360.com, March 14, 2016 by EverCom.

Brennan, Morgan and Justin Solomon, “Cushing, Oklahoma: Small town is holding billions in black gold,”

CNBC, March 7, 2015: “If you look at historically what is available at Cushing….we have never seen

utilization over 80%.”

38

significant at the 1% level when DCap_Util =1, i.e., when capacity utilization is in the top

10% of observed utilization levels.

In both regressions using EIA capacity figures, the coefficient of DNegt-1*ΔSPt-1 is

much lower than in the earlier regressions and the coefficient of DDifft-1 is negative and

significant. These differences appear to be largely a function of the time period. In the final

two columns in Table 8, we present estimations over the October 2010 – May 2015 sub-

period with the original measures of DCap_Util based on our capacity proxy. As in

columns 6 and 7, the coefficient of DNegt-1*ΔSPt-1 is smaller over the 2010-2015 sub-period

than over 2004-2015 period though still statistically significant at the 1% level. This size

reduction may be due to the fact that there were few large negative spreads in the latter sub-

period. In absolute terms, the largest negative spread in the October 2010 – May 2015 period

was -$2.66 versus -$11.55 in the April 2004 – September 2010 sub-period. In the final four

columns, the coefficient of DDifft-1*ΔSPt-1 is negative and significant though considerably

smaller than the coefficients of DNegt-1*ΔSPt-1, and DPosHight-1*ΔSPt-1. In contrast to the

results based on the EIA capacity measure, the results for DCap_Utilt-1* DPosHight-1* ΔSPt-1

are stronger when DCap_Util is defined in terms of capacity utilization in the top 20%.

In summary, all the estimated relations in Table 8 basically imply the same spread

behavior reported in prior tables though some significance levels differ. We find that spread

change reversals are nil or relatively small when little arbitrage is expected – to wit when: (1)

a positive spread falls or a negative spread rises, or (2) when the beginning spread is positive

but small. On the other hand, reversals are considerably larger when arbitrage is expected –

to wit following a further decline in a negative spread or following a further increase in a

large positive spread if capacity utilization levels are moderate or low. However, if capacity

utilization levels are high, the tendency for further increases in a large positive spread to be

reversed is considerably reduced indicating arbitrage is constrained by the physical storage

constraint. These physical constraint results are very robust to different measures of the

presence of physical constraints and to different dividing lines between high and low t-2

spreads

39

8. Conclusions

While there is considerable evidence in the literature documenting the effects of

financial constraints on arbitrage and pricing, there is considerably less attention paid to

physical constraints, which can be equally important when the value of financial securities is

derived from physical assets. We extend the literature by studying the effect of crude oil

storage constraints on arbitrage activity in the U.S. crude oil market. We find that when

actual crude oil storage levels are well short of available storage capacity, the behavior of the

crude futures-spot spread is consistent with arbitrage returning the spread to within

unobservable no-arbitrage bounds – albeit with a lag. When arbitrage appears unlikely – i.e.,

when a positive spread declines or a negative spread rises or when the beginning spread is

small – the change in the spread on day t is basically uncorrelated with the change in the

spread on day t-1. However, when arbitrage appears more likely – i.e., when an already

negative spread declines further on day t-1 or a large positive spread increases more on day t-

1 -- the change on day t is strongly negatively correlated with the change on day t-1 as one

would expect if the arbitrage is pulling the spread back within no-arbitrage bounds. This

spread change reversal is even stronger when volatility is high. In summary, we observe

strong negative autocorrelations in the futures spot spread when the arbitrage hypothesis

predicts we should, and little or no autocorrelation when the arbitrage hypothesis predicts we

should. The fact that these autocorrelations are observable in daily data is consistent with

arbitrageurs often needing time to contract for storage at the hub. Interestingly, the evidence

indicates that the spot price adjusts more than the futures price to bring the spread back

within no-arbitrage bounds suggesting that the futures price is set more by speculators and

other traders.

In contrast, when storage levels approach full capacity, further increases in an already

large positive crude oil spread on day t-1 do not tend to be reversed on day t, or are reduced

much less than when storage is plentiful. Thus, our evidence indicates that while normally

cash-and-carry arbitrage operates to return the futures-spot spread to within the no-arbitrage

upper bound, this breaks down in periods of very high capacity utilization. On the other

hand, we find little evidence that oil inventories are ever so low as to interfere with reverse

cash-and-carry arbitrage. We further find evidence that financial constraints also hindered

40

arbitrage in the crude oil market over our sample period. Nonetheless, our physical

constraint results are robust to accounting for possible financial constraints as well as to

different measures of when physical constraints are likely operative. Our findings highlight

the importance of accounting for physical arbitrage limits in the pricing of commodity

futures, and also contribute to the Theory of Storage literature by highlighting the

consequences when inventories approach storage capacity limits.

41

References

Acharya, V.V., Lochstoer, L.A., Ramadorai, T., 2013. Limits to arbitrage and hedging:

Evidence from commodity markets. Journal of Financial Economics 109, 441-465.

Adrian, T., Etula, E., Muir, T., 2014. Financial Intermediaries and the Cross-Section of Asset

Returns, Journal of Finance 69, 2557-2596.

Adrian, T., Shin, H.S., 2010. Liquidity and leverage. Journal of Financial Intermediation 19,

418-437.

Birge, J., Hortacsu, A., Mercadal, T., 2016, Limits to arbitrage in electricity markets.

Working Paper, University of Chicago

Blas, J., 2015, In a World Awash With Crude Oil, Storage Companies Are Kings,

Bloomberg, March 18.

Brennan, M.J., 1958. The Supply of Storage. The American Economic Review 48, 50-72.

Brunnermeier, M.K., Pedersen, L.H., 2009. Market Liquidity and Funding Liquidity. Review

of Financial Studies 22, 2201-2238

“Cushing Full by June, Then What?”, Oil and Gas 360.com, March 14, 2016 by EverCom

Cheng, I., Kirilenko, A., Xiong W., 2015. Convective risk flows in commodity futures

markets. Review of Finance, 1733-1781.

Deaton, A., Laroque, G., 1992. On the Behaviour of Commodity Prices. The Review of

Economic Studies 59, 1-23.

Duffie, D., 2010. Presidential Address: Asset Price Dynamics with Slow-Moving Capital.

The Journal of Finance 65, 1237-1267.

Ederington, L., Fernando, C., Holland, K., Lee, T., and Linn, S., 2016. Arbitrage and the

Financial-Physical Nexus in Commodity Markets, Working paper, University of Oklahoma.

Einloth, J.T., 2009. Speculation and Recent Volatility in the Price of Oil. Working paper,

SSRN eLibrary.

Etula, E., 2013. Broker-dealer risk appetite and commodity returns. Journal of Financial

Econometrics 11: 486-521.

Gabaix, X., Krishnamurthy, A., Vigneron, O., 2007. Limits of Arbitrage: Theory and

Evidence from the Mortgage-Backed Securities Market. The Journal of Finance 62, 557-595.

42

Gorton, G.B., Hayashi, F., Rouwenhorst, K.G., 2013. The Fundamentals of Commodity

Futures Returns. Review of Finance 17, 35-105.

Gromb, D., Vayanos, D., 2002. Equilibrium and welfare in markets with financially

constrained arbitrageurs. Journal of Financial Economics 66, 361-407.

Hong, H., Stein, J.C., 2003. Differences of Opinion, Short-Sales Constraints, and Market

Crashes. Review of Financial Studies 16, 487-525.

Hong, H., Yogo, M., 2012, What does futures market interest tell us about the

macroeconomy and asset prices? Journal of Financial Economics 105, 473–490.

Kaldor, N., 1939, Speculation and Economic Stability, The Review of Economic Studies,

7(1), 1-27.

Kaminski, V., 2012. Energy Markets. Risk Books (Incisive Media), London, UK.

Kaufman, M., 2015, Why is Demand for Oil Storage Increasing? Bidness Etc, February 10.

Kent, S., Kantchev, G., 2015, The World’s Largest Traders Use Offshore Tankers to Store

Oil, Wall Street Journal, January 20.

Leff, J., 2015, US oil tanks barely one-third full beckon crude contango play, Reuters,

January 12.

Merton, R., 1974, On the Pricing of Corporate Debt: The Risk Structure of Interest Rates.

The Journal of Finance 29, 449–470.

Mitchell, M., Pulvino, T., Stafford, E., 2002. Limited Arbitrage in Equity Markets. The

Journal of Finance 57, 551-584.

Moody’s Analytics, 2015, Modeling Methodology, EDF9: Introduction and Overview.

Moody’s Corporation.

Morgan, B., Solomon, J., 2015, Cushing, Oklahoma: Small town is holding billions in black

gold, CNBC, March 7

Mou, Y., 2010. Limits to Arbitrage and Commodity Index Investment: Front-Running the

Goldman Roll. SSRN eLibrary

Routledge, B.R., Seppi, D.J., Spatt, C.S., 2000. Equilibrium Forward Curves for

Commodities. The Journal of Finance 55, 1297-1338.

Schwartz, E.S., 1997. The Stochastic Behavior of Commodity Prices: Implications for

Valuation and Hedging. The Journal of Finance 52, 923-973.

43

Shleifer, A. and R. W. Vishny, 1997, The Limits of Arbitrage, Journal of Finance 52: 35-55.

Working, H., 1949. The Theory of Price of Storage. American Economic Review 39, 1254-

1262.

Zmijewski, M. E., 1984, Methodological Issues Related to the Estimation of Financial

Distress Prediction Models, Journal of Accounting Research, Supplement, 59-82.

44

9.2

9.6

10.0

10.4

10.8

11.2

04 05 06 07 08 09 10 11 12 13 14 15

Effective Capacity at Cushing, OK (estimate) - logarithmic scaleStorage at Cushing, OK (actual) - logarithmic scale

Year, 20xx

Figure 1 – Estimated Storage Capacity and Actual Storage at Cushing (April 2004-April 2015)

This figure presents the actual weekly crude oil storage and estimated effective crude oil capacity

from April 2004 to April 2015 at Cushing, Oklahoma, which is the NYMEX physical settlement

point for the crude oil WTI future contract. Crude oil storage data is collected from the U.S. Energy

Information Administration (EIA). The estimation of the proxy for effective storage capacity is

described in section 2.2.

45

$-2.00

$-1.00

$0.00

$1.00

$2.00

$3.00

$4.00

$5.00

$6.00

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

1.1

04 05 06 07 08 09 10 11 12 13 14 15

Future-Spot Spread (10 day moving average)

Capacity Utilization at Cushing, OK (10 day moving average)

Year, 20xx

Ca

pa

city

Utiliz

atio

n

Fu

ture

-Sp

ot

Sp

rea

d (

$)

Figure 2 – Cushing, OK Capacity Utilization and the Crude Oil WTI Future-Spot Spread (10-

day moving averages) (April 2004-April 2015)

This figure presents the 10-day moving averages of both the future-spot spread and the capacity

utilization at Cushing, OK from April 2004 to April 2015. Capacity utilization is measured as the

ratio of the actual level of crude oil stocks as reported by the EIA to the proxy for effective storage

capacity. The futures-spot spread is the difference between the price of the second and nearby WTI

crude oil futures contracts.

46

Table 1 - Descriptive Statistics

Attributes of the crude oil futures-spot spread are presented based on daily prices from April 6, 2004

through May 5, 2015. The futures price is the price of the second monthly futures contract and the spot

is measured as the price of the nearby contract. *, **, *** designate coefficients significantly different

from zero at the .10, .05, and .01 levels, respectively, in two-tailed tests.

Panel A - Descriptive statistics

Futures-spot

spread

ΔFutures-

spot spread

Mean $0.5196 $0.0006

Median $0.4500 $0.0000

Standard deviation $1.0056 $0.3892

Skewness 1.0141 -0.3454

Kurtosis 18.2203 350.7426

Panel B - Partial autocorrelation coefficients

Futures-spot

spread

ΔFutures-

spot spread

ΔFutures ΔSpot

First .925*** -.246*** -.045** -.053***

Second .216*** -.141*** .000* -.025***

Third .119*** -.077*** -.009 .004**

Fourth .059*** -.044*** .037** .040***

47

Table 2 – Dummy variable means

This table provides definitions and means for variables based on the crude oil future-spot spread (SP).

SP is measured as the difference between the prices of the second and nearby month WTI crude oil

contracts.

Variable Definition Mean

DDifft-1

=1 if SPt-2 and ΔSPt-1, have different (or zero) signs.

This variable identifies times when changes in spread move in

different directions on two consecutive days.

0.5221

DSamet-1

=1 if SPt-2 and ΔSPt-1 and have the same sign.

This variable identifies times when changes in spread move in the

same direction on two consecutive days.

0.4779

DNegt-1

=1 if ΔSPt-1<0 and SPt-2<0.

This variable splits DSame variable to only show consecutive spread

declines from an already negative spread.

0.103

DPost-1

=1 if ΔSPt-1>0 and SPt-2>0.

This variable splits DSame variable to only show consecutive spread

increases from an already positive spread.

0.3749

DPosLowt-1

=1 if 0<SPt-2<=0.50 and ΔSPt-1>0.

This variable splits DPos variable to only show consecutive spread

increases from an already small positive spread of below $0.50.

0.1532

DPosHight-1

=1 if SPt-2>0.50 and ΔSPt-1>0

This variable splits DPos variable to only show consecutive spread

increases from an already high positive spread of over $0.50. We

deem such conditions to be most profitable for C&C arbitrage and

therefore where such arbitrage is most likely.

0.2217

DCap_Utilt-1

=1 if the ratio of the storage level at Cushing to the estimated

effective storage capacity is in the top 20% of capacity utilization

ratios, or about 92% full. This variables signals when storage is

close to being exhausted.

0.2002

DCap_Utilt-1 *

DPosHight-1

=1 at the intersection of high storage capacity utilization and spread

changes where arbitrage is most likely. 0.0736

48

Table 3 – Testing for Evidence of Arbitrage and Physical Limits in Spread Behavior

The change in the futures-spot spread on day t, ΔSPt, is regressed on functions of the change in the

spread on day t-1 and the level of the spread at the end of day t-2. SP is measured as the difference

between the prices of the second and nearby month WTI crude oil contracts. Independent variables are

defined in Table 2. Model 4 tests whether there is evidence that almost exhausted storage capacity

impedes the ability of cash-and carry arbitrage to return SP to within the no-arbitrage bound.

The regression is estimated using daily data from 4/6/2004 through 5/5/2015. The t-values shown in

parentheses are based on Newey-West standard errors. ***, **, * designate coefficients significantly

different from zero at the .10, .05, and .01 levels respectively in two-tailed tests.

Model 1 Model 2 Model 3 Model 4

Intercept 0.0217***

(3.090)

0.0048

(0.406)

0.0037

(0.372)

0.0041

(0.399)

DDifft-1*ΔSPt-1 0.0490

(1.226)

0.0436

(1.089)

0.0433

(1.146)

0.0434

(0.995)

DSamet-1*ΔSPt-1 -0.6792***

(-5.623)

DNegt-1* ΔSPt-1

-0.8336***

(-15.556)

-0.8341***

(-9.273)

-0.8339***

(-15.508)

DPost-1* ΔSPt-1

-0.3841***

(-2.825)

DPosLowt-1* ΔSPt-1

-0.1551

(-1.619)

-0.1561

(-1.656)

DPosHight-1* ΔSPt-1

-0.4437**

(-2.284)

-0.6588***

(-3.432)

DCap_Utilt-1*

DPosHight-1* ΔSPt-1

0.4705**

(2.022)

Observations 2786 2786 2786 2786

Adjusted R-square 0.186 .202 .204 .209

49

Table 4 – Regressions with volatility variables

Model 4 from Table 3 is re-estimated with the addition of interactive measures of futures-spot spread

volatility. Volt-2 is the log of 1 plus the standard deviation of the spread on days t-22 through t-3 standardized

to a mean of 0 and variance of 1. Independent variables are defined in Table 2. The regression is estimated

using daily data from 4/6/2004 through 5/5/2015. The t-values shown in parentheses are based on Newey-

West standard errors. ***, **, * designate coefficients significantly different from zero at the .10, .05, and

.01 levels respectively in two-tailed tests.

Model 1 Model 2 Model 3

Intercept -0.0040

(-0.487)

-0.0044

(-0.476)

-.0043

(-0.514)

DDifft-1*ΔSPt-1

0.0409

(0.878)

0.0435

(0.819)

0.0408

(0.877)

DNegt-1* ΔSPt-1 -0.8377***

(-16.283)

-0.7936***

(-12.274)

-0.7903***

(-12.772)

DPosLowt-1* ΔSPt-1

-0.1335

(-1.424)

-0.0966

(-1.455)

-0.1325

(-1.602)

DPosHight-1* ΔSPt-1 -0.3219***

(-2.661)

-0.3197***

(-2.710)

-.3206***

(-2.657)

DCap_Utilt-1* DPosHight-1* ΔSPt-1 0.8180***

(3.540)

0.8082***

(3.515)

0.8185***

(3.554)

Volt-2*DPosHight-1* ΔSPt-1

-0.2296***

(-3.218)

-0.2358***

(-2.981)

-0.2300***

(3.218)

Volt-2*DPosLowt-1* ΔSPt-1

-0.0738

(-1.428)

Volt-2*DNegt-1* ΔSPt-1

-0.3646**

(-2.094)

-0.3961***

(-2.662)

Volt-2

0.0073

(0.446)

Observations 2786 2786 2786

Adjusted R-square 0.220 .223 .223

50

Table 5 – Minimum Inventories and Reverse Cash-and-Carry Arbitrage

We test whether there is evidence that low inventory levels impede the ability of reverse cash-and

carry arbitrage to return the futures-spot spread to within the no-arbitrage lower bound. The

change in the futures-spot spread on day t, ΔSPt, is regressed on the variables in Table 4 plus

measures of unusually low inventories. Independent variables are defined in Table 2. Volt-2 is the log

of 1 plus the standard deviation of the spread on days t-22 through t-3. DLow_Inv_10t =1,

DLow_Inv_5t =1, and DLow_Inv_3t =1 when the capacity utilization ratio is in the bottom 10%,

5%, and 3% of observed levels, or about 52%, 40%, 35% full respectively. The regression is

estimated using daily data from 4/6/2004 through 5/5/2015. The t-values shown in parentheses

are based on Newey-West standard errors. ***, **, * designate coefficients significantly different

from zero at the .10, .05, and .01 levels respectively in two-tailed tests.

Model 1 Model 2 Model 3

Intercept 0.0002

(0.021)

-0.0031

(-0.433)

-0.0034

(-0.475)

DDifft-1*ΔSPt-1 0.0422

(1.115)

0.0412

(1.089)

0.0411

(1.087)

DNegt-1* ΔSPt-1 -0.3164***

(-2.961)

-0.8020***

(-7.553)

-0.7964***

(-7.495)

DPosLowt-1* ΔSPt-1 -0.1450

(-1.534)

-0.1360

(-1.437)

-0.1352

(-1.429)

DPosHight-1* ΔSPt-1

Volt-2*DNegt-1*ΔSPt-1

Volt-2*DPosHight-1*ΔSPt-1

-0.3361**

(-2.023)

-0.3587

(-1.045)

-0.2254**

(-2.390)

-0.3250*

(-1.901)

-0.4170

(-1.132)

-0.2287**

(-2.304)

-0.3239*

(-1.898)

-0.4392

(-1.140)

-0.2290**

(-2.313)

DCapUtilt-1*DPosHight-1 * ΔSPt-1 0.8124***

(2.637)

0.8168**

(2.532)

0.8172**

(2.539)

DLow_Inv_10t-1*DNegt-1* ΔSPt-1 -0.5078***

(-3.464)

DLow_Inv_5t-1*DNegt-1* ΔSPt-1 0.4100

(1.245)

DLow_Inv_3t-1*DNegt-1* ΔSPt-1

0.4084

(0.987)

Observations 2786 2786 2786

Adjusted R-square 0.227 0.224 0.224

51

Table 6 – Physical and Financial Constraints to Arbitrage

The change in the futures-spot spread on day t, ΔSPt, is regressed on the variables from previous tables (defined

in Table 2) plus possible measures of financial constraints. EDFFin is the mean EDF score for the U.S. financial

industry. EDFEnergy is the mean EDF score for the U.S. energy firms. BDLeverage is the Adrian, Etula, and

Muir (2014) measure of broker dealer (BD) leverage calculated as BD Total Financial Assets / (BD Total

Financial Assets-BD Total Liabilities). VIX is the Brunnermeier and Pedersen (2009) measure of arbitrageurs’

capital availability and is the adjusted daily closing volatility of S&P 500. The 2008 Crisis dummy captures

potential financial constraints for the time period between September 2008 and December 2009 due to the

2008-09 financial crisis. The Aggregate Zmijewski score is a measure of the aggregate default risk of firms in

the oil and gas industry (SIC 1311). Continuous financial constraint variables are standardized. The regression

is estimated using daily data from 4/6/2004 through 5/5/2015. The t-values shown in parentheses are based

on Newey-West standard errors. ***, **, * designate coefficients significantly different from zero at the .10,

.05, and .01 levels respectively in two-tailed tests.

Model 1 Model 2 Model 3 Model 4 Model 5 Model 6

Intercept -0.0042 -0.0034 -0.0042 -0.0036 -0.0042 -0.0044

(-0.469) (-0.381) (-0.492) (-0.417) (-0.473) (-0.52)

DDifft-1*ΔSPt-1 0.0408 0.0411 0.0408 0.0410 0.0408 0.0407

(0.822) (0.796) (0.822) (0.762) (-0.842) (0.832)

DNegt-1* ΔSPt-1 -0.7902*** -0.7899*** -0.7903*** -0.7900*** -0.7902*** -0.7904***

(-12.33) (-12.261) (-12.342) (-12.362) (-12.297) (-12.368)

DPosLowt-1* ΔSPt-1 -0.1330 -0.1351 -0.1328 -0.1344 -0.1329 -0.1322

(-1.551) (-1.58) (-1.565) (-1.566) (-1.549) (-1.56)

DPosHight-1* ΔSPt-1 -0.3473*** -0.3749*** -0.3424*** -0.3420*** -0.4611*** -0.3207***

(-2.955) (-3.191) (-2.867) (-2.629) (-2.858) (-2.769)

Volt-2*DNegt-1*ΔSPt-1 -0.3961** -0.3959** -0.3961*** -0.396** -0.3961** -0.3961**

(-2.382) (-2.364) (-2.384) (-2.423) (-2.381) (-2.386)

Volt-2*DPosHight-1*ΔSPt-1 -0.4882*** -0.4049** -0.2255*** -0.5331** -0.4197** -0.2402***

(-3.454) (-2.418) (-3.264) (-2.157) (-2.240) (-3.451)

DCap_Utilt-1* DPosHight-1*

ΔSPt-1

1.044***

(4.046)

1.0556***

(2.754)

0.8215***

(3.623)

1.1724**

(2.378)

1.0654**

(-2.391)

0.8419***

(3.797)

EDFFin t-1* DPosHight-1*

ΔSPt-1

0.2691***

(2.739)

EDFEnergy t-1* DPosHight-1*

ΔSPt-1

0.1885*

(1.837)

BDLeveraget-1* DPosHight-

1* ΔSPt-1

-0.1195

(-1.262)

VIXt-1* DPosHight-1* ΔSPt-1 0.3125*

(1.692)

2008Crisist-1* DPosHight-1*

ΔSPt-1

0.6525*

-1.6720

Aggregate_Zmijewskit-1*

DPosHight-1* ΔSPt-1

0.1327*

(1.959)

Observations 2785 2785 2785 2785 2786 2785

Adjusted R-squared 0.229 0.226 0.223 0.231 0.227 0.224

52

Table 7 – Changes in spot and futures prices in response to changes in the spread

The dependent variable is the change in the futures and spot prices on day t, ΔFt and,ΔSt. The futures price

is measured as the second month WTI futures price and the spot as the nearby month contract. Independent

variables are defined in Table 2. Volt-2 is the log of 1 plus the standard deviation of the spread on days t-22

through t-3. DCap_Utilt-1=1 if the ratio of the storage level at Cushing to the estimated effective storage

capacity is in the top 20% of capacity utilization ratios. The regression is estimated using daily data from

4/6/2004 through 5/5/2015. The t-values shown in parentheses are based on Newey-West standard errors.

***, **, * designate coefficients significantly different from zero at the .10, .05, and .01 levels respectively

in two-tailed tests.

Change in futures price Change in spot price

Intercept 0.0166

(0.533)

0.0210

(0.650)

DDifft-1*ΔSPt-1 -0.1732*

(-1.705)

-0.2140*

(-1.932)

DNegt-1* ΔSPt-1 0.1107*

(1.754)

0.9010***

(20.059)

DPosLowt-1* ΔSPt-1 -0.3013

(-0.570)

-0.1689

(-0.335)

DPosHight-1* ΔSPt-1

Volt-2*DNegt-1*ΔSPt-1

Volt-2*DPosHight-1*ΔSPt-1

-0.0542

(-0.149)

1.5072***

(3.449)

0.0920

(0.790)

0.2664

(0.642)

1.9033***

(5.445)

0.3220*

(2.658)

DCap_Utilt-1* DPosHight-1* ΔSPt-1 -0.2060

(-0.445)

-1.0245**

(-2.197)

Observations 2786 2786

Adjusted R-square .0033 .0240

Table 8 - Robustness Checks

We perform various robustness checks using the Model 3 regression from Table 4. In columns 2 and 3, the dividing line between DPosLowt-1 and

DPosHight-1 is a t-2 spread of $0.60 and $0.40 respectively versus $0.50 in Table 4. In columns 4 and 5, DCap_Utilt-1=1 if on day t-1 the ratio of

the storage level at Cushing to the estimated effective storage capacity is in the top 30% and top 10% respectively of capacity utilization ratios versus

20% in Table 4. In the final four columns, the regressions are estimated over the October 2010 – May 2015 subperiod when storage capacity figures

are available from EIA. In columns 6 (7) 7 DCapUtilt-1=1 if the ratio of the storage level at Cushing to the EIA figure for total working capacity is

in the top 20% (10%) of observed capital utilization ratios. For comparison, in the final two columns results are presented for the same period using

the capacity utilization ratio based on historical storage highs as in previous columns and tables.

April 2004 – May 2015.

October 2010 – May 2015

EIA storage capacity data Estimated storage capacity

DPosHigh at

$0.60

DPosHigh at

$0.40

DCap_Util

at 30%

DCap_Util at

10%

DCap_Util at

20%

DCap_Util at

10%

DCap_Util at

20%

DCap_Util at

10%

Intercept -0.0047

(-0.558)

-0.0041

(-0.488)

-0.0029

(-0.329)

-0.0026

(-0.278)

0.0018

(0.440)

0.0020

(0.503)

0.0014

(0.371)

0.0016

(0.370)

DDifft-1*ΔSPt-1 0.0407

(0.876)

0.0408

(0.875)

0.0412

(0.867)

0.0413

(0.968)

-0.1708***

(-3.407)

-0.1707***

(-3.720)

-0.1710***

(-3.720)

-0.1709***

(-3.416)

DNegt-1* ΔSPt-1 -0.7904***

(-12.822)

-0.7902***

(-12.874)

-0.7897***

(-12.631)

-0.7896***

(-12.574)

-0.3212***

(-3.073)

-0.3205***

(-3.984)

-0.3222***

(-4.018)

-0.3218***

(-3.081)

DPosLowt-1* ΔSPt-1 -0.1111

(-1.443)

-0.1992**

(-2.270)

-0.1364

(-1.640)

-0.1372*

(-1.650)

-0.2955*

(-1.947)

-0.2964*

(-1.930)

-0.2941*

(-1.919)

-0.2947*

(-1.939)

DPosHight-1* ΔSPt-1 -0.3244**

(-2.506)

-0.2956***

(-2.772)

-0.4235***

(-3.273)

-0.2038*

(-1.755)

-0.5241

(-1.516)

-0.4570***

(-2.902)

-0.2592

(-1.529)

-0.1938

(-0.782)

Volt-2*DNegt-1*ΔSPt-1

-0.3962***

(-2.669)

-0.3961***

(-2.697)

-0.3958***

(-2.622)

-0.3957***

(-2.603)

-0.7952***

(-2.876)

-0.7953***

(-3.167)

-0.7951***

(-3.177)

-0.7952***

(-2.879)

Volt-2*DPosHight-

1*ΔSPt-1

-0.2319***

(-3.060)

-0.2226***

(-3.170)

-.1587***

(-2.347)

-0.2142***

(-2.826)

-0.5388***

(-3.054)

-0.3301***

(-2.637)

-0.4316***

(-2.998)

-0.4240*

(-1.935)

DCap_Utilt-1*

DPosHight-1* ΔSPt-1

0.8327***

(3.533)

0.7634***

(3.015)

0.6179**

(2.567)

0.6426***

(3.261)

0.5438

(1.487)

0.5383***

(3.322)

0.3973**

(2.207)

0.2052

(0.405)

Adjusted R-square .223 .221 .218 .216 .134 .136 .131 .127