Analysis of the Effects on Housing Values of …faculty.chas.uni.edu/~ecker/Lusts.docx · Web...

111Equation Chapter 1 Section 1

Sorting in the Housing Market Surrounding Disamenities: The Case of Leaking

Underground Storage Tanks

By

Hans R. Isaksona*

And

Mark D. Eckerb

March 29, 2012

a Department of EconomicsUniversity of Northern Iowa

b Department of MathematicsUniversity of Northern Iowa

*Contact Author: Hans R. IsaksonProfessorDepartment of EconomicsUniversity of Northern IowaCedar Falls, Iowa [email protected]: 319-273-2950Fax: 319-273-2922

Keywords: sorting, housing markets, disamenities, leaking underground storage tanks, spatial correlation, maximum likelihood

JEL Codes: Q51, Q53, R21

Running Title: Housing Market Sorting

1

mailto:[email protected]

Abstract

This study examines the housing market surrounding sites with leaking underground

storage tanks (LUSTs) for the presence of sorting by the degree of households’ aversion

to living near a LUST. Using housing sales, Census, and LUST data, we estimate the

coefficients of a Rosen/Lancaster hedonic model, correcting for spatial correlation by

using a maximum likelihood regression technique. The results reveal that households

located within ¼ miles of a LUST have little or no aversion to living near a LUST, while

households located more than ¼ mile from a LUST are willing to pay a premium to live

further away from it.

2

Sorting in the Housing Market Surrounding Disamenities: The Case of Leaking

Underground Storage Tanks

1. Introduction

The notion of sorting in housing markets was introduced by Tiebout (1956) in his well-

known answer to Samuelson’s (1954 and 1955) observation that, because people will not

reveal their true preferences for public goods (they would rather be “free-riders”), public

goods will be undersupplied by government. In the Tiebout model, households reveal

their preferences for local public goods by where they choose to live. Hamilton (1975)

extended the Tiebout model to include the use of zoning to solve a free-rider problem in

the Tiebout model when the local public goods are financed using a property tax

(households consuming low amounts of housing to lower their property taxes). Others

have presented similar models of sorting in which household locational decisions reveal

their preferences in markets for public education (Bogert and Cromwell (1997)), labor

(Wheeler (2001)), credit (Besanko and Thakor (1987), and insurance (Borenstein (1989).

Sorting happens in housing markets because when buyers purchase a house, they are also

buying into a particular location (school district, neighborhood, etc.).

Housing markets surrounding disamenities have been studied by many. In their

meta-analysis of 58 peer reviewed journal articles that focus on housing markets

surrounding disamenities, Simons and Saginor (2006) report that nearly all of these

studies find a positively sloped house-price gradient with increasing distance away from a

3

disamenity. That is, house values, holding other things constant, increase with increasing

distance away from the disamenity. In locational equilibrium, households are indifferent

with respect to how far away they live from a particular disamenity, because they are

compensated for living near a disamenity with lower house prices, other things held

constant. These studies implicitly assume that the house-price gradient (change in price

per unit distance away from the disamenity) is the same for all households.

This study allows households’ tastes for living near a disamenity to vary.

Specifically, we hypothesize that some households may have a very low aversion, while

others have a high aversion, to living near a disamenity. If low aversion households are

sufficient in number, then the house-price-gradient will contain a break-point

corresponding to the point at which the high aversion households out-bid the low

aversion households for housing. In addition, the house-price-gradient for the low

aversion households will lie above the house-price gradient for the high aversion

households at locations near the disamenity. When these conditions are met, competition

sorts the households according to their aversion to living near the disamenity, with low

aversion households living closer to the disamenity, and households with high aversion

live further away from it.

We expect that sorting around relatively mild disamenities will be more evident

than around severe disamenities, because it is more likely to find a large number of

households willing to live near a mild disamenity than a severe one. Thus, the logical

place to look for sorting of the type described above is around relatively mild

disamenities. Simons, Bowen and Sementelli (1997 and 1999) and Simons and

Sementelli (1997) have previously reported finding mild disamenity effects near leaking

4

underground storage tanks (LUST) sites, so LUST sites can represent the type of mild

disamenity around which sorting might occur1. In addition, LUST sites are typically not

found in clusters along with other types of commercial/industrial land uses. An

interesting feature of LUSTs in urban areas is that that they are almost always found

under aging, poorly maintained, commercial buildings, especially gasoline stations.

Indeed, households might consider an aging, poorly maintained, commercial building a

disamenity all by itself. Although some types of commercial land uses cluster

(automobile dealerships, restaurants), gasoline stations (selling a convenience good) tend

not to cluster in particular locations. Because the observed condition of the building

might be a proxy for the unobservable condition of the LUST under it, a disamenity

affect might be due to the condition of the LUST or the condition of the building above it

or some combination of both. Separating these two effects would be difficult, at best.

But, for the purposes of this study, this separation is not necessary. We identify mild

disamenities (LUSTS) that are located in residential neighborhoods around which sorting

might occur. LUSTs, especially gasoline stations, represent an excellent type of

disamenity for testing the theory that households reveal the extent of their aversion to a

disamenity by the price they are willing to pay to live closer or farther from it.

2. Previous Studies of LUSTs

Others have found disamenity effects near LUSTs in housing sales data. Simons, Bowen

and Sementelli (1997 and 1999) provide insight into the impact of LUSTs (and non-

leaking USTs) on property values in Cuyahoga County, Ohio (Cleveland). Although they

do not look at the impact on nearby property values, Simons and Sementelli (1997) report

5

that (commercial) LUST sites suffer in terms of salability and financing, making them

less than one-half as liquid (ability to convert into cash) than clean, comparable

commercial sites in Cleveland, Ohio. In another study, Simons, et al (1997) find about a

17% decline in the selling prices of 83 houses within 300 feet of a LUST site in

Cleveland, Ohio. Using the same data as their earlier study, Simons, et al (1999) report a

14% to 16% reduction in houses values within 300 feet of a gas station LUST site.

In a National Center for Environmental Economics (NCEE) study, Zabel and

Guignet (2010) test for a disamenity effect in houses that use well water near LUSTs and

report that proximity to a more publicized and more contaminated site can impact

surrounding home values by more than 10 percent. But, they also find that many LUSTs,

especially the less contaminated sites, have little (negative) impact on surrounding house

values.

Although they do not study LUSTs, Page and Rabinowitz (1993) study the effects

of groundwater contamination on property values. They compare rural residential

properties using well water with and without groundwater contamination and find no

difference in their prices. Dotzour (1997) examines the sales of houses near Wichita,

Kansas before and after the discovery of groundwater contamination and report no

significant differences in property values.

The findings from Zabel and Guignet, Page and Rabinowitz, and Dotzour suggest

that sometimes proximity to a disamenity may not be capitalized into lower house values,

while the Simon, et. al. studies suggest otherwise. Sorting can explain why an adverse

effect associated with proximity to a disamenity is not observed in areas close to it, yet an

adverse effect is found further away from it. If no allowances are made for the sorting,

6

the influence of one group of households can weaken or even wash-out the influence of

another group of households.

3. The Theoretical Framework

Suppose that the households in a particular city (We focus on a single city to hold any

differences in preferences for local public goods and property tax rates constant.) are of

two types (1) those with an high aversion to living near a LUST, which we shall call High

Aversion Households (HAH) and (2) those with a low aversion to living near a LUST,

which we shall call Low Aversion Households (LAH). We will also assume that the

housing market in this city is competitive, so that houses sell to the highest bidder.

Now, consider the maximum price a household is willing to pay for a

standardized house2 located a particular distance from the nearest LUST. We shall call

the relationship between this maximum price and proximity to a LUST the house price-

bid function or HPBF. As long as proximity to a LUST is undesirable, we expect in

general that the HPBF will increase with distance away from a LUST. That is, in

general, households bid less on houses that are close to a LUST than houses that are

further away from a LUST. Virtually all studies of housing prices near a potential point

source disamenity employ this type of theoretical framework.

We refine this framework by postulating that LAHs will outbid HAHs for houses

located near a LUST. That is, near a LUST, the LAH HPBF lies above the HAH HPBF.

But, at some point, which we shall call d*, HAH will outbid LAH for housing. That is,

eventually all of the LAH will have purchased a house near a LUST, and HAH will start

to compete with each other for houses further away from the LUST. Thus, the HPBF for

7

LAH will be flatter than the HPBF for LAH and the HPBFs will intersect at some

specific distance (d*) away from the nearest LUST.

Whether the HPBF intersect depends upon two factors: (1) the number of the

LAH and (2) the extent of the difference in the slopes of their HBPFs. Of course, the

second factor is driven by the difference in the aversion between the LAH and HAH.

The larger this difference, the greater the difference in their respective HBPF slopes.

Given a sufficient number of LAH and a significant difference in their degree of

aversion to living near a LUST, we can portray their HPBFs as shown in Figure 1.

Because HAH are much more averse to living near a LUST, LAH outbid HAH for any

houses that are adjacent to a LUST and as far away as d* from the LUST. Beyond d*,

HAH outbid LAH for houses, because LAH are not willing to pay the premium that HAH

are willing to pay to live further away from a LUST. The result is that competition in the

housing market will sort households around LUSTs with the LAH living nearer to a

LUST than the HAH.

Unfortunately, the entire HPBFs of the LAH and HAH cannot be observed. All

we can observe is the house-price gradient created by the highest HPBFs. If the sorting

process described above is actually present, then we should find a break-point in this

house-price gradient at d*. So, the question of whether or not sorting occurs is an

empirical question. If sufficient LAH are present in the market, then the effects of

sorting should be observable in the house-price gradients around LUSTs. But, it turns out

that formally testing for the presence of a break-point in the house-price gradient around

LUSTs is not trivial. The challenges of testing for the presence of this break-point are

discussed in the Empirical Model section.

8

4. The Study Area and Data

This study uses data from Cedar Falls, Iowa, a medium sized, Midwestern college town.

The college (University of Northern Iowa) has about 13,000 students, over 95% of which

live on or very near campus. The city has two junior high schools and several elementary

schools of nearly equivalent quality that feed students into the city’s only public high

school. The average commute time is only 14 minutes. The city is highly decentralized.

There is no dominant destination point, such as a central business district. Instead, jobs

and retail shops are disbursed throughout the city. Preliminary analysis of the data

reveals that premiums are not paid to gain proximity to any particular site in the city

(including the college), largely due, we suspect, to the short commute times from one

point to another within the city. Several small retail strip malls exist as well as a

downtown area containing several restaurants, retail shops, banks, a small hotel, and a

small theater. The downtown area is within a mile of the city’s sewage treatment plant

(STP), a source of occasional noxious odors depending on weather conditions. The city

has a newer (1995) industrial park containing primarily warehouses on its far south side

that is home to various, very low polluting industries.

4.1 Housing Sales and Census Data

Initially, the housing sales dataset contained every sale of a single-family house in the

city of Cedar Falls from January 2000 to November 20043; these sales were parsed by

using only those identified as “arm’s length transactions” by the county tax assessor’s

office. The time period (2000 to 2004) was selected in order to avoid the contamination

effects of the recent housing bubble and subsequent crash. Additional refinement

9

consists of choosing only those sales with a selling price greater than $32,000 or less than

$400,000, houses with at least three but less than 12 rooms, at least 500 square feet of

living area, and a lot size less than 3 acres.4 Several homes were repeatedly sold in this

time frame; we use only the most recent sale because the house characteristics data

corresponds to the most recent sale.

The number and type of property specific variables included in previous studies

of proximity to externalities vary widely and are driven primarily by the character of the

urban area from which the sales data is drawn. Typically, these studies contain data, in

addition to the selling price of a house, on the characteristics of the house, the

characteristics of the neighborhood in which the house is located, and the distance of the

house to important points of influence. The sales data in this study include all of the

property characteristics information available from the public records, which include the

following information (variables) for each of the 2,078 transactions: selling price, date of

sale, state-plane coordinates of the centroid of the property, year built, lot size, living

area, and number of rooms. In addition to these variables, we calculate the Euclidian

distance from each property to the STP, the nearest highway, the Cedar River, and all 50

of the LUST sites in Cedar Falls. In addition, neighborhood data was collected from the

2000 census for all 21 census block groups in the city. The neighborhood data include

the following: median rent, median year that houses were built, percent of housing units

that are owner occupied, the number of housing units, and median household income.

Table 1 contains summary statistics of the variables in the model.

4.2 LUST Data

10

Information for the 50 LUST sites in Cedar Falls was obtained from the Iowa Department

of Natural Resources (http://www.iowadnr.gov/land/ust/ustdbindex.html). Registration

of underground storage tanks (USTs) is required by federal (40 CFR parts 280 and 281)

and Iowa (Iowa Code 455.B.474) laws. In Iowa all USTs, except certain small,

residential USTs, must be registered with the Iowa DNR. These tanks must also be

inspected and assessed for their likelihood of contamination. If the site containing the

UST has evidence of contaminants that do not exceed certain delimited levels, then it is

classified as requiring no further action. If the onsite contaminants exceed the delimited

levels, but no receptors (water wells, aquifer, plastic water lines, basement, sewers, or

surface water) are within the transport plum of the contaminants, then it is rated as a low

risk LUST and must be tested annually. If receptors exist within the transport plum of

the contaminants, then it is classified as a high risk LUST and faces more intensive

monitoring, testing, and possible remediation. Figure 2 shows the location of all 2,078

sales together with the STP, the 50 LUST sites, the Cedar River, major highways, and

Census block group boundaries. The classification of each LUST is indicated in Figure 2

using different sized boxes.

These LUSTs are especially useful for this study, because they include mostly

neighborhood gasoline stations, and none of them are located near heavy industrial sites

(see Deaton and Hoehn (2004).

The potential effects of LUSTs clustering are managed using a count variable (the

number of nearby LUSTs), similar to what Ihlanfeldt and Taylor (2004) use in their

model. A distance of ¼ mile is chosen to represent “nearby.” That is, the count variable

is equal to the number of LUSTs within ¼ mile of the house (about three city blocks).

11

This particular distance is somewhat arbitrary, but it is also reasonable. If clustered

LUSTs are going to have an influence on house prices, we would expect that the multiple

LUSTs would be relatively close to nearby houses as well as each other.

4.3 The LUSTs Variables

Previous studies (Simons, Bowen, and Sementelli (1997, 1990) and Simons and

Sementelli (1997) make use of a series of three dichotomous variables representing the

three classifications of LUSTs: (1) sale within 300 feet of a leaking, unregistered tank,

(2) sale within 300 feet of a registered UST, not leaking, and (3) sale within 300 feet of a

leaking and registered tank. Simon et al (1997) report that, although the extent of the

leaks in a LUST is important, information about the extent of the leaks were not available

to them.

The percent of LUST sites in close proximity (within 300 feet) of a house that

sold during the time period in this study is higher than in the Simon, et al (1997) study.

Simon, et al (1997) report 83 out of 16,990 (0.49%) sales occurring over a five year

period being within 300 feet of the nearest UST. In this study, we find 19 out of 2,079

(0.91%) of the sales within 300 feet of the nearest LUST. This study has a greater

percentage of sales in close proximity to a LUST site, because the LUST sites in this

study are predominately neighborhood gasoline stations. Unfortunately, 19 sales is far

too small a number to replicate the Simon, et al (1997) LUST binary variable definitions.

Therefore, we use a different approach to modeling the LUST effect. Specifically, we

use the Euclidian distance to the nearest LUST site, a version of the IDNR classification

of that LUST, and the number of LUSTs within ¼ mile of the house.

12

Six of the LUST sites are classified by IDNR as High Risk (receptors exist within

the transport plum of the contaminants), four are classified as Low Risk (no receptors

within the transport plum of the contaminants), and the remaining 40 are classified as No

Further Action Required (contaminants do not exceed delimited levels). We create a

binary variable to capture the classification of the LUSTs as follows: Condition equals

one if the LUST is classified as low risk or high risk; zero if it is classified as no further

action required. We combine the low and high risk LUSTs because there are so few

(four) low risk LUSTs.

For each sale, we also include a LUST proximity variable (the Euclidian distance

to the nearest LUST). The coefficient of this variable represents the slope of the house-

price gradient and is the primary focus of this study.

In addition to the condition and the proximity variables, as mentioned above we

also include the count of the number of LUST sites within ¼ mile of each house in the

sales data set. Table 1 reveals that 597 of the 2,078 (28.7%) Cedar Falls sales have at

least one LUST site within a quarter mile and one home had eight LUST sites within a

quarter mile.

4.4 The Selection of d*

Finally, we select ¼ mile from the nearest LUST as our hypothetical value of d*.

Preliminary analysis of the data suggests that 1,320 feet, or about three city blocks,

provides a sample size of sold houses (597) sufficient to estimate the parameters of our

empirical model. Decreasing d* rapidly decreases our sample size of sold houses within

d* of a nearby LUST. Increasing d* may include so many HAH that any evidence of

13

sorting gets washed-out. So, we hypothesize that if sorting occurs in our data, then the

LAH will out-bid the HAH for housing within ¼ mile of a nearby LUST, and HAH will

out-bid LAH for housing more than ¼ mile from a nearby LUST, holding other things

that affect house values constant.

5. The Empirical Model

Ideally, an empirical test of the sorting model would consist of testing the hypotheses that

the LAH HPBF lies above and is flatter than the HAH HPBF within a distance of d* to

the nearest disamenity. A similar test could be constructed for sales beyond d*, where

the HAH HPBF should lie above and be steeper than the LAH HPBF. Unfortunately,

these ideal tests are not possible, because we cannot observe the lower HPBFs. We can

only observe the house-price gradient illustrated by the bold line in Figure 3.

Recall that the model is predicated on households between a disamenity and d*

having different tastes for living near the disamenity than households living beyond d*.

Consequently, these two distinct groups of households could also have different tastes for

the characteristics of houses, the characteristics of neighborhoods, and the distance to

points of influence. So, the statistical estimates of the coefficients of a standard

Rosen/Lancaster type model of house prices could differ for the two types of households.

Consequently, applying a simple binary variable to the proximity to a disamenity variable

will yield unreliable results, because the binary variable should have been applied to

additional or even all of the independent variables in the model.

To formally test for differences in the coefficients of the independent variables

(including the coefficient of proximity to a LUST) for LAH and HAH households, we

14

conduct an Analysis of Covariance.5 The results of this analysis demonstrate that we

must not pool the two data sets (LAH and HAH) into a single data set. Therefore, we

estimate the coefficients of the model using two separate analyses. We test whether the

proximity-to-a-LUST coefficient within each data set is greater than zero (meaning

proximity to a LUST is a disamenity) and we compare the signs and statistical

significance of the proximity-to-a-LUST coefficients from each regression to see if they

are consistent with the sorting model.

We use the popular Lancaster/Rosen hedonic type of empirical model to estimate

the coefficients of our model within the two data sets. Three key aspects of the hedonic

model used to examine housing sales data for sorting near LUSTs must be addressed,

namely, (1) the set of variables (dependent and independent) used in the model, (2) the

functional relationship among these variables, and (3) the structure of the error term of

the model. Each of these aspects of the empirical model is discussed below.

We select the selling price of a house as the dependent variable in our empirical

model. Selling price is a popular choice in hedonic models, but selling price per unit area

(dollars per square foot) is also common. The only difference between these two choices

is the role that area plays in the set of independent variables. Using selling price per

square foot of living area, for example, is equivalent to using selling price as the

dependent variable and then multiplying every independent variable by the square

footage of living area. We cannot justify using a set of independent variables implicitly

or explicitly weighted by living area. Therefore, we choose to use the selling price of the

house as our dependent variable.

15

We select independent variables that reflect the (1) structural characteristics of the

property, (2) proximity of the property to points of influence (other than a LUST), (3)

characteristics of the neighborhood in which the property is located, (4) date at which the

sale occurred, and, of course, and (5) LUST specific data. The specific independent

variables included in the model are discussed in greater detail in the next section.

Various functional forms for hedonic models are possible. The log-linear

specification is popular, in which the log of selling price is the dependent variable and the

independent variables are entered in a linear fashion. In the log-log form, all of the

variables are entered in the log form. We use a combination of the log-log form for some

independent variables and the log-linear form for the rest of the independent variables.

The manner in which the distance to the nearest LUST is entered into the model is

not a trivial matter. The functional form of this important variable must be such that it

diminishes with increasing distance from a LUST. The shape of the curves in Figure 1

can be achieved using the square root of distance to the nearest LUST (d1/2), or in general

for 0 < x < 1, using d x. The point estimate of x, together with its associated statistical

inference, will indicate the strength of aversion to living near a LUST.

The mean structure of the empirical model contains several types of explanatory

variables typically thought to explain house prices. Most broadly, these variables include

(1) site level non-spatial characteristics of the home (house size characteristics including

living area, number of rooms, parcel size, year built, and time of sale), (2) spatial

characteristics (distance to a sewage treatment plant (STP), a river, and the nearest

highway), (3) LUST level factors (such as distance to the nearest LUST; classification of

the nearest LUST, and number of LUSTs within ¼ mile), and (4) neighborhood data

16

(median rent, median year that houses were built, percent of housing units that are owner

occupied, the number of housing units, and median household income). Specifically, let

P = the selling price of the house,

S = lot size in acres,

H = the living area of the house,

t = the time of the sale,

C = number of rooms in the house,

D = a vector of variables measuring distance to points of influence,

N = a vector of neighborhood level variables,

d = distance to the nearest LUST, and

L = a vector of other LUST level variables (the classification of the nearest LUST, and

the number of LUSTSs within 0.25 miles of the sale).

We model the mean structure using the following non-linear functional form,

P=α S β 1 H β 2 dβ 3 eδt+ λC +κD+ωL+θN (1)

where the Greek letters represent parameters of the model which will be estimated using

the sales, distances, census and LUSTs data. On the natural log scale, the empirical

model (1) becomes:6

ln P=ln α+β 1 ln S+ β 2 ln H+β 3d+δt+ λC+κD+ωL+θCT (2)

The lot size (S) and house size (H) parameters, β1 and β2, represents the (constant)

elasticity of price with respect to lot size and house size, respectively (see Isakson and

Ecker (2001) and Ecker and Isakson (2005) for discussion). The key parameter, β3,

represents the slope of the house-price gradient on the log-log scale. The parameters of

the rest of the explanatory variables represent the percentage change in price given a one

unit change in the variable. Based upon our preliminary Analysis of Covariance, we

estimate the parameters of (2) for the LAH and HAH separately.

17

To estimate the parameters of (2), we must include an error or disturbance term

(ε) in it. Standard Ordinary Least Squares (OLS) regressions assume that ε ~ N (0 , τ 2)

where normality and constant variance (τ2

is a variability parameter, after accounting for

all values of the covariates) is assumed. When dealing with spatial data, such as housing

sales, the potential exists for the OLS parameter estimates to be biased, especially any

spatial distance parameter estimates in the mean structure, such as distance to the STP

and distance to the nearest LUST site. Visual examination of an empirical variogram

(Cressie, 1993) of the OLS residuals in (2) reveals the presence of spatial correlations

(see Figure 4). Therefore, efforts to remove this source of bias could be beneficial.

Spatial linear models assume that the errors are not independent, that is, two

comparable homes that are closer in space sell for a more similar price than two

comparable homes farther apart. For example, houses located near each other are also

near the same neighborhood amenities/disamenities. In this case, the selling prices of

nearby, comparable houses tend to be more highly correlated than comparable houses

further apart. We build this spatial correlation into our model by assuming that

ε ~ N (0 , τ 2+σ2 ) (3)

where τ2

is called the “nugget” , i.e. a micro-scale or measurement error variability, in

the geostatistical literature (Cressie, 1993). The sum τ2+σ2

in equation (3) is termed

the spatial variability of the spatial process or “sill” (the variability of the home prices

after adjusting for individual home characteristics). Finally, for two home sales with

errors ε i and ε j , their spatial correlation is modeled as a function of their Euclidean

distance apart, d ij . Specifically, we adopt the exponential correlation structure, i.e,

18

. (4)

The parameter φ directly controls the spatial correlation in the dataset and is termed the

“range” (technically 3φ is the exact value of the range in equation (4) (see Ecker, 2003)

for the exponential correlation structure). Thus, any two homes that are separated by a

distance of more than the range, have selling prices that are essentially uncorrelated.7

We make use of a Simultaneously-fitted Spatial Linear model (SFSL) by

combining the traditional hedonic model, (2), together with a spatial correlation

disturbance term, (3) and (4). All parameters of the SFSL model are jointly estimated via

SAS. The spatial correlation terms (3) and (4) are random effects models designed to

capture extra spatial variability not explained in the mean structure of the model. Spatial

correlation models can also help mitigate the impact of missing variables, such as

proximity to neighborhood amenities/disamenities, and incorrectly specified variables in

the model (Brasington and Hite. 2005).

6 Model Fitting and Results

Any OLS parameter estimates might be biased due to potential spatial correlation in the

data. To determine if the model given by equations (2), (3), and (4) is appropriate for the

Cedar Falls home sales data, we generate an empirical variogram from the residuals of

the OLS regression for HAHs and for LAHs. The sharp drop-off near the origin in both

empirical variograms in Figure 4 indicate that houses are more similar (less variability) as

separation distances between a pair of houses decreases, i.e., spatial correlation is present

in the data. Thus, the OLS model is inferior to the SFSL model because the SFSL model

directly incorporates the spatial correlation.

19

To fit the model given by equations (3) and (4) simultaneously with the mean

structure coefficients in equation (2), seed values are required for the range, sill and

nugget spatial parameters. We use S-Plus to fit a theoretical variogram to the empirical

variogram from the residuals from the OLS regression to obtain these seed values. The

coefficients of the model are then simultaneously estimated using Proc Mixed in SAS,

and the results are reported in Table 2.

The size variables of Living Area, Number of Rooms and Parcel Size are all

positive and strongly significant for both LAH and HAH sales. As expected, bigger

houses sell at higher prices. Year built is positive and significant for both LAH and HAH

homes showing that newer houses sell for higher prices than older houses. Date sold is

positive and significant, indicating a 5.2% annual rate of appreciation over the 4 years of

sales for HAH sales and 4.9% for LAH homes.

Distance to the STP is positive and significant at the 0.05 significance level, for

both LAH and HAH homes, suggesting that no one wants to live near the sewage

treatment plant. Distance to the nearest school is significant and negative for LAH homes

only, indicating that only LAH are willing to pay a premium to live closer to a school.

Distance to the nearest highway is marginally significant (p-value=0.0808) for LAH sales

and moderately significant (p-value=0.0323) for HAH homes. Neither group desires to

live very close to a highway. Also, HAH homes are willing to pay a premium to be close

to the river.

For the census block variables, household income is positive and statistically

significant for both LAH and HAH, but very weak in magnitude. Prices and incomes are,

not surprisingly, highly correlated. People with higher incomes tend to buy slightly more

20

expensive houses. The number of occupied units and median rent are not significant for

any of the households. Median year built is strongly significant only for HAH,

suggesting that HAH prefer newer houses, while LAH do not.

Overall, the parameter estimates for the LUST variables are consistent with the

sorting model presented earlier. The positive but non-significant distance to the nearest

LUST variable for LAH sales suggests that these buyers do not have an aversion to living

close to a LUST. The positive and strongly significant distance to the nearest LUST

variable for HAH indicates that these households are willing to pay a premium to farther

away from the nearest LUST. In particular, the coefficient for distance to the nearest

LUST for HAH sales are about three times the size of that for the LAH sales.

Specifically, HAH are willing to pay 1.037% more for a house that is 10% further away

from the nearest LUST.

In addition, the count and condition LUST variables reveal an interesting pattern.

The count variable is only relevant to the LAH, because all of the HAH are more than ¼

away from the nearest LUST. But, the LAH do not seem to mind if they live near one or

multiple LUSTs. This finding is consistent with the statistical insignificance of the

proximity-to-LUST parameter. The condition of a LUST does not seem to matter to

either group of households. Nothing about LUSTs seems to matter to LAH, while all that

matters to the HAH is their proximity to a LUST.

The spatial association parameters for the simultaneously fit mean structure in

equation (2) and spatial components in equations (3) and (4) are also reported in Table 2.

Notice that the nugget and sill estimates for the LAH and HAH sales are close while

21

range parameter is 189 feet (0.0189 times ten thousand feet) for LAH sales, compared to

330 feet for HAH sales.

7. Summary and Conclusions

This study presents the theoretical framework for a model of household sorting around

mild disamenities (LUSTs). It estimates the parameters of a Rosen/Lancaster type

hedonic model modified to include spatial correlation in the error term. Using housing

sales data from a medium sized, Midwestern city, the parameters (mean structure and

error terms) are estimated simultaneously using a maximum likelihood regression

technique. We find evidence that households within ¼ of a mile of a LUST do not have a

statistically significant aversion to living near the LUST, in sharp contrast to a significant

aversion for households living more than ¼ mile from a LUST. Neither the number of

LUSTs nearby, nor the condition of the nearest LUST seems to impact either LAH or

HAH households.

Researchers should be cautious not to pool housing sales data near disamenities,

because doing so may very well wash out any sorting effects. For example, pooling the

LAH and HAH sales together in this study yields the erroneous conclusion that proximity

to a LUST matters for all households, when in fact, it matters only for a fraction of them.

Opportunities for additional research in this area abound. For example, the

sorting process presented in this study may also be found around mild amenities. The

intensity of the disamenity also deserves attention. For example, stronger disamenities

may not show signs of sorting. Lastly, more direct statistical tests to directly compare the

proximity-to-disamenity parameters also deserve additional attention.

22

8. References

Brasington, D.M. and D. Hite (2005). Demand for environmental quality: a spatial

hedonic analysis. Regional Science and Urban Economics, 35(1), 57-82.

Besanko D. and A. V. Thakor (1987). Collateral and rationing: sorting equilibria in

monopolistic and competitive credit markets. International Economic Review,

28(3), 671-689.

Borenstein, S. (1989). The economics of costly risk sorting in competitive insurance

markets. International Review of Law and Economics, 9, 25-39.

Cressie, N (1993). Statistics for Spatial Data. New York: John Wiley and Sons.

Deaton, B.J. and J.P. Hoen (2004). Hedonic analysis of hazardous waste sites in the

presence of other urban disameties. Environmental Science and Policy, 7, 499-408.

Dotzour, M. (1997). Groundwater contamination and residential property values. The

Appraisal Journal, 65(3), 279-85.

Ecker, M.D. (2003). Geostatistics: Past, Present and Future. In Encyclopedia of Life

Support Systems (EOLSS) . Developed under the Auspices of the UNESCO,

EOLSS Publishers, Oxford, UK. [available at: www.eolss.net].

23

Ecker, M.D. and De Oliveira, V. (2008). “Bayesian Spatial Modeling of Housing Prices

Subject to a Localized Externality”. Communications in Statistics. 37, 2066-2078

Ecker, M.D. and Isakson, H. (2005). A unified convex-concave model of urban land

values. Regional Science and Urban Economics, 35, 265-277.

Ferreya, M. M. (2007). Estimating the effects of private school vouchers in multidistrict

economies. American Economic Review, 97:3, 789-817.

Figlio D. N. and M. E. Lucas (2004). What’s in a Grade? School report cards and the

housing market. American Economic Review, 94(3), 591-604.

Guasch, J.L. and A. Weiss (1980). Wages as sorting mechanisms in competitive markets

with asymmetric information: a theory of testing. Review of Economic Studies,

XLVII, 653-664.

Hamilton, B.W. (1975). Zoning and property taxation in a system of local governments.

Urban Studies 12:205-211.

Henderson, J.V. (1991). Separating Tiebout equilibrium. Journal of Urban Economics,

29:128-152.

24

Ihlandfeldt, K.R. and L.O. Taylor (2004). Externality effects of small-scale hazardous

waste sites: evidence from urban commercial property markets. Journal of

Environmental Economics and Management, 47, 117-139.

Isakson, H. and Ecker, M.D. (2008). An Analysis of the Impact of Swine CAFOs on the

Value of Nearby Houses. Agricultural Economics, 39, 1-8.

Isakson, H. and Ecker, M.D. (2001). An Analysis of the Influence of Location in the

Market for Undeveloped Urban Fringe Land. Land Economics, 77(1): 30-41.

Jackson, T.O. (2001). The effects of environmental contamination on real estate: A

literature review. Journal of Real Estate Literature, 9(2): 91-116.

Kiel, K. (1995). Measureing the impact of the discovery and cleaning of identified

hazardous waste sites on house values. Land Economics, 71, 428-435.

Kendall, T.D. (2003). Spillovers, complementarities, and sorting in labor markets with

an application to professional sports. Southern Economic Journal, 70(2).

Kohlhase, J. (1991). The impact of toxic waste sites on housing values. Journal of Urban

Economics, 30, 1-26.

25

Michaels, R. and V.K. Smith (1990). Market segmentation and valuing amenities with

hedonic models: the case of hazardous waste sites. Journal of Urban Economics,

28, 223-242.

Page, G.W. and H. Rabinowitz (1993). Groundwater contamination: its effects on

property values and cities. Journal of the American Planning Association 59(4),

473-81.

Palmquist, R. (1992). Valuing localized externalities. Journal of Urban Economics, 31,

59-62.

Simons, R.A. and Saginor, J.D. (2006). A meta – analysis of the environmental

contamination and positive amenities on residential real estate values. Journal of

Real Estate Research, 28 (1/2): 71-104.

Simons, R.A., Bowen, W.M, and Sementelli, A.J. (1997). The effect of underground

storage tanks on residential property value in Cuyahoga County, Ohio. Journal of

Real Estate Research, 14 (1/2): 29-42.

Simons, R.A. and Sementelli, A.J. (1997). Liquidity loss and delayed transactions with

leaking underground storage tanks. Appraisal Journal, July, 255-260.

26

Simons, R.A., Bowen, W.M. and Sementelli, A.J. (1999). The price and liquidity effects

of UST leaks from gas stations on adjacent contaminated property. Appraisal

Journal, April, 186-194.

Tiebout, C.M. (1956). A pure theory of local expenditures. Journal of Political

Economy, 64:416-424.

Tiemann, M, (2008). Leaking underground storage tanks: Prevention and cleanup.

Congressional Research Service (RS21201), The Library of Congress.

Wheeler, Christopher H. (2001). Search, sorting, and urban agglomeration. Journal of

Land Economics, 19(4), 879-899.

Zabel, J. and D. Guignet (2010). A hedonic analysis of the impact of LUST sites on

house prices in Frederick, Baltimore, and Baltimore City counties. National Center

for Environmental Economics Working Paper #10-01, U.S. Environmental

Protection Agency. (available at

yosemite.epa.gov/ee/epa/eed.nsf/WPNumber/2010-01/$File/2010-01.PDF).

27

Figure 1: The Conceptual Model

28

Figure 2 : Cedar Falls sales (o) and LUSTs (■) locations.

Note: Plot is in State Plane Coordinates

29

Figure 3: The Empirical Model

30

Table 1 Summary Statistics of Sales Data

ALL SALES LAH HAH N=2,078 N=597 N=1481

Variable Mean Std Dev Mean Std Dev Mean Std Dev

Sale Price $144,995 71,415 $104,831 36,481 $161,142 75,575

Structural VariablesLiving Area 1332.8 489.5 1138.6 382.2 1411.1 506.1Year Built 1960.6 28.5 1940.5 25.2 1968.7 25.6Number of Rooms 5.9 1.4 5.7 1.3 5.9 1.5Lot Size (acres) 0.27 0.16 0.22 0.12 0.28 0.16Date of Sale (0=1/1/2000)

2.56 1.39 2.55 1.42 2.57 1.37

Distances (miles)Distance to STP 3.43 1.17 3.10 0.78 3.56 1.27Distance to River 1.20 0.72 0.84 0.36 1.35 0.77Distance to School 0.44 0.27 0.36 0.16 0.47 0.43Distance to Highway 0.73 0.46 0.65 0.50 0.75 0.43

LUST VariablesDistance to LUST (miles)

0.48 0.31 0.17 0.06 0.61 0.28

Count 0.64 1.29 2.23 1.50 0 0Condition 0.11 0.31 0.16 0.37 0.09 0.28

Census Block VariablesMedian Household Income

40,817 12,015 38,152 9,718 41,892 12,670

Number Occupied Units

695.3 423.3 487.9 304.4 778.9 435.6

Percent Owner Occupied

59.3 22.4 58.0 23.9 59.8 21.7

Median Year Build 1962.3 12.8 1953.9 10.5 1965.7 12.1Median Rent 474.3 189.9 464.9 164.1 478.0 199.2Difference Median Year

-1.7 20.9 -13.4 21.7 3.0 18.5

31

Table 2: SAS results for Cedar Falls sales data including Parameter Estimate (P-value).

Note: Numbers in parentheses are p- values.

Figure 4 : Empirical Variograms created from the OLS residuals for the Cedar Falls LAH sales in a) and HAH sales in b). Note that distance is measured in 10,000’s feet.

32

LAH HAHIntercept 7.7291

(0.0001)-4.9447(0.0076)

Proximity to LUST 0.0387 (0.2527)

0.1037 (0.0086)

Count 0.0117 (0.2498)

N/A

Condition 0.0182 (0.6416)

-0.0037 (0.9425)

Distance to STP 0.1050 (0.0376)

0.09007 (.0001)

Distance to School -0.00004(0.0265)

.0000008(0.4371)

Distance to Highway .00002(0.0808)

0.00002(0.0323)

Distance to River .00001(0.4457)

-0.00002(0.0144)

Ln Living Area 0.4043 (0.0001)

0.3027 (0.0001)

Date Sold .05162 (0.0001)

0.0489 (0.0001)

Number of Rooms 0.0726 (0.0001)

0.0632 (0.0001)

Ln Parcel Size 0.1417 (0.0001)

0.1195 (0.0001)

Yr Blt – Median Yr Blt .0044(.0001)

0.0060(0.0001)

Median HH Income .00000050.0301

.0000008(0.0001)

Num Occ Units 0.0001(0.0578)

.00002(0.6349)

Pct Owner Occ -0771(0.4519)

-0.2805(0.0050)

Median Yr Blt .0001(0.9628)

0.0071(0.0005)

Median Rent -0.00005(0.6425)

0.00003(0.6786)

Nugget 0.0133 0.0086Sill 0.0346 0.0431Range 0.0189 0.0330

1 LUSTs can also be severe disamenities. In July, 2011, a Maryland jury awarded 154 property owners $1.495 billion for MTBE contamination of their drinking water from a nearby LUST (Allison v. Exxon).

2 A standardized house is one with a given set of characteristics, such as lot size, living area, number of bedrooms, etc. The term house in the remainder of this study will refer to a standardized house.

3 The authors thank the Black Hawk County Board of Supervisors for providing the housing sales data used in this study. Of course, any opinions expressed in this study are strictly those of the authors.

4 Because the original data set contained all transactions of houses, it included a few very small structures (as small as 100 square feet) that sold for very low prices (as low as $7,500) as well as a few very large houses with large amounts of land that sold for as much as $2.5 million. These outliers were excluded because they were not representative of the typical house in Cedar Falls, Iowa.

5 The results of this analysis of covariance are available from the authors upon request.

6 Equations 1 and 2 do not yet contain an error term. The structure of the error term is discussed later in this section.

7 Ihlanfeldt and Taylor (2004) use a SAR for the error term in their model.

Analysis of the Effects on Housing Values of …faculty.chas.uni.edu/~ecker/Lusts.docx · Web...

Documents

Transcript of Analysis of the Effects on Housing Values of …faculty.chas.uni.edu/~ecker/Lusts.docx · Web...