gis spatial estimation - Spatial Hydrology - Home · following preliminary sampling!12. ......

Spatial estimation

How is spatial estimation

commonly used?

��1

Spatial estimation: Outline

Introduction

Sampling

Spatial interpolation methods

Spatial prediction methods

Core area mapping

��2

Spatial estimation: Intro

��3

What are examples of each?

Spatial Interpolation

Prediction of variables at unmeasured locations based on sampling of same variables at known locations

Spatial Prediction

Prediction of a variable at unmeasured locations based in part on other variables

��4

Spatial estimation: Intro

Core area delineation:

Delineate areas of high utilization, density, intensity, probability of occurrence

Core areas may consist of a home range, center of criminal activity, or center of business activity

��5

Spatial estimation: Sampling

We control location and number of samples

With additional samples the law of diminishing returns

Diminishing returns not usually a problem

Funding limits samples

Few established guidelines for determining number of samples needed for interpolation

��6

Interpolating ETr for irrigation management

For agricultural water management or climate change studies weather station placement is of paramount importance

If the sampling scheme includes anomalous areas then the resulting interpolation could be very biased

��7

Sampling

What is each sampling scheme called?

��8

Spatial estimation: Systematic sampling

Not appropriate if there is greater interest in specific areas within the study area

Travel to each point can be difficult or costly

Systematic can be problematic

Roads are laid out systematically

Geologic features can vary systematically

��9

Spatial estimation: Random Sampling

Random samples are less likely to correspond to systematic landscape variation

Little chance of inaccurate predictions due to biased sampling

Random sampling can cause more sampling than necessary in uniform areas and undersample variable areas

Difficult to implement over large areas

��10

Spatial estimation: Cluster sampling

Cluster centers located randomly or systematically

Reduce travel time between sample locations

Used for mineral exploration and by U.S. Forest Service to conduct forest productivity and condition surveys

Good for natural resources applications requiring significant off-road travel

��11

Spatial estimation: Adaptive samplingHigher sampling densities where the feature of interest is more variable

Increases sampling efficiency

In some cases it is obvious where an attribute is variable (elevation)

Otherwise sampling intensity decisions are made on the fly or after lab results following preliminary sampling

��12

Other aspects of sampling design

Other sampling design strategies are appropriate for specific applications

Sampling can be performed with or without replacement

This is only a problem if linear or area features are the sampling units

��13

Spatial Interpolation Methods

Combine sampled values and their positions to estimate values at unsampled points

Spatial interpolation method variables:

Observation weights and how we choose these weights

Use all or only some observations

Use different mathematical functions

��14

Sample dataset: What is unrealistic?

��15

Thiessen polygon interpolation

��16

Nearest neighbor / Thiessen polygon

Assign value of unmeasured location to nearest sample point value

Polygon output

Abrupt transition between polygon edges

Exact interpolator

Interpolated surface equals sampled values at each sample point

��17

Fixed radius Local averaging

��18


��19

Fixed radius interpolation

��20


Moving average of samples within radius

Radius is centered sequentially at each raster cell center

No value assigned if no samples fall in search radius

Neighboring cell values usually similar

Smooths sample data

��21


Extreme values are maintained only if no other values are present in the search radius

The search radius chosen strongly affects the resultant surface

Not exact interpolator

Too large of a search radius will smooth the surface too much

��22

Inverse distance weighting (IDW)

Estimates values at unknown locations based on sample values and distance to nearby points

Closest points have strongest influence

��23

n and i for IDW

n = distance exponent

i = number of sample points used to estimate a raster cell center value

What effects do n and i have on interpolated surface?

What does this equation become when d approaches zero?

zj =

zidniji

∑1dniji

∑

��24

IDW

An exponent (n) can be applied to the distance to influence the rate at which the influence of a point wanes with distance

IDW is an exact interpolator

As distance becomes very small, its inverse approaches infinity, eliminating the influence of any other points

Creates smooth raster surfaces

��25

IDW

Larger exponent (n):

Increases the influence on closer points

Higher peaks, lower valleys, steeper gradients near sampled points

��26

IDW

• Lower exponent (n) reduces the influence of proximal points, smoothing resultant surface

Spatial interpolation: Splines

Fit polynomial function to measured values:

Minimizes surface curvature—the sum of the squares of the second derivative terms

Smooth surface that passes through input points

Interpolated surface is differentiable

��28

Spatial prediction

Statistical fitting process using coordinate locations and independent variables

Waldo Tobler: “everything in the universe is related to everything else, but closer things are more related.”

Spatial autocorrelation is the tendency of nearby values to be more similar than distant values

��29

Spatial prediction: Autocorrelation

��30

Moran’s I measures spatial autocorrelation

0 low correlation

1 positive autocorrelation

-1 anti-correlated

��31

4

Spatial cross-correlation

In the absence of spatial autocorrelation and cross correlation what is the best estimate of values at unmeasured locations?

��32

Spatial prediction: Correlation

The degree of spatial autocorrelation can vary over space

Spatial prediction methods incorporate knowledge of the correlation structure of the data into predicting unmeasured locations

Cross-correlation is the tendency for two variables to change in concert

��33

Spatial prediction: Regression

Predict output variable distribution as a function of several inputs

Combine measurements of temperature with elevation, latitude, and longitude to predict temperature in a statistical model

��34

Spatial prediction: Trend surface

Not exact predictors

Most accurate when fitting smoothly varying surfaces

Mean daily temperature

No Bulls-Eyes

Not appropriate for modeling complex surfaces (elevation in a mountainous region)

��35

Spatial prediction: Trend surface

��36

Spatial prediction: Kriging

Developed in early 1900s by D.G. Krige and Georges Matheron for mining

They aimed to develop a spatial prediction method that:

accounts for trends, spatial autocorrelation, and random (stochastic) variation

estimates local prediction variance

��37

Spatial prediction: Kriging and Co-Kriging

Based on regionalized variable theory

Model components:

Spatial trend

Local spatial autocorrelation

Random (stochastic) variation

Weights are chosen to be statistically optimal

��38

Kriging

h: lag distance

lag distance is the distance between sample points

��39

Kriging

Lag tolerance is a distance between points that is considered small so that pairs of points can be grouped together for spatial covariance calculations

��40

Semi-variogram anatomy

Nugget: Semivariance when autocorrelation is highest, at a lag distance of zero

Sill: Point where variogram levels off

Inherent or background variation when there is little spatial autocorrelation

Range: Lag distance where the sill is reached

��41

Semi-variogram

Semi-variance measures spatial autocorrelation for a given lag distance

��42

Semi-variogram fit to data

Semivariance is smaller at small lag distances and plateaus as lag distance increases

��43

Determining kriging weights

Optimal weights sum to zero and are calculated to minimize variance using a system of equations

��44

Kriging

��45

Co-kriging

Kriging with another correlated predictive variable in addition to the variable of interest

Good for situations where one variable is widely or easily measured and measurements of the other variable are more scarce or expensive

Temperature (primary), Elevation (secondary)

��46

Spatial estimation: Interpolation accuracy

Difference between measured and interpolated values when several sample points are withheld

No single method is more accurate than any other for all applications

Each has advantages and disadvantages

Test several methods before choosing one

��47

Spatial prediction: Interpolation accuracy

��48

Spatial prediction: Interpolation accuracy

RMSE is very common

Squaring errors magnifies the effects of outliers possibly leading to an overly pessimistic error estimate in some cases

Mean absolute error is less common

Less sensitive to outliers

��49

Interpolation accuracy

How accurately does the estimated surface match the actual surface?

Generally, use the most accurate model

A slightly less accurate model may be preferable if it is simpler or easier to implement

��50

Cross-validation

Accuracy assessment requires collection of observed and predicted values for a set of points

Cross-validation is the process of fitting a surface iteratively each time withholding one point to derive an accuracy estimate

A statistical package such as R can be used for advanced geostatistical analysis

��51

Spatial estimation: Sampling

How can we determine if sampling is too sparse to interpolate a surface in between measurement locations?

Why might it be beneficial to have two measurement locations very nearby?

��52

Why might we develop a core area map?

Core area is a primary area of activity for an organism (endangered species), humans (criminals), objects, or resources

Delineate area features from a set of point or line observations

Additional resources or protections may be devoted to this area or habitat requirements can be inferred

��53

What limitations exist in using mean circles?Mean center is the average x, y coordinate

Circles of maximum distance, standard error, or mean distance can define variability around the center

��54

Disadvantages of mean circles

Distributions may exhibit irregular shapes

Outliers can bias both the mean center and parametric mean circles

Core area can be over-estimated

��55

Spatial estimation: Convex Hulls

Identify irregularly shaped core areas

Smallest polygon created by edges that:

completely enclose a set of points

all exterior angles between edges >=180

Outliers will cause convex hulls to overestimate core areas

��56

Concave and Convex hulls

Simple

No subjectivity

Ignore clustering

Information on frequency and density is lost

��57

Convex Hull algorithm

Identify an extreme point

Largest or smallest x or y

Find the point with the largest angle of deflection

��58

Characteristic Hull Polygons

Delaunay triangulation to develop a set of minimum spanning triangles

Remove largest area or longest perimeter subset

Reduces influence of outliers

Allows for holes within a core area

Depends on threshold value chosen

��59

��60

Kernel Density Mapping can identify core areas

Continuous density surface estimated from a set of points

Mathematically flexible

Easy to implement

Robust to outliers and irregular shapes

Incorporates clustered samples

��61

Kernel density

��62

Kernel density: bandwidth effects

��63

Kernel density

Collect samples

User selects bandwidth (h) that defines spread

Narrower bandwidths create higher, narrower peaks

Choose kernel density function and sum

��64

Kernel density with Gaussian density distrib.

��65

Kernel density

Density or probability of occurrence of the variable

Crime density

Habitat utilization density

Defect density

Core areas delineated from kernel density depend on bandwidth selection

��66

Bandwidth effects on core area delineation

��67

Gaussian density

distribution by

bandwidth

Core area

Time-Geographic Density Estimation

Home range analysis

Combines a sequential set of control (GPS) points with maximum velocity information to estimate spatial occurrence probability

��68

Time-Geographic Density Estimation

A geoellipse describes the furthest an object could have reached given known end points and known maximum velocity

��69

What controls ellipse shape?

��70

Controls on Ellipse Shape

As more time passes ellipse becomes larger

Ellipse will approach a straight line as the distance between points approaches the maximum set by the maximum velocity

Ellipses approach circular shape if object moves short distance over long time interval

Densities highest where points cluster or paths intersect

��71

From b to d maximum velocity increasesThis figure assumes uniform distribution

We could assume a linear decay function, that occurrence probability is highest near the connecting line

��72

Primary sources

http://www.paulbolstad.net/gisbook.html

��73

http://www.paulbolstad.net/gisbook.html

gis spatial estimation - Spatial Hydrology - Home · following preliminary sampling!12. ......

Documents

Transcript of gis spatial estimation - Spatial Hydrology - Home · following preliminary sampling!12. ......