gis spatial estimation - Spatial Hydrology - Home · following preliminary sampling!12. ......
Transcript of gis spatial estimation - Spatial Hydrology - Home · following preliminary sampling!12. ......
Spatial estimation
How is spatial estimation
commonly used?
���1
Spatial estimation: Outline
Introduction
Sampling
Spatial interpolation methods
Spatial prediction methods
Core area mapping
���2
Spatial estimation: Intro
���3
What are examples of each?
Spatial Interpolation
Prediction of variables at unmeasured locations based on sampling of same variables at known locations
Spatial Prediction
Prediction of a variable at unmeasured locations based in part on other variables
���4
Spatial estimation: Intro
Core area delineation:
Delineate areas of high utilization, density, intensity, probability of occurrence
Core areas may consist of a home range, center of criminal activity, or center of business activity
���5
Spatial estimation: Sampling
We control location and number of samples
With additional samples the law of diminishing returns
Diminishing returns not usually a problem
Funding limits samples
Few established guidelines for determining number of samples needed for interpolation
���6
Interpolating ETr for irrigation management
For agricultural water management or climate change studies weather station placement is of paramount importance
If the sampling scheme includes anomalous areas then the resulting interpolation could be very biased
���7
Sampling
What is each sampling scheme called?
���8
Spatial estimation: Systematic sampling
Not appropriate if there is greater interest in specific areas within the study area
Travel to each point can be difficult or costly
Systematic can be problematic
Roads are laid out systematically
Geologic features can vary systematically
���9
Spatial estimation: Random Sampling
Random samples are less likely to correspond to systematic landscape variation
Little chance of inaccurate predictions due to biased sampling
Random sampling can cause more sampling than necessary in uniform areas and undersample variable areas
Difficult to implement over large areas
���10
Spatial estimation: Cluster sampling
Cluster centers located randomly or systematically
Reduce travel time between sample locations
Used for mineral exploration and by U.S. Forest Service to conduct forest productivity and condition surveys
Good for natural resources applications requiring significant off-road travel
���11
Spatial estimation: Adaptive samplingHigher sampling densities where the feature of interest is more variable
Increases sampling efficiency
In some cases it is obvious where an attribute is variable (elevation)
Otherwise sampling intensity decisions are made on the fly or after lab results following preliminary sampling
���12
Other aspects of sampling design
Other sampling design strategies are appropriate for specific applications
Sampling can be performed with or without replacement
This is only a problem if linear or area features are the sampling units
���13
Spatial Interpolation Methods
Combine sampled values and their positions to estimate values at unsampled points
Spatial interpolation method variables:
Observation weights and how we choose these weights
Use all or only some observations
Use different mathematical functions
���14
Sample dataset: What is unrealistic?
���15
Thiessen polygon interpolation
���16
Nearest neighbor / Thiessen polygon
Assign value of unmeasured location to nearest sample point value
Polygon output
Abrupt transition between polygon edges
Exact interpolator
Interpolated surface equals sampled values at each sample point
���17
Fixed radius Local averaging
���18
Fixed radius Local averaging
���19
Fixed radius interpolation
���20
Fixed radius Local averaging
Moving average of samples within radius
Radius is centered sequentially at each raster cell center
No value assigned if no samples fall in search radius
Neighboring cell values usually similar
Smooths sample data
���21
Fixed radius Local averaging
Extreme values are maintained only if no other values are present in the search radius
The search radius chosen strongly affects the resultant surface
Not exact interpolator
Too large of a search radius will smooth the surface too much
���22
Inverse distance weighting (IDW)
Estimates values at unknown locations based on sample values and distance to nearby points
Closest points have strongest influence
���23
n and i for IDW
n = distance exponent
i = number of sample points used to estimate a raster cell center value
What effects do n and i have on interpolated surface?
What does this equation become when d approaches zero?
zj =
zidniji
∑1dniji
∑
���24
IDW
An exponent (n) can be applied to the distance to influence the rate at which the influence of a point wanes with distance
IDW is an exact interpolator
As distance becomes very small, its inverse approaches infinity, eliminating the influence of any other points
Creates smooth raster surfaces
���25
IDW
Larger exponent (n):
Increases the influence on closer points
Higher peaks, lower valleys, steeper gradients near sampled points
���26
IDW
• Lower exponent (n) reduces the influence of proximal points, smoothing resultant surface
Spatial interpolation: Splines
Fit polynomial function to measured values:
Minimizes surface curvature—the sum of the squares of the second derivative terms
Smooth surface that passes through input points
Interpolated surface is differentiable
���28
Spatial prediction
Statistical fitting process using coordinate locations and independent variables
Waldo Tobler: “everything in the universe is related to everything else, but closer things are more related.”
Spatial autocorrelation is the tendency of nearby values to be more similar than distant values
���29
Spatial prediction: Autocorrelation
���30
Moran’s I measures spatial autocorrelation
0 low correlation
1 positive autocorrelation
-1 anti-correlated
���31
4
Spatial cross-correlation
In the absence of spatial autocorrelation and cross correlation what is the best estimate of values at unmeasured locations?
���32
Spatial prediction: Correlation
The degree of spatial autocorrelation can vary over space
Spatial prediction methods incorporate knowledge of the correlation structure of the data into predicting unmeasured locations
Cross-correlation is the tendency for two variables to change in concert
���33
Spatial prediction: Regression
Predict output variable distribution as a function of several inputs
Combine measurements of temperature with elevation, latitude, and longitude to predict temperature in a statistical model
���34
Spatial prediction: Trend surface
Not exact predictors
Most accurate when fitting smoothly varying surfaces
Mean daily temperature
No Bulls-Eyes
Not appropriate for modeling complex surfaces (elevation in a mountainous region)
���35
Spatial prediction: Trend surface
���36
Spatial prediction: Kriging
Developed in early 1900s by D.G. Krige and Georges Matheron for mining
They aimed to develop a spatial prediction method that:
accounts for trends, spatial autocorrelation, and random (stochastic) variation
estimates local prediction variance
���37
Spatial prediction: Kriging and Co-Kriging
Based on regionalized variable theory
Model components:
Spatial trend
Local spatial autocorrelation
Random (stochastic) variation
Weights are chosen to be statistically optimal
���38
Kriging
h: lag distance
lag distance is the distance between sample points
���39
Kriging
Lag tolerance is a distance between points that is considered small so that pairs of points can be grouped together for spatial covariance calculations
���40
Semi-variogram anatomy
Nugget: Semivariance when autocorrelation is highest, at a lag distance of zero
Sill: Point where variogram levels off
Inherent or background variation when there is little spatial autocorrelation
Range: Lag distance where the sill is reached
���41
Semi-variogram
Semi-variance measures spatial autocorrelation for a given lag distance
���42
Semi-variogram fit to data
Semivariance is smaller at small lag distances and plateaus as lag distance increases
���43
Determining kriging weights
Optimal weights sum to zero and are calculated to minimize variance using a system of equations
���44
Kriging
���45
Co-kriging
Kriging with another correlated predictive variable in addition to the variable of interest
Good for situations where one variable is widely or easily measured and measurements of the other variable are more scarce or expensive
Temperature (primary), Elevation (secondary)
���46
Spatial estimation: Interpolation accuracy
Difference between measured and interpolated values when several sample points are withheld
No single method is more accurate than any other for all applications
Each has advantages and disadvantages
Test several methods before choosing one
���47
Spatial prediction: Interpolation accuracy
���48
Spatial prediction: Interpolation accuracy
RMSE is very common
Squaring errors magnifies the effects of outliers possibly leading to an overly pessimistic error estimate in some cases
Mean absolute error is less common
Less sensitive to outliers
���49
Interpolation accuracy
How accurately does the estimated surface match the actual surface?
Generally, use the most accurate model
A slightly less accurate model may be preferable if it is simpler or easier to implement
���50
Cross-validation
Accuracy assessment requires collection of observed and predicted values for a set of points
Cross-validation is the process of fitting a surface iteratively each time withholding one point to derive an accuracy estimate
A statistical package such as R can be used for advanced geostatistical analysis
���51
Spatial estimation: Sampling
How can we determine if sampling is too sparse to interpolate a surface in between measurement locations?
Why might it be beneficial to have two measurement locations very nearby?
���52
Why might we develop a core area map?
Core area is a primary area of activity for an organism (endangered species), humans (criminals), objects, or resources
Delineate area features from a set of point or line observations
Additional resources or protections may be devoted to this area or habitat requirements can be inferred
���53
What limitations exist in using mean circles?Mean center is the average x, y coordinate
Circles of maximum distance, standard error, or mean distance can define variability around the center
���54
Disadvantages of mean circles
Distributions may exhibit irregular shapes
Outliers can bias both the mean center and parametric mean circles
Core area can be over-estimated
���55
Spatial estimation: Convex Hulls
Identify irregularly shaped core areas
Smallest polygon created by edges that:
completely enclose a set of points
all exterior angles between edges >=180
Outliers will cause convex hulls to overestimate core areas
���56
Concave and Convex hulls
Simple
No subjectivity
Ignore clustering
Information on frequency and density is lost
���57
Convex Hull algorithm
Identify an extreme point
Largest or smallest x or y
Find the point with the largest angle of deflection
���58
Characteristic Hull Polygons
Delaunay triangulation to develop a set of minimum spanning triangles
Remove largest area or longest perimeter subset
Reduces influence of outliers
Allows for holes within a core area
Depends on threshold value chosen
���59
���60
Kernel Density Mapping can identify core areas
Continuous density surface estimated from a set of points
Mathematically flexible
Easy to implement
Robust to outliers and irregular shapes
Incorporates clustered samples
���61
Kernel density
���62
Kernel density: bandwidth effects
���63
Kernel density
Collect samples
User selects bandwidth (h) that defines spread
Narrower bandwidths create higher, narrower peaks
Choose kernel density function and sum
���64
Kernel density with Gaussian density distrib.
���65
Kernel density
Density or probability of occurrence of the variable
Crime density
Habitat utilization density
Defect density
Core areas delineated from kernel density depend on bandwidth selection
���66
Bandwidth effects on core area delineation
���67
Gaussian density
distribution by
bandwidth
Core area
Time-Geographic Density Estimation
Home range analysis
Combines a sequential set of control (GPS) points with maximum velocity information to estimate spatial occurrence probability
���68
Time-Geographic Density Estimation
A geoellipse describes the furthest an object could have reached given known end points and known maximum velocity
���69
What controls ellipse shape?
���70
Controls on Ellipse Shape
As more time passes ellipse becomes larger
Ellipse will approach a straight line as the distance between points approaches the maximum set by the maximum velocity
Ellipses approach circular shape if object moves short distance over long time interval
Densities highest where points cluster or paths intersect
���71
From b to d maximum velocity increasesThis figure assumes uniform distribution
We could assume a linear decay function, that occurrence probability is highest near the connecting line
���72
Primary sources
http://www.paulbolstad.net/gisbook.html
���73