Statistical Bases for Map Reconstructions and Comparisons

57
1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

description

Statistical Bases for Map Reconstructions and Comparisons. Jerry Platt May 2005. Preliminaries. Motivation Do Different Maps “Differ”? Methods Singular-Value Decomposition Multidimensional Scaling and PCA Mantel Permutation Test Procrustean Fit and Permu. Test - PowerPoint PPT Presentation

Transcript of Statistical Bases for Map Reconstructions and Comparisons

Page 1: Statistical Bases for Map Reconstructions and Comparisons

1

Statistical Bases for Map Reconstructions and Comparisons

Jerry Platt

May 2005

Page 2: Statistical Bases for Map Reconstructions and Comparisons

2

Preliminaries

Page 3: Statistical Bases for Map Reconstructions and Comparisons

3

Outline• Motivation

– Do Different Maps “Differ”?

• Methods

– Singular-Value Decomposition

Multidimensional Scaling and PCA

Mantel Permutation TestProcrustean Fit and Permu. TestBidimensional Regression

• Working Example– Locational Attributes of Eight URSB Campuses

Page 4: Statistical Bases for Map Reconstructions and Comparisons

4

Motivation• Comparing Maps Over Time

Accuracy of a 14th Century MapLeader Image Change in Great BritainWhere IS Wall Street, post-9/11?

• Comparing Maps Among Sub-samplesThings People Fear, M v. F Face-to-Face Comparisons

• Comparing Maps Across AttributesCompetitive Positioning of FirmsChinese Provinces & Human Dev. Indices

Page 5: Statistical Bases for Map Reconstructions and Comparisons

5

Accuracy of a 14th Century Map

http://www.geog.ucsb.edu/~tobler/publications/pdf_docs/geog_analysis/Bi_Dim_Reg.pdf

Page 6: Statistical Bases for Map Reconstructions and Comparisons

6http://www.mori.com/pubinfo/rmw/two-triangulation-models.pdf

Page 7: Statistical Bases for Map Reconstructions and Comparisons

7http://igeographer.lib.indstate.edu/pohl.pdf

Page 8: Statistical Bases for Map Reconstructions and Comparisons

8

http://www.analytictech.com/borgatti/papers/borgatti%2002%20-%20A%20statistical%20method%20for%20comparing.pdf

Things People Fear, F v. M

Page 9: Statistical Bases for Map Reconstructions and Comparisons

9http://www.multid.se/references/Chem%20Intell%20Lab%20Syst%2072,%20123%20(2004).pdf

Face-to-Face Comparisons

Page 10: Statistical Bases for Map Reconstructions and Comparisons

10http://www.gsoresearch.com/page2/map.htm

Page 11: Statistical Bases for Map Reconstructions and Comparisons

11

Page 12: Statistical Bases for Map Reconstructions and Comparisons

12

MethodsEigen-Analysis and Singular-Value Decomposition

Multidimensional Scaling & Principal Comps.

Mantel Permutation Test

Procrustean Fit and Permutation Test

Bidimensional Regression

Page 13: Statistical Bases for Map Reconstructions and Comparisons

13

Eigen-analysis

• C = an NxN variance-covariance matrix

• Find the N solutions to C = = the N Eigenvalues, with 1 ≥ 2 ≥ …

= the N associated Eigenvectors

• C = LDL’, where

L = matrix of s

D = diagonal matrix of s

Page 14: Statistical Bases for Map Reconstructions and Comparisons

14

Singular Value Decomposition

• Every NxP matrix A has a SVD

• A = U D V’

• Columns of U = Eigenvectors of AA’

• Entries in Diagonal Matrix D = Singular Values

= SQRT of Eigenvalues of either AA’ or A’A

• Columns of V = Eigenvectors of A’A

Page 15: Statistical Bases for Map Reconstructions and Comparisons

15

SVD

Page 16: Statistical Bases for Map Reconstructions and Comparisons

16

Principal Component Analysis

• A is a column-centered data matrix

• A = U D V’

• V’ = Row-wise Principal Components

• D ~ Proportional to variance explained

• UD = Principal Component Scores

• DV’ = Principle Axes

Page 17: Statistical Bases for Map Reconstructions and Comparisons

17

Multidimensional Scaling• A is a column-centered dissimilarity matrix

• B =

• B = U D V’

• B = XX’, where X = UD1/2

• Limit X to 2 Columns Coordinates to 2d MDS

'

1'

1

2

1 2 iiN

IAiiN

I

Page 18: Statistical Bases for Map Reconstructions and Comparisons

18

A RandomPermutation

Test

Given DissimilarityMatrices A and B:

N! Permutations37! = 1.4*E+43 8! = 40,320

Page 19: Statistical Bases for Map Reconstructions and Comparisons

19

Permutation Tests

PermuteList & rerun

ObservedTestStatisticTS = 25# CorrectOf 37 SB.

Is 25Significantly> 18.5?

Ho: TS = 18.5HA: TS > 18.5

P = .069P > .05Do NotReject Ho

Page 20: Statistical Bases for Map Reconstructions and Comparisons

20

Page 21: Statistical Bases for Map Reconstructions and Comparisons

21http://www.entrenet.com/~groedmed/greekm/mythproc.html

Page 22: Statistical Bases for Map Reconstructions and Comparisons

22

http://www.zoo.utoronto.ca/jackson/pro2.html

Centering &Scaling

MirrorReflection

Rotation &Dilation toMin ∑(є2)

Page 23: Statistical Bases for Map Reconstructions and Comparisons

23

Procrustean Analysis

• Two NxP data configurations, X and Y• X’Y = U D V’• H = UV• OLS Min SSE = tr ∑(XH-Y)’(XH-Y)

= tr(XX’) + tr(YY’) -2tr(D)

= tr(XX’) + tr(YY’) – 2tr(VDV’)

Page 24: Statistical Bases for Map Reconstructions and Comparisons

24

OLS Regression

• Y = X + • Y = Xb + e• X = UDV’• b = VrD-1Ur’Y, where r = first r columns (N>P)

• b = (X’X)-1X’Y

• b = VrVr’ • Estimated Y values = Ur Ur’Y

Page 25: Statistical Bases for Map Reconstructions and Comparisons

25

Bidimensional Regression• (Y,X) = Coordinate pair in 2d Map 1

Y = 0 + 0X

• (A,B) = Coordinate pair in 2d Map 2

E[A] 1 1 -2 X 1

E[B] 1 2 1 Y 2

1 = Horizontal Translation

2 = Vertical Translation

= Scale Transformation = SQRT(12

+ 22)

= Angle Transformation = TAN-1(2 / 1 ) +1800

= + +

Iff 1 < 0

Page 26: Statistical Bases for Map Reconstructions and Comparisons

26

Althoughr = 1,differ inlocation,scale, andangles ofrotationaroundorigin (0,0)

Horizontal& VerticalTranslation

Angle ofrotationaroundorigin (0,0)

Scaletransform,with < 1 ifcontration,& > 1 ifexpansion

Page 27: Statistical Bases for Map Reconstructions and Comparisons

27

Working Example

• Eight URSB Campuses– RD, BK, TO, RC, SA, RV, SD, TA

• Data Sources– Locations– Housing Attributes– Tapestry Attributes

• Data Analyses

Page 28: Statistical Bases for Map Reconstructions and Comparisons

28

Eight URSB Campuses

Page 29: Statistical Bases for Map Reconstructions and Comparisons

29

87.5 miles

88.1 miles

Page 30: Statistical Bases for Map Reconstructions and Comparisons

30

Page 31: Statistical Bases for Map Reconstructions and Comparisons

31

Page 32: Statistical Bases for Map Reconstructions and Comparisons

32

Page 33: Statistical Bases for Map Reconstructions and Comparisons

33

EXAMPLE: Eight URSB Campuses

Page 34: Statistical Bases for Map Reconstructions and Comparisons

34

Page 35: Statistical Bases for Map Reconstructions and Comparisons

35

SD

TA

RDRVRCBK

TOSA

Page 36: Statistical Bases for Map Reconstructions and Comparisons

36

… and if DISTANCES available, but COORDINATES Unavailable?

• Treat Distance Matrix as Dissimilarity Matrix

• Apply Multidimensional Scaling

• Apply the two-dimension solution “as if” it represents latitude and longitude coordinates

Page 37: Statistical Bases for Map Reconstructions and Comparisons

37

Distance Estimates Vary

… But Not “Significantly”

Page 38: Statistical Bases for Map Reconstructions and Comparisons

38

MDS RepresentationInput = D; Output = 2d

D8x8

Page 39: Statistical Bases for Map Reconstructions and Comparisons

39

SD

TA

RD

RVRC

BK

TO

SA

Errors“appear”

to bequitesmall

…BUT

is therea wayto test

if errorsare

“STATSIGNIF”

?

Page 40: Statistical Bases for Map Reconstructions and Comparisons

40

Mantel Test

Page 41: Statistical Bases for Map Reconstructions and Comparisons

41

Procrustean Test:MDS Map Recreation

CONCLUDE: Near-perfect Map Recreation

Page 42: Statistical Bases for Map Reconstructions and Comparisons

42

Driving Distances

Do these differ “significantly” from linear distances?

STATISTICAL PRACTICAL

Page 43: Statistical Bases for Map Reconstructions and Comparisons

43

DriveD = Driving DistancesEight URSB Locations

Multidimensional Scaling,with 2-dimension solution

Page 44: Statistical Bases for Map Reconstructions and Comparisons

44

SD

TA

RD

RVRC

BK

TO

SA

Page 45: Statistical Bases for Map Reconstructions and Comparisons

45

Page 46: Statistical Bases for Map Reconstructions and Comparisons

46

Bidimensional Regression:AB on YX

Page 47: Statistical Bases for Map Reconstructions and Comparisons

47

PROTEST Comparison

BidimensionalRegression

ProcrusteanRotation

Page 48: Statistical Bases for Map Reconstructions and Comparisons

48

Housing

Page 49: Statistical Bases for Map Reconstructions and Comparisons

49

Tapestry (ESRI)

Page 50: Statistical Bases for Map Reconstructions and Comparisons

50

Map Coordinates as Explanatory Variables in Linear Models

Page 51: Statistical Bases for Map Reconstructions and Comparisons

51

Incremental Tests

So Map Coordinates seem sufficient as predictors

Page 52: Statistical Bases for Map Reconstructions and Comparisons

52

Proxy Measures of lat-longin Linear Model

Translations& Transforms

Reduce 8

And ↑ R2

Page 53: Statistical Bases for Map Reconstructions and Comparisons

53

Robust criterionwould help here:

Min (Med(є2))

Page 54: Statistical Bases for Map Reconstructions and Comparisons

54

Bidimensional Regression

Is There a Linear RelationshipBetween Housing and Tapestry

Data?

r = 0.5449

MustStandardize

Data

Page 55: Statistical Bases for Map Reconstructions and Comparisons

55

Page 56: Statistical Bases for Map Reconstructions and Comparisons

56

It’s Still a 3-d World

Page 57: Statistical Bases for Map Reconstructions and Comparisons

57