Citing and understanding spatial references for eResearch: Spatial Identifier Reference Framework...
Transcript of Citing and understanding spatial references for eResearch: Spatial Identifier Reference Framework...
www.csiro.au
Citing and understanding spatial references for eResearch: Spatial Identifier Reference Framework (SIRF)Paul Box, Terry Rankine, Rob Atkinson, Simon CoxCSIROeResearch 2014 , Melbourne
DIGITAL PRODUCTIVITY FLAGSHIP
Understanding and citing spatial references - SIRF | Paul Box
Understanding and citing spatial identifiers
• Spatial identifiers – nature and use• Current challenges & future directions• Spatial Identifiers Reference Framework - SIRF
2 |
Understanding and citing spatial references - SIRF | Paul Box
Multi-disciplinary perspectives
3 |
The Critical ZoneEarth's permeable near-surface layer... from the tops of the trees to the bottom of the groundwater.
Hydrology, geology, soil science, biology, ecology, geochemistry, + anthropogenic.
Source: - US National Science Foundation Critical Zone Observatory (CZO)
• Multiple domains, organizations, scientific disciplines
• Information – multiple scales, sources, structure, semantics, format & quality
• Different phenomena & reporting geographies• Rapid information integration from multiple
sources • Reliably link spatial + O&M• Reliably link reporting geographies
The present - information silos
System 1
System 2
System 3
System 5 System
4
System n
System 7
$$
$$$
$ $
Discover Access Understand Extract, Transform, Load
Use
Time and effort
Everything Happens Somewhere
Understanding and citing spatial references - SIRF | Paul Box 4 |
Gazetteer?
• Subjective constructs of geography - e.g. Gazetteer, local government areas, catchment management authority area, ground water management unit
• Layer & feature based approach • Identity (name, code) • Type• Geometry • Topology
• Independent spatial layers + metadata in catalogues (SDI)
Representing location - spatial data
Concept
Relationship
Understanding and citing spatial references - SIRF | Paul Box 5 |
Key issues Dataset • Identity is ambiguous • Topology & geometry conflated
SDI• Duplication - multiple representations• Disconnection – spatial layers disconnected from:
• each other • from other information• Discovery - many catalogues, poor metadata @ dataset level • Change
Understanding and citing spatial references - SIRF | Paul Box 6 |
What’s in a name?
Official Country name lists• United Nations Statistics Division Country and
Region Codes for Statistical Use• United Nations Group of Experts on Geographic
Names - List of Country Names• Department for General Assembly and Conference
Management - Multilingual Terminology Database (UNTERM)
• ISO - ISO 3166: Codes for country names • UN FAO - Global Administrative Unit Layers (GAUL)
Figure 1: Variation rates in spelling for country names between the UN Statistical Division’s Country and Region Codes for Statistical Use and UN datasets from data.un.org
• One place - many names• Australia, Australie, أستراليا• Sydney, Syd, City of Sydney
• One name – many places
Understanding and citing spatial references - SIRF | Paul Box 8 |
One name – many places
Understanding and citing spatial references - SIRF | Paul Box 9 |
‘Leichhardt’
Leichhardt in Australia4 (52) Australian Gazetteer (GA) - NSW, QLD10 (12) ASGS (ABS) - NSW, VIC, QLD
Leichhardt in NSW3 (9) in NSW Place Names search (GNB NSW)
Leichhardt (Sub) NSW
Feature types• National Gazetteer• Suburb • Parish • Homestead
•ASGS • SA2 (NSW, QLD)• SA3 (NSW)• CED (QLD)• IARE (NSW) • State Suburb (VIC, NSW & QLD)• Gazetted locality
•GNB• County • Parish• Suburb (No LGA!)
One spatial object - multiple identities (& representations)
Geospatial information Other information resources
NSW Gaz (GNB NSW) 32679
Spatial Data Infrastructure (SDI)
Understanding and citing spatial references - SIRF | Paul Box 10 |
Geonames 2160386
ASGS (ABS) SSC11351
Nat Gaz (GA) NSW3267
Understanding and citing spatial references - SIRF | Paul Box
Identifiers in spatial data
• Ambiguous • system scoped and non-unique• multiple identifiers for same spatial object• Unstable – not well managed change over time• Not well governed
11 |
Separating geometry and topology
• Geometry used to encode topology• Geometry used for visualization, analysis• Most use cases need topology not geometry:
• Which Jurisdictions contains Leichhardt LGA• Which rivers are upstream of the Hawkesbury? • Geometry is – unreliable, inefficient for representing topology
Understanding and citing spatial references - SIRF | Paul Box 13 |
LGA boundaries & State boundaries
What do we need?• Unambiguously identify spatial objects using URIs• Reliably cite geospatial features e.g.
• Geographic keywords • Identify features used in workflow/model• Cross-reference spatial identifiers to each other
• link multiple identities• topology - touches, contains, adjacent • Link spatial features to stats/obs
Understanding and citing spatial references - SIRF | Paul Box 14 |
Understanding and citing spatial references - SIRF | Paul Box
Spatial Identifier Reference Framework - spatial linked data enabling of SDI
15 |
Geospatial information Statistical information (Implicitly geospatial)
GNB NSW 32679
Spatial Data Infrastructure (SDI)
Understanding and citing spatial references - SIRF | Paul Box 16 |
Geonames 2160386
ABS ASGS SSC11351
GA Aus Gaz NSW3267
Linked spatial data to bridge existing gaps
Spatial IdentifierReference Framework
Index Crosswalk Explore
How SIRF works – Indexing spatial references
SIRFSpatial
Information
User
Linked Data Web
Index identifiers
http://demo.sirf.net/siset/CGNA/NSW81093
http://demo.sirf.net/siset/AUS-FSDF/admin/2
Publish URIs
Understanding and citing spatial references - SIRF | Paul Box 17 |
(National) Spatial Data Infrastructure
owl:sameAs
Understanding and citing spatial references - SIRF | Paul Box
LinksetLinksetSISetSISet SISetSISet
Relationships between spatial objects
Australian National Gazetteer
FSDF Admin Boundary
‘Cross-walks’ between SISets
• Centralise the problem• Machine + manual processing• Demand driven• Cross-walks treated as first class citizens• Well managed and shareable
19 |
Understanding and citing spatial references - SIRF | Paul Box
http://id.sirf.net/siset/SA_SUBURBS/501001- SA suburb - SA gazetteer- National gazetteer
SIRF data network
20 |
SIRF - linking between productsANZ Foundation Spatial Data Framework
Understanding and citing spatial references - SIRF | Paul Box 21 |
OWL SameAs
NSW
Albury(LGA)
TowongLGA
VICGeo:sf-adjacent
Geo:sf-contains
Geo:sf-contains
Geo:sf-adjacent
Geo:sf-contains
FSDF Admin Boundary Product
Leichhardt (LGA)
S. Metro Region
Albert Park District
Geo:sf-contains
Geo:sf-contains
FSDF Place Name Product
Geo:sf-contains
NSW
Geo:sf-contains
Leichhardt
LambertPark Geo:sf-contains
OWL SameAs
Cross-walks registered in SIRF
Data set indexed in SIRF
Data set indexed in SIRF
Understanding and citing spatial references - SIRF | Paul Box
Identifier patterns
Collection http://demo.sirf.net/siset/CGNA/Instance http://demo.sirf.net/siset/CGNA/AHO34899 concept of Sydney
http://demo.sirf.net/siset/CGNA/AHO34899?_version=1.1.0 – representation of the concept at a snapshot in time
Alt ID patternshttp://environment.data.gov.au/water/id/catchment/100862
22 |
Linked data / Internet of th
ings – BOF
Key messages
• Different to Wikipedia/Geonames • NOT creating a new unauthoritative dataset • Building set of indexes to underlying authoritative spatial data -
with provenance - source metadata• Reliable unambiguous URI to reference to spatial object (concept
or representation) • Linked data to express identity of and linkage between spatial
objects• Bridge between native GIS representations and other data across
the web• URIs – pattern and content governance
Understanding and citing spatial references - SIRF | Paul Box 23 |
Paul Box
t: +61 2 93253122e: [email protected]: www.csiro.au
DIGITAL PRODUCTIVITY FLAGSHIP
Thank youCSIRO Digital Productivity Flagship
SIRF OpenSearch http://unsdi.csiro.au/UNSDIOpenSearch/searchSIRF API doco http://www.sirf.net