RDAP14: David Van Riper of Terra Populus
description
Transcript of RDAP14: David Van Riper of Terra Populus
David Van Riper
RDAP14, San Diego
March 27, 2014
TerraPop Goals
Lower barriers to conducting interdisciplinary human-environment interactions research by making data with different formats from different scientific domains easily interoperable
Provide an organizational and technical framework to preserve, integrate, disseminate, and analyze global-scale spatiotemporal data describing population and the environment.
TerraPop in ContextCollaborating Organizations
• Data integration expertise• Large census and survey
data collections & expertise• Institutional foundation
• Human-environment interactions research expertise
• Environmentally-oriented data collections & expertise
• Data preservation and sustainability expertise
• Social science data collections & expertise
• Major producers and distributors of data on both humans and their environment • Major producers of tools for integrating and transforming data across formats• Leaders in preservation and sustainability
TerraPop in ContextDataNet Cyberinfrastructure
Curated population and environment data collection
Exposed through DataONE, SEAD
Extracts exportable to DFC
Integration services Potentially available through
DFC, SEAD Open source components and
API
• T W O D O M A I N S : P O P U L AT I O N & E N V I R O N M E N T• T H R E E D ATA S T R U C T U R E S
• Microdata• Area-level data• Rasters
Source Data
Making disparate data formats interoperable
Microdata: Characteristics of individuals and households
Area-level data: Characteristics of places defined by boundaries
Raster data: Values tied to spatial coordinates
M I C R O D ATA A R E A - L E V E L R A S T E R
Location-Based Integration
Location-Based IntegrationMicrodata
Area-level dataRasters
Mix and match variables originating in
any of the data structures
Obtain output in the data structure most
useful to you
Location-Based Integration
Individuals and households with their environmental
and social context
Microdata
Area-level dataRasters
Location-Based Integration
Summarized environmental
and population
Microdata
Area-level dataRasters
County IDG17003100001G17003100002G17003100003G17003100004G17003100005G17003100006G17003100007
County IDMean Ann. Temp.
Max. Ann. Precip.
G17003100001 21.2 768G17003100002 23.4 589G17003100003 24.3 867G17003100004 21.5 943G17003100005 24.1 867G17003100006 24.4 697G17003100007 25.6 701
County IDMean Ann. Temp.
Max. Ann. Precip.
Rent, Rural
Rent, Urban
Own, Rural
Own, Urban
G17003100001 21.2 768 3129 1063 637 365G17003100002 23.4 589 2949 1075 1469 717G17003100003 24.3 867 3418 1589 1108 617G17003100004 21.5 943 1882 425 202 142G17003100005 24.1 867 2416 572 426 197G17003100006 24.4 697 2560 934 950 563G17003100007 25.6 701 2126 653 321 215
characteristics for administrative
districts
Location-Based Integration
Rasters of population and environment data
Microdata
Area-level dataRasters
Future Plans
Collection Expansion
Add more census/survey data Add more global environmental dataAdd more GIS mapping filesDevelop data curation protocol(s) for adding
additional data to TerraPopDevelop mechanisms for scholars to deposit
data/metadata into TerraPop Allow others’ data to be integrated with other TerraPop data
Upcoming Activities
Beta system public release – soon! User interface redesign for Fall 2014
Conference workshops IASSIST – Toronto – June 3, 2014 Population Association of America – Boston – April 30, 2014 More to come in last 2014/early 2015
Become a DataONE member nodeContinue working with SEAD and DFC on
potential collaborations
Keep in touch with TerraPop
-Website: http://www.terrapop.org
-Data access system: http://beta.terrapop.org
-Twitter: @TerraPopulus
-Email: David Van Riper – [email protected]