Data Mining for the Upper Colorado River Basin Climate Effects Network
description
Transcript of Data Mining for the Upper Colorado River Basin Climate Effects Network
Briefing for the Upper Colorado River Basin Pilot Update Meeting, February 24, 2011
Provide a “portal” to information about people, projects, publications, and data important to UCRB climate effects research
Add information to a framework that can be sustained by scientists and data managers over the long term
Include direct participation and data from science and resource management partners
Use USGS ScienceBase and related projects already under development
Conduct data mining, information cleanup, and make contacts with scientists and other data providers
Solicit input from stakeholders to refine the system, produce a better tool, and expand capabilities (visualization)
UCRB Footprint
UCRB resourceswith links to moreinfo (access, abstract)
Spatially locatedUCRB resources
USGS BASIS+ projects, tasks, or subtasks identified as potential candidates Over 418 reviewed and connected to
UCRB in ScienceBaseWork ongoing to identify sources of
other agency projectsWorking on process for review and
approval of “public project records” in ScienceBase
• UCRB citations harvestedinto ScienceBase Catalog
• abstract• authors• access methods • original metadata
• Resources cataloged in centralized locale
• Refine information(add spatial footprints)
• Connect information (e.g., see all records with keyword “Utah”)
3,420 journal and other external citations cataloged and quality checked in Mendeley Periodically, citations are being harvested
from Mendeley and integrated with ScienceBase
546 USGS Numbered Series reports cataloged in ScienceBase
USGS projects reviewed for publication products with items cataloged
ContactInformation
See all resources credited to person of interest
Over 100 USGS researchers with UCRB history identified and referenced through ScienceBase
Over 1,000 authors identified through publications record
Working on possibilities and best options for personal profiles and connections
• Add information to Catalog• Describe information (metadata)• Share and archive
Describe data using comprehensive fields:
what, where, from, who, when
Major USGS and other corporate data assets identified and cataloged in ScienceBase
Project datasets under review from BASIS+ record ScienceBase Uploader will be used to load
these into a UCRB Project GeoDatabase Data mining and identification
through partner contacts
Developing ScienceBase to use data, maps, visualizations, and analytical tools to address science questions
UCRB question we are currently pursuing:
How will drought/land use affect dust production and deposition, and how will this affect water supplies?
Requirements to address question: Cataloged and available data and
information Visualization tools
▪ Online mapper for quick views▪ Advanced GIS capabilities for geospatial analysts▪ Derivative product generation and packaging
Data handling capabilities▪ Complex datasets (MODIS, ArcMap (.mxd),
databases)▪ Data services (GIS services, THREDDS)
Data sharing, archiving, documenting
Task: develop a dust source to deposition map at different scales (plot to regional across time)
Source data: soil type, geology, vegetation, land use, imagery, wind patterns
Deposition data: topography, wind patterns
Study how water supplies are affected during years of low to high dust deposition
Soils
Land Cover
Imagery
Geology
Snowpack
Potential product
Acquire and integrate disparate datasets Develop capabilities to catalog and displaycomplex data Develop Spatial Toolkit – online mapper
Visualize and lay out information Establish visualization options (transparency) Produce, share, catalog derivative data products
Plan for maintenance and sustainability
http://ucrb.sciencebase.gov
Natalie Latysh ([email protected]) or Sky Bristol ([email protected])