1
A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov
Brand NiemannSenior Enterprise Architect, US EPA
April 21, 2009PARS 2009 Mid-Year with Sanjib Chaki,
Acting Branch Chief, ITSPB, MISD, OTOP, Office of the CIO
2
Introduction• Senior Enterprise Architect
– http://epaenterprisearchitecture.wik.is/ (closed)• Web 2.0 Work Group, Web 2.0 for E-Rulemaking Work Group, Web
2.0/3.0 Pilots for CREM, GEOSS, Ontology for the National Map, etc.– http://geolob.wik.is/ (open)
• Interagency Working Group on Digital Data– http://gcn.com/articles/2009/03/25/nstc-report-digital-data.aspx?s=gcnd
aily_260309
• Semantic Community– http://semanticommunity.net
• Semantic Technology Conference Program Advisory Board– http://www.semantic-conference.com/
• World – Wide Web Consortium Member (Advisory Committee)– http://www.w3.org
3
Recent Activities• April 10, 2009: Architecture & Infrastructure Committee: Governance
Subcommittee Meeting Report on Compilation of DRM Comments by the DRM Community of Practice (Brand Niemann and Rick Murphy).
• April 15, 2009: Brainstorming Session with PPC and BAH on Target Data Architecture for US EPA.
• April 16, 2009: Suggestions for the Federal Enterprise Architecture Program at the ArchitecturePlus Seminar.
• April 16, 2009: Participation in the Interagency Working Group on Digital Data Report Preparation for the Whitehouse/OSTP May 21st.
• April 16, 2009: Invitation to Comment on Electronic State of the Environment Report (eROE) (Did a Web 2.0/3.0 version earlier as part of PARS 2008 and now as part of the John Shirey Semantic Web Pilot with Intervise).
• April 17, 2009: Participation in the World Bank – OASIS Global Community of Practice Workshop on Web 2.0/3.0 for E-Government.
• April 28, 2009: Invited to participate in the EPA Taxonomy Discussion at the EPA Web Work Group Conference.
4
DRM 3.0 Architecture Suggestions Highlights
• Data Description:– Uniform Resource Identifiers (URI)
• Data Context:– Taxonomy/Ontology:
• Information: Topic and Subtopic• Data: Data Table and Data Elements• Information and Data Modeling: Build on David Hay’s Work
• Data Sharing:– Data and Metadata “Travel Together”
Note: These have been implemented in a Web 2.0 Wiki at http://federaldata.wik.is
5
Target Data Architecture for US EPA
Role Request DRM 3.0 and Data.gov
Senior Advisor Value-added Data Yes, Use Statistical Abstract
Chief Architect SOA Data Services Layer
Yes, Use Web 2.0 Wiki with WOA
Data Architect Data Model Yes, Use David Hay Start
Data Standards Ontologies Yes, Use Kevin Keck Pilot
Information Architect Semantic Web Yes, Open Linked Data with RDF
6
Suggestions for the Federal Enterprise Architecture Program
Item Suggestion Action
FEA Segment Architecture Methodology (FSAM)
Decompose Segments into Patterns
7th
SOA for E-Gov Conference
(see next slide)
Recovery.gov Public wants raw data and tools
Web 2.0/3.0 pilots initiated
Data.gov Use Statistical Abstract Staff
Previously collaborated
7
Patterns
• SOA Patterns:SOA Patterns:– A proven solution to a common problem A proven solution to a common problem
individually documented in a consistent format individually documented in a consistent format and usually as part of a larger collection.and usually as part of a larger collection.• See See http://
www.soaglossary.com/design_pattern.asp
– Both SOA Design Patters and the NCOIC Both SOA Design Patters and the NCOIC Services WG have standard templates:Services WG have standard templates:• http://www.soapatterns.com/ • http://networkcentricity.wik.is/Practical_Guidance_f
or_Net-Centric_Patterns_Developers
8
Suggestions for the FEA Data.gov Program
Data.gov epadata.wik.is* CommentsCIO proposes Open to all Supports
transparency, openness, & collaboration.
Raw Value-Added Should do both.
FGDC Metadata Standard
Statistical Abstract Table Metadata
Much simpler-only 7 elements versus many more.
GSA Cloud Computing
Mindtouch/Amazon Web 2.0/3.0 Cloud Computing
Free or very low cost.
*Also epametadata.wik.is
9
Interagency Working Group on Digital Data
• The National Science and Technology Council (NSTC) released a report describing a strategy to promote preservation and access to digital scientific data. The report, Harnessing the Power of Digital Data for Science and Society, was produced by the NSTC's Committee on Science under the auspices of the Office of Science and Technology Policy (OSTP) in the Executive Office of the President by the IWGDD.– See
http://federaldata.wik.is/Interagency_Working_Group_on_Digitial_Data
10
IWGDD Report
• Interagency Working Group on Digital Data, May 21st Report Outline (Draft):– Subgroup on Agency Science Data Policies:
• Executive Summary• Preamble• Guiding Principles and Concepts• State of the Art• Key Components• Examples and Best Practices• Recommendations• Appendix A: Terminology and Definitions of Terms
– Topic Tree Being Developed and Implement in a Web 2.0 Wiki by David Wojick and Brand Niemann
11
IWGDD Report
• Subgroup on Agency Science Data Policies:– Compilation of Agency Data Management Polices and Plans:
• See http://federaldata.wik.is/Interagency_Working_Group_on_Digitial_Data/Agency_Data_Policies_and_Management_Plans
– Also see my EPA and Interagency work at http://federaldata.wik.is/Interagency_Working_Group_on_Digitial_Data/Agency_Data_Policies_and_Management_Plans#Documents_of_Relevance_or_Interest
– Appendix A: Terminology and Definitions of Terms• Topic Tree Being Developed and Implement in a Web 2.0 Wiki by
David Wojick and Brand Niemann (see next slide).• Also Suggest Using Dictionary of Data Management published in
conjunction with The DAMA-DMBOK Guide in 2009 (800 terms defined).
– See http://www.dama.org
12
Appendix A: Terminology and Definitions of Terms
http://federaldata.wik.is/Interagency_Working_Group_on_Digitial_Data/Policy_Issue_Tree
13
IWGDD Report
• Interagency Working Group on Digital Data, May 21st Report Outline (Draft):– Subgroup on Agency Data Management Plans:
• Executive Summary• Preamble• The Landscape of Data Management Planning• State of the Art• Examples and Best Practices• Recommendations• Appendix A: Sample Data Management Plans
14
IWGDD Report• Subgroup on Agency Data Management
Plans:– Section 3.1. Motivations (laws, policies,
interests)• See slide 15.
– 3.2 Objectives• See slide 5.
– 3.3 Standards (e.g. DRM, etc.)• See slide 4.
– 5.1 Models for Best Practices• See slide 16.
15
IWGDD ReportTopics Relationships Examples
Laws are interpreted as …
Clinger-Cohen (1996)
Policies which are implemented by …
Federal Enterprise Architecture Data Reference Model 3.0
Governance that is expressed in …
Federal CIO Vivek Kundra
Plans that produce results.
Data.gov
16
IWGDD Report
• Subgroup on Agency Data Management Plans:– Data Management Association Data Management Book of
Knowledge (DAMA DM BoK) 10 Functions:• Data Governance• Data Architecture Governance (see David Loshin book on Master
Data Management, page 15-21)• Data Development• Database Operations Management• Data Security Management• Reference and Master Data Management• Data Warehousing & BI Management• Document and Content Management (Section I reviewed and
recommended use of Web 2.0 Wiki)• Metadata Management• Data Quality Management
See http://edw2009.wilshireconferences.com/uploads/handouts/TUE_1015_Mosley_Mark_Henderson_Deborah_COlor_1216.pdf
Top Related