1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior...

16
1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009 Mid-Year with Sanjib Chaki, Acting Branch Chief, ITSPB, MISD, OTOP, Office of the CIO

Transcript of 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior...

Page 1: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

1

A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov

Brand NiemannSenior Enterprise Architect, US EPA

April 21, 2009PARS 2009 Mid-Year with Sanjib Chaki,

Acting Branch Chief, ITSPB, MISD, OTOP, Office of the CIO

Page 2: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

2

Introduction• Senior Enterprise Architect

– http://epaenterprisearchitecture.wik.is/ (closed)• Web 2.0 Work Group, Web 2.0 for E-Rulemaking Work Group, Web

2.0/3.0 Pilots for CREM, GEOSS, Ontology for the National Map, etc.– http://geolob.wik.is/ (open)

• Interagency Working Group on Digital Data– http://gcn.com/articles/2009/03/25/nstc-report-digital-data.aspx?s=gcnd

aily_260309

• Semantic Community– http://semanticommunity.net

• Semantic Technology Conference Program Advisory Board– http://www.semantic-conference.com/

• World – Wide Web Consortium Member (Advisory Committee)– http://www.w3.org

Page 3: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

3

Recent Activities• April 10, 2009: Architecture & Infrastructure Committee: Governance

Subcommittee Meeting Report on Compilation of DRM Comments by the DRM Community of Practice (Brand Niemann and Rick Murphy).

• April 15, 2009: Brainstorming Session with PPC and BAH on Target Data Architecture for US EPA.

• April 16, 2009: Suggestions for the Federal Enterprise Architecture Program at the ArchitecturePlus Seminar.

• April 16, 2009: Participation in the Interagency Working Group on Digital Data Report Preparation for the Whitehouse/OSTP May 21st.

• April 16, 2009: Invitation to Comment on Electronic State of the Environment Report (eROE) (Did a Web 2.0/3.0 version earlier as part of PARS 2008 and now as part of the John Shirey Semantic Web Pilot with Intervise).

• April 17, 2009: Participation in the World Bank – OASIS Global Community of Practice Workshop on Web 2.0/3.0 for E-Government.

• April 28, 2009: Invited to participate in the EPA Taxonomy Discussion at the EPA Web Work Group Conference.

Page 4: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

4

DRM 3.0 Architecture Suggestions Highlights

• Data Description:– Uniform Resource Identifiers (URI)

• Data Context:– Taxonomy/Ontology:

• Information: Topic and Subtopic• Data: Data Table and Data Elements• Information and Data Modeling: Build on David Hay’s Work

• Data Sharing:– Data and Metadata “Travel Together”

Note: These have been implemented in a Web 2.0 Wiki at http://federaldata.wik.is

Page 5: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

5

Target Data Architecture for US EPA

Role Request DRM 3.0 and Data.gov

Senior Advisor Value-added Data Yes, Use Statistical Abstract

Chief Architect SOA Data Services Layer

Yes, Use Web 2.0 Wiki with WOA

Data Architect Data Model Yes, Use David Hay Start

Data Standards Ontologies Yes, Use Kevin Keck Pilot

Information Architect Semantic Web Yes, Open Linked Data with RDF

Page 6: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

6

Suggestions for the Federal Enterprise Architecture Program

Item Suggestion Action

FEA Segment Architecture Methodology (FSAM)

Decompose Segments into Patterns

7th

SOA for E-Gov Conference

(see next slide)

Recovery.gov Public wants raw data and tools

Web 2.0/3.0 pilots initiated

Data.gov Use Statistical Abstract Staff

Previously collaborated

Page 7: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

7

Patterns

• SOA Patterns:SOA Patterns:– A proven solution to a common problem A proven solution to a common problem

individually documented in a consistent format individually documented in a consistent format and usually as part of a larger collection.and usually as part of a larger collection.• See See http://

www.soaglossary.com/design_pattern.asp

– Both SOA Design Patters and the NCOIC Both SOA Design Patters and the NCOIC Services WG have standard templates:Services WG have standard templates:• http://www.soapatterns.com/ • http://networkcentricity.wik.is/Practical_Guidance_f

or_Net-Centric_Patterns_Developers

Page 8: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

8

Suggestions for the FEA Data.gov Program

Data.gov epadata.wik.is* CommentsCIO proposes Open to all Supports

transparency, openness, & collaboration.

Raw Value-Added Should do both.

FGDC Metadata Standard

Statistical Abstract Table Metadata

Much simpler-only 7 elements versus many more.

GSA Cloud Computing

Mindtouch/Amazon Web 2.0/3.0 Cloud Computing

Free or very low cost.

*Also epametadata.wik.is

Page 9: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

9

Interagency Working Group on Digital Data

• The National Science and Technology Council (NSTC) released a report describing a strategy to promote preservation and access to digital scientific data. The report, Harnessing the Power of Digital Data for Science and Society, was produced by the NSTC's Committee on Science under the auspices of the Office of Science and Technology Policy (OSTP) in the Executive Office of the President by the IWGDD.– See

http://federaldata.wik.is/Interagency_Working_Group_on_Digitial_Data

Page 10: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

10

IWGDD Report

• Interagency Working Group on Digital Data, May 21st Report Outline (Draft):– Subgroup on Agency Science Data Policies:

• Executive Summary• Preamble• Guiding Principles and Concepts• State of the Art• Key Components• Examples and Best Practices• Recommendations• Appendix A: Terminology and Definitions of Terms

– Topic Tree Being Developed and Implement in a Web 2.0 Wiki by David Wojick and Brand Niemann

Page 11: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

11

IWGDD Report

• Subgroup on Agency Science Data Policies:– Compilation of Agency Data Management Polices and Plans:

• See http://federaldata.wik.is/Interagency_Working_Group_on_Digitial_Data/Agency_Data_Policies_and_Management_Plans

– Also see my EPA and Interagency work at http://federaldata.wik.is/Interagency_Working_Group_on_Digitial_Data/Agency_Data_Policies_and_Management_Plans#Documents_of_Relevance_or_Interest

– Appendix A: Terminology and Definitions of Terms• Topic Tree Being Developed and Implement in a Web 2.0 Wiki by

David Wojick and Brand Niemann (see next slide).• Also Suggest Using Dictionary of Data Management published in

conjunction with The DAMA-DMBOK Guide in 2009 (800 terms defined).

– See http://www.dama.org

Page 12: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

12

Appendix A: Terminology and Definitions of Terms

http://federaldata.wik.is/Interagency_Working_Group_on_Digitial_Data/Policy_Issue_Tree

Page 13: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

13

IWGDD Report

• Interagency Working Group on Digital Data, May 21st Report Outline (Draft):– Subgroup on Agency Data Management Plans:

• Executive Summary• Preamble• The Landscape of Data Management Planning• State of the Art• Examples and Best Practices• Recommendations• Appendix A: Sample Data Management Plans

Page 14: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

14

IWGDD Report• Subgroup on Agency Data Management

Plans:– Section 3.1. Motivations (laws, policies,

interests)• See slide 15.

– 3.2 Objectives• See slide 5.

– 3.3 Standards (e.g. DRM, etc.)• See slide 4.

– 5.1 Models for Best Practices• See slide 16.

Page 15: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

15

IWGDD ReportTopics Relationships Examples

Laws are interpreted as …

Clinger-Cohen (1996)

Policies which are implemented by …

Federal Enterprise Architecture Data Reference Model 3.0

Governance that is expressed in …

Federal CIO Vivek Kundra

Plans that produce results.

Data.gov

Page 16: 1 A Target Data Architecture for the US EPA: Implementing DRM 3.0 and Data.gov Brand Niemann Senior Enterprise Architect, US EPA April 21, 2009 PARS 2009.

16

IWGDD Report

• Subgroup on Agency Data Management Plans:– Data Management Association Data Management Book of

Knowledge (DAMA DM BoK) 10 Functions:• Data Governance• Data Architecture Governance (see David Loshin book on Master

Data Management, page 15-21)• Data Development• Database Operations Management• Data Security Management• Reference and Master Data Management• Data Warehousing & BI Management• Document and Content Management (Section I reviewed and

recommended use of Web 2.0 Wiki)• Metadata Management• Data Quality Management

See http://edw2009.wilshireconferences.com/uploads/handouts/TUE_1015_Mosley_Mark_Henderson_Deborah_COlor_1216.pdf