Data Grid Management Systems: Standard API - community development

12
© 2007 Open Grid Forum Data Grid Management Systems: Standard API - community development Arun Jagatheesan, San Diego Supercomputer Center & iRODS.org June, 2008 Barcelona, Spain

description

Data Grid Management Systems: Standard API - community development. Arun Jagatheesan, San Diego Supercomputer Center & iRODS.org. June, 2008 Barcelona, Spain. OGF IPR Policies Apply. “ I acknowledge that participation in this meeting is subject to the OGF Intellectual Property Policy. ” - PowerPoint PPT Presentation

Transcript of Data Grid Management Systems: Standard API - community development

Page 1: Data Grid Management Systems:  Standard API  - community development

© 2007 Open Grid Forum

Data Grid Management Systems: Standard API - community development

Arun Jagatheesan, San Diego Supercomputer Center & iRODS.org

June, 2008Barcelona, Spain

Page 2: Data Grid Management Systems:  Standard API  - community development

2© 2007 Open Grid Forum

OGF IPR Policies Apply

• “I acknowledge that participation in this meeting is subject to the OGF Intellectual Property Policy.”• Intellectual Property Notices Note Well: All statements related to the activities of the OGF and addressed to

the OGF are subject to all provisions of Appendix B of GFD-C.1, which grants to the OGF and its participants certain licenses and rights in such statements. Such statements include verbal statements in OGF meetings, as well as written and electronic communications made at any time or place, which are addressed to:

• the OGF plenary session, • any OGF working group or portion thereof, • the OGF Board of Directors, the GFSG, or any member thereof on behalf of the OGF, • the ADCOM, or any member thereof on behalf of the ADCOM, • any OGF mailing list, including any group list, or any other list functioning under OGF auspices, • the OGF Editor or the document authoring and review process

• Statements made outside of a OGF meeting, mailing list or other function, that are clearly not intended to be input to an OGF activity, group or function, are not subject to these provisions.

• Excerpt from Appendix B of GFD-C.1: ”Where the OGF knows of rights, or claimed rights, the OGF secretariat shall attempt to obtain from the claimant of such rights, a written assurance that upon approval by the GFSG of the relevant OGF document(s), any party will be able to obtain the right to implement, use and distribute the technology or works when implementing, using or distributing technology based upon the specific specification(s) under openly specified, reasonable, non-discriminatory terms. The working group or research group proposing the use of the technology with respect to which the proprietary rights are claimed may assist the OGF secretariat in this effort. The results of this procedure shall not affect advancement of document, except that the GFSG may defer approval where a delay may facilitate the obtaining of such assurances. The results will, however, be recorded by the OGF Secretariat, and made available. The GFSG may also direct that a summary of the results be included in any GFD published containing the specification.”

• OGF Intellectual Property Policies are adapted from the IETF Intellectual Property Policies that support the Internet Standards Process.

Page 3: Data Grid Management Systems:  Standard API  - community development

3© 2007 Open Grid Forum

Agenda

• Vision and Mission (Re-focusing)• Technical Background

• Reference Architecture • Concepts that need to be in Data Grids• Standard APIs

• WG initiation process Questions• Charter, 7 Questions• Software providers perspective (iRODS.org, Sybase)• Software users perspective

• Next steps

Page 4: Data Grid Management Systems:  Standard API  - community development

4© 2007 Open Grid Forum

Vision & Mission

• Data Grid Management Systems (DGMS)• Solve collaborative data management problems in

industry and academia that deal with distributed data management

• Standard Interface (OGF / SNIA / IETF ??)• Single interface for users, vendors and solution

providers/integrators

• Open-source software standard community• Open source software community effort along with

other organizations

Page 5: Data Grid Management Systems:  Standard API  - community development

5© 2007 Open Grid Forum

Data Grid or Data Cloud Reference Architecture (Vendor)

Data Grid or Data Cloud

Amazon

Storage Resource Interface

Storage Resource Interface

Page 6: Data Grid Management Systems:  Standard API  - community development

6© 2007 Open Grid Forum

Data Grid or Data Cloud Reference Architecture

Data Grid or Data Cloud

Amazon

Storage Resource Interface

Storage Resource Interface

Client Interface Storage Resource Interface

Page 7: Data Grid Management Systems:  Standard API  - community development

7© 2007 Open Grid Forum

Interfaces to be standardized

• Storage Resource Interface• Allow resources to be plugged into the grid• P2P communication amongst resources• Protocol used by middleware to communicate with

resources

• Data Grid Client Interface• Collaborative logical namespace of multiple file

storage systems• Client to manage, discover, and access file data• Control file’s life-cycle and the infrastructure usage

What should these interfaces do? - Next slideWhat should these interfaces do? - Next slide

Page 8: Data Grid Management Systems:  Standard API  - community development

8© 2007 Open Grid Forum

What should the interfaces do?

• Gateway to implement concepts• Provide Shared Collections as a logical

namespace • Provide Shared resources as an

infrastructure• Common functions necessary to discover,

manage and access files and their life-cycle.

What are these concepts? - Next slideWhat are these concepts? - Next slide

Page 9: Data Grid Management Systems:  Standard API  - community development

9© 2007 Open Grid Forum

Core Concepts of DGMS

• Logical shared collections • Logical shared resources• Logical global policies with local

autonomy• Collaborative communities

Page 10: Data Grid Management Systems:  Standard API  - community development

10© 2007 Open Grid Forum

Logical shared resource

• Identified in the entire system with a logical name (human readable)

• Not associated with the logical name (no mount point).

• Shared as a storage resource pool based on local and global policies

• Allows basic management operations to a trusted data grid management middleware

Page 11: Data Grid Management Systems:  Standard API  - community development

11© 2007 Open Grid Forum

Logical shared collection

• Similar to directory in file systems• Contains logical identifiers (logical file name) to files

that are distributed in heterogeneous data storage systems

• The same collection can have files from different locations. Each logical file could have multiple physical copies

• Meta data, data access etc controlled by the community (partners)

• Apart from data, data grid collection could also include behavior (services to manipulate data) as part of the collection.

Page 12: Data Grid Management Systems:  Standard API  - community development

12© 2007 Open Grid Forum

Next Steps

• WG initiation• Charter discussion• Deliverables discussion• The 7 Questions• Relevance and inter-operation with SNIA, IETF??

• Design and Development • Based on core data grid concepts• Open source software design and development

• Open questions• Chairs? Active Participants? Volunteers?