Overview

19

description

Overview. Earth System Grid Grid-enabled OPeNDAP Architecture - Server and Application access Framework experience Summary Plans for the coming year. Earth System Grid Overview. - PowerPoint PPT Presentation

Transcript of Overview

Fox ([email protected])

2

AISRP April 4-6, 2005

Earth System Grid

Grid-enabled OPeNDAP

Architecture - Server and Application access

Framework experience

Summary

Plans for the coming year

Overview

Fox ([email protected])

3

AISRP April 4-6, 2005

The goal of ESG is to make climate data – particularly climate model data – an easily accessible community resource. The project is funded by the SciDAC program: Scientific Discovery through Advanced Computing.

Enabling researchers to understand and make effective use of very large, distributed climate datasets is critical. The broad strategy is to develop a collection of server-side capabilities – minimize the amount of data movement.

Multiple interfaces to ESG will allow researchers to focus on science rather than issues of data transfer, format, and data set manipulation.

Foundation is Globus Grid technology

Earth System Grid Overview

Fox ([email protected])

4

AISRP April 4-6, 2005

Earth System Grid Portal

Fox ([email protected])

5

AISRP April 4-6, 2005

ESG: U.S. Collaborations & Development

ORNL: Climate storage &computational resources

ORNL: Climate storage &computational resources

LANL: Next generationcoupled models & computing

LANL: Next generationcoupled models & computing

ANL: Computational grids,& grid-based applications

ANL: Computational grids,& grid-based applications

USC/ISI: Computational grids,& grid-based applications

USC/ISI: Computational grids,& grid-based applications

NCAR: Climate changepredication and scenarios

NCAR: Climate changepredication and scenarios

LBNL: Climate storage facility

LBNL: Climate storage facility

LLNL: Model diagnostics& inter-comparison

LLNL: Model diagnostics& inter-comparison

Fox ([email protected])

7

AISRP April 4-6, 2005

ESG: ESG-II Architecture

Fox ([email protected])

9

AISRP April 4-6, 2005

NCAR

LBNL

LLNL

ISI

ANL

ORNL

GSI

GSIGSIGSI

GSI

GSI CAS server

CAS client

CAS client

CAS client

MyProxy client MyProxy server

TOMCAT

SECURITY services

GRAM

METADATA services

FRAMEWORK services

Auth metadata

RLSMySQL

RLSMySQL

RLSMySQL

RLSMySQL

NERSCHPSS

NCAR MSS

DISK

DISK

DISK

DISKORNLHPSS

DATA storage

The Earth System Grid

THREDDS catalogs Xindice

XindiceMySQL OGSA-DAISMCS

TRANSPORT services

gridFTP server/client

gridFTP server/client

gridFTP server/clientgridFTP server/client

HRM

HRM

HRMHRM

openDAPg server

openDAPg server

ANALYSIS & VIZ services

NCL openDAPg client LAS server

CDAT openDAPg client

MONITORING services

SLAMON daemon

SLAMON daemon

TOMCAT

AXIS

Fox ([email protected])

10

AISRP April 4-6, 2005

Metadata-centric view of ESG services

METADATASERVICES

METADATASERVICES

USER AUTHENTICATIONAND AUTHORIZATION

USER AUTHENTICATIONAND AUTHORIZATION

ACCESS AND AUTHORIZATION

METADATA

DATA TRANSPORTDATA TRANSPORT

LOCATIONMETADATA

SYSTEM MONITORINGAND CONTROL

SYSTEM MONITORINGAND CONTROL

LOGGINGMETADATA

DATA SEARCH & DISCOVERYDATA SEARCH & DISCOVERY

CONTENT METADATA

ANNOTATION & HISTORYMETADATA

DATA ANALYSIS & VISUALIZATION

DATA ANALYSIS & VISUALIZATION

AGGREGATION METADATA

DATA BROWSINGDATA BROWSING

CATALOGUINGMETADATA

Fox ([email protected])

11

AISRP April 4-6, 2005

OPeNDAP and Grid systems

DODS since ~ 1995 was based on http and cgi-style architecture

Two concernsApplication support and performance of HTTPHousekeeping abilities of cgi architecture

Solution: evolve OPeNDAP, the discipline neutral aspect of DODS

Fox ([email protected])

12

AISRP April 4-6, 2005

OPeNDAP ctd.

Data transport protocol and access protocol separated

Revised server architecture Address Grid-style authentication Memory management Exception handling All these changes and retain interoperation with

HTTP and cgi Advanced requirements: URL should support

more than one dataset, or object, i.e. aggregation

Fox ([email protected])

13

AISRP April 4-6, 2005

OPeNDAP 3.x vs OPeNDAP-g Architecture

• Simple and easy to install• One CGI process per

URL request• Limited memory

management – external• Limited scalability• Limited status reporting to

web server• Returns data stream from

one format

• Standalone server or httpd module

• Can manage multiple daemon processes

• Strong memory management – internal

• Reuse processes, scales• Coupled to OPeNDAP

server for status• Returns multiple formats

in a single stream, multiple protocols

Fox ([email protected])

14

AISRP April 4-6, 2005

Fox ([email protected])

15

AISRP April 4-6, 2005

Application development

Fox ([email protected])

16

AISRP April 4-6, 2005

Status

Operational/production release of standalone OPeNDAP server (no dependence on web server) for ESG

Run OPeNDAP server as a client to GridFTP or HTTP server Multi-protocol support: file, http, GridFTP, ftp, etc. File format support: netCDF, CDF, FITS, CEDAR, … Re-architected for aggregation support and performance Portal application client in production, netCDF client operational Authentication is handled outside OPeNDAP server framework URL syntax is more complex but more expressive Will become part of community OPeNDAP release very soon

Fox ([email protected])

17

AISRP April 4-6, 2005

ESG: Framework experience

ESG is a highly collaborative effort allowing users to quickly access data (petabytes of raw or processed data in an application independent manner).

Payoffs of this distributed collaborative infrastructure have included: Distributed data-sharing, RLS works! SRM/HRM work! OPeNDAP-g

works! Simplified data discovery of climate data, the work on metadata paid off!

Scalability? Large-scale climate data processing and analysis via highly integrated

portal Increased collaboration among climate research scientists, people use

it! Aid in climate assessments and estimates of future climate variability

and trends, IPCC!

Fox ([email protected])

18

AISRP April 4-6, 2005

ESG: Framework experience

Transport - GridFTP versus HTTP Server to server Very good performance Depends on a very specific version of GRIDftp server (stripped) Clients are not as capable due to ‘weight’ of globus, revert to

HTTP Scalability and response times (data AND metadata)

Framework architecture supports re-layered for tuning Service monitoring

to support the distributed collaborative infrastructure need lots or all services to really make a production environment

work Try out ESG by visiting the website at: http://www.

earthsystemgrid.org

Fox ([email protected])

19

AISRP April 4-6, 2005

Success?

Users are generally happy, developers are very happy Exploited new technology components

Integration - when and how does it work and scale? XML -> SQL DODS -> OPeNDAP and OPeNDAP-g

Globus provides a suite of framework components, some are easier to integrate than others, some just don’t fit our use-cases and architecture

Data framework - e.g. OPeNDAP has been extremely successful

Carrying this to space science (solar-terrestrial)

Fox ([email protected])

23

AISRP April 4-6, 2005

Summary

Basic success in both data systems and data frameworks

Satisfying user and sponsor needs (from ‘just’ to ‘outstanding’)

Experience with Globus ranges from very good, to not ready for our need

Experience with OPeNDAP is very good, esp. with core services

Scalability and performance require an adaptable architecture which is something system-level interfaces can still hide from the user

Challenge - to bring these attributes to a framework, i.e. in which the user is more exposed

Fox ([email protected])

24

AISRP April 4-6, 2005

Plans

IDL application level access to new OPeNDAP server framework

Outreach to NASA communities/data centers to install and test new capabilities (server and client)

Joint development of accompanying semantic catalogs for Sun-Earth Connection datasets within the OPeNDAP framework

SPDML-enabled OPeNDAP server