Virtual Laboratory: Exploring e-Science in CAS CNIC,CAS Jianjun Yu yujj@cnic

download Virtual Laboratory:  Exploring e-Science in CAS CNIC,CAS Jianjun Yu yujj@cnic

If you can't read please download the document

description

Virtual Laboratory: Exploring e-Science in CAS CNIC,CAS Jianjun Yu [email protected]. Outline. Motivation e-Science and Virtual Laboratory Current Work on VLAB Planning in Future. The Definition of e-Science. Three kinds of e-Science: - PowerPoint PPT Presentation

Transcript of Virtual Laboratory: Exploring e-Science in CAS CNIC,CAS Jianjun Yu yujj@cnic

Palette - WP0

European and Chinese Cooperation on Grid

Virtual Laboratory: Exploring e-Science in CAS

CNIC,CASJianjun [email protected]

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Second EchoGrid Workshop Beijing 29 & 30 October 2007
*
Outline
Motivation

e-Science and Virtual Laboratory

Current Work on VLAB

Planning in Future

Second EchoGrid Workshop Beijing 29 & 30 October 2007

The Definition of e-Science
Three kinds of e-Science: Computationally intensive science in highly distributed network environmentsScience that uses immense data sets that require grid computingDistributed collaboration, such as the Access Grid

Examples:social simulations, particle physics, earth sciences, bio-informatics,

Cited from wikipedia
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Why e-Science
Challenges in modern science research (from the view of scientific researchers ) Science problems are more complex than ever

Science research object is not isolated, but cross discipline and large-scale

Science data processing, simulation and computing become indispensable methods

need more and more communication, collaboration, coordination among them closer than ever
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Why e-Science
From the perspective of resources

Great total amount while with limited utilization

Lack of effective usage

Urgently requiring for more capabilities

People need to collaborate sharing their resources and knowledge
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Why e-Science
From the perspective of IT:What researchers needHuge demand for computing technologiesHuge demand for sharing the resource / collaborationWhat providesHigh speed network HPCLarge scale of scientific data How to providee-Science environment Give scientists a more EASY-TO-USE interface to help them using the infrastructure
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

What e-Science do
Provide a research collaborative environmentCombining all resources: including computing, network, data, human,.
Observation &Experiment
Computation & Simulation
Theory Analysis
Communication & Collaboration
Networks
Specimen Library
Database
Documentation
Storage Facility
Computing Facility
Supercomputer Center
Videoconference & On-line Forum
Experiments & Field Stations
Software Tools
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*
Cited from CAS 11th Five-year Informatization Program (2006- 2010)

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Outline
Motivatione-Science and Virtual LaboratoryCurrent Work on VLABPlanning in Future
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

CAS 11th Five-year Informatization Program (2006- 2010)
Continue to develop the infrastructure and existing applicationse-Science FacilityNetworks of field stations/instruments(60), Mobile equip., Digital library of natural resourcese-Science ApplicationsHEP, Astro, Bio, Geo, Chemistry, (about 10 or 16)Resource Integration PlatformSupporting Environment for Applications
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Our vision of e-Science
Virtual Laboratory based e-ScienceAn integrated collaboration environment to support e-ScienceComposed of shared and collaborative hardware, software, data, information, human, .An EASY-TO-USE interface for scientific researchers

A basic form and tool for e-Science activities
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

e-Science VS. e-Science Virtual Labs

e-Science would be applications-driven

Virtual Labs , the key position in our e-Science framework ,the core component to make e-Science a reality

Should emphasize that Virtual Labs dont mean make experiments or simulations online
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*
Cited from CAS 11th Five-year Informatization Program (2006- 2010)

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Virtual Labs Solves
Currently:Infrastructure may be (almost) ready, but e-Science is not yet.many existing resources in place, but just a few could be brought into public. bottleneck may be the gap between products by computer experts and end users of domain scientistsmuch more effort than expected to bridge this gap

Virtual Lab is proposed to bea basic unit of research activity in the e-Science environment a connector between Infrastructure/resources and domain scientiststhe right user interface between scientists and their e-Science environment
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Virtual Labs Goals
With Virtual Labs,

all kinds of resources could be integrated into a single access point

customized and flexible services would be provided according to the specific requirements of different domains in an easier way than ever before

Multidisciplinary, multi-organization collaboration could be carried out online.
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Features of Virtual Labs
Ease of usemuch easier to use than current systems

Resource integrationprovide the user with a single operating environmentmany kinds of resources, such as supercomputers, mass storage facilities, scientific databases, digital libraries, high bandwidth network, scientific equipment, etc. could be accessed in a seamless way.

Customized serviceprovide a user with what he or she wants completely and exactly. Each user may have a specific workbench individually. .

Ubiquitous researchbenefit from state-of-the-art technologies on mobile computing and related so that user could use the Virtual Lab at any time and anywhere.
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Features of Virtual Labs (cont)
Collaborative workenable a lot of scientists, who are from multiple independent institutions, from multiple sites across the world, and from different professional backgrounds, to work together on a collaborative project or a common problem.

Scalabilitysupport hundreds of users from tens of institutions, but should work just as well for three or five users.

Managementinteract with outer management systems, such as ARP in CAS, to help improve efficiency during the whole lifetime of a research project or other research activities.
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Virtual Labs Goals in 2006- 2010
Provide a virtual lab environmentScientific Virtual Organization management Scientists can access, organize and manage resource easily and transparently Integrate resources from CAS, such as computing, storage, database, libraryDocument share and collaborationOrganize Internet-based activities, such as project review, conferenceAt least three e-Science applications on Virtual LabsBiologyAstronomyHigh Energy Physics...
*
Second EchoGrid Workshop Beijing 29 & 30 October 2007

Second EchoGrid Workshop Beijing 29 & 30 October 2007

VLAB Projects
Exploration of e-Science Virtual Lab with our vision

A product under Virtual Lab concept

We focus more on how to develop an e-Science environment for real scientific application
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Architecture of VLAB
VO management
Document Collaboration
Activity Collaboration
Core toolkit
Resources & services
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

New technologies Advancing VLAB
SOAGrid computingWeb ServicesScientific workflowPortal (WSRP)
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Research/standards go ahead
Put forward theory, technology framework for VLAB project

Solve key technologies

Provide the guide of VLAB project

Publish Universal standards to integrate different kinds of resources
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

The core components of VLAB
Virtual Workbench

Core Toolkits

e-Science Security Infrastructure

Virtual Labs services
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Virtual Workbench
A universal portal for e-Science activities

Open, scalable, flexible integrated platform for different resources

Support flexible requirements from scientists

Support component reuse

Assure usable and accessible
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Virtual Workbench (cont)
Application centered portlet can be plugged into workbenchNow support :Resource addinCalendar management Instance messageDocument ShareVirtual CommunityMail subscribeapplication can be capsulated as portlet and added into the virtual workbench
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Core Toolkits
Virtual organization management tool

Document collaboration tool

Activity organization tool
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

e-Science Security Infrastructure
Construct a PKI based security infrastructureCAS e-Science CABe authorized by APGrid PMA, 2006Be trusted by IGTFBased on OpenCA package
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Virtual Labs Services
Work and develop with the front-line scientists

Applied to real scientific application

Biology

Astronomy

High energy physics

.
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Current Work on VLAB
SDG(Scientific Data Grid)Data processing centered e-Science

Virtual Lab Core Toolkit

Document Share Tool

VO Management Tool

Activity Management Tool

AVLAB (Astronomical Virtual Lab)e-Science application


Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Scientific Data Grid
Introduction of Scientific DatabaseSDB is a long-term project since 1983, in which there are multi-disciplinary scientific data accumulated through the course of science activities in CAS.many institutes involvedlong-term, large-scale collaborationdata from research, for research

Aimed at:connecting massive data resources in Scientific Databaserealizing effective sharing of those geographically distributed,heterogeneous and autonomousdata resources via grid computing technology, especially data grid technology
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

SDB status 45 institutes across 16 cities 503 databases 16.6TB total volume

Second EchoGrid Workshop Beijing 29 & 30 October 2007

What SDG provides
DASData Access ServiceIMSGrid Information ServiceSecurity InfrastructureStorage ServiceSDG PortalSDG Toolkit
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Scientific Database
Scientific Data Grid Middleware
Scientific Data Grid Applications
Virtual Observatory Grid
High Energy Physics Grid
Avian Flu Alert System
Other Grids
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

DAS
Goal:Access over forty scientific databases in CASIntegrate immense scientific data setsProvide universal interfaces for researchersFunctionsMetadata extraction Database schema mappingBlock data set GT4 compatible
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

SDG Portal
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Document Share Tool
FeatureShare files for VO teamsUser-labeled File tag for more exact classificationDocument Search engineAutomatic summary generation ACL or RBAC based privilege controlDucklingWiki based document publishing, editingA web-based Document share portalCLB: Document Sharing toolsFile upload, lock, unlock, updateREST APICLB-U: Document share clientWindows shell applicationOffice Add-in(support office xp/2003/2007)
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Duckling -Wiki
*
Second EchoGrid Workshop Beijing 29 & 30 October 2007

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Duckling-file upload/update
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Ducking-Search
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Duckling-Tag
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

VO Management Tool
UMTHierarchical Directed Acyclic Graph based modelGroup can has subgroups, and subgroups can has multiple parent groupMembers of one group would inherit the roles and privileges from its parentsACL or RBAC based resource access control
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

UMT
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Road Map
To develop Virtual Lab version 1.0Popularize the VLAB environment
2006.12 Vlab framworkdesign
2006.12 Vlab proposal
Vlab applications
ProteinAstronomy
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Planning in Future
Select typical science areas to deploy Virtual LabBiologyAstronomyHigh Energy Physics...

step by step, case by case, project by project and worldwide cooperation to actualize the e-Science of CAS

Need more international cooperations!


Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Summary
e-Science or science researches through cyberinfrastructure will be one of the main goals of CNIC,CAS in the next five years

e-Science need more international collaborations on cyberinfrastructure and e-Science applications

Merging scientific domain and IT, not only in IT technology and scientific knowledge, but also in human, e.g. e-scientist
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

Thanks!
Second EchoGrid Workshop Beijing 29 & 30 October 2007
*

Second EchoGrid Workshop Beijing 29 & 30 October 2007

*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
2006.1 proposal 2006.12 framework design 2007.5 develop vlab2007.9 v w &2007.12 1.0Proteinastronomy


*
*
*
*