Tony Doyle - University of Glasgow Introduction. Tony Doyle - University of Glasgow 31 August...
-
Upload
charity-douglas -
Category
Documents
-
view
214 -
download
1
Transcript of Tony Doyle - University of Glasgow Introduction. Tony Doyle - University of Glasgow 31 August...
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Expression of Interest
Universities of Aberdeen, Dundee, Edinburgh, Glasgow (lead),
Heriot-Watt, St. Andrews, Stirling, Strathclyde1. wide variety of disciplines 2. computing-enabled research3. portals4. training 5. accounting 6. infrastructure management7. distributed computing provision8. three-year project 9. approximately £5m from SFC 10.comparable equipment resources from
partners
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Objectives
• Added Value: a grid-enabled software infrastructure, providing added value to institutional hardware
• Integration: integrated with existing projects in the UK, Europe and internationally
• Leverage: standardised middleware will be exploited
• Virtual Communities: nurture and support• Outreach and Training: reach new
research communities (Scottish companies will also be able to benefit)
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Development Plans
• Pilot phase: prototype grid service (www.scotgrid.ac.uk)
• Boost: R&D at NeSC (www.nesc.ac.uk) • Prototype: e-infrastructure in operation for
four years• Growth: support for growing number of
activities• Evolution: technology development required• Challenges:
– Grid accessible to non-computing researchers– development of discipline-specific portals– information management– integration of different authentication systems
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Institutional Resources
• Substantial investments in computing resources
• ScotGrid will integrate theseDundee: £1m life sciences HPC facility (currently 200
processors, > 100TB), upgrade plannedEdinburgh: £2m Computing and Data Facility (ECDF)
plannedGlasgow: £1m dedicated grid facility (first
components arrived this week) 1 MSI2k, 100 TBStirling: £850k in central servers Aberdeen, Heriot-Watt, St. Andrews and Strathclyde
are supportive
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Accounting and Monitoring
• Ensure fairness and balance of the institutional resources between the different research communities
• Effective distributed accounting provides the basis by which partners can have confidence in resource sharing and in additional resource provision and consumption
• Annual ScotGrid accounts provided to SFC will enable each partner institute to determine their usage charges, providing a basis for long-term sustainability via research grants and the rolling equipment programmes in each university
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
SFC Support • Institutional equipment investments are comparable to the support
required from SFC for staff to develop and realise the full potential of ScotGrid
• In addition, support will be needed for meetings, travel, etc• The full economic cost over three years, starting July 2007, is
estimated at around £5m• Project Management: (1.2 FTE) A dedicated project director plus a
fraction of the PI to coordinate the complex technical developments and specialised discipline requirements
• System Management: (4 FTE) Local technical management of the key facilities forming part of ScotGrid
• Grid Operation: (6 FTE) A distributed team of technicians to achieve agreed levels of service across the entire ScotGrid infrastructure
• Accounting Support: (1 FTE) Development of accounting support to ensure fair and well-managed use of ScotGrid facilities
• Portal Development: (4 FTE) A team of specialists to develop the specialised portals required by client disciplines
• Training and Outreach: (2 FTE) A team of specialists to determine the development and training needs of client disciplines
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Sustainability
• There is a long-term demand for a Grid over the next ten years
• Medium-term sustainability will depend upon close co-operation
• The project is designed to be fully inclusive• The ScotGrid resource will help sustain the leading position
Scotland has in e-Science and Grid research, and ensure that the Scottish research community is well positioned for future research funding opportunities
• continued support via grant applications for new research growing out of the project and from the rolling equipment programmes in each University
• A summary report of research income generated and/or supported using the infrastructure will be compiled in 2009 to enable SFC and the partners to determine the most appropriate sustainable funding model from 2010 and beyond
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
SFC feedback (17/7/06)
• The proposal was considered outwith the discussion of proposals to the SRDG periodic competition
• The Panel considered that the proposal met a strategic need for Scotland
• It would be ideally suitable for consideration for funding under the Enhancing Excellence (pooling) funding stream of SRDG
• Two key issues for the proposal: (i) what parts of infrastructure are provided
already at the UK level(ii) be clear on the pan-Scottish dimension
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
SFC further feedback (26/7/06)
A. need to understand the relationship and synergy with the Informatics Pooling bid
B. Highlight the "service" aspects of the ScotGrid proposition
C. The outcome of the proposal would be for assessment next year
D. Institutions involved should consider how they can contribute
E. Scottish Enterprise could be involved particularly in knowledge transfer and core technology aspects
F. Approach Abertay w.r.t. their formal training (linked to informatics pooling)
G. New senior appointments in this area should be considered to build up the area of Grid research
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
A further set of points were identified:4. Industrial liaison should be incorporated -
testbed encouraged5. A gap analysis should be performed6. Investigate Grid-associated studentships 7. Service existing research pooling efforts8. An overview document should be prepared 9. Indicate how many users are anticipated
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Broader Issues
a. Technical issues should be discussed (in the appendices)
b. International review committee will judge
c. Develop links with co-funders (e.g. Research Councils)
d. EPSRC must be approached re: supporting academic investments. (PPARC are already aware).
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Benefits
Establishment of a Research e-Infrastructure leading to:
1. Expansion in terms of partners and services2. Robust operations and management procedures3. Interoperability with other national and
international e-infrastructures4. Integration of key data sources5. Improved understanding of data provenance6. Improved and measured ScotGrid reliability7. Supported research across a range of domains8. Access by non-Grid focused researchers9. Service definitions for a sustainable infrastructure10.Industrial partners testing/utilising Grid services
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Timeline
In terms of defining a timeline:i. need to identify contacts at all Institutesii. SFC (David Gani) will meet with partners on
November 6th iii. focus on enhanced service aspects and identifying
the core Grid technologies • Three meetings currently planned:
– Today Edinburgh– November 6th Glasgow– End-December TBC
Internal aim (for discussion): appendices by the end of the yearproposal by SFC-requested timescale
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Summary
• Expression of Interest submitted at end of May 2006
• Draft proposal developed up to mid-May 2006 was used to guide internal discussions
• Received prompt, positive feedback to EoI from SFC in July
• Need to build collaboration and review draft proposal in light of SFC feedback and today’s inputs
Talk Title
Enabling Grids for E-sciencE
EGEE-II INFSO-RI-031688
Grids and Business – key points
• Trust - Not used to sharing resources
• Security - Sensitive data with sensitive applications
• Business models – what can be charged for as a service
• Guaranteed QoS – Service Level Agreements
• Accounting - tracking resources usage in multi-admin context
• Standards – to encourage long-term investment
• Applications – need to support legacy applications
• Portability – across multiple platforms and implementations
• Open source support – robust reference implementation
• Software license management – how to generate revenue ina grid context
EGEE-II provides an excellent framework for collaborating with business on these subjects
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
News: First Meeting of ScotGrid Partners to develop Pan-Scottish Grid
News: Glasgow prepares with new Grid servers
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Accounting
June 2002 June 2006
Past CPU performance •3,061,049 CPU hours •383,858 completed jobs.•> 100 users, >30 VOsNeed to:•Publish data automatically •Agree the conversion rates £: [CPU, disk, manpower]•Agree internal charging
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
PP Grid OverviewAim: by 2008 (full year’s data
taking)- CPU ~100MSi2k (100,000
CPUs)- Storage ~80PB - Involving >100 institutes
worldwide
- Build on complex middleware being developed in advanced Grid technology projects, both in Europe (Glite) and in the USA (VDT)
1. Prototype went live in September 2003 in 12 countries
2. Extensively tested by the LHC experiments in September 2004
3. 197 sites, 13,797 CPUs, 5PB storage in September 2005
4. Today: 177 active sites, 28,574 CPUs, 45PB storage
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
MiddlewareRequirement
Storage Element
Basic File Transfer
Reliable File Transfer
Catalogue Services
Data Management tools
Compute Element
Workload Management
VO Agents
VO Membership Services
DataBase Services
Posix-like I/O
Application Software Installation Tools
Job Monitoring
Reliable Messaging
Information System
15 Baseline Services for a functional Grid
Scottish (and National) Grid Service will rely upon gLite components
This middleware (now at v3.0) builds upon VDT (Globus and Condor) and meets the requirements of all the low-level, basic scientific use cases:
1. Green (amber) areas are (almost) agreed as part of the shared generic middleware stack by each of the application areas
2. Red are areas where generic middleware competes with application-specific software.
www.glite.org
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Glasgow Edinburgh
Leicester Oxford
London
Netherlands
Publically Curated Data
Private data
Private data
Private data
Private data
Private data
Private data
CFG Virtual Organisation Ensembl
MGI
HUGO
OMIM
SWISS-PROT
… DATA HUB
RGD
Middleware
Generic User
Services
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
1. Glasgow/Edinburgh/Durham/ (Dundee) Project
2. Compute-intensive jobs performed at Glasgow and Durham
3. Data-intensive jobs performed at Edinburgh
4. First applications using the Grid. Meeting real requirements of Grid applications: currently Particle Physics, Bioinformatics,
Computing Science, Electrical Engineering
5. Glasgow leading R&D in Grid Data Management, Security, Portals
6. Edinburgh leading User Training, R&D in Networking and Storage
7. Management part of a worldwide Grid infrastructure through GridPP and EGEE
What was?
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
What are the Grid challenges?
Data Management, Security and
Sharing
1. Software process2. Software efficiency3. Deployment
planning 4. Link centres
5. Share data
6. Manage data7. Install software8. Analyse data9. Accounting
10. Policies
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Summary• ScotGrid is part of an evolving National
and International Grid• NGS, EGEE and OMII-UK are important
partner organisations• Deployment, middleware and application
support will each raise issues for ScotGrid• External developments place constraints
on the hardware and software• Timeline is ~right for spin-off
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Work Plan• We are developing a more detailed work plan…• International leadership of the participating departments, their
proven ability to collaborate successfully, their needs for large-scale computing, and their stated intentions to exploit a Scottish Grid
• Each discipline has its own application needs, pressing timescales, and expectations of large-scale computing
• The project will provide the framework needed for synergy and economies of scale
• Deployment plans to: – expand the user base– include new provider sites– create a training programme– enhance current service provision and establish user
support• This will be integrated with the work of the UK and European Grid
services • A top-down approach will draw on institutional strengths and
strategic plans for computing• A bottom-up approach will focus on requirements of each discipline,
building unified services across Scotland
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Stakeholders
Stakeholder Interest / stake Importance University Computing Services Road mapping and resource provision High Higher Education Colleges Accessibility of facilities Medium Researchers Standard interfaces and access to
resources of appropriate scale High
Middleware Providers Interoperability standards, established via deployment practice
Medium
Industrialists Using (open) standards to establish commercial systems
High
Research Councils Optimising use of infrastructure investments
High
Scottish Funding Council Developing national e-infrastructure
High
Community and project infrastructure projects Interoperability and sustainability High
UKERNA/JISC Grid as a service above JANET network provision
Medium
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Risk analysisName Likelihood
(1-4) Impact
(1-4) Risk
(L x I) Action to Mitigate Risk
Staffing 2 3 6 Build on existing expertise. Organisational 3 3 9 Define/build/agree collaboration
structure. Clarify the role of ScotGrid and its interactions.
Technical 3 2 6 Adopt a conservative approach to technology deployment.
External suppliers 3 1 3 Work with developers and partner infrastructures.
Insufficient support for a focussed e-infrastructure
2 4 8 Ensure funding is in place early and work to increase the scale and relevance of ScotGrid.
Service under utilised 3 2 6 Establish user board and survey initial requirements.
Service over subscribed 3 2 6 Work with partners and affiliates to increase hardware via annual review.
Inadequate support infrastructure
2 3 6 Monitor performance of support activities via pre-defined metrics.
Lack of international compatibility
1 4 4 Active engagement in NGF, GGF, EGEE.
Failure to integrate e-Infrastructure authentication
2 3 6 Review authentication early in the project – agree processes, recommend UK CA.
Security compromise 1 4 4 Work with other e-Infrastructure providers. Limit capability through portals. Key part of user training.
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
Middleware:
Baseline Services
Requirement OMII VDT/GT gLite Other Comment 1. Storage Element
Access to data/storage
through OGSA-DAI functionality
Yes SRM via dCache or
DPM
gLite includes Storage Resource Management capability
2. Basic File Transfer
Yes GridFTP Yes gLite and OMII include GridFTP
3. Reliable File Transfer
Remote File Transfer
File Transfer Service
FTS being tested by OMII
4. Catalogue Services
UDDI registry via
GRIMOIRES
RLS LCG File Catalogue
Central catalogues adequate, high throughput needed
5. Data Management tools
OMII incorporates OGSA-DAI
LCG tools (replica
management)
SRB gLite File Placement Service under development
6. Compute Element
OMII Job Service
Gatekeeper Yes gLite uses Globus (GT2)
7. Workload Management
GridSAM and manual job submission
Condor-G Resource Broker
RB builds on Globus, Condor-G. Migrate to GT4?
8. VO Agents Perform localised activities on behalf of VO
9. VO Membership Services
Tools for account
management, no
GridMapFile equivalent
CAS VOMS CAS does not provide all the needed functionality
10. DataBase Services
OGSA-DAI MySQL, PostgreSQL, ORACLE
Off–the-shelf offerings are adequate
11. Posix-like I/O
GFAL Xrootd Application access to data
12. Application Software Installation Tools
PACMAN Yes Tools already exist in EGEE
13. Job Monitoring
Monalisa, Netlogger
Logging & Bookkeeping
service, R-GMA
14. Reliable Messaging
Supported through FIRMS
Tools such as Jabber are used
15. Information System
MDS (GLUE)
Yes BDII EGEE based on BDII and GLUE schema
31 August 2006 ScotGrid Introductory Meeting Tony Doyle - University of Glasgow
External Relations
Applications
Application Specific Middleware
Generic Grid Middleware
Infrastructure
BRIDGES
OMII
GridPP
SBRN
Scot Grid
PPA
BIO
VOTES
CDF ATLAS
…
…
NGS
EEE
IR NW
EGEE …
nanoCMOS
Vis