GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader...

23
GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton, Middleware Coordinator Neasan O’Neill, Events Officer

Transcript of GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader...

Page 1: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

GridPP, The Grid & Industry

Who we are, what it is and what we can do.

Tony Doyle, Project LeaderSteve Lloyd, Collaboration Board ChairmanRobin Middleton, Middleware Coordinator

Neasan O’Neill, Events Officer

Page 2: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

19 UK Universities, CERN and

CCLRC (RAL & Daresbury) Funded by PPARC:GridPP1 2001-2004 (£17m)

“From Web to Grid”

GridPP2 2004-2007 (£16m)

“From Prototype to Production”

GridPP3 2007-2011 (proposed)“From Production to

Exploitation”

Who are GridPP?

Developed a working, highly functional Grid

Page 3: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

Web: information sharing

• Invented at CERN by Tim Berners-Lee

• Agreed protocols: HTTP, HTML, URLs

• Anyone can access information and post their own

• Quickly crossed over into public use

No. of

Inte

rnet

host

s (m

illio

ns)

Year

Page 4: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

4 Large Experiments

The CERN LHCThe world’s most powerful particle accelerator

Why do particle physicists need the Grid?

Page 5: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,
Page 6: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

Why do particle physicists need the Grid?Example from LHC: starting

from this event

We are looking for this “signature”

Selectivity: 1 in 1013

Like looking for 1 person in a thousand world populationsOr for a needle in 20 million haystacks!

• ~100,000,000 electronic channels

• 800,000,000 proton-proton interactions per second

• 0.0002 Higgs per second

• 10 PBytes of data a year

• (10 Million GBytes = 14 Million CDs)

Concorde(15 Km)

Mt. Blanc(4.8 Km)

One year’s data from

LHC would fill a stack of CDs 20km

high

Page 7: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

Solution – Build a Grid

• Share more than information• Efficient use of resources at many

institutes• Leverage over other sources of funding• Data, computing power, applications• Join local communities

Challenges:• share data between thousands of scientists with multiple interests• link major and minor computer centres• ensure all data accessible anywhere, anytime• grow rapidly, yet remain reliable for more than a decade• cope with different management policies of different centres• ensure data security• be up and running routinely by 2007

Page 8: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

Middleware is Everything

MIDDLEWARE

CPUDisks, CPU etc

PROGRAMS

OPERATING SYSTEM

Word/Excel

Email/Web

Your Progra

mGames

CPUCluste

r

UserInterfac

eMachine

CPUCluste

r

CPUCluste

r

Resource Broker

Information Service

Single PC

Grid

DiskServer

Your Progra

m

Middleware is the Operating System of a distributed computing system

Replica CatalogueBookkeepin

g Service

Page 9: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

GridPP Middleware Development

Workload Management

Storage Interfaces

Network Monitoring

SecurityInformation Services

Grid Data Management

Page 10: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

What you need to use the Grid

1. Get a digital certificate (UK Certificate Authority)

2. Join a Virtual Organisation (VO)

3. Get access to a local User Interface Machine (UI) and copy your files and certificate there

Authentication – who you are

Authorisation – what you are allowed to do

4. Write some Job Description Language (JDL) and scripts to wrap your programs

############# HelloWorld.jdl #################Executable = "/bin/echo";Arguments = "Hello welcome to the Grid ";StdOutput = "hello.out";StdError = "hello.err";OutputSandbox = {"hello.out","hello.err"};#########################################

Page 11: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

International Context

LHC Computing Grid (LCG)Grid Deployment Project for

LHC

EU Enabling Grids for e-Science (EGEE) 2004-2008Grid Deployment Project for all disciplines

GridPP LCG

EGEE

GridPP is part of EGEE and LCG (currently the largest Grid in the world)

UK National Grid ServiceUK’s core production computational and data Grid

Open Science Grid (USA)Science applications from HEP to biochemistry

NorduGrid (Scandinavia)Grid Research and Development collaboration

Page 12: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

The LCG Grid Status Worldwide

182 Sites

23,438 CPUs

9.2 PB Disk

2,200 Years of CPU time

UK

21 Sites

4,482 CPUs

180 TB Disk

593 Years of CPU time

Page 13: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

What GridPP Has Done So Far

•Reached transfer speeds of 1 Gigabyte per second in high speed networking tests from CERN – a DVD every 5 seconds•Simulated 500 million particle physics collisions with the BaBar experiment•Transformed the way particle physics computing problems are approached

•Analysed 300,000 possible drug components in the fight against the Avian Flu virus•Simulated 46 million molecules for medical research in 5 weeks, which would have taken over 80 years on a single PC

Page 14: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

Who else can use a Grid?• Astronom

y

• Healthcare

• Bioinformatics

• Gaming

• Engineering

• Commerce

Page 15: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

“UK contributes to EGEE's battle with malaria”

BioMedSuccesses/Day 1107Success % 77%

WISDOM (Wide In Silico Docking On Malaria)

The first biomedical data challenge for drug discovery, which ran on the EGEE grid production service from 11 July 2005 until 19 August 2005.

GridPP resources in the UK contributed ~100,000 kSI2k-hours from 9 sites

Number of Biomedical jobs processed by country

Normalised CPU hours contributed to thebiomedical VO for UK sites, July-August 2005

Page 16: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

"GridPP has been developed to help answer questions about the conditions in the Universe just after the Big Bang," said Professor Keith Mason, head of the Particle Physics and Astronomy Research Council (PPARC)."But the same resources and techniques can be exploited by other sciences with a more direct benefit to society."

Page 17: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

GridPP & IndustryWhat We Have To

Offer• Our Grid• Security tools• GridSite • R-GMA• APEL accounting

system

Page 18: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

Our Grid• The UK Grid (via

one of the individual university sites) can be used to run applications for areas such as finance and image processing.

Page 19: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

Security Tools & GridsiteGrid Security for the WebWeb platforms for Grids

• Digital Certificates• Certification Authority• Gridsite identifies users to websites with

the digital certificates• GridSiteWiki is an extension to the tool • GridSite is open source

(http://www.gridsite.org/)

Page 20: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

RGMA & APEL accounting system

• Relational Grid Monitoring Architecture– An information and monitoring system for

static and dynamic information about grid resources, applications and networks

• Accounting Processor for Event Logs– Provides a summary of the resources

consumed based on attributes such as CPU time, Wall Clock Time, Memory and grid user identity

Page 21: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

• HP are sponsoring a joint project with GridPP at Bristol.

• GridPP has an association with IBM through collaboration on ScotGrid and R-GMA.

• Specific sites also have close relationships with various industrial suppliers.

GridPP & IndustryCurrent Involvement

Page 22: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

• Posters at “Technology Opportunities from CERN: the impact of Big Physics on Industry”.

• Attended KITE club meetings on: – Healthcare, – Medical image processing– Film and computer games

• Speakers at a forum on Network and Grid Security organised for the IT industry.

GridPP & IndustryCurrent Involvement

Page 23: GridPP, The Grid & Industry Who we are, what it is and what we can do. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton,

Future

Plan to establish a small steering group to lead technology transfer activity. The group, working with various companies, would examine different methods of technology transfer and identify the GridPP activities that can be used in industry and business.