Progress of the Helix Nebula Science Cloud PCP Project

20

Transcript of Progress of the Helix Nebula Science Cloud PCP Project

Page 1: Progress of the Helix Nebula Science Cloud PCP Project
Page 2: Progress of the Helix Nebula Science Cloud PCP Project

Helix Nebula – The Science Cloud with Grant Agreement 687614 is a Pre-Commercial Procurement Action funded by H2020 Framework Programme

Progress of theHelix Nebula Science Cloud

PCP project19 October 2016

Bob JonesCERN

IT department

01/05/2023

This work is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License.The content of this presentation is the sole responsibility of the authors and does not necessarily represent the views expressed by the European Commission or its services.

Page 3: Progress of the Helix Nebula Science Cloud PCP Project

The Helix Nebula Science Cloud public-private partnership

Page 4: Progress of the Helix Nebula Science Cloud PCP Project

D. Giordano HN GA8 21/09/2016

Series of short procurementsof increasing size and complexity

4

Augmenting CERN’s scientific computing programme with commercial cloud services

Page 5: Progress of the Helix Nebula Science Cloud PCP Project

D. Giordano WLCG Workshop 9/10/2016

CERN cloud procurements 2015-2016

5

Page 6: Progress of the Helix Nebula Science Cloud PCP Project

6

The Helix Nebula InitiativeBrings together• research organisations,• data providers,• publicly funded e-

infrastructures,• commercial cloud service

providers

In a hybrid cloud with procurement and governance approaches suitable for the dynamic cloud market In-house

Page 7: Progress of the Helix Nebula Science Cloud PCP Project

Major challenges

What if I get locked in? Are there relevant

standards I should be looking into?

What happens to my data?

How do I get a good deal?

What happens to my IT staff?

How can I compare contracts & SLAs?

What is PCP?

What are the others

doing?

How can I allocate costs?

What services do

I need?

1. Cloud computing is disrupting the way IT resources are provisioned2. In-house resources, publicly funded e-infrastructure and commercial cloud

services are not integrated to provide a seamless environment3. Current organisational and financial models are not appropriate4. The new way of procuring cloud services is also a matter of skills and education5. Legal impediments exist

Page 8: Progress of the Helix Nebula Science Cloud PCP Project

Provides a landscape of cloud procurement in the European public research sector Makes pragmatic recommendations for the procurement of cloud services by PROs in Europe Provides a guide to cloud procurement, supported by best practices adopted worldwideProposes actions within the pillar three of the Digital Single Market Strategy which focus on maximising the growth potential of the digital economy

The PICSE Roadmap

4/5/2016 11

www.picse.eu/roadmap

Page 9: Progress of the Helix Nebula Science Cloud PCP Project

HNSciCloud Joint Pre-Commercial Procurement

Bob Jones, CERN 9

Procurers: CERN, CNRS, DESY, EMBL-EBI, ESRF, IFAE, INFN, KIT, STFC, SURFSaraExperts: Trust-IT & EGI.eu

The group of procurers have committed• Procurement funds• Manpower for testing/evaluation• Use-cases with applications & data• In-house IT resources

Resulting services will be made available to end-users from many research communities

Co-funded via H2020 Grant Agreement 687614

Total procurement budget >5M€

Page 10: Progress of the Helix Nebula Science Cloud PCP Project

What will be procuredA hybrid cloud platform for the European research community

05/01/2023 10

HNSciCloudPCP

Sour

ce: C

loud

Com

putin

g fo

r Gov

ies,

DLT

Solu

tions

,Da

vid

Blan

kenh

orn,

Van

Rist

au a

nd C

aron

Bee

sley

Combining services at the IaaS level to support science workflows

The R&D services to be developed are to be integrated withResources in data centres operated by the buyers groupEuropean-scale publicly funded e-Infrastructures

Page 11: Progress of the Helix Nebula Science Cloud PCP Project

ChallengesInnovative IaaS level cloud services integrated with procurers in-house resources and public e-infrastructure to support a range of scientific workloads

Compute and Storagesupport a range of virtual machine and container configurations including HPC working with datasets in the petabyte range

Network Connectivity and Federated Identity Managementprovide high-end network capacity via GEANT for the whole platform with common identity and access management

Service Payment Modelsexplore a range of purchasing options to determine those most appropriate for the scientific application workloads to be deployed

Bob Jones, CERN 11

Page 12: Progress of the Helix Nebula Science Cloud PCP Project

HNSciCloud project phases

Preparation

• Analysis of requirements, current market offers and relevant standards

• Build stakeholder group• Develop tender material

Implementation

and sharing

Jan’16 Dec’18

Each step is competitive - only contractors that successfully complete the previous step can bid in the next

4/5/2016 12

200+ downloads70+ requests

for clarifications

4 Designs3 Prototypes 2 Pilots

Call-off Feb’17

Call-off Oct’17

Tender Jul’16

Page 13: Progress of the Helix Nebula Science Cloud PCP Project

Bob Jones, CERN 13

Research Infrastructures are facilities, resources or services of a uniquenature identified by European research communities toconduct top-level research activities in all fields

Interested Research Infrastructures:• EPOS, ESA, ESS• clusters: CORBEL,

ASTERICS-OBELICS

Will form an observer group

Page 14: Progress of the Helix Nebula Science Cloud PCP Project

Launch eventICRI 2016, Cape Town - South

Africa

e-INFRASTRUCTUREResearch Infrastructure as key nodes of e-Infrastructure for Research

• Advanced e-Infrastructure of all Research Infrastructures

• Optimal interfaces between RIs and the external e-Infrastructure (Networks, Cloud, HPC, HTC)

• Data Quality assessment at RIs and setting quality standards for broad use

• Data access to “enabling data” i.e. data completed with adequate metadata, traceable origin, FAIR

• Long Term preservation of “useful data”

• Key role of “public” institutions and interplay with commercial clouds/repositories

Giorgio RossiChair

Page 15: Progress of the Helix Nebula Science Cloud PCP Project

Helge Meinhard, CERN, 17 March 2016

• Foreseen users:- bioinformaticians who will do most of the large scale processing- less tech-savvy end-users will perform analysis

• Data types: genotype & phenotype information.Data can be assumed to be anonymised but it is still sensitive

Page 16: Progress of the Helix Nebula Science Cloud PCP Project

Helge Meinhard, CERN, 17 March 2016

Long tail of science• Foreseen users: individual

researchers/small labs in the need of accessing highly performant IT resources to analyse their data on

• Data types: Due to the nature of the use-cases, the exact type of datasets can’t be predicted upfront. However, the infrastructure will need to ensure datasets will be kept private to the single user, with the possibly to share them among other users / publicly provided sufficient authorisation is granted.

Page 17: Progress of the Helix Nebula Science Cloud PCP Project

Helge Meinhard, CERN, 17 March 2016

• Foreseen users: the EuroBioImaging consortia through the representatives in EMBL

• Data types: private and public datasets consisting of images coming from human cells, Drosophila and fungi, with a plan to further extend the coverage adding more datasets and cell types. No sensitive data are currently foreseen.

Page 18: Progress of the Helix Nebula Science Cloud PCP Project

European Open Science Cloud

05/01/2023 18

https://bit.ly/cloudeu

Page 19: Progress of the Helix Nebula Science Cloud PCP Project

Widening access (2/2): e-Infras as aggregators of demand

19

EOSC

Scientific Users

Commercial services

e-Infrastructures EU H2020 funding

€ ProcurementGrants

Augusto Burgueño Arjona, head of the Unit "eInfrastructure & Science Cloud“, DG CNECT, EC, Sept’16

Page 20: Progress of the Helix Nebula Science Cloud PCP Project

Widening access (1/2): e-Infrastructures as service providers

20

EOSC

Scientific Users Industry Public Sector

e-Infrastructures

Augusto Burgueño Arjona, head of the Unit "eInfrastructure & Science Cloud“, DG CNECT, EC, Sept’16

HNI 2.0: Building value chains with data intensive science