dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I...

17
Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005 dCache.org dCache The Big Picture Patrick Fuhrmann et al.

Transcript of dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I...

Page 1: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.org

dCache

The Big PicturePatrick Fuhrmann et al.

Page 2: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.orgPreliminary Notes

Tape Storage Media Migration @ DESY­HH

Capacity Transfer Speed (v) 

'Ready' Time (T)

20 GB 200 GB 500 GB

10 MB/sec 30 MB/sec 120 MB/sec

6 sec 60 sec > 60 sec

9840 9940 'titanuim'

efficiency = T * vn * Fs

1 +

1

T * v  60 1800 7800

Fs : average file sizen : average files / mount

9840 994030%

Page 3: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.org

International Project

Software Development

Service

ProductGrid Storage Element

Future

dCache

Page 4: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.orgInternational Project

dCache is a joined effort between the Deutsches Elektronen Synchrotron (DESY) and the Fermi  National Laboratory (FNAL)

DESYFnal

dCache Collaboration

Grid KAKarlruhe

LCG Deployment

CERNBrookhaven

LCG Tier I / II

CDF

US CMS

Page 5: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.org

✗ Software development and coordination

✗ New feature acquisition

✗ Product deployment

✗ Packaging

✗ Documentation

✗ Web Presentation (www.dCache.ORG)

✗ Bug report and developer feedback (ticket system)

✗ Installation and operation support ([email protected])

Project, Product

Page 6: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.orgSoftware, Features (Public Relations Part)

✗ Combines several hundred pool nodes and lets them look like a single huge file system space.

✗ Support multiple internal and external copies of a singe file system entry point.✗ Performs automatic pool to pool copies of datasets to flatten data access hot spots.✗ Fine grained pool selection (experiment, read­write, internal external, priority)✗ Cached data only removed if space is running short (no threshold)✗ Powerful administration interface via 'ssh' and GUI.✗ Scales due to multiple doors.

Core

✗ Takes care that at least 'n' but not more than 'm' copies of a single dataset exists within one dCache instance. 

✗ Takes care that this rule is still true if nodes go down (schedules or even unexpected)

Resilient Module

Page 7: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.orgSoftware, Features (cont.)

✗ Groups incoming datasets according to HSM specific sorting criteria and flushes them to one or more Tape systems, following certain rules.

✗ Removes 'old' files from disk, but only if space is running short.✗ Retrieves dataset from tape to disk if dataset is requested by dCap/Ftp/Srm open operation without user/administrator interaction.

HSM connection module (tape access optimization)

✗ Local name space operations via nfs 2/3✗ dCap protocol for local area posix like access (plain,kerberos,ssl,gsi)✗ Ftp protocol (plain,gsi)✗ Storage Resource Manager (SRM)

Supported Access Methods

Page 8: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.orgSoftware, Features (cont.)

✗ Supports optimized I/O and name space operations via URL like syntax✗ c­language library implementation including PRELOAD✗      ls ­l dcap://pnfs/desy.de/it/users/patrick✗ Supports linux (32 + 64 bit ), solaris, (limited windows)✗ automatic reconnect on pool or server failures✗ dCache interfaced to ROOT✗ dCache and non dCache I/O transparently handled by dCap library✗ dCap interfaced by GFAL (Grid File access library)✗ Read Ahead buffering and deferred write✗ Supports Gss(Kerberos), Gsi (Grid) and ssl as secure protocols.✗ Thread safe

dCap protocol/implementation details

✗ Prepares data transfers, checks certificates and permissions.✗ Negotiates transfer protocols (dCap,rfio,ftp,http)✗ Retries until transfer succeeds✗ Space reservation✗ Future : quotas

SRM (Storage Resource Manager) details

Page 9: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.orgGrid Storage Element (lcg 2 SE)

✗ Support of wide area protocol (GsiFtp)

✗ Support of local, posix like protocol (dCap) , incorporates with CERN GFAL.

✗ Support of Storage Resource Manager Protocol (SRM)

✗ Grid Resource Information Service (GRIS)

Requirements for lcg2 Storage Element

Page 10: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.org

Pools

GsiFtpDoor

SRM

Pools

GsiFtpDoor

SRM­dCache

Desy Main PNFS   /acsH1 scratch /acsdisk

h1 ­ farm

R

h1 resilient dCache

Pool

Pool Pool

Pool Pool

Pool Pool

Pool Pool

d 2 cache

Desy HH Main cache

Desy+H1 Installation

World

70 TBytes

4 Nodes = 5 TBHigh Performance

Page 11: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.org

H1

dCache disk storage

13 Tbytes    disk space

0

5000

10000

15000

20000

25000

30000

35000

40000

File Size vers. Data used

Days

GB

ytes

0

2500

5000

7500

10000

12500

15000

17500

20000

Restore vers. Transferred (H1 only)

Days

GB

ytes

Some H1 Statistics (Jan 2005)

211 Tbytes    tape space

Some other experiment31 Tbytes    disk space

244 Tbytes    tape space

Page 12: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.org

Compute Element

Storage Element

Local Cache

Tertiary Storage System

Prediction Module

Grid Scheduler Incoming Jobs

ScheduledJobs

Query current load

Query load prediction

Query “optimized submission time”

DGrid Proposal : Improved Grid Scheduling

(dCache)

Load / Cost

Load Info

“make ready”

Local Site

RED : improved modules

Local Scheduler

Page 13: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.org

✗ Nearly all Tape Data on Mirror Cache

✗ Mirror Cache has highest possible data density (lowest dollars/TBytes)

✗ Controlled number of high speed streams between Mirror Cache  and Regular Cache

✗ Mirror Cache behaves like HSM (except for  mount/dismount delays)

✗ Mirror Cache disks (or disk clusters) switched OFF if not accessed

✗ HSM to Mirror Cache transfers necessary only after disk replacement

The Mirror Cache

90 % of total Repository

Mirror CacheTape Repository

Regular CachesConsumer

Page 14: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.org

End of official Part

Page 15: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.org

Source : Michael Ernst 18/5/2004

Wide Area

Access

Physics Application

Grid File Access Library (GFAL)

ReplicaManager

Client ClientSRM Local

I/O I/OI/OrfiodCap rfio

rfioService Service

dCapService

SRMLRC

GridFTPService

MSSService

Local Disk

Posix like I/O

LCG Storage Element : File access

dCacheSE

Page 16: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.org

(gsi,kerberos) dCap Server

dCache Core Pnfs

dCap Client

Cell Package

Resilient Manager

Ftp Server (gsi, kerberos)

Storage Resource Manager (SRM)SRM Client

Storage Element

Remotely accessible

Resilient Cache

Basie Cache System

BSD

3 rd party

???

dCache Component License Model

Postgres Gdbm

GPL? (library LGPL ?) (BSD) (GPL)

Sun Java VM (Sun Binary Code L)

COG (GTPL)

COG (GTPL)

Globus,Cog (GTPL)

Page 17: dCache The Big Picture · 2/17/2005  · Grid KA Karlruhe LCG Deployment CERN Brookhaven LCG Tier I / II CDF US CMS. Patrick Fuhrmann h1 collaboration meeting 17 Feb 2005 dCache.org

Patrick Fuhrmann                              h1  collaboration meeting                                                 17 Feb 2005

dCache.orgdCache ­ LCG support model

LHC  Tier  I / II center

Web Pages Downloads Call Center Ticket System

LCG Deployment

Grid KA

dCache Deployment (DESY)

Web Pages Downloads Call CenterTicket System

DevelopersFERMI

DevelopersDESY

LCG dCache Evaluation Team

Support Tools

Support Tools