Scalla Back Through The Future Andrew Hanushevsky SLAC National Accelerator Laboratory Stanford...

ScallaBack Through The Future

Andrew Hanushevsky

SLAC National Accelerator LaboratoryStanford University

8-April-10http://xrootd.slac.stanford.edu

http://xrootd.slac.stanford.edu 2

Outline

Introduction History Design points Architecture

Scalla Usage

Future projects involving Scalla

Conclusion

8-April-10

http://xrootd.slac.stanford.edu 38-April-10

What is Scalla?

Structured Cluster Architecture for

Low Latency Access Low Latency Access to data via xrootd servers

POSIX-style byte-level random access Arbitrary data organized as files Hierarchical directory-like name space

Protocol includes high performance features Structured Clustering provided by cmsd servers

Exponentially scalable and self organizing

Brief History

1997 – Objectivity, Inc. collaboration Design & Development to scale Objectivity/DB

First attempt to use commercial DB for Physics data Very successful but problematical

2001 – BaBar decides to use root framework vs Objectivity Collaboration with INFN, Padova & SLAC created

Design & develop high performance data access system Work based on what we learned with Objectivity

2003 – First deployment of xrootd system at SLAC2005 – Collaboration extended Root collaboration & Alice LHC experiment, CERN Over 100 deployment sites across the world

The Scalla Design Point

Write once read many times processing mode Capitalize on simplified semantics

Large scale small block sparse random access Provide very low latency per request

Secure large compute investment Provide high degree of fault-tolerance

Accommodate more data than disk space Integrate offline storage (Mass Storage System)

Adapt to disparate deployment environments Robust security framework Component based replaceable subsystems Simple setup with no 3rd party software requirements

In typical cases

Scalla Plug-in Architecture

8-April-10

lfn2pfnprefix encoding

Storage System(oss, drm/srm, etc)

authentication(gsi, krb5, etc)

Clustering(cmsd)

authorization(name based)

Storage System(oss, hdfs, etc)

File System(ofs, sfs, alice, etc)

Protocol (1 of n)(xrootd, xproofd)

Protocol Driver(XRD)

Clustering

xrootd servers can be clustered Increase access points and available data Allows for automatic failover

Structured point-to-point connections Cluster overhead (human & non-human) scales linearly

Cluster size is not limited (easily accommodates 262,144 servers)

I/O performance is not affected

Symmetric cookie-cutter management Always pairs xrootd & cmsd server

8-April-10

xrootd

How Scalla Is Accessed

The root framework Used by most HEP experiments (MacOS, Unix and Windows)

A mounted FUSE Scalla file system (Linux and MacOS only)

SRM and gridFTP General grid access (Unix only)

POSIX preload library Any POSIX compliant application (Unix only, no recompilation needed)

xrdcp The copy command (MacOS, Unix and Windows)

xprep The redirector seeder command (MacOS, Unix and Windows)

xrd The admin interface for meta-data operations (MacOS, Unix and Windows)

8-April-10

Who Uses Scalla?

US ATLAS (SLAC, ANL, UWisc160-node cluster, UTA, UVICCanada, more to follow) BaBar (SLAC, IN2P3France, INFNItaly) Fermi/GLAST (SLAC & IN2P3France) STAR (BNL 600-node cluster)

CERN (Switzerland) To support most LHC local site data analysis

LHC ALICE (World-Wide) Global cluster of over 90 sites

IN2P3 (France) Over 12 unrelated physics, astronomy, & biomed experiments

There are many many more E.G., all sites running the Parallel Root Facility (PROOF)

8-April-10

Scalla Flexibility

Engineered to play well in different contexts Flexible plug-in architecture Robust multi-protocol security

KRB4/5, GSI, SSL, Shared Secret, password, unix Highly scalable and fault-tolerant Multi-platform

HP/US, Linux, MacOS, Solaris, Windows (client only)

A good framework for the future

8-April-10

Future Scalla-Based Projects I

SSD Based Multi-Tiered Storage Use Scalla’s automatic data migration facility

to keep active data on expensive SSD. Determine usability of SSD’s in this mode to

improve performance and/or reduce overall storage costs

LDRD submitted

8-April-10

Future Scalla-Based Projects II

Data Direct Networks Port Port Scalla/xrootd to run natively on the DDN

head-node for decreased latency, perhaps increased throughput.

Waiting for DDN to supply hardware Requires lots of accounting/support co-ordination

8-April-10

Future Scalla-Based Projects III

PetaCache Pair SLAC-developed SOFI (Mike Huffer’s Sea Of Flash)

system with Scalla to determine it’s usability for high-performance data access in typical analysis environments

In progress, waiting for software interface specifications and access to hardware

8-April-10

Future Scalla-Based Projects IV

ExaScale mySQL Use Scalla framework to cluster large numbers

of mySQL servers to provide a fault-tolerant map/reduce functionality for relational databases

ASCR proposal submitted to DOE Selection will be announced in the fall

This is in support of LSST

8-April-10

Future Scalla-Based Projects V

Secure export of Lustre Filesystem Use Scalla to securely and efficiently export

Lustre to “laptops” for remote researchers Can be done via FUSE

Have a working version for Linux and MacOS Need to pair Windows xrootd client with Dokan

• Will provide equivalent functionaility

In support of LCLS researchers Idea is currently being floated around

8-April-10

Future Scalla-Based Projects VI

Multi-Tiered Storage Provide mechanism to mix and match hardware

Expensive disk, Cheap Disk, Tape Yet maintain high I/O performance

Already supported by the software A matter of configuration

In support of ATLAS SLAC Tier2 Increase effective disk capacity at highly reduced cost In progress

8-April-10

External Future Projects

Scalla + HDFS (Brian Bockelman, University of Nebraska)

Provide the best features of Hadoop and ScallaScalla + DPM (David Smith, CERN)

Enables Disk Pool Manager global participationScalla + Castor (Andreas Peters, CERN)

Overcomes the high latency of CERN’s MSS Near-term plan is to move toward pure Scalla

Scalla + SSD (Andreas Peters, CERN)

Provide block level SSD caching In alpha test at BNL

8-April-10

Conclusion

Scalla is a highly successful HEP s/w system It works as advertised The feature set is tuned for large scale data access Collaboratively designed and developed The only system of its kind freely available

It is widely used in the HEP community Increasing use in the Astro community

Being actively developed and adapted

Acknowledgements

Software Contributors Alice: Derek Feichtinger CERN: Fabrizio Furano, Andreas Peters FZK: Artem Trunov Fermi/GLAST: Tony Johnson (Java) Root: Gerri Ganis, Beterand Bellenet, Fons Rademakers SLAC: Tofigh Azemoon, Jacek Becla, Andrew Hanushevsky,

Wilko Kroeger, Daniel Wang LBNL: Alex Sim, Junmin Gu, Vijaya Natarajan (BeStMan team)

Operational Collaborators BNL, CERN, FZK, IN2P3, SLAC, UTA, UVIC, UWisc

Partial Funding US Department of Energy

Contract DE-AC02-76SF00515 with Stanford University

Scalla Back Through The Future Andrew Hanushevsky SLAC National Accelerator Laboratory Stanford...

Documents

Transcript of Scalla Back Through The Future Andrew Hanushevsky SLAC National Accelerator Laboratory Stanford...

Xrootd Andrew Hanushevsky Stanford Linear Accelerator Center 30-May-03.

Scalla/xrootd WAN globalization tools: where we · Scalla/xrootd WAN globalization tools: where we are. Fabrizio Furano1, Andrew Hanushevsky2 1Conseil Europeen Recherche Nucl.(CERN),

Scalla Developer Summary xrootd /cmsd Andrew Hanushevsky SLAC National Accelerator Laboratory CERN Workshop 10-November-08 .

Summary Andrew Hanushevsky SLAC National Accelerator ... · dCache based storage federation with caching Data caching is significant activity for worker-node jobs Competing I/O –

Romantica Featuring La Scalla By: Robert Kaufman FabricsRomantica Featuring La Scalla By: Robert Kaufman Fabrics Quilt Designed By: Margrit Hall Quilt: 48" x 60" All Seams are 1/4"

Xrootd Update Alice Tier 1/2 Workshop Karlsruhe Institute of Technology (KIT) January 24-26, 2012 Andrew Hanushevsky, SLAC .

Scalla/xrootd 2009 Developments

December 4, 2006 EPAC1 SLAC Overview and Future of Accelerator Science at SLAC Persis S. Drell Deputy Director SLAC.

SLAC BaBar Program

SCALLA: A Platform for Scalable One-Pass Analytics Using MapReducemcgregor/papers/12-tods.pdf · 2012-12-21 · SCALLA: A Platform for Scalable One-Pass Analytics Using MapReduce

EPICS in SLAC Controls Ron Chestnut, SLAC Beijing, 2001.

1 SLAC BaBar Program Blair Ratcliff SLAC BaBar Program Manager June 2004.

ATF2 Mover Software 19 June 2008 Janice Nelson, Doug McCormick (SLAC) Glen White (LAL/SLAC) Justin May (no longer at SLAC)

1 Measuring The Digital Divide Prepared by: Les Cottrell SLAC, Shahryar Khan NIIT/SLAC, Jared Greeno SLAC, Qasim Lone NIIT/SLAC Presentation to Princess.

Welcome! Mauro Pivi SLAC, ESTB Workshop March 2011, SLAC.

SLAC Remote Access and Citrix XPe Brian Scott SLAC May 2004.

Facebook-training Scalla College

SLAC Redwood Room A/B, SLAC Thursday, June 19, 2008

Xrootd Update Andrew Hanushevsky Stanford Linear Accelerator Center 15-Feb-05 .

Scalla/xrootd WAN globalization tools: where we are.