Lustre Beyond HPC - OpenSFScdn.opensfs.org/wp-content/uploads/2014/04/D2_S25... · Lustre Market...

Lustre Beyond HPCToward Novel Use Cases for Lustre?

Presented at LUG 2014

Robert Triendl, DataDirect Networks, Inc.2014/03/31

ddn.com

this is a vendor presentation!

ddn.com

Lustre Today

• The undisputed file-system-of-choice for HPC – large-scale parallel I/O for very large HPC clusters

(several thousand nodes or larger)– applications that generate very large datasets but only a

relatively limited amount of metadata traffic

• Alternative solutions– are typically more expensive (since proprietary) – and not nearly as scalable as Lustre

ddn.com

Lustre Market OverviewData for the Japan Market

2008 2013 2018

Cloud & Content

Business Data Analysis

HPC "Work" Corporate

Research Data Analysis

HPC Archive

HPC "Work"

ddn.com

What Happened?Example: Storage Market Japan

• Lustre market expansion– From a relatively small player to the dominant HPC file

system

– From tens of OSSs to hundreds of OSSs (and, when including the “K” system, literally thousands of OSSs)

– But, also Lustre has become synonymous with large-scale HPC

– Experiments to use Lustre in other markets and for other applications have decreased, rather than increased

ddn.com

Overall Storage Market Evolution• “Software-defined storage” is replacing traditional

storage architectures in large cloud deployments

• Many Lustre “alternatives” are now available– CEPH, OpenStack/Swift, Swift Stack, Gluster, etc. – Various Hadoop distributions– Commercial Object Storage (DDN WOS, Scality,

Cleversafe, etc.)

• Lustre remains “exotic” and is rarely considered even an option

ddn.com

HPC "Work" Corp 15%

Research Data Analysis 35%

HPC Archive 23%

HPC Work, 24%

Project Quota

Small File I/O

SSD Acceleration

Fine-Grained Monitoring

NFS/CIFS Access

Management

Connectors

Object/Cloud Links

Data Management

Backup/Replication

Client Performance

Cluster Integration

Large I/O

I/O Caching

RAS Features

ddn.com

HPC "Work" Corp 15%

HPC Archive 23%

HPC Work, 24%

Project Quota

Small File I/O

SSD Acceleration

NFS/CIFS Access

Management

Connectors

Object/Cloud Links

Data Management

Backup/Replication

Client Performance

Cluster Integration

Large I/O

I/O Caching

ddn.com

Nagoya University Acceptance BM

• Large Cluster– FFP from each core in the cluster– Most efficient configuration with 3 TB/4 TB

drives– 350-400 threads per Lustre OST

ddn.com

Nagoya University Initial DataFPP with Large Number of Threads

0 50 100 150 200 250

GB/sec

Processes per OST

Write Performance Single OST (GB/sec)

ddn.com

Large I/O Patches

0 100 200 300 400 500 600 700

Number of Process

Write: Lustre Backend Performance Degradation (Maximum for each dataset=100%)

7.2KSAS(1MB RPC) 7.2KSAS(4MB RPC) SSD(1MB RPC)

0 100 200 300 400 500 600 700

Number of Process

Read : Lustre Backend Performance Degradation (Maximum for each dataset=100%)

7.2KSAS(1MB RPC) 7.2KSAS(4MB RPC) SSD(1MB RPC)

ddn.com

Raw Device Performance: Write

Total number of thread

sgpdd-survey(SeagateES3 7.2 NL-SAS, RAID6, write)crg=1 crg=2 crg=4 crg=8 crg=16

crg=32 crg=64 crg=128 crg=256

sgpdd-survey(Hitachi 7.2K NL-SAS, RAID6, write)

crg=1 crg=2 crg=4 crg=8 crg=16

ddn.com

Raw Device Performance: Read

sgpdd-survey(SeagateES3 7.2 NL-SAS, RAID6, read)crg=1 crg=2 crg=4 crg=8 crg=16

sgpdd-survey(Hitachi 7.2K NL-SAS , RAID6, read)

crg=1 crg=2 crg=4 crg=8 crg=16

ddn.com

HPC "Work" Corp 15%

HPC Archive 23%

HPC Work, 24%

Project Quota

Small File I/O

SSD Acceleration

NFS/CIFS Access

Management

Connectors

Object/Cloud Links

Data Management

Backup/Replication

Client Performance

Cluster Integration

Large I/O

I/O Caching

ddn.com

Output Data from a Simulation

• Requirements– Random reads against multiple large files– 2 million 4k random read IOPS

• Solution– Lustre file system with 16 OSS servers– Two SFA12K (or one SFA12KXi)– 40 SSDs as Object Storage Devices

ddn.com

Lustre 4K Random Read IOPSConfiguration: 4 OSSs, 10 SSDs, 16 clients

100,000

200,000

300,000

400,000

500,000

600,000

1n32p 2n64p 4n128p 8n256p 12n384p 16n512p

Lustre 4K Random Read (FPP and SSF)FPP (POSIX)

FPP (MPIIO)

SSF (POSIX)

SSF (MPIIO)

ddn.com

Genomics Workflos

• Mixed workflows– Ingest– Sequencing pipelines: large file I/O– Analytics workflows: mixed I/O

• Various I/O issues– Random reads for reference data– Small-file random reads

ddn.com

Random Reads with SSDs

ddn.com

Data Analysis “Workflows”

• Scientific data analysis– Genomics workflows– Seismic Data Analysis– Various types of accelerators– Large scientific instruments in astronomy– Remote sensing and environmental monitoring– Microscopy

ddn.com

Data Analysis “Workflows”

• Additional topics–Data ingest–Data management and data retention–Data distribution and data sharing

ddn.com

Hyperscale StorageHPC, Cloud, Data Analysis

High Performance ComputingMostly (very) large files (GBs)Mostly write I/O performance

Mostly streaming performance

10s of Petabytes of DataScratch data

100,000s coresMostly InfinibandSingle location

Very limited replication factorHigh efficiency

Cloud ComputingSmall and medium size files (MBs)

Mostly read I/O performanceMostly transactional performance

10s of Billions of filesWORM & WORN

10s of millions of coresAlmost exclusively Ethernet

Highly distributed dataHigh replication factor

Low efficiency

Data Analysis

Workflows

ddn.com

HPC "Work" Corp 15%

HPC Archive 23%

HPC Work, 24%

Project Quota

Small File I/O

SSD Acceleration

NFS/CIFS Access

Management

Connectors

Object/Cloud Links

Data Management

Backup/Replication

Client Performance

Cluster Integration

Large I/O

I/O Caching

ddn.com

A (DDN) Vision for Lustre

• Maximum sequential and transactional performance per storage sub-system CPU

• Caching at various layers within the data path

• Increased single node streaming and small file performance

• Millions of metadata operations in a single FS

• Millions of random (read) IOPS within a single FS

ddn.com

A (DDN) Vision for Lustre cont.

• Data management features, including (cloud) tiering, fast and efficient data back-up, and data lifecycle management

• Novel usability features such as cluster integration, QoS, directory-level quota, etc.

• Extremely high backend reliability for small and mid-sized systems

ddn.com

Futures for Lustre?

• Work Closely with Users– User problems are the best source for future

direction– Translate user problems into roadmap priorities

• Work Closely with the Lustre Community– Work very closely with OpenSFS and Intel

HPDD on Lustre roadmap priorities and various other topics

ddn.com

Futures for Lustre?

ddn.com

Lustre Beyond HPC - OpenSFScdn.opensfs.org/wp-content/uploads/2014/04/D2_S25... · Lustre Market...

Documents

Transcript of Lustre Beyond HPC - OpenSFScdn.opensfs.org/wp-content/uploads/2014/04/D2_S25... · Lustre Market...

SFA12KX and Lustre Update - HPC Advisory Council and Lustre Update Sep 2014 ... DDN Confidential – NDA Required, Roadmap Subject to Change ©2012 DataDirect Networks. All Rights

Deploying Hadoop on Lustre Storage - OpenSFScdn.opensfs.org/.../Deploying-Hadoop-on-Lustre-Storage_Gallegos_Tao...Deploying Hadoop on Lustre Storage: Lessons learned and best practices.

Dell Terascala HPC Storage Solution Technical Whitepaperi.dell.com/sites/content/business/solutions/hpcc/en/...Lustre Object Storage Attributes Lustre is a clustered file system, offering

Map/Reduce on Lustre Hadoop Performance in HPC Environments Nathan Rutman, Xyratex James B. Hofmann, Naval Research Laboratory.

Wireshark for Lustre* - OpenSFScdn.opensfs.org/wp-content/uploads/2013/04/Oucharek_Wireshark_LUG-2013.pdfWireshark for Lustre* Doug Oucharek April 17, 2013 Intel®’High’Performance’DataDivision’

Lustre Update - HPC Advisory Council · Whamcloud introduction Whamcloud is widely recognized as the source for Lustre We have the only HW vendor-neutral offering 5 ... ZFS, btrfs?,

Lustre in Commercial Space Panel - OpenSFScdn.opensfs.org/wp-content/uploads/2013/04/lug... · Lustre Filesystems • Now - 3 x 2.5PB production • Each has 16 OSS, 2 DDN SFA12K-40

Eric Barton Lustre - ISW2008 · Lustre deployments today •Think “extreme computing” •Largest market share in HPC IDC's HPC User Forum Survey 2007) •Adopted by the largest

VIRTUALIZING HIGH-PERFORMANCE COMPUTING (HPC) … · • Parallel file systems such as Lustre or IBM Spectrum Scale can be used for HPC workloads that require access to large files,

Map/Reduce on Lustre Hadoop Performance in HPC Environments

Lustre Beyond HPC - OpenSFSopensfs.org/wp-content/uploads/2014/04/D2_S25_LustreFor... · 2016-10-12 · • Lustre market expansion – From a relatively small player to the dominant

An I/O analysis of HPC workloads on CephFS and Lustre

Accelerating Lustre with SSDs and NVMe - OpenSFScdn.opensfs.org/wp-content/uploads/2016/04/LUG2016D2...Instant Commit, DSS, fadvice() Millions of Read IOPS L2RC OSS-level Read cache

Intel and Lustre* - OpenSFScdn.opensfs.org/wp-content/uploads/2015/04/BGorda-2015-04-14-LU… · SAS Institute TechTerrain Advanced Clustering Technologies Bioteam, Inc Compecta EMC

Lustre Status and Roadmap - HPC Advisory Council · 10 Lustre Status and Roadmap @ 2011 Whamcloud, Inc. 12 25 52 110 271 3 6 12 25 53 1 10 100 1000 1 24 8 16 ime (sec) Item Count

Lustre & Kerberos - OpenSFScdn.opensfs.org/wp-content/uploads/2015/04/Lustre-and...Objectives – control who can be part of a Lustre file system Currently – whichever node that

HPC Storage - DDN.com · PDF file• HPC Storage • September 2015 Lustre Towards ExaScaler. DDN/Intel Webinar. Content • Exascale Storage Challenges (Intel) ... – DDN Automated

Dell EMC HPC Storage with Lustre - Lustre中国社区lustrefs.cn/wp-content/uploads/2019/10/Dell-EMC-HPC-PPT-in-CLUG… · DELL EMC HPC STORAGE PORTFOLIO High performance and flash

Scalable and Distributed DNN Training on Modern HPC Systems: …€¦ · HBase, Memcached, etc.) Deep Learning (Caffe, TensorFlow, BigDL, etc.) HPC (MPI, RDMA, Lustre, etc.) Increasing

Running Docker on Lustre - OpenSFScdn.opensfs.org/wp-content/uploads/2016/04/LUG2016D2_An... · Running Docker on Lustre An architectural overview ... ... PowerPoint Presentation