Failing in Place for Low-Serviceability Infrastructure Using High-Parity GPU-Based RAID

Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear Security Administration under contract DE-AC04-94AL85000.

Failing in Place for Low-Serviceability Infrastructure Using High-Parity GPU-

Based RAID

Matthew L. Curry, University of Alabama at Birmingham and Sandia National Laboratories, mlcurry@sandia.gov

H. Lee Ward, Sandia National Laboratories, lee@sandia.govAnthony Skjellum, University of Alabama at Birmingham,

tony@cis.uab.edu

Introduction• RAID increases speed and reliability of disk

arrays• Mainstream technology limits fault tolerance to

two arbitrary failed disks in volume at a time (RAID 6)

• Always-on and busy systems that do not have service periods are at increased risk of failure– Hot spares can decrease rebuild time, but leave

installations vulnerable for days at a time• High-parity GPU-based RAID can decrease

long-term risk of data loss 2

Motivation: Inaccurate Disk Failure Statistics

• Mean Time To Data Loss of RAID often assumes manufacturer statistics are accurate– Disks are 2-10x more likely to fail than

manufacturers estimate*– Estimates assume low replacement latency

and fast rebuilds… Without load

*Bianca Schroeder and Garth A. Gibson. “Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you?” In Proceedings of the 5th USENIX Conference on File and Storage Technologies, Berkeley, CA, USA, 2007. USENIX Association.

Motivation: Unrecoverable Read Errors

0 3 6 9 12 15 18 21 24 27 30 33 36 39 42 45 48 51 54 57 60 63 66 69 72 75 78 81 84 87 90 93 96 990

BER = 10^-14 BER = 10^-15 BER = 10^-16

Terabytes Read

Result: Hot Spares Are Inadequate Under Load

0 50 100 150 200 250 300

RAID 5+0 (4 sets)RAID 6+0 (4 sets)RAID 1+0 (3-way replication)

Data Capacity (TB, using 2TB Disks)

MTTR = One Week, MTTF = 100,000 Hours, Lifetime = 10 years

Solution: High-Parity RAID• Use Reed-Solomon Coding to provide much higher fault tolerance

– Configure array to tolerate failure of any m disks, where m can vary widely

– Analogue: RAID 5 has m=1, RAID 6 has m=2, RAID 6+0 also has m=2– Spare disks, instead of remaining idle, participate actively in array,

eliminating window of vulnerability• This is very computationally expensive

– k+m RAID, where k is number of data blocks per stripe and m is number of parity blocks per stripe, requires O(m) operations per byte written

– Table lookups, so no vector-based parallelism with x86/x86-64– Failed disks in array operate in degraded mode, which requires the

same computational load for reads as writes

Use GPUs to Provide Fast RAID Computations

• Reed-Solomon coding accesses a look-up table

• NVIDIA CUDA architecture supplies several banks of memory, accessible in parallel

• 1.82 look-ups per cycle per SM on average

• GeForce GTX 285 has 30 SMs, so can satisfy 55 look-ups per cycle per device– Compare to one per core in x86

processors

Compute Core

Shared Memory Bank

Performance: GeForce GTX 285 vs. Intel Extreme Edition 975

• GPU capable of providing high bandwidth at 6x-10x CPU speeds

• New memory layout and matrix generation algorithm for Reed-Solomon yields equivalent write and degraded read performance

2 6 10 14 18 22 26 300

GPU m=2GPU m=3GPU m=4CPU m=2CPU m=3CPU m=4

Number of Data Buffers

A User Space RAID Architecture and Data Flow

• GPU computing facilities are inaccessible from kernel space

• Provide I/O stack components in user space, accessible via iSCSI

– Accessible via loopback interface for direct-attached storage, as in this study

• Alternative: Micro-driver

Performance Results

• 10 GB streaming read/write operations to a 16-disk array

• Volumes mounted over loopback interface– stgt version used is bottleneck, later versions have

higher performance

GPU +2

GPU +4

0 100 200 300 400 500 600

WriteRead

Throughput (MB/s)

Advance: GPU-Based RAID• Prototype for general-purpose RAID

– Arbitrary parity– High-speed read verification– High-parity configuration for initial hardware

deployments• Guard against batch-correlated failures

• Flexibility not available with hardware RAID– Lower-bandwidth higher-reliability distributed data

stores

Future Work• UAB

– Data intensive “active storage” computation apps– Computation, Compression within the storage stack– NSF-funded AS/RAID testbed (.5 Petabyte)– Multi-level RAID architecture, failover, trade studies– Actually use the storage (e.g., ~200TB of 500TB)

• Sandia– Active/active failover configurations– Multi-level RAID architectures– Encryption within the storage stack

• Commercialization12

Conclusions• RAID reliability is increasingly related to BER• Window of vulnerability during rebuild is

dangerous, needs to be eliminated if system stays busy 24x7– Hot spares are inadequate because of elongated

rebuild times under load• High-parity GPU RAID can eliminate window of

vulnerability while not harming performance for streaming workload

• Fast writes and degraded reads enabled by GPU make this feasible

Failing in Place for Low-Serviceability Infrastructure Using High-Parity GPU-Based RAID

Documents

Transcript of Failing in Place for Low-Serviceability Infrastructure Using High-Parity GPU-Based RAID

Limit state of serviceability

Access and serviceability Tractor

Dell EMC PowerEdge RAID Controller S140 · Windows RAID: Volume, RAID 1, RAID 0, RAID 5, RAID 10 Linux RAID: RAID 1 NOTE: Non-boot virtual disks of any supported RAID level by the

Guide Storage Administration - SUSE Linux · 7 Software RAID Conﬁguration88 7.1 Understanding RAID Levels88 RAID 088 • RAID 189 • RAID 2 and RAID 389 • RAID 489 • RAID 589

MAINTENANCE AND SERVICEABILITY

FastTrak TX 4650-2650 User v0.8 - Promise Technology Bank/Manual/FastTrak... · 2019. 2. 11. · TX4650 supports: RAID 0, RAID 1, RAID 5, and RAID 10 TX2650 supports: RAID 0 and RAID

MILITARY COLLECTION XII. WORLD WAR II PAPERS, … · Air raid drills: reports . Air raid protection regulations . Air raid shelters . Air raid wardens . Air raid warnings . American

RAS - Reliability, Availability, Serviceability

Jiû3 SERVICEABILITY OF SHEETS

Chapter 6 Serviceability

Serviceability Guidelines for steel structures

RAID Config Guide - CNET Content Solutions - English · RAID Levels 1-7 RAID Levels Summary 1-8 RAID 0 1-8 RAID 1 1-9 RAID 5 1-10 RAID 6 1-11 RAID 00 1-12 RAID 10 1-13 RAID 50 1-14

Long-Term Effects and Serviceability Effects and Serviceability Edward G. Nawy, ... (From Nawy, E.G., Reinforced Concrete: ... Long-Term Effects and Serviceability 4 ...

Serviceability Design Considerations

Serviceability Assessment for HSR Track

What is RAID? Benefits of RAID Concepts of RAID RAID

Sean Traber CS-147 Fall 2009. 7.9 RAID 7.9.1 RAID Level 0 7.9.2 RAID Level 1 7.9.3 RAID Level 2 7.9.4 RAID Level 3 7.9.5 RAID Level 4 7.9.6.

RAID Interactive Simulator 3 (Discrete RAID controller) · RAID Interactive Simulator 3 (Discrete RAID controller) ... RAID Interactive Simulator 3 (Discrete RAID controller) ...

Understanding RAID Levels (RAID 0, RAID 1, RAID 2, RAID 3, RAID 4, RAID 5)

PMC Adaptec Trusted Storage Solutions - Ingram Micro€¦ · RAID Adaptec RAID 81605ZQ Adaptec RAID 8885Q Adaptec RAID 81605Z Adaptec RAID 8885 Adaptec RAID 8805 Adaptec RAID 8405