It's the End of Data Storage As We Know It (And I Feel Fine)

29
It's the End of Data Storage As We Know It (And I Feel Fine!) Stephen Foskett Community Organizer, Tech Field Day

description

Technological change is finally coming to storage, and it will wipe away the architecture we've come to know over the last few decades. Say goodbye to the "do it all" Fibre Channel SAN storage array and get ready for converged infrastructure, distributed storage, alternative attachments like PCIe, and top-of-rack flash! In this session, Stephen Foskett will explain why this change is inevitable and how it will shake out. You won't recognize what's coming, but it will be faster, cheaper, and more integrated than ever! Delivered at

Transcript of It's the End of Data Storage As We Know It (And I Feel Fine)

Page 1: It's the End of Data Storage As We Know It (And I Feel Fine)

It's the End of Data Storage As We Know It(And I Feel Fine!)

Stephen FoskettCommunity Organizer, Tech Field Day

Page 2: It's the End of Data Storage As We Know It (And I Feel Fine)

Outline

Technological change is finally coming to storage, and it will wipe away the architecture we've come to know over the last few decades. Say goodbye to the "do it all" Fibre Channel SAN storage array and get ready for converged infrastructure, distributed storage, alternative attachments like PCIe, and top-of-rack flash! In this session, Stephen Foskett will explain why this change is inevitable and how it will shake out. You won't recognize what's coming, but it will be faster, cheaper, and more integrated than ever!

Page 3: It's the End of Data Storage As We Know It (And I Feel Fine)

Hello! My Name Is Stephen!

I’m a storage guy…

…but I love virtualization too!

You may know me as “the Tech Field Day guy”

Or perhaps for some other crazy techie nonsense

Page 4: It's the End of Data Storage As We Know It (And I Feel Fine)

Chapter 1: Why Is Storage Like It Is?

Page 5: It's the End of Data Storage As We Know It (And I Feel Fine)

Storage Is…

Data storage – the act of saving information for later use

Storage is not (necessarily):– Disks– SCSI– Fibre Channel– RAID– Arrays

Page 6: It's the End of Data Storage As We Know It (And I Feel Fine)

Prehistoric Evolution of Storage

Tape• “We can store

stuff!”• Good sequential

throughput; non-existent random access

Disk• “Spinning rust”• Reasonable

compromise between sequential and random

Disk Array• “A bunch of disks

pretending to be one”

• Faster and redundant

Page 7: It's the End of Data Storage As We Know It (And I Feel Fine)

Three Things Storage Arrays Do Well…

Acceleration• Aggregation (wide-striping)• Caching (predictive write-

back cache)• Tiering (automated SSD

tiers)

Motion• Local copies (snapshots,

mirrors, and data movement)

• Remote copies (data replication)

Sharing• Multi-client (SAN, NAS)• Multi-protocol

(iSCSI/FC/FCoE, NFS/SMB)

Page 8: It's the End of Data Storage As We Know It (And I Feel Fine)

Ye Olde I/O Path

Server = HBA = LUNArrays can…

– Accelerate I/O by predicting and pre-filling the cache

– Move and copy data logically as a whole LUN/server

– Share data while knowing “who” is accessing it

Block-O-Matic

ProServer

ProServer

ProServer

Page 9: It's the End of Data Storage As We Know It (And I Feel Fine)

Today’s Storage Market

Capacity

Performance

SAN and NAS try to strike a balance between capacity and performance optimization

– The storage network slows performance but allows sharing– Because they are shared, arrays must offer lots of capacity

Networked StorageArrays

Page 10: It's the End of Data Storage As We Know It (And I Feel Fine)

Chapter 2: How Is Storage Changing?

Page 11: It's the End of Data Storage As We Know It (And I Feel Fine)

RAID Can’t Keep Up

RAID is inflexible

RAID is bad at math

RAID has no (data) integrity

Page 12: It's the End of Data Storage As We Know It (And I Feel Fine)

Block Storage Is Stupid*

Object = Data in Databases

File = Remote Directories

Block = Fake Disks

• Simple high-level protocols for Create, Read, Update, Delete

• Independent of data location, protection, hardware

• NAS and file servers handle file translation and organization

• Data access uses directory location, filename, offset

• Filesystem (driver in the computer) locates files

• Protocols: SATA, SCSI, Fibre Channel, iSCSI, FCoE, USB, FireWire, thumb drive, etc

*I’m being completely serious

Page 13: It's the End of Data Storage As We Know It (And I Feel Fine)

Moving Beyond Blocks

Operating systems already speak “file”

– LAN Manager, SMB/CIFS, NFSHypervisors now speak “file”

– VMware = NFSv3– Hyper-V = SMB3

Applications speak “file” or “object”

– File = POSIX, Windows APIs, etc– Object = Amazon S3, etc

Page 14: It's the End of Data Storage As We Know It (And I Feel Fine)

What Does Virtualization Do?

Server ≠ HBA ≠ LUNArrays see a random stream of data

– Acceleration is limited to write-back and “most-recently used” caching

– Moves and copies of whole LUNs are less useful

– Shared access leads to locking conflicts

Block-O-Matic

VM Guest

VM Guest

VM Guest

Hypervisor

Page 15: It's the End of Data Storage As We Know It (And I Feel Fine)

“The I/O Blender” Demands New Architectures

Server virtualization throws block I/O into a blender: All I/O is now random!

– Caching pre-fetch is confounded

– Granular movement is impossible

– Shared storage is stymied

Page 16: It's the End of Data Storage As We Know It (And I Feel Fine)

What About NFS and SMB?

File I/O to the ArrayArrays get better information but must be specially designed to act on it

– Thin provisioning and acceleration can work better

– No locking issues– VAAI and VSS may allow per-file

movement

File-O-Matic

VM Guest

VM Guest

VM Guest

Hypervisor

Page 17: It's the End of Data Storage As We Know It (And I Feel Fine)

Array Integration APIs

Hypervisor <-> ArrayAPIs are a partial solution

– VMware VAAI = vSphere offloaded copy, snapshots, thin provisioning

– Microsoft ODX = Windows Server 2012 and Hyper-V offloaded copy & thin provisioning

– VMware VASA enhances vSphere automation

Block-O-Matic

VM Guest

VM Guest

VM Guest

Hypervisor

VAAI

ODX

VASA

Page 18: It's the End of Data Storage As We Know It (And I Feel Fine)

The Solid-State Storage Fairy

Solid State Storage is appearing everywhere!

– SSDs in servers and arrays

– PCIe cards in servers and arrays

– Dedicated appliances and arrays

Solid state storage can be used in many different ways

– Read-only cache

– Read/write cache

– Tiered storage arrays

– All-solid state arrays

Some use NAND flash, others use DRAMStor-O-Matic

ProServer

SAN/LAN

SSD SSDSuper-SSD SSD

SSD

SSD

SSD

Page 19: It's the End of Data Storage As We Know It (And I Feel Fine)

How Fast Is It?

USB2 Drive

SATA HDD

SATA SSD

PCIe Drive

Memory

File Copy

Windows

Server

Rack

Datacenter

Page 20: It's the End of Data Storage As We Know It (And I Feel Fine)

Chapter 3: What Will Storage Look Like?

Page 21: It's the End of Data Storage As We Know It (And I Feel Fine)

“Software-Defined”?

“Computer”• People use computers• Hardware-oriented,

“hold it in your hands”• Complex, user-friendly

interfaces

“Server”• Computers use servers• Operating system and

hypervisor-focused• Standards-based

protocol interfaces

“Platform”• Applications use

platforms• Software-only, “bits

and bytes”• Application

programming interfaces (APIs)

Page 22: It's the End of Data Storage As We Know It (And I Feel Fine)

Virtualizing the Controller

The whole storage array can be a virtual machineStorage arrays can even run virtual machines

Block-O-Matic

Front-End I/O

ProServer

Back-End I/O

Hypervisor

Page 23: It's the End of Data Storage As We Know It (And I Feel Fine)

Distributed Storage

Use storage virtualization software to combine local storage resourcesDistribute data intelligently

– Across devices for reliability– Tiered flash + disk

Scale with clientsNo expensive SAN or storage network needed!

ProServer

ProServer

ProServer

Shared Storage:Distribution and

Protection

Page 24: It's the End of Data Storage As We Know It (And I Feel Fine)

The Return of Local Storage

Block-O-Matic

Front-End I/O

ProServer

Back-End I/O

Windowsor

HypervisorProServer

ProServer

ProServer

Shared Storage:Distribution and

Protection

Page 25: It's the End of Data Storage As We Know It (And I Feel Fine)

Where Should Features Live?

Software

Hardware

We need reliability in software or hardware, but not necessarily both

– If operating systems don’t provide reliability, scalability, and manageability, then server hardware must

– Server virtualization can add these features, so you don’t necessarily need them in hardware!

“The marginal cost of reliable hardware is linear while the marginal cost of reliable software is zero.” – Sam Johnston

Page 26: It's the End of Data Storage As We Know It (And I Feel Fine)

The Distributed Storage Future

Capacity

Performance

Storage is moving out of the network and closer to the servers– Software manages data sharing– Low-latency connections allow much greater performance– Low-cost JBOD is for bulk storage

Distributed Flash

Distributed JBOD

Page 27: It's the End of Data Storage As We Know It (And I Feel Fine)

Disaggregated Storage

ProServer

ProServer

ProServer

ProServer

ProServer

ProServer

ProServer

Top-of-rack performanceSpeedy high-performance storage:• InfiniBand• PCIe flashhigh cost,high performance

Bottom-of-rack capacityScaly capacity-oriented storage:• SAS JBOD• Object store• Cloud gatewaylow cost,low performance

Tied together with

software!

Flashy!

Scaly!

Page 28: It's the End of Data Storage As We Know It (And I Feel Fine)

Thank You!

Stephen [email protected]/sfoskett

blog.fosketts.netTechFieldDay.com

Page 29: It's the End of Data Storage As We Know It (And I Feel Fine)