Title of Presentation - SNIA · Title: Title of Presentation Author: khauser Created Date:...

27
2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved. 1 Computational Storage Architectural Discussion Scott Shadley NGD Systems Co-Chair Computational Storage TWG

Transcript of Title of Presentation - SNIA · Title: Title of Presentation Author: khauser Created Date:...

2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.1

Computational Storage

Architectural Discussion

Scott Shadley

NGD Systems

Co-Chair Computational Storage TWG

2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.2

Today’s Learning Opportunities (TLO).

▪ EDGE needs CSD in M.2, EDSFF

▪ Architecture Our Way – CSD with PCSS

▪ AI – ML – CSD – the Overlap

▪ Hadoop & DB – CSD – Growth, Scale

But Don’t Take Our Word For it.Data, Data, Data.

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC193

What Are You Doing with Your Data Today?

It’s No Longer Black and White.Source: Gartner - Bittman

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC194

What is Driving Our Data Analytics Issues?

Weeding through the Noise at the Edge

Source: Gartner - Bittman

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC195

From Edge Sensors to Centralized Cloud

Cloud

Centralized CloudFarthest from the network edgeHigh density of compute, storageand network resources

InfrastructureSmall distributed data centersLow roundtrip latency 5 – 10ms

Edge DevicesReal-time data processing

Edge Sensors &ChipData Collection & Origination

Computational Storage opportunities exist throughout the distributed compute environment

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC196

Innovative Computational Storage Uses.

FACIAL

RECOGNITION

MACHINE

LEARNING

HPC

CONTENT

DELIVERY

SMART CITIES

AUTONOMOUS

DRIVING

IN-SITU

PROCESSING

security

(de) compressionoff-load

objecttracking

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC197

Highest Capacity, Lowest Power.

Industry leading W/TB

Industry’s Only 16-Channel M.2

Industry’s Largest Capacity U.2

Form Factor Capacity (TB) MAX Power (W)

M.2 22110 Up to 8 8

EDSFF E1.S Up to 16 12

EDSFF E1.L Up to 32 12

U.2 15mm Up to 32 12

AiC FHTQL Up to 64 15

8TB/8WM.2

32TB/12W

16TB/12WE1.S

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC198

2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.9

Today’s Learning Opportunities (TLO).

▪ EDGE needs CSD in Compact Form Factor

▪ Architecture One Way – CSD with PCSS

▪ AI – ML – CSD – the Overlap

▪ Hadoop & DB – CSD – Growth, Scale

Complete Solution and Disruptive Technology.1st FULLY INTEGRATED COMPUTATIONAL STORAGE SOLUTION

Industry’s First 14nm

Modular firmware

Efficient algorithm

Flash characterization

SoC Controller Management Optimized Hardware

“In-Situ Processing” Computational Storage Stack

• Full fledged on drive OS

• Quad-core 64-bit application processor

• Light virtualization

• Hardware acceleration

Linux

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC1910

The Scalable, ASIC-based Computational Storage Drive.

Needed Key Attributes:

• Use standard protocols (NVMe)

• Minimize data movement (Faster Response, Lower W/TB)

• Improve (TB/in3) with maximize (Customer TCO)

Host

Platform

Standard NVMe Protocol

AI

Core Solution Stack:

Moving Compute to Data

An enterprise class device capable of processing workloads in storage at the source

Linux

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC1911

Computational Storage DriveComputational Storage Drive

Computational Storage Drive

Why Not Work on it There?

Traditional SSD Solutions

The Data Lives on Storage.

Computational Storage Drive

NVMe

DRAM

host API

application

host OS

drive OS

app

Armquad-core shared

NAND media

media controller

Server Host Processor Complex

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC1912

2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.13

Today’s Learning Opportunities (TLO).

▪ EDGE needs CSD in Compact Form Factor

▪ Architecture One Way – CSD with PCSS

▪ AI – ML – CSD – the Overlap

▪ Hadoop & DB – CSD – Growth, Scale

The

Lead

er in

Co

mp

uta

tio

nal

Sto

rage

So

luti

on

s The

Lead

er in

Co

mp

utatio

nal Sto

rage Solu

tion

s

No Host Interaction Required

Using Computational Storage Drives for ML.

MobileNet Object Classification

TensorFlow & Keras API

Weightless Neural Networks Used for Object Tracking.

Moving Beyond Traditional Models.

• Parallel & distributed Training in Computational Storage

• Federated/Transfer Learning

• Reduce data transfers by sending sparse model updates

ML Training with Traditional Approach.

Load Data

ML Training withComputational Storage.

... ...

Load Data

...

Train

ML Training withTraditional Approach.

ML Training withComputational Storage.

• No data movement

• No host CPU needed

• Distributed training

...

Load Data

...

Train

Evaluate

Update

ML Training withTraditional Approach.

ML Training withComputational Storage.

• Host CPU still needed

• No Parallelism

...

...

Load Data

Train

Evaluate

Update

Deploy

Train

Evaluate

Update

Repeat Steps

ML Training withTraditional Storage.

ML Training withComputational Storage.

...

Federated/Transfer Learning.

60,000 samplesFrom the training set

94% accuracyWith only 4 partial model updates

61 updatesModel updates transferred

Precision&

Generalization

MNIST DATASET

The

Lead

er in

Co

mp

uta

tio

nal

Sto

rage

So

luti

on

s The

Lead

er in

Co

mp

utatio

nal Sto

rage Solu

tion

s

The Leader in Computational Storage Solutions

Constant Updates to Training Model

.

.

.

Computational Storage Cores Running WiSARD.

2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.23

Today’s Learning Opportunities (TLO).

▪ EDGE needs CSD in Compact Form Factor

▪ Architecture One Way – CSD with PCSS

▪ AI – ML – CSD – the Overlap

▪ Hadoop & DB – CSD – Growth, Scale

Amplifying TCO for Hadoop

Datanode Config:Single E5-2620v4, 32GB DRAM, 12*8 TB SAS HDD

18U Total Density in 18U = 864TB

Datanode Config:Single E5-2620v4, 32GB DRAM, 36*8TB NVMe

3U Total Density in 3U = 864TB 432 Additional Drive Cores

@ ScaleSaves Power!Saves Space!Saves Time!

18

23

28

33

38

0 2 4 6 9 12

Ener

gy (

KJ)

Number of Computational Storage Devices

Terasort energy consumption

16-core host

4-core host

0.5

0.6

0.7

0.8

0.9

1

1.1

1.2

1.3

1.4

1.5

0 2 4 6 9 12

Perf

orm

ance

(n

orm

aliz

ed)

Number of Computational Storage Devices

Terasort performance

16-core host

4-core host

16-core host

4-core host

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC1924

Using MongoDB within Computational Storage.

Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC1925

• The “New Cloud” needs the Distributed Edge

o There is no longer just a ‘central’ storage location

• Edge data growth challenges HW platforms

o Innovative form factors and high capacity for the Edge

• In-Situ Processing brings ML closer to data

o Exploit data locality and enable distributed processing

Scalable Computational Storage.

A New Storage Paradigm is Here

2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.27

Computational [email protected]

www.NGDSystems.com

TW: @SMShadley

@NGDSystems