Title of Presentation - SNIA · Title: Title of Presentation Author: khauser Created Date:...
Transcript of Title of Presentation - SNIA · Title: Title of Presentation Author: khauser Created Date:...
2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.1
Computational Storage
Architectural Discussion
Scott Shadley
NGD Systems
Co-Chair Computational Storage TWG
2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.2
Today’s Learning Opportunities (TLO).
▪ EDGE needs CSD in M.2, EDSFF
▪ Architecture Our Way – CSD with PCSS
▪ AI – ML – CSD – the Overlap
▪ Hadoop & DB – CSD – Growth, Scale
But Don’t Take Our Word For it.Data, Data, Data.
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC193
What Are You Doing with Your Data Today?
It’s No Longer Black and White.Source: Gartner - Bittman
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC194
What is Driving Our Data Analytics Issues?
Weeding through the Noise at the Edge
Source: Gartner - Bittman
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC195
From Edge Sensors to Centralized Cloud
Cloud
Centralized CloudFarthest from the network edgeHigh density of compute, storageand network resources
InfrastructureSmall distributed data centersLow roundtrip latency 5 – 10ms
Edge DevicesReal-time data processing
Edge Sensors &ChipData Collection & Origination
Computational Storage opportunities exist throughout the distributed compute environment
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC196
Innovative Computational Storage Uses.
FACIAL
RECOGNITION
MACHINE
LEARNING
HPC
CONTENT
DELIVERY
SMART CITIES
AUTONOMOUS
DRIVING
IN-SITU
PROCESSING
security
(de) compressionoff-load
objecttracking
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC197
Highest Capacity, Lowest Power.
Industry leading W/TB
Industry’s Only 16-Channel M.2
Industry’s Largest Capacity U.2
Form Factor Capacity (TB) MAX Power (W)
M.2 22110 Up to 8 8
EDSFF E1.S Up to 16 12
EDSFF E1.L Up to 32 12
U.2 15mm Up to 32 12
AiC FHTQL Up to 64 15
8TB/8WM.2
32TB/12W
16TB/12WE1.S
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC198
2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.9
Today’s Learning Opportunities (TLO).
▪ EDGE needs CSD in Compact Form Factor
▪ Architecture One Way – CSD with PCSS
▪ AI – ML – CSD – the Overlap
▪ Hadoop & DB – CSD – Growth, Scale
Complete Solution and Disruptive Technology.1st FULLY INTEGRATED COMPUTATIONAL STORAGE SOLUTION
Industry’s First 14nm
Modular firmware
Efficient algorithm
Flash characterization
SoC Controller Management Optimized Hardware
“In-Situ Processing” Computational Storage Stack
• Full fledged on drive OS
• Quad-core 64-bit application processor
• Light virtualization
• Hardware acceleration
Linux
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC1910
The Scalable, ASIC-based Computational Storage Drive.
Needed Key Attributes:
• Use standard protocols (NVMe)
• Minimize data movement (Faster Response, Lower W/TB)
• Improve (TB/in3) with maximize (Customer TCO)
Host
Platform
Standard NVMe Protocol
AI
Core Solution Stack:
Moving Compute to Data
An enterprise class device capable of processing workloads in storage at the source
Linux
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC1911
Computational Storage DriveComputational Storage Drive
Computational Storage Drive
Why Not Work on it There?
Traditional SSD Solutions
The Data Lives on Storage.
Computational Storage Drive
NVMe
DRAM
host API
application
host OS
drive OS
app
Armquad-core shared
NAND media
media controller
Server Host Processor Complex
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC1912
2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.13
Today’s Learning Opportunities (TLO).
▪ EDGE needs CSD in Compact Form Factor
▪ Architecture One Way – CSD with PCSS
▪ AI – ML – CSD – the Overlap
▪ Hadoop & DB – CSD – Growth, Scale
The
Lead
er in
Co
mp
uta
tio
nal
Sto
rage
So
luti
on
s The
Lead
er in
Co
mp
utatio
nal Sto
rage Solu
tion
s
No Host Interaction Required
Using Computational Storage Drives for ML.
MobileNet Object Classification
TensorFlow & Keras API
Moving Beyond Traditional Models.
• Parallel & distributed Training in Computational Storage
• Federated/Transfer Learning
• Reduce data transfers by sending sparse model updates
Load Data
...
Train
ML Training withTraditional Approach.
ML Training withComputational Storage.
• No data movement
• No host CPU needed
• Distributed training
...
Load Data
...
Train
Evaluate
Update
ML Training withTraditional Approach.
ML Training withComputational Storage.
• Host CPU still needed
• No Parallelism
...
...
Load Data
Train
Evaluate
Update
Deploy
Train
Evaluate
Update
Repeat Steps
ML Training withTraditional Storage.
ML Training withComputational Storage.
...
Federated/Transfer Learning.
60,000 samplesFrom the training set
94% accuracyWith only 4 partial model updates
61 updatesModel updates transferred
Precision&
Generalization
MNIST DATASET
The
Lead
er in
Co
mp
uta
tio
nal
Sto
rage
So
luti
on
s The
Lead
er in
Co
mp
utatio
nal Sto
rage Solu
tion
s
The Leader in Computational Storage Solutions
Constant Updates to Training Model
.
.
.
Computational Storage Cores Running WiSARD.
2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.23
Today’s Learning Opportunities (TLO).
▪ EDGE needs CSD in Compact Form Factor
▪ Architecture One Way – CSD with PCSS
▪ AI – ML – CSD – the Overlap
▪ Hadoop & DB – CSD – Growth, Scale
Amplifying TCO for Hadoop
Datanode Config:Single E5-2620v4, 32GB DRAM, 12*8 TB SAS HDD
18U Total Density in 18U = 864TB
Datanode Config:Single E5-2620v4, 32GB DRAM, 36*8TB NVMe
3U Total Density in 3U = 864TB 432 Additional Drive Cores
@ ScaleSaves Power!Saves Space!Saves Time!
18
23
28
33
38
0 2 4 6 9 12
Ener
gy (
KJ)
Number of Computational Storage Devices
Terasort energy consumption
16-core host
4-core host
0.5
0.6
0.7
0.8
0.9
1
1.1
1.2
1.3
1.4
1.5
0 2 4 6 9 12
Perf
orm
ance
(n
orm
aliz
ed)
Number of Computational Storage Devices
Terasort performance
16-core host
4-core host
16-core host
4-core host
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC1924
Using MongoDB within Computational Storage.
Sept 26th, 2019NGD Systems, Inc. - Presented at SNIA SDC1925
• The “New Cloud” needs the Distributed Edge
o There is no longer just a ‘central’ storage location
• Edge data growth challenges HW platforms
o Innovative form factors and high capacity for the Edge
• In-Situ Processing brings ML closer to data
o Exploit data locality and enable distributed processing
Scalable Computational Storage.
A New Storage Paradigm is Here
2019 Storage Developer Conference. © NGD Systems, Inc. All Rights Reserved.27
Computational [email protected]
www.NGDSystems.com
TW: @SMShadley
@NGDSystems