GTC 2016 Europe: Breaking New Database Performance Records with GPUs

New performance benchmarks over 40 billion rowsBill Maimone, Head of Engineering -- MapDMazhar Memon, CTO -- BitfusionJerry Gutierrez, Global HPC Sales Leader -- IBM Cloud

IBM Cloud/SoftLayer Key Differentiators for GPU Accelerated Computing

Virtual/Bare Metal Servers with Hourly or Monthly Billing The Latest Intel CPUs and NVIDIA GPUs On-demand provisioning Triple-network architecture Private network only server deployments, private VLAN Un-metered private network Flash Based NetApp Performance/Endurance Storage Enterprise Grade Encryption

IBM Cloud -SoftLayerGlobal reach with local presence

Data centers near every major metro area enabling low-latency connectivity to cloud infrastructure.

Hourly GPU Servers Now Available!

• Analyzing increasingly massive datasets is critical• Ability to scale past a single node• Need access to the latest GPUs• Did not want to own or build infrastructure• Worked with IBM Cloud and very quickly came up

with a compelling solution

The data explosion is just beginning

Source: IDC and EMC Digital Universe

Report

Confidential & Proprietary

MapD Analytic DatabaseSQL-based column storeWritten ground-up for GPU

MapD ImmerseReact.js/d3 charts & dashboardsGPU rendering where it matters

https://www.mapd.com/blog/2016/06/27/crushing-the-billion-row-taxi-data-benchmark/

The Dataset & Queries

Confidential & Proprietary 8

1. Query Id is Q001 : query is 'select count(*) from flights2’2. Query Id is Q002 : query is 'select carrier_name, count(*)

from flights2 group by carrier_name’3. Query Id is Q003 : query is 'select carrier_name,

avg(arrdelay) from flights2 group by carrier_name'

US Flight Data from 1987 to 2008. Total dataset is 128M rows and was replicated 312 times.

Querying 40 billion rows in milliseconds.

GPUGPUGPUGPU

GPU capacity limited by the node

GPUGPUGPUGPU

Existing methods add maintenance and development costs

Bitfusion Boost: GPU Remote Virtualization

Hardware

VM Hypervisor

Drivers

Operating system

User Space

Hardware

VM Hypervisor

Drivers

Operating system

Hardware

VM Hypervisor

Drivers

Operating system

Open APIs

Custom APIs

Libraries

Application

Core Functions

Hardware

VM Hypervisor

Drivers

Operating system

• Binary-level API interception

• Distribute work across local and remote machines

application

remote servers

local server

System view

data and compute

pipelining

Advanced caching and data

directories

Auto service discovery, metering

Function redirection for advanced coprocessors

Supports the latest CUDA features including unified memory

Virtually attached GPUs

CPU-only Node

48 Cores3 TB Memory

72 TB SSD Storage

BoostMassive Virtual NodeGPUGPUGPUGPU

GPUGPUGPUGPU

Racks with GPUs

GPU GPUGPU GPU

Creating the largest virtual GPU machines on demand

Unprecedented Speed at Scale

• 40 billions rows on 'select carrier_name, count(*) from flights2 group by carrier_name’ in 271ms

• 147 billion records scanned per second• 8X the number of records scanned previously

Combining: GPU-accelerated database + GPU Virtualization + Optimized CloudFor fastest database query times

App Specific Instance Configurations as Machine

Images

Resource Pooling:• Consolidate use of compute resources• Increase utilization• Lower capital costs

Resource Provisioning:• Enforce CPU, memory, utilization quotas• Effect QoS policy and guarantees• Maximize utilization and reduce costs

High availability:• Detect failures at app level• Rollback, failover, error detection• Events for higher level reporting

Heterogeneous Offload:• Leverage HPC hardware• Interpose vendor libraries• Retarget hot functions to efficient specialized devices

Scale-out:• Distribute and load balance load across systems• Scale performance on demand• Take advantage of runtime optimizations

Advanced Profiling:• Understand application

demands of the datacenter• Fine-grained data provides

unique insight• Precise recommendations for

capacity planning

Deep Learning Caffe Deep Learning Torch

Deep Learning TensorflowMedia Transcoding

Rendering Scientific Computing

Boost: Adding a broad set of GPU features to your application

In Summary

• Enable powerful GPU super nodes with Bitfusion Boost

• 60 days of collaboration with IBM and just a week to integrate

• Unprecedented database performance with MapD

Q & AJerry GutierrezGlobal HPC Sales Leaderjegutierrez@us.ibm.comwww.softlayer.com/gpu

Bill MaimoneMapD VP Engineeringbill@mapd.comwww.mapd.com

Mazhar MemonCTO Bitfusionmazhar@bitfusion.iowww.bitfusion.io

GTC 2016 Europe: Breaking New Database Performance Records with GPUs

Data & Analytics

Transcript of GTC 2016 Europe: Breaking New Database Performance Records with GPUs

Cluster Monitoring and Management Tools - GTC …on-demand.gputechconf.com/gtc/2015/presentation/S5144-Rob-Todd... · Cluster Monitoring and Management Tools . MANAGE GPUS IN THE

Monitoring & Managing GPUs in Cluster Environments | GTC · 2013-03-21 · Monitoring & Managing GPUs in Cluster Environments | GTC – Author: Przemyslaw Zych Subject: NVIDIA Tesla

Scaling OpenACC Across Multiple GPUs - GTC On …on-demand.gputechconf.com/gtc/2014/presentations/S... · #pragma acc exit data delete ... Scaling OpenACC Across Multiple GPUs Author:

GPUs in the Film Visual Effects Pipeline - NVIDIAon-demand.gputechconf.com/gtc/2013/webinar/gtc-express... · 2013. 9. 13. · GPUs in the VFX pipeline - The Past, Present & Future

Dr. James C. Beyer - GTC On-Demand Featured Talkson-demand.gputechconf.com/gtc/2013/presentations/S3084... · 2013-03-21 · A common directive programming model for today [s GPUs

How GPUs Power Comcast's X1 Voice Remote and Smart Video ...on-demand.gputechconf.com/gtc/2017/presentation/s... · How GPUs Power Comcast's X1 Voice Remote and Smart Video Analytics

WebGL Visualization Tools And GPUs For Marketing Of ...on-demand.gputechconf.com/gtc/2015/presentation/S5673-Steve-Rueckhaus.pdf · Steve Rueckhaus . Digital Marketing Specialist

Breaking Through Serial Barriers: Scalable Hard Particle ...on-demand.gputechconf.com/gtc/2014/presentations/S...Breaking Through Serial Barriers: Scalable Hard Particle Monte Carlo

Driverless AI for GPUs - GTC On-Demand Featured Talkson-demand.gputechconf.com/gtc/2017/presentation/s7652-srisatish-ambati... · Ethio Telecom Ethz Swiss Federal Institute of Technology

Signal Processing on GPUs for Radio Telescopes - GTC …on-demand.gputechconf.com/gtc/2012/presentations/S0124-GTC2012... · GTC'12 May 14-17, 2012 1 Signal Processing on GPUs for

Dynamically Allocating GPGPU to Host Nodes (Servers) - GTC ...on-demand.gputechconf.com/...Dynamically-Allocating...Dynamically Allocating GPUs to Host Nodes (Servers) Saeed Iqbal,

Real-time Use of GPUs in High-Energy Physics Experiments ...on-demand.gputechconf.com/gtc/2012/presentations/S0013-GPUs-for... · Real-time use of GPUs in High-Energy Physics experiments

C++17 Parallel algorithms on NVIDIA GPUs with PGI C++ · david olsen gtc s9770 march 20, 2019 c++17 parallel algorithms on nvidia gpus with pgi c++

High Productivity Computational Finance on GPUs - GTC 2012 · 2012. 11. 27. · High Productivity Computational Finance on GPUs GPU Technology Conference March 15, 2012 Annuity Solutions

Introducing GPUs to a Commercial Reservoir Simulatoron-demand.gputechconf.com/gtc/2015/presentation/S5298-Dominic...Introducing GPUs to a Commercial Reservoir Simulator ... • Seismic

Deep Packet Inspection Using GPUs - NVIDIA...13 5/11/2017 Deep Packet Inspection Using GPUs, GTC’17 • To address the problem of out-of-order packets, one widely adopted approach

BREAKING THE BARRIERS WITH DL - SCALE ON AI SYSTEMSon-demand.gputechconf.com/gtc-au/pdf/Breaking-the-Barriers-with-D… · Video surveillance ... recognition techniques, Wiener provides

Real-time 3D Tracking With GPUs | GTC 2013on-demand.gputechconf.com/gtc/2013/presentations/S... · JPEG, H.264, and MPEG4 Part 2 • NVIDIA GPUs support H.264 and MPEG 4 Part 2 decoding

High Performance Molecular Visualization and …on-demand.gputechconf.com/gtc/2012/presentations/S0142...High Performance Molecular Visualization and Analysis on GPUs John E. Stone

GPUs TO MARS - NVIDIAon-demand.gputechconf.com/gtc/2015/presentation/S5398... · 2015-03-27 · GPUs TO MARS Full Scale Simulation of SpaceX’s Mars Rocket Engine Adam Lichtl, Stephen