POWER of EMC ISILON To Transform eResearch - … · ... Rapid introduction of the latest...

33
1 © Copyright 2013 EMC Corporation. Company Confidential, do not distribute without explicit permission The POWER of EMC ISILON To Transform eResearch Greg Rogers, RTM EMC ISILON Australia Charles Sevior, CTO EMC ISILON Asia-Pac ISILON – CAUDIT presentation to RDSI

Transcript of POWER of EMC ISILON To Transform eResearch - … · ... Rapid introduction of the latest...

Page 1: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

1 © Copyright 2013 EMC Corporation. Company Confidential, do not distribute without explicit permission

The POWER

of EMC ISILON To Transform eResearch

Greg Rogers, RTM EMC ISILON Australia

Charles Sevior, CTO EMC ISILON Asia-Pac

ISILON – CAUDIT presentation to RDSI

Page 2: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

3 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Agenda – Topics to be covered

1. Isilon eResearch customer references - Charles

2. Isilon NAS brief overview - Charles

3. Aspera integration into Isilon OneFS – JC

4. Hadoop analytics integration with Isilon - Charles

5. Isilon OneFS software / hardware roadmap – Charles

6. Next steps and partnerships - Greg

Page 4: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

5 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Columbia University Center for Computational Biology and Bioinformatics (“C2B2”)

Challenge Using a traditional NAS system that struggled to support the

huge amounts of input/output demands on the 400 CPUs in our computing infrastructure

Imminently outgrowing their infrastructure

Solution X-Series, NL-Series

SmartPools

SmartQuotas

Results Now supports some 4,000 CPUs, which can handle the

heavy I/O and data analysis demands of C2B2’s research

Supports the storage needs of three additional entities within the Columbia University network

Applications Genomic Research

“After switching to Isilon, we no longer had to worry that our system couldn’t handle our research demands. We knew that we could independently scale capacity and performance, so that we buy only what we need, when we need it.”

JOHN LOWELL WOFFORD

Director IT Services

Page 5: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

6 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

3TIER Turns Big Data Into Accurate Renewable Energy Intelligence

Challenge Traditional RAID array couldn’t scale to meet the demands

of its big data workflow

Complicated management

Escalating costs

Solution X-Series

NL-Series

SmartPools

Results Single file system and point of management for multiple

performance tiers

Unifying all operations on a single, shared pool of storage

Simplifying Big Data management

Applications energy assessment and

forecasting operations

“Before Isilon, our team had to do everything from manual data migrations to mapping directory paths, which simply wasn’t sustainable for a business growing as quickly as ours. With Isilon, everything is simple. We’ve eliminated data management and storage headaches, freeing up resources to focus on the services that deliver real value to our clients.”

PAUL ENGLISH Director of IT at 3TIER

Page 6: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

7 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

RENCI Renaissance Computing Institute of the University of North Carolina

• Turning Big Data into Insight in the lab and therapy in the clinic

• Knowledge based medicine programs for epilepsy & prostate cancer

• Secure medical workspace

• Informatics for Genetic Sequencing (IGS)

• Blending traditional cluster computing with iRODS and Hadoop analytics

http://www.emc.com/collateral/white-papers/h11692-life-sciences-renci-big-data-manage-info-wp.pdf

Page 7: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

8 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

eResearch Challenges

Page 8: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

9 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

User requirements for RDSI?

Growing data sets – mostly unstructured

Reliable storage integrity / preservation

Manage migration issues – active and cold data

Leverage MapReduce and Hadoop analytics

Ease of use and simple management

Trust / Security / Immutability

Sharing / Collaboration / Cloud – REST API

Page 9: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

10 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

EMC Isilon Unique Differentiators

Page 10: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

11 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Challenges arising from standard NAS

RAID

Aggregate

Volume

File System

•Utilisation rates vary across islands •QOS headroom •Performance considerations

= Storage Efficiency

50% - 65%

Page 11: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

12 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Clustered Storage Layer

Intra-cluster Communication Layer

Servers/Clients

Client/Application Layer Ethernet Layer

Servers/Clients

Servers/Clients

Compute Nodes

EMC Isilon – NODE-Based Architecture

Page 12: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

13 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Focus on the data not the storage

Page 13: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

14 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

AutoBalance: Automated data balancing across nodes reduces costs, complexity and risks for scaling storage

• AutoBalance migrates content to new storage nodes while system is online and in production

• NO manual intervention

• NO reconfiguration

• NO server or client mount point or application changes

• Eliminates “Hot Spots”

EMPTY

EMPTY

EMPTY

EMPTY

EMPTY

FULL

FULL

FULL

FULL

BALANCED

BALANCED

BALANCED

BALANCED

BALANCED

EMC Isilon – On-The-Fly Expansion

Page 14: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

15 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Data Layout - protection and throughput

Data is striped across the nodes – Not across disks

– FEC not RAID

Data breakdown:

– 8 KB blocks

– 16 blocks per stripe unit

– 128 KB stripe width per drive

Clients File

Write

Data Stripe Unit

Data Stripe Unit

File Data stripe unit

Data stripe unit

Parity stripe unit

Page 15: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

16 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

With N+2, N+3, and N+4 protection, data is 100% available if multiple drives or nodes fail together

With n+1 protection,

data is 100% available even if one drive or one node fails

Built-in high availability clustered architecture – No RAID!

100%

100%

100%

100%

100%

100%

100%

100%

FAILED

FAILED

Fastest rebuild time, and with Isilon, the more nodes in the cluster, the faster drive rebuild time

EMC Isilon – Leading Data Protection

Page 16: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

17 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

2014? 2008

2010

2012

“No Node Left Behind”

12

12

72

12

72

288

Generations of Isilon nodes can coexist

– Heterogeneous nodes mix and match

– Storage investment protected

Rapid introduction of the latest technology

– Faster, denser, greener storage each year

– New capabilities like Hadoop and MobileIQ

– No server, network or application changes

Push-button node retire – SmartFail

– Storage older than 5 years is a waste of space!

EMC Isilon – Storage Never Obsolete

No More Data Migration. Keep up with Moore’s Law

Page 17: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

18 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Isilon+Aspera Store & Move Big Files

JC Diomard Arrazau Aspera

Page 18: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

19 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

EMC Isilon with integrated Aspera FASP

Page 19: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

20 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Aspera and Isilon have partnered for many years to create a predictable, non-disruptive, high-performance wide area file and data delivery solution specifically for moving large data sets over long distances at the fastest possible speed.

Aspera’s cluster-aware high-speed transfer software coupled with EMC Isilon OneFS operating system and scale-out NAS architecture is a powerful combination for distributed storage environments

The Benefits of Isilon + Aspera

Page 20: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

21 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

The Benefits of Isilon + Aspera

Maximum cluster-to-cluster performance over campus and wide-area networks

Massive concurrency

Easy scalability of WAN transfer performance

No single point of failure

Single point of management

Web services and industry-standard API support

Page 21: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

22 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

EMC Isilon Hadoop Analytics

Page 22: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

23 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

ANALYTICS MOBILE

Key Features

Benefits

Hadoop 2.0

Native HDFS Support

Pivotal HD Support

Simultaneous HDFS 1.0 & HDFS 2.0 Support

Distributed NameNode

Support For Open Hadoop Applications

No Single Point Of Failure

Improved TCO

Next-Gen Analytics

Page 23: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

24 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

HDFS: Standard Hadoop Cluster

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

Data

Compute

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

Compute

Data

Name node

SMB, NFS, HTTP, FTP

3X

NFS

Name node

OLAP

HTTP

CIFS

FTP

NFS

Landing Zone Servers

Step 1: Data is copied into the Landing Zone

Step 3: Data is locally copied from a node onto the

Cluster (3 times)

Step 4: MapReduce Job is Run

Log Files

Decision Support Databases

User/WEB Click data

3X

3X

Step 2: Data is copied from landing zone onto a node in the Cluster

Page 24: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

25 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

SMB, NFS, HTTP, FTP,

HDFS

MAP Reduce

MAP Reduce

MAP Reduce

MAP Reduce

HDFS: Integrated Isilon and Hadoop

name node data

node

Isilon

name node

name node

name node

Step 1: Much or all of the Data

lives on the Isilon Cluster

Step 2: Jobs are run

Hadoop Cluster NFS

OLAP

Log Files

Decision Support Databases

User/WEB Click data

Page 25: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

26 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

ARCHITECTURE FOR HIGH PERFORMANCE COMPUTING WITH INTEGRATED ANALYTICS

Page 26: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

27 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Future Directions

Page 27: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

28 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

OneFS 7.0 - Mavericks

Scalability Enterprise Management

Performance – Improved Throughput

– Reduced Latency

Capacity – SnapshotIQ File Clones

– 20 PB File System Ready

Extensibility – REST Platform API

Data Protection – SyncIQ: Failover/Back

– SnapshotIQ File Clones

Automated Provisioning – Improved Efficiency

– Higher Reliability

Availability - NDU – Upgrade - 2 Minutes

Multi-Tenancy – Authentication Zones

– Role Based Administration

VMware – Reduced I/O Latency

– VAAI, VASA, SRM

Security and Archive – SmartLock v2 (SEC17a-4)

2H 2012

Page 28: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

29 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

OneFS 7.1 - Waikiki

Efficiency Scale Deduplication

– Post process, block level deduplication

– Increased data efficiency

– Multi-level directory configuration

– Dry-run dedupe savings estimation tool

– SmartPools Enhancements

– CLI and API management

– RBAC integration

– User defined node-pools

Performance SyncIQ & Snapshots

– “Continuous Mode” Replication

– ~1 minute sync granularity

Job Engine

– Improved impact control

– Multiple simultaneous jobs

– Enhanced per-worker resource monitoring

Breaking boundaries

– 3 simultaneous background jobs

Enterprise Security and Archive

– CEE support. Varonis first partner for Audit

– Improved RBAC Coverage

– Encrypted Nodes

Backup

– Faster incrementals and incrementals forever

Manageability & Extensibility

– RBAC improvements to API

– Broaden pAPI coverage incl. SyncIQ AND JobEngine

– ESRS Gateway Integration

2H 2013

Page 29: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

30 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Growth In Applications

Sources: IDC, Gartner, AWS Workload Estimates

Next Gen Cloud Applications

2016 48M

2012 6M 700%

Traditional Applications

2016 141M

2012 83M 70%

Page 30: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

31 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Making Compute the Bottleneck

A Giant Leap in Capacity

Around-the-Clock, Around-the-Globe

Absolute Freedom and Control

Isilon’s OneFS Operating System

Enterprise @Scale

Strategic Vectors

Page 31: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

32 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

NEXT GEN APPLICATIONS

Hybrid

Private (Enterprise)

OneFS Storage Isilon, The Big Data Platform

VERTICAL DATA CENTRIC APPLICATIONS

Public

Page 32: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...

33 © Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Isilon strengths in eResearch Area Strength

Onsite data collection • Multi-protocol support for instruments & workstations • CIFS (SMB), NFS, FTP, HTTP

Repository • SmartPools (Automatic storage tiering) • GNS (Global Namespace, with metadata acceleration)

Workspaces • SmartQuotas (Policy-based storage management) • Home Directories

Collaboration • Aspera FASP protocol integrated into OneFS • Aspera Enterprise Server and Connect Server hosted within the cluster

HPC/Hadoop • HDFS (native support) • 10GbE (2/node) • SmartConnect • Concurrent File Access support (OneFS 7.0)

Archival • SnapshotIQ • WORM – immutable writes • NL Series

Page 33: POWER of EMC ISILON To Transform eResearch - … ·  ... Rapid introduction of the latest technology ... –VAAI, VASA, SRM Security and ...