PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine...

25

Transcript of PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine...

Page 1: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream
Page 2: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Data Analytics

Nagesh Madhwal

Client Solutions Director, Consulting,

Southeast Asia, Dell EMC

Page 3: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

3 Dell - Internal Use - Confidential

Next 15 yearsBusiness-centric

Cloud-Native Applications

Prescriptive Analytics

Agile Infrastructure

Internet of Everything

Last 15 yearsIT-centric

Traditional Applications

Traditional Analytics

Rigid Infrastructure

Internet

Page 4: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

4 Dell - Internal Use - Confidential

Evolution of Workloads

Page 5: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Your business must evolve.

Future success is achieved

by unlocking ALL DATA.

Page 6: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

6 Dell - Internal Use - Confidential

When it comes to Applications

and Data Analytics

TIME is everything.

Page 7: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential7 of Y

The Power of “Now”

Business Event “Moment of Impact”

Data Captured

Intelligence Delivered

Decision Taken

Valu

e

Time

Real time Batch

Big Data

Opportunity Missed Opportunity “Too Late To Take Action”

Page 8: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential8

Traditional Analytics

Limited to static data

Reactive and slow

Restricted sources and access

Page 9: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential9

Traditional ..

Data Warehouse

Reporting

Source Systems

Files

Sources 1 to x

Batch Data SourcesExisting DBs, ERP

Staging Transform

ation

ETL Data Marts

Analytics

Page 10: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential10

Traditional…

Data Warehouse

Reporting

Source Systems

Files

Sources 1 to x

Batch Data SourcesExisting DBs, ERP

Staging Transform

ation

ETL

Data Marts

Analytics

ETL

ETL

ODS

Page 11: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential11

Modern Analytics

Analyze ALL data

Deliver anywhere, anytime analytics

Empower your end users

Page 12: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential12

The Data Lake

12

ETL Offload | Analytics as a Service | Data Science | Decision Support |

Data Visualization | Executive Reporting | Predictive Modeling | Threat Analysis

Structured Unstructured

Managed using NoSQL

Static Schema

RDBMS/EDW

Dynamic Schema

Hadoop Eco Sys

file types: videos, pdf, ppt,

mp3, doc, email, pics

Managed using SQLdata types: numeric, currency,

alphabetic, name, date, address

Sources:

• ERP

• CRM

• SCM

• POS

Sources:

• Social

• IOT

• Text

• Geo

• Media

Use Cases /APPS

Kafka

BigSQLSqoop

Spark Streaming

Hive

FlumeImpala

HAWQ…

Application-Integrated Protocols

Page 13: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential13

Modern Data Lake Environment

All Data

Fed Into

The

Data

Lake

HADOOP

DATA LAKE

ETL

- Exploratory, Ad Hoc

- Unpredictable Load

- Experimentation

- Loosely Governed

- Best Tools

- Production

- Predictable Load

- SLA Drive

- Heavily Governed

- Standard Tools

DWH

MPP

Analytics

Sandbox

Analytics / Sandbox Environment

Data Prep & Enrichment

ISILON SCALE OUT

NAS STORAGE

Foundation of your Data Management and Analytics

Architecture

Active Archive

BI / DWH Environment

Page 14: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential14

Reference Architectures

DATA LAKE (HADOOP)RDBMS

MACHINE IOT

STATISTICAL MODELING/NLP EXPLORATION

TRANSFORM

BI

ORGANIZE MANAGE/CATALOG

DATA WAREHOUSESTREAMCEP

NEARREAL-TIME

MODELS MAY TAKE HOUR OR DAYSQUERIES MAY RETURN IN SECONDS OR MINUTES

SECONDS

SEARCH/INDEX

ENTERPRISE LOG ANALYSIS

APPLICATIONS

3rd PARTY

EMAIL

SOCIAL MEDIA SQL ON HADOOP

Page 15: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential15

Big Data Journey To BDaaS

BDaaSAgility

Valu

e

Control

Prototyping

Dev/Test and Pre-Production Lifecycle Management

< 20 Nodes

Template Libraries, Shared Data Across Clusters, Rapid Prototyping and Evaluation

Dev / Test Lab

Multiple Hadoop Distros, Spark, Transient Workloads

Departmental

Hadoop and Spark in a Secure Production Environment

20+ Nodes

Performance, Security, Capacity Prioritization, Compute/ETL Offload,

Separate Compute/Storage

Dev/Test/Stage/QA/UAT

Improve Utilization, Scaling, Consistent Data

Big-Data-as-a-Service

Multi-Tenant Hadoop and Spark Deployment On/Off Premise

50+ Nodes

Multi-Tenancy, Self-Service, Logs, APIs, Tenant/Admin

Controls, Shared Data Lake with Access Controls

Heterogeneous Production Environments, Diverse Tenants

and User Groups

Support Multiple LOBs, Dynamic Resource

Management/QoS, Automation

Page 16: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential16

Multi-Tenant Big-Data-as-a-Service

Multiple lines of business, multiple user

groups

Multiple use cases

Multiple ecosystem products

(including non-Hadoop, BI/ETL

tools)

Compute isolation between tenants

Multiple environments

per tenant

Multiple versions and/or

distributions

Data isolation by tenant (incl.

ability to physically isolate storage)

Data/Storage

Prod Dev/Test POC Prod Dev/Test

Data Isolation

Data Isolation

MARKETING R&D MANUFACTURING

360 Customer View Log Analysis Predictive Maintenance

MARKETING R&D MANUFACTURING

Shared, Centrally Managed Server Infrastructure

Compute Isolation

Compute Isolation

Page 17: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential17

Analytics

Infrastructure

Integration

Range of solutions

44% of organizations still struggle with how to approach Big Data

S e r v i c e s

Page 18: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

18 Dell - Internal Use - Confidential

Data Analytics Journey

Page 19: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

19 Dell - Internal Use - Confidential

Big Data Systems

• VxRail

• VxRack

• Vblock

• XC Series

Page 20: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

20 Dell - Internal Use - Confidential

Big Data Foundations

• Servers

• Storage

• Networking

• Software

Page 21: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

21 Dell - Internal Use - Confidential

Big Data Solutions

• Reference Architectures

• Engineered Solutions

• Customized Designs

Page 22: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

22 Dell - Internal Use - Confidential

BUSINESS

TECHNOLOGY

DEPLOYASSESS PROVE

Big Data

Proof of Value

Big Data

Proof of

Technology

Big Data

Applied Analytics

Implementation

Big Data

Technology

Implementation

Big Data Vision

Workshop

Big Data Technology

Advisory

DELL EMC Big Data

Portfolio Implementation

Global Services

Page 23: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

Dell - Internal Use - Confidential23 of Y

Subject Details

Workshop Objective To understand the key business initiatives and requirements for big data in order to understand where

and how to start the big data journey

Workshop Vision How to become data driven through the use of big data and analytics

Workshop Duration Half Day or 1 Day on customer site

Workshop Agenda 1) Business & IT Goals

2) Business Initiatives

3) Current Environment Review

4) Use Cases

5) Data Sources Review

6) Data Science / Analytics / BI Requirements

Recommended types of participants: IT and Business Users

Expected Outcomes 1) List of business opportunities and use cases

2) Business Value and Feasibility

3) Identify and prioritize data sources mapped with use cases

4) Prioritized used cases with potential business value and impediments

5) Document workshop results

6) Big Data Technology Roadmap with clear next steps

Next Step After Workshop Proof of Value (POV) or Proof of Technology (POT)

Dell EMC Workshop Team Head of Big Data Practice SEA, Head of Consulting SEA, Dell EMC Account / Sales Manager

Example of a Big Data Workshop

Page 24: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream

24 Dell - Internal Use - Confidential

Customer Success Stories

Page 25: PLEASE READ – INSTRUCTIONS FOR ADDING PAGE NUMBERS “X … · rdbms data lake (hadoop) machine iot statistical modeling/nlp exploration transform bi organize manage/ catalog stream