Big Data on Public Cloud

37
Big Data on Public Cloud Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 13 March 2015

Transcript of Big Data on Public Cloud

Page 1: Big Data on Public Cloud

Big Data on Public Cloud

Assoc. Prof. Dr. Thanachart NumnondaExecutive DirectorIMC Institute13 March 2015

Page 2: Big Data on Public Cloud

2

“B ัy 2015, 20% of Global 1000 organizationsWill have established a strategic focus on

information infrastructure ”

Gartner

Page 3: Big Data on Public Cloud

3

Big Data Landscape

Source: Big Data in the Enterprise. When to Use What?

Page 4: Big Data on Public Cloud

4

Big Data Landscape

Source : http://www.vitria.com/

Page 5: Big Data on Public Cloud

5

Page 6: Big Data on Public Cloud

6

NoSQL

Page 7: Big Data on Public Cloud

7

A scalable fault-tolerant distributed system for data storage and processing

Completely written in javaOpen source & distributed under Apache license

What is Hadoop?

Page 8: Big Data on Public Cloud

8

Hadoop Environment

Source: Hadoop in Practice; Alex Holmes

Page 9: Big Data on Public Cloud

9

Major Hadoop Components

Hadoop Distributed File System(HDFS)

Map/Reduce System

Page 10: Big Data on Public Cloud

10

Hadoop Distribution

Microsoft Azure

Page 11: Big Data on Public Cloud

11

Big Data Future Architecture

Sscial Media Images e-mails Crawlers ERP CRM LOB APPs

Unstructured and Structured Data

Parallel Data Warehouse

Hadoop OnCloud

Hadoop OnPrivateServer

Connectors

SSRS

BI Platform

Familiar End User ToolsSpreadsheet Predictive Analytics

Data Market Place

NoSQL

Petabytes of Data(Unstructured)

Hundreds of TB of Data(structured)

Page 12: Big Data on Public Cloud

12

Issue with Big Data Infrastructure

Large investment

Scalabilty

ROI

Business Cases

Page 13: Big Data on Public Cloud

13

Page 14: Big Data on Public Cloud

14Source : http://acloudyplace.com/

Page 15: Big Data on Public Cloud

15

Big Data on Cloud

Using IaaS to leverage Cloud Vms

Using Big Data as a Services

Page 16: Big Data on Public Cloud

16

Big Data Services on Cloud

Amazon Elastic Mapreduce

Microsoft Azure Hadoop

Page 17: Big Data on Public Cloud

17

Big Data as a Service

Page 18: Big Data on Public Cloud

18

Page 19: Big Data on Public Cloud

19

Database as a Service

Amazon RDS

IBM SQL Database for Bluemix

Microsoft SQL Database

Google CloudSQL

Page 20: Big Data on Public Cloud

20

NoSQL as a Service

Amazon DynomoDB

Google Cloud DataStore

Microsoft Azure DocumentDB

Cloudant on IBM Bluemix.

Mongo DB on Heroku

Page 21: Big Data on Public Cloud

21

Hadoop as a Service

Amazon Elastic Map Reduce

Rackspace Cloud Big Data Platform

Qubole

Google Cloud Platform

IBM Bluemix: Analytic on Hadoop

Microsoft Azure HDInsight

Page 22: Big Data on Public Cloud

22

Page 23: Big Data on Public Cloud

23

Page 24: Big Data on Public Cloud

24

Big Data on Amazon EMR

Page 25: Big Data on Public Cloud

25

Page 26: Big Data on Public Cloud

26

Page 27: Big Data on Public Cloud

27

Page 28: Big Data on Public Cloud

28

Big Data on Cloud Roadmap

Step 1: Build the business case

Step 2: Assess your Big Data applicationworkloads

Step 3: Develop a technical approach fordeploying and managing Big Data in the cloud

Step 4: Address governance, security, privacy,risk,

Step 5: Deploy, integrate, and operationalizeyour cloud-based Big Data infrastructure

Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS

Page 29: Big Data on Public Cloud

29

Access your application workloads

Big-data storage

Big-data processing

Big-data development

Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS

Page 30: Big Data on Public Cloud

30

Sample applications

Enterprise applications already hosted in thecloud

High-volume external data sources thatrequire considerable preprocessing

Tactical applications beyond your on-premises, Big Data capabilities

Elastic provisioning of very large but short-lived analytic sandboxes

Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS

Page 31: Big Data on Public Cloud

31

Demo

Page 32: Big Data on Public Cloud

32

Amazon DynomoDB

Page 33: Big Data on Public Cloud

33

Google BigQuery

Page 34: Big Data on Public Cloud

34

Hadoop on Google

Page 35: Big Data on Public Cloud

35

Amazon EMR

Page 36: Big Data on Public Cloud

36

www.facebook.com/imcinstitute

Page 37: Big Data on Public Cloud

37

Thank you

[email protected]/imcinstitutewww.slideshare.net/imcinstitutewww.thanachart.org