IBM PureData System for Analytics N3001 Overview

51
© 2015 IBM Corporation IBM ® PureData System for Analytics N3001 Overview

Transcript of IBM PureData System for Analytics N3001 Overview

Page 1: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation

IBM® PureData™ System for Analytics

N3001 Overview

Page 2: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation2

INTRODUCTION TO NETEZZA

TECHNOLOGY

Page 3: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation3

IBM PureData System for AnalyticsThe Simple Data Warehouse Appliance for Serious Analytics

What makes it different?

Speed - 10-100x faster than traditional custom systems1

Simplicity - minimal administration and tuning

Scalability - petabyte+ scale user data capacity

Smart - high performance, advanced analytics

1 Based on IBM customers' reported results. "Traditional custom systems" refers to systems that are not professionally pre-built, pre-tested and optimized. Individual results may vary.

Purpose-built analytics appliance

Integrated database, server and storage

Standard interfaces

Low total cost of ownership

Page 4: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation4

▪ Too complex an infrastructure

▪ Too complicated to deploy

▪ Too much tuning required

▪ Too inefficient at analytics

▪ Too many people needed to maintain

▪ Too costly to operate

4

Traditional Data Warehouses

They do NOT meet the demands of advanced analytics on big data.

are just too complex

Too long to get answers

Page 5: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation5

Appliances Make It Simple

transforming the user experience.

▪ Dedicated device

▪ Optimized for purpose

▪ Complete solution

▪ Fast installation

▪ Very easy operation

▪ Standard interfaces

▪ Low cost

Page 6: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation6

Evolution of Netezza & PureData System for Analytics

World’s FirstData Warehouse

Appliance

World’s First100 TB DataWarehouse Appliance

World’s FirstPetabyte Data

Warehouse Appliance

World’s FirstAnalytic Data Warehouse Appliance

NPS®

8000 Series

TwinFin™ with i-Class™

Advanced Analytics

NPS®

10000 Series

TwinFin™

2003 2006 2009 2010 2012 2014

World’s Fastest and Greenest Analytical

Appliance

PureData System for AnalyticsN300x

PureData System for AnalyticsN200x

World’s First appliance with no cost encryption

Page 7: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation7

Targeted advertisingto promote products that customers

want at the price they want them

Understand what customers want,

when they walk into a Bon-Ton store

Freeing the time of Bon-Ton

buyers and plannersfrom the mundane task of gathering &

compiling customer data so they can spend

their time making informed decisions to

drive the business

“I need some way to understand what they're

thinking, what they're feeling, without having to

have contact with them. PureData for Analytics

is what's going to help us understand what the

customers want when they walk into my

stores”

- Paula Post, Vice President Merchandising Optimization.

Bon-Ton Optimizes Their Customer’s Experience Using

IBM PureData System for Analytics

Video: https:/www.youtube.com/watch?v=0gsWOL6gciw

Page 8: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation8

Carphone Warehouse Increases Profitability Through New Revenue

Streams & Reduced Costs

Case Study: http://www-03.ibm.com/software/businesscasestudies?synkey=M183113U13038J58

“The PureData System, powered by

Netezza technology, provided huge

technical advantages & big business

advantages. We can now insure devices on

behalf of a bank in the UK, which we

couldn’t have done before.”

- Paul Scullion, Head of Business Intelligence

Up to 1200Xfaster performance; reports that once

took an hour to run now take seconds

50% reductionin time to market for new business intelligence services

Page 9: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation9

96% decreasein query run times

(from 1 hour to 2 minutes)

100% increasein subscriber base

Reduced spendingOn low-return promotional activities

"Through the entire subscription lifecycle, the

company tracks everything members do on

the website. This process generates an

enormous amount of data, which would be

completely wasted without the ability to extract

hidden insights about how members behave.”

- eHarmony C-Level executive

eHarmony Attracts New Members by Understanding Behavior and

Fine-tuning Matching Algorithm

Video: https://www.youtube.com/watch?v=_0wffNyHn8s

Page 10: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation10

Canadian National Railway Company leverages the power of

predictive analytics to run trains on time

Reduction in time spent on running reports, some

reports that took 10-20 minutes

earlier now run in 5 seconds

Enhanced confidence in data driven decision-making

Accelerated analytics for faster insight, the company is

moving to near real time report

generation compared to monthly

reports earlier

“The performance of PureData is very good,

most reports we have are running in less than

5 seconds where as with other databases we

had reports running for 10-20 minutes”

- Philippe Chartier, BI Team Lead, Information Delivery,

Canadian National Railway Company

Video: https://www.youtube.com/watch?v=yyZu5seKbLI

Page 11: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation11

Promotes self-service

business intelligence & insights throughout the hospital

98% reduction in time spent on some queries

“We’re getting deeper into the data in

multiple ways . . . When we see new

commonalities in treatments for children,

we can design new protocols to provide

the best possible care”

- Wendy Soethe, Enterprise Data Warehouse Manager

More effective

diagnosis & treatment by enabling faster, more accurate insights,

on-demand

Seattle Children’s Optimizes Business Intelligence & Insight into

New Treatment Protocols to Enrich Patient Care

Video: https://www.youtube.com/watch?v=bjGWIectvkI

Page 12: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation12

THE NEW PUREDATA SYSTEM

FOR ANALYTICS N3001

Page 13: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation13

Announcing the PureData System for Analytics N3001

Big Data and Business Intelligence ready

with capabilities to unlock data’s true potential

Advanced security in an insecure world

at no extra cost

An even broader family of appliance models

to fit a broad range of data capacity needs

Changing the game for data warehouse appliances (again)

and yes, simple is STILL better!

Page 14: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation14

Big Data and Business Intelligence ReadyUnlocking Data’s True Potential

Data Warehouse Appliance

Built-in, In-Database analytic capability and integration with

a variety of 3rd party toolsReal-time AnalyticsInfoSphere Streams Developer Edition 2 users, non-production licenses

Business Intelligence Cognos software, 5 Analytics User licenses, plus 1 Analytics Administrator license

Hadoop Data ServicesInfoSphere BigInsights Software licenses to manage ~100 TB of Hadoop data

Exceptional value

provided

Included with the PureData System for Analytics N3001

Industry Process & Data ModelsModels for Banking, Financial Markets, Healthcare, Insurance, Retail, Telco

For additionalvalue

• Advanced security• New rack-mountable

appliance for midsize organizations

• New 8-rack system for Petabyte+ capacity

Data Integration & TransformationInfoSphere DataStage 280 PVUs, 2 concurrent Designer Client licenses and InfoSphere Data Click

IBM InfoSphere Data Privacy and Security for Data Warehousing

Page 15: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation15

IBM Netezza AnalyticsIn-database Analytics For Every Role in Your Enterprise

Bring the analytics to the data

not the data to the analytics

Included

Use cases

Features

▪ Built-in, in-database analytic functions

- Data mining, prediction, transformations, statistics, geospatial, data preparation

▪ Full integration with tools for BI & visualization

- IBM Cognos, Microstrategy, Business Objects, SAS, MS Excel, SSRS, Kognitio, Qlikview

▪ Full integration with tools for model building & scoring

- IBM SPSS, SAS, Open Source R, Fuzzy Logix

▪ Full integration for custom analytics

- Open Source R, Java, C, C++, Python, LUA

▪ Reduce hospital admissions or personalize disease treatments

▪ Achieve an order of magnitude improvement in manufacturing quality

▪ Better understand the risk of catastrophic events

▪ …and many more

Data

Preparation

Predictive

Analytics

Geospatial

Analytics

Advanced

Statistics

Page 16: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation16

Use cases

Features

Business IntelligenceThe Power of IBM Cognos with PureData System for Analytics

▪ Leading Business Intelligence

- Interactive analysis

- Compelling visualizations - web, mobile or email

- Enterprise scalability

▪ Optimized for PureData for Analytics

- Offers high performing OLAP over relational experience

- Cognos Dynamic Query Mode extends benefits of PureData by adding in-memory & caching on top of already fast appliance performance

- Exploits Netezza analytic in-database functions

Rapid deployment of answers

to key business questions

Included with PureData for Analytics:

IBM Cognos Business Intelligence 10.2.1

5 Analytics User licenses,

1 Analytics Administrator license1

Included

▪ Reporting, analysis, scorecards, dashboards

▪ Data visualization

▪ Mobile business intelligence

▪ … and many others

1PureData System for Analytics N3001 must be the data source for Cognos.

Page 17: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation17

Data Integration & TransformationInfoSphere DataStage, Designer Client and Data Click

Rich capabilities for

data integration

Included

Use cases

Features

▪ Ease of Use

- Provides an easy-to-use, top-down, work-as-you-think design interface that enables users to design once and deploy anywhere—batch or real time; extract, transform, load (ETL); or extract, load, transform (ELT)

- Self-service data integration to enhance business agility

▪ Accelerate time to value

- Includes a comprehensive library of transformation components for easily defining common integration processes

▪ Integration, transform and deliver trustworthy information to your data warehouse

▪ Analysts, data scientists or even line-of-business users can easily retrieve data and populate the PureData System for Analytics

▪ Move data from the data warehouse into a subject area data mart

Included with PureData for Analytics:

IBM InfoSphere DataStage 11.3 (280 PVU

Information Server Engine Tier)1,

Designer Client (2 concurrent users),

InfoSphere Data Click1

1PureData System for Analytics N3001 must be the source or target database.

Page 18: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation18

Hadoop Data ServicesIncluded Capability with IBM InfoSphere BigInsights

▪ Big data analytical platform

- Best of open source + IBM technologies

- Big SQL

- High performance SQL access of Hadoop

- Federation across many data sources -

combine information from Hadoop and

PureData for Analytics

- BigSheets visualization tool

▪ Built-in analytics

- Text analytics, Big R

Bringing the power of

Hadoop to your enterprise

Included with PureData for Analytics:

InfoSphere BigInsights 3.0 software

licenses for 5 enterprise nodes to

manage up to ~100 TB of Hadoop data1

Included

▪ Federated SQL access across Hadoop and

your PureData System for Analytics

▪ Pre-processing and landing zone for all data

types prior to loading to data warehouse

▪ Queryable backup for cold data

Use cases

Features

1Based on 4 data nodes + 1 master node. 12 TB uncompressed per data node with 4 TB drives. 12 TB x 4 nodes = 48 TB uncompressed.

Using 2-2.5x compression yields 96-120 TB compressed data. Capacity will depend on hardware configuration selected.

Page 19: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation19

Use cases

Features

Real-Time AnalyticsIncluded Capability from IBM InfoSphere Streams

▪ Analyze data in motion

- Provides sub-millisecond response times,

allowing you to view information and events as

they unfold

- Analyze all kinds of data: simple & advanced text,

geospatial, acoustics, images, video, sensors

- Eclipse-based development environment

Deploy analytic models on

data-in-motion to enable real-time

decisions and land data in the

warehouse to build the analytic models

Included with PureData for Analytics:

InfoSphere Streams Developer Edition 3.2.1

2 developer users, non-production licenses

Included

▪ Fraud detection

▪ Predict customer churn

▪ Telco real-time mediation and analysis

▪ Real-time monitoring of medical sensors to improve

healthcare outcomes

▪ Defect detection in manufacturing

▪ Traffic pattern analysis and management

Page 20: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation20

FinancialMarkets

Use cases Features

Accelerating Industry Specific Business Analysis Accelerate Time to Value with IBM Industry Models

▪ Risk Management

▪ Wealth and Investment

Management

▪ Customer Intelligence

▪ Regulatory Reporting

▪ Health Care Analytics

Available

▪ Comprehensive with Built-in Expertise

- Data Warehouse design models, business terminology models and analysis templates

- Experience from >500 client engagements

▪ Solution bundles including data model, data appliance, ETL & Business Intelligence

− Banking, Healthcare, Insurance

Banking Healthcare Insurance Retail Telco

Industry Vertical Models

Customer Insight

Market & Campaign Insight

Supply Chain Insight

Horizontal

Model Packs

Page 21: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation21

IBM InfoSphere Data Privacy and Security for

Data Warehousing

InfoSphere Data Privacy and Security for Data

WarehousingInfoSphere Data Security and Privacy

Define and ShareDiscover and Classify

Mask and RedactMonitor Data Activity

Purpose-Built Capabilities • Achieve and enforce

compliance• Secure and Protect sensitive

data in appliances• Reduce costs of attaining

enterprise security

Define and Share

• Define a warehousing glossary• Share sensitive data definitions and

policies• Create project blueprints

Discover and Classify

• Discover / profile data• Explore lineage and relationships• Classify sensitive data

Monitor Data Activity

• Monitor data warehouses• Real-time alerts• Centralized reporting of audit data

Mask and Redact

• De-identify sensitive data within the warehouse

• Apply obfuscation techniques to both structured and unstructured data

Available

Page 22: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation22

What’s New in PureData System for Analytics N3001

Performance

▪ Faster performance with upgraded CPUs with more

cores

New appliance models

▪ New rack mountable, ultra lite/mini appliance for

midsize businesses

▪ New 8-rack, Petabyte capacity appliance

Security

▪ Improved security with Self Encrypting Drives

▪ Kerberos support

New Netezza Platform Software (NPS) 7.2

▪ Faster load rates

▪ Performance Portal enhancements

▪ and more

Page 23: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation23

Introducing PureData System for Analytics NPS 7.2New Database Features, Improved Performance and Resiliency

Database

features

Better performance

and reliability

Improved resiliency

and fault tolerance

▪ Faster load rates up to

10 TB/hr

▪ Faster restore rates

▪ WLM throughput and

latency optimization

▪ Enhanced security

enables single sign-on

and centralized

management

▪ New built-in functions

and SQL updates

▪ Portal enhancements

▪ Enhanced Health

Check capabilities

▪ Enhanced storage

topology and

communication fabric

▪ Call Home via https

and SOAP

Page 24: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation24

Introducing PureData System for Analytics N3001-001:

The Mini-Appliance

Bringing speed and simplicity to midsize organizations for big outcomes

• Rack mountable

• Production ready

• Full function appliance

• User data capacity 16 TB*

• High availability - All redundant

hardware, 4 disk spares, hot swap

power supply

• Self encrypting drives, Kerberos

support, LDAP/Active directory

Solution Highlights

*Assumes 4x compression

▪ Simple

Same user experience as all PureData System for

Analytics appliances

• Full function Netezza Platform Software with IBM

Netezza Analytics

• Support tools and Netezza Performance Portal

• ODBC/JDBC/OLE-DB/SQL Driver integration

Load and go with no tuning or administration

▪ Speed

10-100x faster than traditional custom systems1

▪ Smart

Rich set of in database analytic functions

Protection of all data from unauthorized access

Includes starter kits for Big Data and Business Intelligence

▪ Agile

Easily incorporated into the data center with simplified

installation into an existing rack

▪ Affordable

Purchase or lease

1Based on IBM customers’ reported results. “Traditional custom systems” refers to systems that are not professionally pre-built, pre-

tested and optimized. Individual results may vary.

Page 25: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation25

Introducing PureData System for Analytics N3001-0808-rack System

▪ 1.5 PB of user data capacity1

▪ Hosts: 2x x3750M4 and 600 GB Self Encrypting Drives

▪ Blades: 56x HS23 with 20 core IvyBridge processors

▪ Storage: 96 EXP2524 disk enclosures with 24x 600 GB Self Encrypting Drives

1Assumes 4x compression

Page 26: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation26

IBM DB2 Analytics AcceleratorEnhanced with PureData System for Analytics N3001

Benefits

▪ Extreme performance for complex queries

- Up to 2000x performance improvements

▪ Cost Savings

- Offload complex queries and eliminate

costly query tuning

- No need to create or maintain indices

- Improves access to and lowers the cost of

storing, managing and processing historical

data

▪ Integrated with DB2 z/OS and inherits

mission critical features such as security and

recoverability

▪ Access to DB2 Analytics Accelerator is

transparent to applications and users

▪ Fast deployment and time to value

- Installation is non-disruptive

- Plug it in, load data and go in 1-2 days

A high performance

appliance that integrates

Netezza technology with

zEnterprise technology to

deliver dramatically faster

business analysis

Highlights – What’s New?

The PureData System for Analytics N3001

provides additional benefits to DB2 Analytics

Accelerator customers:

▪ Advanced security through encryption of data

at rest with self-encrypting disks

▪ Performance improvements for analytic

workloads

▪ Improved serviceability with the recently

introduced automatic call home capability

▪ Broader range of SQL compatibility

Available

Page 27: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation27

PureData System for Analytics Family

▪ 10-100x faster than

custom systems1

▪ 3.3x faster I/O scan

rate2

▪ Load and go, no tuning

▪ Designed to run

complex analytics in

minutes, not hours

▪ Rich set of in-database

analytics

N2002 N3001-xxx

N3001-001

DB2 Analytics Accelerator for z/OS

(now with N3001)

1Based on IBM customers' reported results. "Traditional custom systems" refers to systems that are not professionally pre-built, pre-tested and optimized.

Individual results may vary.2Comparing N1001 scan rate of 145 TB/hour to N2002 scan rate of 478 TB/hour

…plus

▪ Rack mountable

appliance

▪ Ideal for small and

medium business with

up to 16 TB of user data

...plus

▪ Entitled software capability for

real-time analytics, Hadoop

data services, data movement

and business intelligence

▪ Advanced security

▪ Partial rack to 8-rack

configurations

▪ The hybrid computing platform

integrating Netezza technology with

zEnterprise technology

▪ Supports transaction processing and

analytic workloads concurrently,

efficiently & cost effectively

▪ Accelerates complex queries, up to

2000x faster

▪ Required security compliance with

Data-at-Rest Encryption

Page 28: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation28

The PureData System for Analytics N3001 Family

Specification N3001-001 N3001-002 N3001-005 N3001-010 N3001-020 N3001-040 N3001-080

Racks n/a, 2 x 2U 1 (1/4 full) 1 (1/2 full) 1 2 4 8

Active S-

Blades

n/a 2 4 7 14 28 56

CPU cores 40 40 80 140 280 560 1,120

User data

(TB) *

16 32 96 192 384 768 1,536

* Assuming 4x compression

Single rack systems Multiple rack systems

Linear Scalability!

Page 29: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation29

WHAT MAKES THE N3001

BETTER?

Page 30: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation30

What are the Demands of a Modern Data Warehouse?

Faster Insight

▪ Fast response times are

expected

▪ People are used to an

experience as easy as

Google

▪ Users do not want to wait

for query results

Insight

Cost

Agility

Lower cost

▪ Initial acquisition

▪ Ongoing operation and

administration

▪ Total cost of ownership

Added Agility

▪ Ability to respond quickly to

the needs of the business

▪ By simplifying operations,

more time is provided for

innovation

▪ Better business outcomes

by utilizing more data

sources

Page 31: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation31

PureData System for Analytics Delivers on the Demands of a

Modern Data Warehouse

Faster Insight

InsightCost

Agility

Lower cost Added Agility

“BCBSMA has combined IBM Cognos

Business Intelligence with an IBM

Netezza data warehouse appliance to

provide lightning-fast analysis of

medical and financial data. The

solution creates sophisticated reports

on clinical and financial risk and

operational efficiency.”

- Shashikanth Vangala, Manager & Chief

Solutions Architect of Business Intelligence,

Blue Cross Blue Shield of Massachusetts

“That simplicity cannot be

underrated. It is just

amazingly simple to do very,

very large scale things that in

any other environment takes

engineering, just to pull off.”

- David Birmingham, Senior

Consultant, Brightlight Consulting

“we tested PureData Systems late

last year on a set of very complex

use cases and we found,

compared to earlier architectures, it

was performing 2-3 times faster

on batch processes and anywhere

from 3-10 times better on our

concurrent workload”

- John Naduvathusseril, Chief Data Architect,

Nielsen Company

Page 32: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation32

What do the Best-in-class Data Warehouses Deliver1?

1Source: Aberdeen Group, The Best-in-Class Data Warehouse: Fast, Simple, Impactful, May 2014.

99%of users are satisfied with speed

of information delivery(46% industry average)

97%are satisfied with ease-of-use

analytical tools(44% industry average)

97%of users are satisfied with access

to data needed to support

decisions(51% industry average)

Faster information delivery

Easy access to required data

Analytical tools that are easy to use

Page 33: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation33

PureData System for Analytics Delivers

Faster information delivery

Easy access to required data

Analytical tools that are easy to use

“Making decisions based on data instead of intuition or gut feeling is better. There is

already a greater demand from users for data to support day-to-day operations –

solutions such as the InfoSphere Business Glossary empower them with this

information so that they can work more autonomously and efficiently.”

- Philippe Chartier, BI Team Lead, Information Delivery, Canadian National Railway Company

“With the IBM PureData System for Analytics, we can reduce the time to analyze

complex GIS data from days to minutes—a more than 98 percent improvement.”

- Steve Trammell, Strategic Alliances Marketing Manager, Esri

“We knew that our IBM SPSS Modeler software could scale to meet our needs;

the limitation was on the hardware and data warehousing side. Instead of having

separate databases and servers for each client, we wanted to build a single,

multi-tenant platform that could support a cloud-based service for the entire

business. In the IBM PureData System for Analytics, we found the answer.”

- Patrick Ritto, CTO, FleetRisk Advisors

Page 34: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation34

Comparing PureData System for Analytics with Teradata

1ITG: Comparing Costs and Time to Value with Teradata Data Warehouse Appliance, May 2014.

2.6x higherpersonnel costs1

3.4x moreDBAs required1

33% higher3-year TCO1

3.8x higherdeployment costs1

Teradata has …

…than the IBM PureData System for Analytics

Page 35: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation35

Comparing PureData System for Analytics with Oracle

1ITG: Comparing Costs and Time to Value with Oracle Exadata Database Machine X3, June 2014.

3x more

DBAs required1

45% higher

3-year TCO1

3.5x higher

deployment costs1

Oracle has …

…than the IBM PureData System for Analytics

Page 36: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation36

Synergy with Data Integration and Reporting & Analysis Tools

SQ

L O

DB

C J

DB

C O

LE

-DB

SQ

L O

DB

C J

DB

C O

LE

-DB

Data In Data Out

Data IntegrationReporting &

Analysis

▪ IBM

▪ BigInsights

▪ Information Server

▪ InfoSphere Streams

▪ Ab Initio

▪ Hadoop

▪ Informatica

▪ Microsoft

▪ Oracle

▪ SAP

▪ SAS▪ Others using standard

ODBC/JDBC/OLE-DB/SQL

▪ IBM

▪ Cognos

▪ SPSS

▪ Campaign

▪ Hadoop

▪ Information Builders

▪ Microsoft

▪ MicroStrategy

▪ Oracle

▪ SAP

▪ SAS

▪ Tableau▪ Others using standard

ODBC/JDBC/OLE-

DB/SQL

Note: Sample list, not all inclusive

Page 37: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation37

PureData System for Analytics Overview: Model N3001

▪ User Data Capacity: 192 TB1

▪ Data Scan Speed: 478 TB/hr*▪ Load Speed (per system): 10+ TB/hr

▪ Power Requirements: 7.5 kW▪ Cooling Requirements: 27,000 BTU/hr

1Assuming 4X compression

Scales up to 8 full Racks

Terabyte to Petabyte+ Capacity

2 Hosts (Active-Passive)▪ 2 Intel Ivy Bridge CPUs▪ 5X600 GB SAS Self Encrypting Drives▪ Red Hat Linux 6 64-bit

7 PureData for Analytics S-Blades™▪ 2 Intel 10 Core Ivy Bridge CPUs▪ 2 8-Engine Xilinx Virtex-6 FPGAs▪ 128 GB RAM + 8 GB slice buffer▪ Linux 64-bit Kernel

12 Disk Enclosures▪ 288 600 GB SAS2 Self Encrypting Drives

• 240 for User Data• 14 for S-Blades• 34 Spare

▪ RAID 1 Mirroring

Page 38: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation38

Simplify …

Move Analytics into the Data Warehouse

▪ Integrate the

server, storage

and database

into one

optimized

package

▪ Move complex

analytics into

the database

▪ Leverage proven

technology that

accelerates

analytics with no

tuning or storage

administration

Database AnalyticsStorageServer

Server

Storage

Database

Analytics

Page 39: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation39

INDUSTRY SPECIFIC

BENEFITS

Page 40: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation40

Speed the analysis of customer

data for improved insight

Optimizing Offers and Cross Sell Use Case Goals

Capabilities Provided by PureData for Analytics

• Speed the analysis of customer data for improved insight

• Improve the cycle time for predictive models to continuously improve offer prediction accuracy

• Encrypt data at rest with self-encrypting disks for improved customer data security

Data

Preparation

Predictive

Analytics

Geospatial

Analytics

Advanced

Statistics

Optimizing Offers and Cross Sell in Banking with…The New PureData System for Analytics N3001

Business Outcomes

• Asian bank increased credit card marketing response rate more than 300%

• European bank increased key client interaction performance metrics by 98 %

• Improve response rates to offers for increased revenue

• Improve cross-selling for increased wallet share

• Improve customer advocacy through improved offer targeting

Page 41: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation41

Speed the analysis of fraud data for

better fraud detection and prevention

Fraud Detection and Mitigation Use Case Goals

Capabilities Provided by PureData for Analytics

• Speed the analysis of fraud data for improved fraud detection

• Improve the cycle time for predictive fraud models to continuously improve prediction accuracy

• Encrypt data at rest with self-encrypting disks to protect against security threats

Data

Preparation

Predictive

Analytics

Geospatial

Analytics

Advanced

Statistics

Fraud Detection and Mitigation in Banking with…The New PureData System for Analytics N3001

Business Outcomes

• Global securities exchange reduced the time to run market surveillance by 99%

• Japanese bank improved analysis speed by 90% for improved money laundering detection

• Reduce fraud losses and lower costs to fight fraud

• Improve fraud detection to stop fraud before significant impact

• Improve customer satisfaction with reduction in fraud false positives

Page 42: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation42

WHERE THE N3001 FITS IN THE

LOGICAL DATA WAREHOUSE

Page 43: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation43

Data Sources

Transactional

Social

Application

User Generated

Journal

Video and Audio

Machine / Sensor

Documents

Third Party

Enterprise Data Warehouses have evolved into Logical Data

Warehouses which optimize access and reduce costs

Internal Insight

Reporting

Enterprise

Content

Discovery

Exploration

Decision

Management

Predictive

Analytics

Visualization

External-Facing

Applications

Web or Mobile

Systems of

Engagement

Information Governance

Real-time Analytics

NoSQL Doc

Store

Data Warehouse Deep Analytics,

Modeling

Transactional

Systems

Landing,

Exploration,

Archive

Reporting,

Analytics

Logical Data Warehouse

Page 44: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation44

On Premise Cloud

Flu

id Q

ue

ry

IBM Fluid Query – Powering the Logical Data Warehouse

▪ In the world of big data, can you

really afford to move all your data

to the analytics?

▪ Intelligently route queries to the

correct data store

▪ Simplify and unify information

access for end users and

applications

▪ Access all data within the logical

data warehouse for analytics and

business insight

Move the query to the data, not the data to the query

Question

Answer

Hadoop

Data Warehouse

Data Mart

Operational

Other

Page 45: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation45

IBM Fluid Query – Powering the Logical Data WarehouseWithin both PureData and BigInsights

SQL access to data across any

system from Hadoop, including

relational data via IBM Big SQL

Data Warehouse

PureData System

for Analytics

Hadoop

IBM BigInsights for

Apache Hadoop

Run Hadoop queries from

your data warehouse and

move data to/from Hadoop

via IBM Fluid Query 1.0

Other SourcesIBM Fluid Query

Page 46: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation46

IBM Fluid Query 1.0

Cross platform query & data movement

between PureData System for Analytics and Hadoop

Question

Answer

Unifying PureData System for Analytics with Hadoop

Hadoop Queries

Data Movement

Page 47: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation47

IBM Big SQL adds to the capability of Fluid Query

Cross platform query & data movement

from Hadoop to PureData System for Analytics

Answer

Question

Unifying PureData System for Analytics and Hadoop

PDA Queries

Data Movement

Page 48: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation48

Cloudera and Hortonworks can access PureData but

only offer “Fluid Data” not Fluid Query

This is inefficient

Big Data is about moving “Little Data”

Answer

Question

Queries move all data back to Hadoop and do the filtering there

Always Data

Movement

Page 49: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation49

IBM Fluid Query Use Cases

Discovery, Exploration and Archive

Land all data in Hadoop for discovery, exploration & “day 0” archive

Queries originate on Hadoop can explore data stored on PureData (Big SQL)

Queries originate from PureData to Hadoop to combine Hadoop data with data in the

data warehouse

Multi-temperate data management

Run queries combining hot data from PureData with colder data from Hadoop

Utilize Hadoop as part of the logical data warehouse

Overall database can be split between PureData and Hadoop based upon frequency of

access, with hot tables or hot data in tables on PureData and colder, less frequently

accessed data residing on the Hadoop distribution

Data Warehouse Capacity Relief and Disaster Recovery

Offload colder data from PureData to Hadoop to relieve resources on the data warehouse

Copy data to Hadoop as a disaster recovery solution (Can be queried in an emergency)

Backup your database to Hadoop, in an immutable format

Queryable Archive

Query archived data on Hadoop with Big SQL or from PureData

Utilize IBM Big SQL to combine Hadoop data with other data sources

Page 50: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation50

How do I get Fluid Query 1.0?

Data Warehouse Appliance Requirements

Machine Models Minimum Software requirements

TwinFin N100x systems NPS 7.0.2 and IBM Netezza Analytics 2.5

StriperN2001 NPS 7.0.4 and IBM Netezza Analytics 2.5.4

N2002 NPS 7.1 and IBM Netezza Analytics 3.0

Mako N3001 NPS 7.2 and IBM Netezza Analytics 3.02

Included free as a feature in Netezza Platform Software (NPS)

for PureData System for Analytics appliances

Specifications

IBM Fluid Query download

Supported Hadoop Providers

IBM BigInsights 2.1, 3.0 Cloudera 4.7 and 5.3 Hortonworks 2.1, 2.2

Page 51: IBM PureData System for Analytics N3001 Overview

© 2015 IBM Corporation51

WAP12710-USEN-04