4. Big data & analytics HP

16
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Big Data & Analytics IoT advanced analytics powered by HAVEn Alberto de Obeso Orendain Business Intelligence Solutions Architect Hewlett-Packard

Transcript of 4. Big data & analytics HP

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Big Data & AnalyticsIoT advanced analytics powered by HAVEn

Alberto de Obeso Orendain

Business Intelligence Solutions Architect

Hewlett-Packard

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.2

Pick a box…

CBA CBA

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.3

Datification

Key enablers of BigData

Information Sources

MobileTransactional Data SearchTextsCRM, SCM, ERP

$ € ¥

ImagesEmail Social MediaIT Ops AudioVideo

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.4

Connectivity

Key enablers of BigData

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.5

The data scientist

Key enablers of BigData

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.6

The tools

Key enablers of BigData

HAVEn

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.7

HAVEn – Big Data platform

HAVEn

Catalog massive

volumes of

distributed data

Hadoop/

HDFS

Process and

index all

information

Autonomy

IDOL

Analyze at

extreme scale

in real-time

Vertica

Collect & unify

machine data

Enterprise

Security

Powering

HP Software

+ your apps

nApps

hp.com/haven

Social media IT/OT ImagesAudioVideoTransactional

dataMobile Search engineEmail Texts Documents

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.8

HAVEn Hadoop

Stores and mines any type

of data

• Structured, semi-structured,

unstructured

Excels at processing

complex data

• Workloads divided among

multiple nodes

Scales

economically

• Scale-out architecture deploys

on commodity hardware

Open source Linux-based platform for

data storage and processing that is…

Scalable

Fault tolerant

Distributed

Based on HP Gen8 ProLiant Servers

Hadoop

Distributed File

System (HDFS)

Self-healing,

high bandwidth

clustered storage

MapReduce

Distributed

Computing

Framework

Core system components

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.9

Human Information is made up of ideas, is diverse, and has context.

HAVEn Autonomy

Ideas don’t exactly match like data does; they have distance.

Human Information is not static – it’s dynamic and lives everywhere.

Only IDOL can handle the continuum of Human Information

• Single processing layer for all data

• Continuous learning ability

• Built in security & compliance functionality

• 400+ seamless data connectors & Supporting 1,000+ file types

• Language independent

• Process data in-memory, in-time, and in-place

Ability to understand meaning makes us totally unique in the market

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.10

Gain insights into your data in near-real time by running queries 50x-1,000x faster than legacy products

Blazing fast analytics

Infinitely and easily scale your solution by adding an unlimited number of industry-standard servers

Massive scalability

Get to market quickly with your analytics initiatives at low cost of administration and maintenance

Easy set-up and administration

Protect your investments, with built-in support for Hadoop, R, BI/Visualization, ETL

Open architecture

Store 10x-30x more data per server than row databases with patented columnar compression

Optimized data storage

Speed, scalability, simplicity, and openness at lower TCO

HP Vertica Analytics Platform

High-performance data analytics platform purpose built for big data

HAVEn Vertica

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.11

SmartTerritoryCity

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.12

Water Consumption

H1: Empowerment & Engagement Water Consumption

Data:

• Gallons

• pounds of Carbon Dioxide

• Temperature

• Humidity

• Precipitation

• Number inhabitants

• Age

• Household size, style,

• Household year built

• GIS data

Methods:

Identify Consumption Patterns time series

Insights

Leaks (outliers)

Establish Consumption Baseline Clustering

Demand planning Predictive analytics

regression , neural networks.

Consumption metrics:

d/d, w/w, m/m, lift, peaks

SQL vs NoSQL• Columnar

• Compression

Data Modeling• Entities vs queries

• Different roles

Inhabitant/

Government

SELECT * FROM WaterMeter;

ts | meterid | gallons

---------------------+--------+------

2014-01-01 03:00:00 | m001 | 10

2014-01-01 03:00:05 | m001 | 10.5

(2 rows)

slice_time | gallons

---------------------+------

2014-01-01 03:00:00 | 10

2014-01-01 03:00:02 | 10.2

2014-01-01 03:00:04 | 10.4

(3 rows)

=> SELECT slice_time, TS_FIRST_VALUE(gallons, 'LINEAR') gallons

FROM WaterMeter

TIMESERIES slice_time AS '2 seconds' OVER(PARTITION BY meterid

ORDER BY ts);

fit lm(gallons ~ temperature + NoMembers +

HouseSize…)

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.13

Security Information and Event Manager (SIEM)

HAVEN Enterprise Security (HP ArcSight)

Predictive

Analyzing unlimited data

interactively in real-time

Vertica

Real-time

analytics

Machine data –– event

stream and logs

Logger / CORR

Machine

generated data

and security

Tools to implement ITIL

best practices

BSM

IT Operations

Proactive

Conceptual and

contextual understanding of

all content

IDOL

Human generated

data and security

Reactive

400+ Rules for real-time, cross-device

correlation

Detect, Respond, and Prevent

Threats

Big Data Security

Expandable to entire IT

High-Performance security analytics to combat cyber security issues

Analyze machine data in real-time, across-devices to protect your IT from cyber threats

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.14

HAVEn Powering HP Software + your appsDetails on applications

HAVEn is integrated to costumers architecture through other n Apps

HP has started modifying our existing application portfolio to use HAVEn

And HP is building new applications that leverage power of HAVEn

Many customers are already building applications that use multiple HAVEn

OPERATIONS ANALYTICS.- Optimize performance by collecting ops data from diverse sources

HP in collaboration with our customers has developed 5 Apps Lines

SERVICE ANYWHERE.- Generate collective insight, and proactive action, from history, trends, structured,

unstructured to create business advantage

PROPEL.- Catalog of catalogs, all in one place for all users

DIGITAL MARKETING HUB.- Get a complete picture of the customer using an intuitive dashboard

HEALTHCARE ANALYTICS.- Leverages curated taxonomies at query time to provide advanced search functionality

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.15

Innovative analytic use cases are cutting across structured, unstructured and semi structured data

Big Data opportunities across industries and use cases

Government Telecom Manufacturing Healthcare

• Sentiment analysis

• Social CRM / network analysis

• Churn mitigation

• Brand monitoring

• Cross and Up sell

• Loyalty & promotion analysis

• Web application optimization

• Marketing campaign optimization

• Brand management

• Social media analytics

• Pricing optimization

• Internal risk assessment

• Customer behavior analysis

• Revenue assurance

• Logistics optimization

• Clickstream analysis

• Influencer analysis

• IT infrastructure analysis

• Legal discovery

• Equipment monitoring

• Enterprise search

• Drug

development

• Scientific research

• Evidence based

medicine

• Healthcare

outcomes

analysis

• Supply chain

optimization

• Defect tracking

• RFID Correlation

• Warranty

management

• Analysis and

customer retention

• Analysis of network

usage

• Monitoring hearings

and ad optimization

• Law enforcement

• Video surveillance

and security

• Traffic flow

optimization

Horizontal use cases

Sources: IDC: 2012 “Worldwide Big Data Technology and Services Forecast: 2011-2015, Gartner: 2012 “Big Data Drives Rapid Changes in Infrastructure and $232 Billion in IT Spending

Through 2016

Finance

• Customer

knowledge

• Event marketing

• Risk management

Energy

• Weather

forecasting

• Natural resource

exploration

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

Thank you