Big Data Analytics and Predictive Analytics - _ Predictive Analytics Today
4. Big data & analytics HP
-
Upload
mitef-mexico -
Category
Business
-
view
181 -
download
0
Transcript of 4. Big data & analytics HP
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Big Data & AnalyticsIoT advanced analytics powered by HAVEn
Alberto de Obeso Orendain
Business Intelligence Solutions Architect
Hewlett-Packard
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.2
Pick a box…
CBA CBA
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.3
Datification
Key enablers of BigData
Information Sources
MobileTransactional Data SearchTextsCRM, SCM, ERP
$ € ¥
ImagesEmail Social MediaIT Ops AudioVideo
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.4
Connectivity
Key enablers of BigData
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.5
The data scientist
Key enablers of BigData
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.6
The tools
Key enablers of BigData
HAVEn
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.7
HAVEn – Big Data platform
HAVEn
Catalog massive
volumes of
distributed data
Hadoop/
HDFS
Process and
index all
information
Autonomy
IDOL
Analyze at
extreme scale
in real-time
Vertica
Collect & unify
machine data
Enterprise
Security
Powering
HP Software
+ your apps
nApps
hp.com/haven
Social media IT/OT ImagesAudioVideoTransactional
dataMobile Search engineEmail Texts Documents
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.8
HAVEn Hadoop
Stores and mines any type
of data
• Structured, semi-structured,
unstructured
Excels at processing
complex data
• Workloads divided among
multiple nodes
Scales
economically
• Scale-out architecture deploys
on commodity hardware
Open source Linux-based platform for
data storage and processing that is…
Scalable
Fault tolerant
Distributed
Based on HP Gen8 ProLiant Servers
Hadoop
Distributed File
System (HDFS)
Self-healing,
high bandwidth
clustered storage
MapReduce
Distributed
Computing
Framework
Core system components
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.9
Human Information is made up of ideas, is diverse, and has context.
HAVEn Autonomy
Ideas don’t exactly match like data does; they have distance.
Human Information is not static – it’s dynamic and lives everywhere.
Only IDOL can handle the continuum of Human Information
• Single processing layer for all data
• Continuous learning ability
• Built in security & compliance functionality
• 400+ seamless data connectors & Supporting 1,000+ file types
• Language independent
• Process data in-memory, in-time, and in-place
Ability to understand meaning makes us totally unique in the market
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.10
Gain insights into your data in near-real time by running queries 50x-1,000x faster than legacy products
Blazing fast analytics
Infinitely and easily scale your solution by adding an unlimited number of industry-standard servers
Massive scalability
Get to market quickly with your analytics initiatives at low cost of administration and maintenance
Easy set-up and administration
Protect your investments, with built-in support for Hadoop, R, BI/Visualization, ETL
Open architecture
Store 10x-30x more data per server than row databases with patented columnar compression
Optimized data storage
Speed, scalability, simplicity, and openness at lower TCO
HP Vertica Analytics Platform
High-performance data analytics platform purpose built for big data
HAVEn Vertica
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.11
SmartTerritoryCity
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.12
Water Consumption
H1: Empowerment & Engagement Water Consumption
Data:
• Gallons
• pounds of Carbon Dioxide
• Temperature
• Humidity
• Precipitation
• Number inhabitants
• Age
• Household size, style,
• Household year built
• GIS data
Methods:
Identify Consumption Patterns time series
Insights
Leaks (outliers)
Establish Consumption Baseline Clustering
Demand planning Predictive analytics
regression , neural networks.
Consumption metrics:
d/d, w/w, m/m, lift, peaks
SQL vs NoSQL• Columnar
• Compression
Data Modeling• Entities vs queries
• Different roles
Inhabitant/
Government
SELECT * FROM WaterMeter;
ts | meterid | gallons
---------------------+--------+------
2014-01-01 03:00:00 | m001 | 10
2014-01-01 03:00:05 | m001 | 10.5
(2 rows)
slice_time | gallons
---------------------+------
2014-01-01 03:00:00 | 10
2014-01-01 03:00:02 | 10.2
2014-01-01 03:00:04 | 10.4
(3 rows)
=> SELECT slice_time, TS_FIRST_VALUE(gallons, 'LINEAR') gallons
FROM WaterMeter
TIMESERIES slice_time AS '2 seconds' OVER(PARTITION BY meterid
ORDER BY ts);
fit lm(gallons ~ temperature + NoMembers +
HouseSize…)
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.13
Security Information and Event Manager (SIEM)
HAVEN Enterprise Security (HP ArcSight)
Predictive
Analyzing unlimited data
interactively in real-time
Vertica
Real-time
analytics
Machine data –– event
stream and logs
Logger / CORR
Machine
generated data
and security
Tools to implement ITIL
best practices
BSM
IT Operations
Proactive
Conceptual and
contextual understanding of
all content
IDOL
Human generated
data and security
Reactive
400+ Rules for real-time, cross-device
correlation
Detect, Respond, and Prevent
Threats
Big Data Security
Expandable to entire IT
High-Performance security analytics to combat cyber security issues
Analyze machine data in real-time, across-devices to protect your IT from cyber threats
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.14
HAVEn Powering HP Software + your appsDetails on applications
HAVEn is integrated to costumers architecture through other n Apps
HP has started modifying our existing application portfolio to use HAVEn
And HP is building new applications that leverage power of HAVEn
Many customers are already building applications that use multiple HAVEn
OPERATIONS ANALYTICS.- Optimize performance by collecting ops data from diverse sources
HP in collaboration with our customers has developed 5 Apps Lines
SERVICE ANYWHERE.- Generate collective insight, and proactive action, from history, trends, structured,
unstructured to create business advantage
PROPEL.- Catalog of catalogs, all in one place for all users
DIGITAL MARKETING HUB.- Get a complete picture of the customer using an intuitive dashboard
HEALTHCARE ANALYTICS.- Leverages curated taxonomies at query time to provide advanced search functionality
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.15
Innovative analytic use cases are cutting across structured, unstructured and semi structured data
Big Data opportunities across industries and use cases
Government Telecom Manufacturing Healthcare
• Sentiment analysis
• Social CRM / network analysis
• Churn mitigation
• Brand monitoring
• Cross and Up sell
• Loyalty & promotion analysis
• Web application optimization
• Marketing campaign optimization
• Brand management
• Social media analytics
• Pricing optimization
• Internal risk assessment
• Customer behavior analysis
• Revenue assurance
• Logistics optimization
• Clickstream analysis
• Influencer analysis
• IT infrastructure analysis
• Legal discovery
• Equipment monitoring
• Enterprise search
• Drug
development
• Scientific research
• Evidence based
medicine
• Healthcare
outcomes
analysis
• Supply chain
optimization
• Defect tracking
• RFID Correlation
• Warranty
management
• Analysis and
customer retention
• Analysis of network
usage
• Monitoring hearings
and ad optimization
• Law enforcement
• Video surveillance
and security
• Traffic flow
optimization
Horizontal use cases
Sources: IDC: 2012 “Worldwide Big Data Technology and Services Forecast: 2011-2015, Gartner: 2012 “Big Data Drives Rapid Changes in Infrastructure and $232 Billion in IT Spending
Through 2016
Finance
• Customer
knowledge
• Event marketing
• Risk management
Energy
• Weather
forecasting
• Natural resource
exploration