Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… ·...

69
Stream Processing as Game Changer for the Internet of Things Kai Wähner [email protected] @KaiWaehner www.kai-waehner.de LinkedIn / Xing Please connect!

Transcript of Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… ·...

Page 1: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Stream Processing as Game Changer for the Internet of Things

Kai Wähner

[email protected]

@KaiWaehner

www.kai-waehner.de

LinkedIn / Xing Please connect!

Page 2: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Key Messages

– Streaming Analytics processes Data while it is in Motion!

– Automation and Proactive Human Interaction are BOTH needed!

– Time to Market is the Key Requirement for most Use Cases!

Page 3: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Agenda

– Real World Use Cases

– Introduction to Stream Processing

– Market Overview

– Relation to other Big Data Components

– Live Demo

Page 4: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Agenda

– Real World Use Cases

– Introduction to Stream Processing

– Market Overview

– Relation to other Big Data Components

– Live Demo

Page 5: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

© Copyright 2015 TIBCO Software Inc.

Find and Act on “Critical Business Moments”

“Business Moments” occur in Every Facet of Enterprise Operations, they drive competitive differentiation, customer satisfaction and business success!

Optimize Pricing Identify

fraud

Make cross-sell offers

Restock inventory

Reroute trucks

Deliver proactive customer service

Predict equipment failure & fix proactively Anticipate

and handle disruptions

Page 6: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

6

Success Story

© Copyright 2000-2015 TIBCO Software Inc.

Predictive Fault Management

Page 7: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

© Copyright 2000-2013 TIBCO Software Inc.

“An outage on one well can cost $10M per

hour. We have 20-100 outages per year.“

- Drilling operations VP, major oil company

Page 8: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Data Monitoring

• Motor temperature • Motor vibration • Current • Intake pressure • Intake temperature

Flow

Electrical power cable

Pump

Intake

Protector

ESP motor

Pump monitoring unit

Electric Submersible Pumps (ESP)

Predictive Analytics (Fault Management)

Page 9: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Voltage

Temperature

Vibration

Device

history

Temporal analytic: “If vibration spike is followed by temp spike then

voltage spike [within 12 minutes] then flag high severity alert.”

Predictive Analytics (Fault Management)

Page 10: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Live Surveillance of Equipment

© Copyright 2000-2014 TIBCO Software Inc.

Continuous, live geospatial display of pump health and predictive signal breeches

Alerts based on predictive signals

Compare live readings and signals to historical average and means

Continuous, live visualization of stats per 100’s of wells

Page 11: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

11

Success Story

© Copyright 2000-2015 TIBCO Software Inc.

Smart Manufacturing

Page 12: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

© Copyright 2000-2013 TIBCO Software Inc.

““For every 1% increase in shipped product,

we make $11MM in profit. The demand is

there, we just need to fulfill it.“

- Head of Quality, Solar Panel Manufacturer

Page 13: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

IoT for High Tech Manufacturing Yield Optimization

© Copyright 2000-2014 TIBCO Software Inc.

• Before: Solar Panel Manufacturer with No Unified View of

Manufacturing Process

– Multiple manufacturing facilities, multiple processes – no way to compare

production to yield expectations

• Negative Consequences: Sub-Optimal Production

– Operations are sub-optimal: high tolerance leads to better yield but less

output; tight tolerance means high throughput but lower yield

• Business Outcome: Higher Yield and More Runs

– Process Manufacturing can run tighter tolerances and adjust them mid-run,

predicting yield and adjusting to changing variables

– Systems proactively re-route high-value customers around affected network

areas in real-time

• How We Do It: The TIBCO Fast Data Platform

– IoT, Spotfire, StreamBase, and TERR for predictive modeling, high-speed

network by TIBCO

“For every 1% increase in shipped product, we make $11MM in profit. The demand is there, we just need to fulfill it.”

- Head of Quality, Solar Panel Manufacturer

Page 14: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

High Tech Manufacturing Yield Optimization

© Copyright 2000-2014 TIBCO Software Inc.

Continuously computed real-time analytics on streams (thresholds, min / max, average)

Analysis, alerts and triggers are based on streaming analytics

Page 15: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

High Tech Manufacturing Yield Optimization

© Copyright 2000-2014 TIBCO Software Inc.

Manufacturing operations staff drill down on any machine, any

time, to inspect and fix problems before they impact yield

Page 16: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

16

Success Story

© Copyright 2000-2015 TIBCO Software Inc.

Crowd Management

Page 17: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

© Copyright 2000-2013 TIBCO Software Inc.

“Turn the customer into a fan and increase

revenue significantly.“

Page 18: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Sacramento Kings World’s Smartest Building

© Copyright 2000-2015 TIBCO Software Inc.

Page 19: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

19

© Copyright 2000-2014 TIBCO Software Inc.

All Customers are different… Treat them that way…

Capture – Engage – Expand - Monetize

Patterns – Real time

MO

RE

PER

SON

AL

MORE CONTEXT

social

CRM

POS mobile web

e-mails

Page 20: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

20

Great success stories, but …

© Copyright 2000-2015 TIBCO Software Inc.

… how to realize these use cases?

Page 21: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Agenda

– Real World Use Cases

– Introduction to Stream Processing

– Market Overview

– Relation to other Big Data Components

– Live Demo

Page 22: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Streaming Analytics

© Copyright 2000-2015 TIBCO Software Inc.

time 1 2 3 4 5 6 7 8 9

Event Streams

• Continuous Queries • Sliding Windows • Filter • Aggregation • Correlation • …

Page 23: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Operational Intelligence in Action

© Copyright 2000-2015 TIBCO Software Inc.

Actions by Operations

Human decisions in real time informed by up to date information

The Challenge: Empower operations staff to see and seize key business moments 23

Automated action based on models of history combined with live context and business rules

The Challenge: Create, understand, and deploy algorithms & rules that automate key business reactions

Machine-to-Machine Automation

Page 24: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Operational Analytics

Operations Live UI

SENSOR DATA

TRANSACTIONS

MESSAGE BUS

MACHINE DATA

SOCIAL DATA

Streaming Analytics Action

Aggregate

Rules

Stream Processing

Analytics

Correlate

Live Datamart

Continuous query processing

Alerts

Manual action, escalation

HISTORICAL ANALYSIS

MS Excel

SAS

Data Scientists

Cleansed Data

History

Data Discovery

R

Enterprise Service Bus

ERP MDM DB WMS

SOA

BIG DATA

Data Warehouse, Hadoop

Intern

al Data

Integratio

n B

us

API

Event Server

Streaming Analytics Reference Architecture

Spark

Page 25: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Agenda

– Real World Use Cases

– Introduction to Stream Processing

– Market Overview

– Relation to other Big Data Components

– Live Demo

Page 26: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Operational Analytics

Operations Live UI

SENSOR DATA

TRANSACTIONS

MESSAGE BUS

MACHINE DATA

SOCIAL DATA

Streaming Analytics Action

Aggregate

Rules

Stream Processing

Analytics

Correlate

Live Datamart

Continuous query processing

Alerts

Manual action, escalation

HISTORICAL ANALYSIS

MS Excel

SAS

Data Scientists

Cleansed Data

History

Data Discovery

R

Enterprise Service Bus

ERP MDM DB WMS

SOA

BIG DATA

Data Warehouse, Hadoop

Intern

al Data

Integratio

n B

us

API

Event Server

Streaming Analytics Reference Architecture

Spark

Page 27: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

27

Alternatives for Stream Processing

Time

to

Market

Streaming Frameworks

Streaming Products

Slow Fast

Streaming

Concepts

Includes Includes

© Copyright 2000-2015 TIBCO Software Inc.

Page 28: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Concepts (Continuous Queries, Sliding Windows)

Patterns (Counting, Sequencing, Tracking, Trends)

Build everything by yourself!

28

What Streaming Alternative do you need?

Time

to

Market

Streaming Frameworks

Streaming Products

Slow Fast

Streaming

Concepts

© Copyright 2000-2015 TIBCO Software Inc.

Page 29: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

29

Usually not an option ...

© Copyright 2000-2015 TIBCO Software Inc.

… as there are a lot of

Frameworks and

Products available!

Page 30: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

30

Alternatives

© Copyright 2000-2015 TIBCO Software Inc.

OPEN SOURCE CLOSED SOURCE

PRODUCT

FRAMEWORK

(no complete list!)

Page 31: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Library (Java, .NET, Python)

Query Language (often similar to SQL)

Scalability (horizontal and vertical, fail over)

Connectivity (technologies, markets, products)

Operators (Filter, Sort, Aggregate)

31

What Streaming Alternative do you need?

Time

to

Market

Streaming Frameworks

Streaming Products

Slow Fast

Streaming

Concepts

© Copyright 2000-2015 TIBCO Software Inc.

Page 32: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

32

Apache Storm

© Copyright 2000-2015 TIBCO Software Inc.

Spout Bolt

Page 33: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

33

Apache Storm – Hello World

© Copyright 2000-2015 TIBCO Software Inc.

http://wpcertification.blogspot.ch/2014/02/helloworld-apache-storm-word-counter.html

Page 34: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

34

Amazon Kinesis

© Copyright 2000-2015 TIBCO Software Inc.

https://aws.amazon.com/kinesis/

AWS S3 RedShift DynamoDB

Page 35: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

35

Amazon Kinesis – Hello World

© Copyright 2000-2015 TIBCO Software Inc.

Page 36: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

36

Amazon Kinesis – The Cloud ...

© Copyright 2000-2015 TIBCO Software Inc.

… is easy to setup and scale!

But you do not have full control

• Any data that is older than 24 hours is automatically deleted • Every Kinesis application consists of just one procedure, so you can’t use Kinesis to

perform complex stream processing unless you connect multiple applications • Kinesis can only support a maximum size of 50KB for each data item

http://diamondstream.com/amazon-kinesis-big-real-time-data-processing-solution/ (blog post from 2014, might be outdated, but shows that you do not have full control over a cloud service)

Page 37: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

37

Apache Spark

© Copyright 2000-2015 TIBCO Software Inc.

General Data-processing Framework However, focus is especially on Analytics (these days)

http://fortune.com/2015/09/09/cloudera-spark-mapreduce/

Page 38: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

38

Apache Spark – Focus on Analytics

© Copyright 2000-2015 TIBCO Software Inc.

http://aptuz.com/blog/is-apache-spark-going-to-replace-hadoop/

http://fortune.com/2015/09/09/cloudera-spark-mapreduce/

http://www.ebaytechblog.com/2014/05/28/using-spark-to-ignite-data-analytics/

http://www.forbes.com/sites/paulmiller/2015/06/15/ibm-backs-apache-spark-for-big-data-analytics/

“[IBM’s initiatives] include: • deepening the integration between Apache Spark and

existing IBM products like the Watson Health Cloud; • open sourcing IBM’s existing SystemML machine

learning technology;

Page 39: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

39

Spark Streaming

© Copyright 2000-2015 TIBCO Software Inc.

Spark Streaming • is no real streaming solution • uses micro-batches • cannot process data in real-time (i.e. no ultra-low latency) • allows easy combination with other Spark components (SQL, Machine Learning, etc.)

Page 40: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

40

Apache Spark – Hello World

© Copyright 2000-2015 TIBCO Software Inc.

Spark Streaming API

Spark Core API

Page 41: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

41

Apache Flink

© Copyright 2000-2015 TIBCO Software Inc.

Spark Streaming • „Newcomer“ • Looks very similar to Spark • But „Streaming First“ concept

Page 42: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

42

Apache Flink vs. MapReduce, Tez, Spark

© Copyright 2000-2015 TIBCO Software Inc.

http://www.slideshare.net/sbaltagi/overview-of-apacheflinkbyslimbaltagi/12

Page 43: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

43

Alternatives

© Copyright 2000-2015 TIBCO Software Inc.

OPEN SOURCE CLOSED SOURCE

PRODUCT

FRAMEWORK

(no complete list!)

Page 44: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Visual IDE (Dev, Test, Debug)

Simulation (Feed Testing, Test Generation)

Live UI (monitoring, proactive interaction)

Maturity (24h support, consulting)

Integration (out-of-the-box: ESB, MDM, etc.)

Library (Java, .NET, Python)

Query Language (often similar to SQL)

Scalability (horizontal and vertical, fail over)

Connectivity (technologies, markets, products)

Operators (Filter, Sort, Aggregate)

What Streaming Alternative do you need?

Time

to

Market

Streaming Frameworks

Streaming Products

Slow Fast

Streaming

Concepts

Page 45: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

45

IBM InfoSphere Streams

© Copyright 2000-2015 TIBCO Software Inc.

Page 46: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

46

IBM InfoSphere Streams

© Copyright 2000-2015 TIBCO Software Inc.

https://developer.ibm.com/streamsdev/wp-content/uploads/sites/15/2014/04/Streams-and-Storm-April-2014-Final.pdf

Page 47: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

TIBCO StreamBase

• Performance: Latency, Throughput, Scalability – Multi-threaded and clustered server from version 1

– High throughput: Millions of messages, 100,000s of quotes, 10,000s of orders

– Low-latency: microsecond latency for algo trading, pre-trade risk, market data

• Take Advantage of High Performance Hardware – Multicore (12, 24, 32 core) large memory (10s of gigabytes)

– 64-bit Linux, Windows, Solaris deployment

– Hardware acceleration (GPU, Solace, Tervela)

• Enterprise Deployment – High availability and fault tolerance

– Distributed state management for large data sets

– Management and monitoring tools

– Security and entitlements Integration

– Continuous deployment and QA Process Support

StreamSQL compiler and static optimizer

In process, in thread adapter architecture

Visual parallelism and scaling

ActiveSpaces integration for

distributed shared state

Data parallelism and dispatch

StreamBase Server Innovations

“The StreamBase engine is for real. We couldn’t break it, and believe me, I tried” SVP Development, Top 5 Broker Dealer

Page 48: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

48

© Copyright 2000-2013 TIBCO Software Inc.

StreamBase: The Power of Visual Programming

Aggregate

Capture card activations per location

Sales too high Fraud

Log to any database

No Fraud

Sales too high?

Page 49: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Visual Debugger

Feed Simulation

Unit Testing

“StreamBase’s modeling tools are easy to use and will enable the exchange to quickly react to the ever changing needs of our customers.” Steve Goldman, Director of Enterprise Architecture

StreamBase Development Studio

Page 50: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Live Datamart

Continuous Query Processor Alerts

BusinessEvents

FTL

EMS

ActiveSpaces

Live Datamart

BusinessWorks

Social Media Data

Market Data

Sensor Data

Historical Data

ActiveSpaces Datagrid

Enterprise data

Market Data

IoT

Mobile

Social

LiveView Desktop Command & Control

ACTION

Continuous Query

Page 51: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

51

Dynamic aggregation

Live visualization

Ad-hoc continuous query

Alerts

Action

LiveView Desktop

Page 52: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

52

Spoilt for Choice – Framework or Product?

© Copyright 2000-2015 TIBCO Software Inc.

Does it make sense to combine both?

Page 53: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Example: Apache Storm + TIBCO Live Datamart

External Data

Snapshot Results

Continuous Query Processor

Query

TIBCO Live Datamart

Continuous Alerting

Active Tables Active Tables

Continuous Updates

Clients

Message Bus

Public Data

Customer Data

StreamBase Bolt StreamBase Spout

Operational Data

StreamBase Bolt and Spout connect Apache Storm to StreamBase to provide real-time analytics on operational data

Page 54: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Agenda

– Real World Use Cases

– Introduction to Stream Processing

– Market Overview

– Relation to other Big Data Components

– Live Demo

Page 55: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Operational Analytics

Operations Live UI

SENSOR DATA

TRANSACTIONS

MESSAGE BUS

MACHINE DATA

SOCIAL DATA

Streaming Analytics Action

Aggregate

Rules

Stream Processing

Analytics

Correlate

Live Datamart

Continuous query processing

Alerts

Manual action, escalation

HISTORICAL ANALYSIS

MS Excel

SAS

Data Scientists

Cleansed Data

History

Data Discovery

R

Enterprise Service Bus

ERP MDM DB WMS

SOA

BIG DATA

Data Warehouse, Hadoop

Intern

al Data

Integratio

n B

us

API

Event Server

Streaming Analytics Reference Architecture

Spark

Page 56: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

56

© Copyright 2000-2014 TIBCO Software Inc.

Real Time Close Loop

Model

Develop model

Deploy into Stream

Processing flow

Act

Automatically monitor

real-time transactions

Automatically trigger

action

Analyze

Analyze data via Data

Discovery

Uncover patterns,

trends, correlations

Page 57: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Real Time Close Loop: Understand – Anticipate – Act

Big Data

store everything in Hadoop, DWH, NoSQL, etc.

even without structure

even if you do not need it today

http://blogs.teradata.com/international/tag/hadoop/

Page 58: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Real Time Close Loop: Understand – Anticipate – Act

Data Discovery + Statistics + Machine Learning

to find insights and patterns in historical data

Page 59: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Real Time Close Loop: Understand – Anticipate – Act

Streaming Analytics

to operationalize insights

and patterns in real time

Stream

Processing Hadoop

Open Source

R

TERR

SAS

MATLAB

In-database analytics

Spark

Page 60: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

© Copyright 2000-2013 TIBCO Software Inc.

80% of betting happens

AFTER the game begins

TODAY

Page 61: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Case Study: Streaming Analytics for Betting

• Situation: Today, 80% of Betting is Done After the

Game Starts • It’s not your father’s bookie anymore!

• Problem: How to Analyze Big Betting Data? • Thousands of concurrent games, constantly adjusting odds, dozens of

betting networks – firms must correlate millions of events a day to find

the best betting opportunities in real-time

• Solution: TIBCO for Fast Data Architecture

• TXOdds uses TIBCO to correlate, aggregate, and analyze large

volumes of streaming betting data in real-time and publish innovative

predictive betting analytics to their customers

• Result: TXOdds First to Market with Innovative Zero

Latency Betting Analytics • Innovative real-time analytics help players who can process electronic

data in real-time the edge

“With StreamBase, in two

months we had our first betting

analytics feed live, and we

continually deploy new ideas

and evolve our old ones.” - Alex Kozlenkov, VP of technology, TXOdds

Page 62: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Reference Architecture: Streaming Betting Analytics

Event Processing

MONITOR

REAL-TIME ANALYTICS

AGGREGATE

HISTORICAL COMPARISON

Predictive odds

analytics

Zero Latency

Betting Analytics

GLOBAL, DISTRIBUTED INFRASTRUCTURE

Historical odds

deviations

B

U

S

BETTING LINES

SCORES

NEWS

HADOOP

Context:

Historical

Betting Data,

Odds, Outcomes

B

U

S

CACHE CACHE CACHE

Real-Time Analytics

CORRELATE

Live Datamart

SOCIAL

Page 63: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Twitter

(#TomBradyBrokenLeg)

Twitter (#Boston)

Brady’s Stats

Actionable

Insights

Real-Time Social Media Analytics

Twitter (#NFL) Something relevant happening?

Every second counts!

Change Odds (automated or manually triggered):

• Stop live-betting for the currently running game?

• How many interceptions will the Quarterback throw?

• Will the Patriots win the Super Bowl?

• …

Page 64: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

64

Real-Time Social Sentiment Analysis

Page 65: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Agenda

• Real World Use Cases

• Introduction to Stream Processing

• Market Overview

• Relation to other Big Data Components

• Live Demo

Page 66: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Predictive Sensor Analytics

Live Demo

Page 67: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Did you get the Key Message?

Page 68: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

– Streaming Analytics processes Data while it is in Motion!

– Automation and Proactive Human Interaction are BOTH needed!

– Time to Market is the Key Requirement for most Use Cases!

Key Messages

Page 69: Stream Processing - Huihoodocs.huihoo.com/javaone/2015/CON1534-Stream-Processing-Framew… · existing IBM products like the Watson Health Cloud; ... Stream Processing Analytics Correlate

Questions?

Kai Wähner [email protected]

@KaiWaehner

www.kai-waehner.de

LinkedIn / Xing Please connect!