Managing your Assets with Big Data Tools

Post on 15-Jul-2015

642 views 4 download

Tags:

Transcript of Managing your Assets with Big Data Tools

Managing your

Assets with Big Data Tools

Karthigai Muthu, MachinePulse

Big Data value proposition

Big Data Technology Stack

Agenda

Source: Wikipedia

Hype Cycle for Emerging Technologies

Sources of data

25+ TBs oflog data

every day 2+

billion

people

on the

Web

by end

2011

30 billion

RFID tags

today

(1.3B in

2005)

4.6

billion

camera

phones

world

wide

100s of

millions of

GPS

enabled

devices

sold

annually

76 million smart

meters in 2009…

200M by 2014

12+ TBsof tweet data

every day

? T

Bs o

fd

ata

eve

ry d

ay

What makes Data Big

Characteristics Description Attributes Drivers

Volume The amount of data generated or

intensify that must be ingested,

analyzed and managed to make

decision based on complete data

analysis

Exabyte (EB)

Zettabyte (ZB)

Yottabyte (YB)

Increase in data sources

Higher resolution sensors

Scalable infrastructure

Velocity How fast the data is being

produced and changed and the

speed at which is transformed into

insight

Batch

Near real time

Real time and Streams

Rapid feedback loop

Improved throughput connectivity

Competitive advantage

Pre-computed information

Variety The degree of diversity of data from

sources both inside and outside an

organization

Degree of structure

Complexity

M2M/IoT

Social Media

Genomics

Video and Mobile

Veracity The quality and provenance of data Consistency

Completeness

Ambiguity

Integrity

Cost

Need of traceability and justification

Big Data’s Greatest Power: Predictive Analytics

What’s driving Big Data

- Ad-hoc querying and reporting

- Data mining techniques

- Structured data, typical sources

- Small to mid-size datasets

- Optimizations and predictive analytics

- Complex statistical analysis

- All types of data, and many sources

- Very large datasets

- More of a real-time

Big Data:

Batch Processing &

Distributed Data StoreHadoop/Spark;

HBase/Cassandra/MongoDB

BI Reporting

OLAP &

Data warehouse

Business Objects, SAS,

Informatica, Cognos

other SQL Reporting

Tools

Interactive

Business

Intelligence &

In-memory

RDBMSQlikView, Tableau,HANA

Big Data:

Real Time &

Single View

Graph Databases

The Evolution of Business

Intelligence

1990’s 2000’s

2010’s

Speed

Scale

Scale

Speed

Solving business problem with big data

Formulation of big data strategy

People31%

intent20%

Data16%

Tools33%

Companies Market share in Big Data

Big Data Investments

Priority for big data across industry

Are you aware the risk of not implementing Big Data in

your company

Big data changed connected things to Internet

of Everything(IoE)

How the industry can leverage from big data

Challenges in implementing the big data

Returns of Investment(ROI)

Merge

Optimize

Respond

Empower

How do companies get MORE from big data

Are you planning to launch your new product.

Customer 360`

Customer

Social

Media

Gaming

Entertain

Banking

Finance

Our

Known

History

Purchase

Real-Time Analytics/Decision Requirement

Customer

Influence

Behavior

Product

Recommendations

that are Relevant

& Compelling

Friend Invitations

to join a

Game or Activity

that expands

business

Preventing Fraud

as it is Occurring

& preventing more

proactively

Learning why Customers

Switch to competitors

and their offers; in

time to Counter

Improving the

Marketing

Effectiveness of a

Promotion while it

is still in Play

IoT+Big Data = IoE(Internet-of-Everything)

Big Data is a factor that will, to a large extent, determine the future

growth rate in the M2M industry

M2M will connect increasingly more nodes that will provide data from

endpoints.

Data will be more granular, more frequent, and more accurate, with

bigger data sets or even live data streams

Large volume of endpoint connections IPv4 addressing scheme can’t

accommodate everything(sensors, smart phones, smart factories, smart

grids, smart vehicles, controllers, meters ) that it requires IPv6

IoE= Convergence of IoT, Big Data Analytics ,Cloud Computing and

other technologies is collectively called as Internet of Everything

Role of Big Data in M2M/IoT

Meeting the need for speed

Data understanding

Maintaining data quality

Displaying the meaningful result

Challenges of Big Data in M2M/IoT

IoT/M2M Applications..

Personal IoT: the scope is a single person, such as a smartphone

equipped with GPS sensor or a fitness device that measures the heart

rate. This is one of the fastest growing, consumer-oriented areas of IoT.

Group IoT: the scope is a fairly small group of people, such as a family

in a smart house, co-workers in a van or a group of tourists. This is one

of the most challenging areas and is still in its early phase.

Community IoT: the scope is a large group of people, potentially

thousands and more; usually this is in a public infrastructure context,

such as smart cities or smart roads. This is a young and potentially

promising IoT area.

Industrial IoT: the scope can be within an organization (smart factory)

or between organizations (retailer supply chain). This is arguably the

most established and mature part of IoT.

Big Data Use Cases – IoT/M2M

Agriculture - sensors can be deployed on farm machinery in order to provide data about

the equipment, soil temperature, moisture, etc.

Buildings/Smart Homes - Building sensors be used to help facility managers become

more proactive about ensuring that their buildings operate at peak efficiency.

Communities – Smart cities make use of parking space availability systems, intelligent

traffic monitoring systems, intelligent highways, weather-adaptive street lighting, and

more.

Healthcare – Infant monitors, smart diapers, pills with ingestible sensors are just some of

the IOT-based devices.

Manufacturing – factories with sensors can improve operations, product quality, and

decrease safety hazards.

Smartphones – can control everything from door locks, thermostats, light bulbs, vacuum

cleaners, and more.

Utilities – smart water meters can be used to reduce water leaks. Smart electric grids

can adjust rates depending on usage.

Wearables – Smart watches, fitness trackers and health monitors may become primary

source for human-related data, and can also be used in sports, retail, travel and

manufacturing.

Big Data Use cases – IoT/M2M

1. Device Maintenance:

a. Time for next patch upgrade

b. Energy management

c. Inventory management and track replacement

2. Proactive Healthcare:

Capture and analyze real time data from medical monitors to predict

potential health problems before patients manifest clinical signs of

infection.

3. Monetize Machine Data:

a. Monitor performance, usage and capacity details to uncover up-sell

and cross-sell opportunities

b. Maximize the lifespan and performance of high value medical assets

Benefits of Big Data Analytics in M2M/IoT

4. Optimize Support Operations:

a. Reduce MTTR and support escalations

b. Preempt failures with proactive support

c. Troubleshoot with accurate information

d. Proactive consultation to customers on approaching

expiry dates

Benefits of Big Data Analytics cont..

Big Data Analytics Stack

Lamda Architecture

Batch processing

- Gathering of data and processing as a group at one time.

- Jobs run to completion

- Data might be out of date

Real-time processing

- Processing of data that takes place as the information is being

entered.

- Run for ever

Batch vs. Real-Time processing

Apache Storm is a free and

open source distributed real-time

computation system.

Storm makes it easy to reliably

process unbounded streams of data,

doing for real-time processing what

Hadoop did for batch processing

Storm

Stream Processing

Fast

Scalable

Fault Tolerant

Reliable

Storm Is

Tuple

Streams

Spouts

Bolts

Topologies

Reliable Processing

Reliable Processing

Groupings are used to decide to which task in the

subscribing bolt (group) a tuple is sent.

Possible Groupings:

- Shuffle

- Fields

- All

- Global

- None

- Direct

- Local or Shuffle

Stream Grouping

Storm Cluster View

Fault Tolerance

Fault Tolerance

Fault Tolerance

Fault Tolerance

Fault Tolerance

Parallelism

Parallelism

Apache Storm Real-time -Use cases

Segment Prevent Use Cases Optimize Use Cases

Financial Services Securities fraud

Operational risks & compliance

violations

Order routing

Pricing

Telecom Security breaches

Network outages

Bandwidth allocation

Customer service

Retails Shrinkage

Stock outs

Offers

Pricing

Manufacturing Preventative maintenance

Quality assurance

Supply chain optimization

Reduced plant downtime

Transportation Driver monitoring

Predictive maintenance

Routes

Pricing

Web Application failures

Operational Issues

Personalized content

The End