5 Big Data Use Cases for 2013

Post on 22-Apr-2015

7.839 views 1 download

description

Jump start into 2013 by exploring how Big Data can transform your business. Listen to Infochimps Director of Product, Tim Gasper, cover the leading use cases for 2013, sharing where the data comes from, how the systems are architected and most importantly, how they drive business insights for data-driven decisions.

Transcript of 5 Big Data Use Cases for 2013

5 Big Data Use Cases

for 2013Tim GasperDirector of ProductInfochimps, Inc.

#5usecases

#1 enterprise cloud for big datasome of our customers our partners

#5usecases

BI & DataVisualization

Big DataApplications

Reporting &Ad-hoc Business

Questions

#5usecases

poll

#5usecases

poll

#5usecases

#5usecases

81% of companiessay Big Data is a top 5IT priority in 2013

#5usecases

#5usecases

CRM/customer supportPOS/purchasesERP/accountingemail/documents/collab.BI & data warehousesystem & network logsweb logs/clickstreamgoogle analytics/omniturefacebook/twitteryelp/foursquare/googleexperian/epsilon/acxiommobile devicessensorsproduct reviewsgoogle search results+ more

?many terabytes of data,

sometimes many petabytes

more data than ever before

#5usecases

BIG• volume• velocity• variety• variability

DATA• scalable• intelligent• agnostic• holistic

#5usecases

poll

#5usecases

#5usecases

#5usecases

14

#5usecases

1risk analysis and fraud detection

#5usecases

comprehensive data picture• build comprehensive data picture of customer-

side risk• publish a consolidated set of attributes for

analysis• add additional context, both internal and

externalparse and aggregate data from different sources

• credit and debit cards, product payments, deposits and savings

• banking activity, browsing behavior, call logs, e-mails and chats

merge data into a single view• a “fuzzy join” among data sources• structure and normalize attributes• sentiment analysis, pattern recognition

#5usecases

customer risk analysis

#5usecases

activity records in a central repository• centralized logging across all execution

platforms• structured and raw log data from multiple

applicationspattern recognition to detect anomalies/harmful behavior

• feature set and timeline vector are very dynamic

• “schema on read” provides flexibility for analysis

data is primarily served and processed in HDFS with MapReduce

• data filtering and projection in Pig and Hive• statistical modeling of data sets in R or SAS

#5usecases

surveillance & fraud detection

#5usecases

regulatorycompliance

Source:http://virtualization.sys-con.com/node/101598

ingest datasearch &

legal discovery

intraday analysis& historical analysis

(production reports +exploratory risk modeling)

TradingData

CustomerData

global investment bank#5usecases

trade risk

#5usecases

brand and sentiment analysis

2

the internet generates a lot of chatter about brands

• understanding what’s said is key to protecting brand value

• facebook & twitter generate a flood of data for large brands

capturing and processing direct feedback• better engagement and alerting via sentiment

analysis• integration with other customer service

systemshadoop handles the diverse data types and processing

• sources of data changing and semantics continuously evolving

• sophistication of algorithms is iteratively improving

#5usecases

brand & sentiment analysis

ingest datasearch &

application

trend analysis

SocialMedia

TraditionalMedia

large media conglomerate#5usecases

News, Blogs, etc.

real-time sentiment,influence, gender,

topic extraction, etc.

customer insights/behavior

3#5usecases

#5usecases

understanding customer behavior and preferences

• rapidly test and build behavioral model of customer

• combine disparate data sources (transactional, social, etc.)

structure and analyze with Hadoop• traversing usage and social graphs• pattern identification and recognition to find

indicatorsfeature extraction to find root causes

• defining attributes and modeling statistical significance

• combinations and sequence of attributes + actions factor in

customer churn analysis

#5usecases

comparison shopping is making retail hyper-competitive

• discount programs, e-mail correspondence entice shoppers

• brand loyalty means attention to detail and service

customer lifecycle is more than purchases• browsing and online data used to capture

customer attention• loyalty programs bridge the gap between

purchasesreach into online channels

• online engagement is personalized just as in store

• connecting online and in store shows customer awareness

customer loyalty

#5usecases

ingest datacustomer insight

reports

shopping patternrecognition

Demographics,Geography,

Web Data, etc.

Point Of SalePurchase Data

#5usecases

customer segmentation

targeted marketing and personalization

4#5usecases

the checkout lane is everywhere• cookies track users through ad impressions• purchasing behavior is time sensitive

logs collected online and offline• data is ingested incrementally• process happens at a variety of time scales

data logged into HBase and primary store• some events naturally associate, others require

deeper analysis• insights implemented via application logic

targeted offers

#5usecases

collect and serve personalization information• wide variety of constantly changing data

sources• data guaranteed to be messy

data ingestion includes collection of raw data• filtering and fixing of poorly formatted data• normalization and matching across data

sourcesanalysis looks for reliable attributes and groupings

• interpretation (e.g. gender by name)• aggregation across likely matching identifiers• identify possible predicted attributes or

preferences

recommendations & forecasting

#5usecases

ingest data

pre-definedweb content

and deals

behavioralcluster analysis

ClickstreamData from Online

Storefront

#5usecases

major apparel brandtargeted discounts

big data business intelligence

5#5usecases

poll

#5usecases

#5usecases

#5usecases

#5usecases

traditional datawarehousing

#5usecases

big datawarehousing

#5usecases

big datawarehousingThe Infochimps Approach

#5usecases

big data exploration & visualization

ingest data

SQL analysiswith Hive & Hue

Retail SiteWeb Logs

BI dashboarding

#5usecases

popular online deal sitebusiness command center

learn more >>

sales@infochimps.com1-855-328-2386

Request a Demo:http://infochimps.com/demo

#5usecases