Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

13
Pulsar - Real Time Analytics At Scale Dror Engel Product Lead - eBay http://www.linkedin.com/in/drorengel @drorengel eMetrics Summit, SF, April 5, 2016

Transcript of Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

Page 1: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

Pulsar - Real Time Analytics At Scale Dror Engel Product Lead - eBay http://www.linkedin.com/in/drorengel @drorengel eMetrics Summit, SF, April 5, 2016

Page 2: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

Global Connected Commerce

$32B!GMV VIA MOBILE!

(2015)

304M!MOBILE DOWNLOADS

GLOBALLY

1.4B!LISTINGS CREATED

VIA MOBILE

162M!ACTIVE BUYERS

25M!ACTIVE SELLERS

800M!ACTIVE LISTINGS

9.2M!MOBILE LISTINGS

EVERY WEEK

Page 3: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

Every 7 seconds (U.S.A.)

Every 2 hours (Korea)

Every 2 mins (UK)

Global Commerce Velocity

Page 4: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

COMMERCE IS AT AN INFLECTION POINT Offline / online lines are collapsing Customer expectations are changing

Page 5: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

CHECKOUT

WATCH LIST SHARE

BIDDING

SERVE AD

ZOOM

100’s PB! DATA

10B!EVENTS/DAY

36TB!X-PLATFORM DATA

TRANSFERS PER MONTH

CLICK

IMPRESSIONS

DATA

Page 6: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

TECHNOLOGY TRENDS •  Customer centric continues Intelligence

•  Faster analysis (Daily -> Hourly -> Minutes -> Seconds)

•  Bigger data volume and processing

•  Big data technologies shifts from POC to production use cases

•  More data points: IoT services; link data quickly and conveniently

•  More data sources

•  Fast data exploration capabilities - OLAP

Page 7: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

CONNECTING WITH USER BEHAVIOIR DATA

User Behavior

Data

Real-time reporting

Business activity

monitoring

Personalization

Advertising

Marketing

Fraud & Bot Detection

Page 8: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

ENABLING DATA INSIGHTS AT SCALE

Pulsar is an open-source(2015), real-time analytics platform that includes stream processing, metrics store, and reporting frameworks. Pulsar is used to collect, process user and business events in real time, provide key insights using custom dashboards, and enable systems to react to user activities within seconds.

Page 9: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

Key Customers Demands

Millions of events per second

SCALABILITY < 1 seconds from source to end-user

LATENCY

Enrichments – 1st, 3rd data sourced Filtering (bots) Grouping (customer level) Ordering

PRCCESSING

99.99% Up Time No Downtime During Upgrades Self Healing & Distributed

AVAILABILITY

Integrations with other sources & channels

FLEXIBILITY

Page 10: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

Pulsar Lessons Learned

–  Connecting data points is the present not the future

–  By connecting new data points, you bring many new insights. Always seek to add new dots to draw the full picture

–  Complete view of customer journey is essential but leveraging real-time signals is the future

–  Real-time insights must be aggregated at the customer level to deliver actionable insights

Page 11: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

More Information

GitHub: http://www.github.com/PulsarIO

Website: http://www.gopulsar.io

Page 12: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

“In God we trust,

all others must bring data”

W. Edwards Deming

Page 13: Pulsar - Real Time Analytics at Scale - eMetrics SF 2016

THANK YOU!

http://www.linkedin.com/in/drorengel @drorengel