Post on 21-Apr-2017
®© 2015 MapR Technologies 1
®
© 2015 MapR Technologies
Taking Your Spark To Production Scale Anil Gadre, SVP Product Management, MapR Technologies June 15, 2015
®© 2015 MapR Technologies 2
The Journey To Production Scale
Trials, science projects
Large mission-critical, operational
deployments
®© 2015 MapR Technologies 2
®© 2015 MapR Technologies 3
Companies with Spark & MapR in Production
GLOBAL TELECOM
HEALTHCARE
GLOBAL FINANCIAL SERVICES
®© 2015 MapR Technologies 4
Key Issues To Plan For
Spark stack
support?
Real-time?
Enterprise reliability &
security?
Open ended agility?
1
2
3
4
®© 2015 MapR Technologies 5
Global Managed Security Services
delivered on Hadoop
Spark Stream processing used to first check for known threats
Data next processed on Hadoop
using MLLib and GraphX
Additional SQL querying done via Spark SQL
Security Intelligence Operations
Delivers Lightning Fast Analytics for Clients
Building largest Hadoop cluster in Australia Real-time analytics using Spark on MapR–reducing data loading time from hours to minutes Leverage multi-tenancy, high-performance and reliability of MapR
®© 2015 MapR Technologies 7
Next-Gen Genomics
Develop flexible platform to keep up with fast changing research techniques POSIX file access lets bio-informaticians use existing tools with open source tools (Spark) Graph manipulations can be done reliably and at scale using Spark
®© 2015 MapR Technologies 8
Real-Time Customer Analytics
• MapR Data Lake stores both online and archive data
• Spark on MapR reduced ETL processing
• NFS moved data into the cluster seamlessly
• 1/10th Total Cost of Ownership vs. old way
• New customer onboarding cut from months to weeks
®© 2015 MapR Technologies 9
Databricks & MapR Strategic Partnership (since April 2014)
Support for the complete Spark stack
Engineering & roadmap collaboration
Back-end support +
®© 2015 MapR Technologies 10
The Most Complete Spark Environment
Spark SQL (SQL)
Spark Streaming (Streaming)
MLlib (Machine learning)
GraphX (Graph computation)
Foundation For Enterprise-Grade Spark
®© 2015 MapR Technologies 11
DB Operations
Real-Time and Actionable
Analytics
Operations + Analytics on One Hadoop Platform with SQL Access
Mobile application
server
Customer 360 dashboard
Churn analysis Product/service optimization and personalization
Real-time ad targeting
Web application server
Data exploration (SQL)
• User profiles and state • User interactions • Real-time location data
• Web and mobile session state • Comments/rankings
®© 2015 MapR Technologies 12
Spark + MapR = Ready For Production Success
World-record performance on disk High Performance
SLA-Driven Applications • High availability • Data protection • Disaster recovery Reliability for Production
Strategic partnership with Databricks to ensure enterprise support for the entire stack
24/7 Best-in-class Global Support
MapR-DB + Spark = real-time analytics Operational Data Store
®© 2015 MapR Technologies 13
MapR Introduces 3 New Spark-Based Quick Start Solutions
Real-Time Security Log Analytics
Time Series Analytics
Genome Sequencing
®© 2015 MapR Technologies 14
Self-Service Data Exploration
Data Agility with Less IT Required
Single SQL Interface for Structured and Semi-Structured Data
®© 2015 MapR Technologies 16
Get Your Tattoo In The MapR Booth!
Show off your Kickstart My Heart skills and enter to win Xbox 360 & Guitar Hero