1 ©MapR Technologies - Confidential Super-Fast Clustering Report from MapR workshop.
MapR 6.0 Powers DataOps
-
Upload
mapr-data-technologies -
Category
Data & Analytics
-
view
162 -
download
1
Transcript of MapR 6.0 Powers DataOps
© 2017 MapR TechnologiesMapR Confidential 1
MapR 6.0 Powers DataOps:Unleash the Value of Your Data with New
Features in the MapR Converged Data Platform
Mitesh Shah, Director Product Marketing, MapR
Prashant Rathi, Sr. Product Manager, MapR
December 5, 2017
© 2017 MapR TechnologiesMapR Confidential 2
No Friction
Turning Data into Value is Easy When There is No Friction
(there’s always friction)
Data Value
Systems that are inflexible, hard to manage, insecure, …
People Friction
Process Friction
Technology Friction
Waterfall not agile.
Cumbersome audit and compliance requirements.
Organizational silos.
© 2017 MapR TechnologiesMapR Confidential 3
What is DataOps?
+
DevOps Data EngineersData Scientists
=
DataOps helps an organization rapidly deliver value from data by supporting agility and accelerating and enabling the integration of operations and analytics.
Day Zero Operations
Embrace Data Flows
Always On
All Data
Secure the Data Not Access Method
Self-service Not Dependency
Convergence Not Orchestration
Distributed
DataOpsPrinciples
DataOps
© 2017 MapR TechnologiesMapR Confidential 4
MapR Powers DataOps to Unleash Greater Value
from All Data
Now available, MapR Converged Data Platform 6.0 adds innovations for
security, database, and automated administration, across clouds
• Real-time Data Integration with Innovations in MapR-DB• Self-service Data Science with Data Science Refinery• Secure Data with Single-Click Security Enhancements• Cloud-scale Multi-Tenancy and Edge to Cloud File Migrate• Automatic Platform Health and Security with the New MCS
© 2017 MapR TechnologiesMapR Confidential 5
Real-time Data Integration with Innovations in MapR-DB
© 2017 MapR TechnologiesMapR Confidential 6
MapR-DB Innovations in 6.0
• Integrated Operational DB for Mission Critical Apps
• Horizontal Scalability
• Extreme Performance
• 24X7 Reliability
• HBase API compatible
HIGH PERFORMANCE WIDE COLUMN
DATABASE
• Cross Data Center active/active Replication
• Fine Grained Security controls
GLOBAL DATABASE
• Native JSON Support
• Comprehensive Datatypes
• Granular & Efficient operations
• Trillions of documents, Millions of tables
• Open & Intuititve OJAI APIs
MULTI-MODEL DATABASE
W/DOCUMENT DATA MODEL
• Native Secondary indexes
• Rich OJAI 2.0 Query APIs
• Optimized Drill/SQL analytics & BI
• Advanced Analytics w/Native Spark and Hive connectivity
• Global Real-time Change Data Capture
DATABASE FOR GLOBAL DATA-
INTENSIVE APPS
MapR-DB 6.0 release
© 2017 MapR TechnologiesMapR Confidential 7
Real-Time Data Integration and Micro-Services w/Global
Change Data Capture
Allows arbitrary external systems to consume changes in MapR-DB tables globally
Build Scalable real time data hubs for fast ingesting and fast ingesting big data
Enables Real-time event driven micro-services app fabrics to create rich experiences
Machine Learning
Models
Microservices
Elastic Search
Change Data CaptureRemote MapR-DB
© 2017 MapR TechnologiesMapR Confidential 8
Self-service Data Science with Data Science Refinery
© 2017 MapR TechnologiesMapR Confidential 9
The MapR Data Science VisionA Holistic Approach To Self-Service Data Science
MAPR DATA SCIENCE REFINERY REFINERY DATA SCIENTISTS
Data Scientist led product-and-
services offerings including Quick
Start Solutions (QSS) & Training
REFINERY PARTNERSHIPS
Expand on what we offer in-
product to meet the needs of all
data science teams
An easy-to-deploy, secure, and
extensible data science offering
that leverages all existing platform
assets
MAPR CONVERGED DATA PLATFORM
© 2017 MapR TechnologiesMapR Confidential 10
Secure Data with Single Click Security Enhancements
© 2017 MapR TechnologiesMapR Confidential 11
Criminals accessed, copied, and deleted data from unpatched or badly configured databases, and then held the data as ransom.
Cloud computing misconfiguration resulted in vulnerabilities that exposed information about 200 million voters, including names, dates of birth, home addresses, phone numbers, and voter registration details.
Recent Security Issues Caused by Misconfiguration
Over 35,000 servers were found open to the internet on AWS. Hundreds of instances were compromised and the data was held for ransom.
Major NoSQL DB
Major Cloud Storage Provider
Open Source Search Engine
© 2017 MapR TechnologiesMapR Confidential 12
High Availability Real Time Security & Governance Multi-tenancy Disaster Recovery Global Namespace
Converge-X™ Engine
HDFS APIPOSIX, NFS HBase API JSON API Kafka API
MapR Introduces Single-Click Security Enhancements
Event Data Streams
Analytics &Machine Learning Engines
Operational Database
Cloud-scale Data Store
* Some exceptions apply.
Encryption on the Wire*Authentication
Enforcement
CLEA
RTEX
T A
ZDD
SAD
UX
© 2017 MapR TechnologiesMapR Confidential 13
Cloud-scale Multi-Tenancy and Edge to Cloud File Migrate
© 2017 MapR TechnologiesMapR Confidential 14
PRIVATE CLOUD
Cloud-scale Multi-tenancy
OpenStack Manila Plugin
PUBLIC CLOUD
Cloud-native Operations
Cloud Storage Integrations
Object Tiering
REST APIs
MULTI CLOUD
Mirroring
Replication
EDGE
Small Footprint
Edge to Cloud File Migrate
Data Queueing
Bandwidth Optimization
MapR Orbit Cloud Suite
© 2017 MapR TechnologiesMapR Confidential 15
TenantSan Francisco Giants
TenantOakland Athletics
VMs Users & Groups
Analytical and Machine Learning Engines
Event Data StreamsCloud Scale Data Store
High Availability Real Time Security & Governance Multi-tenancy Disaster Recovery Global Namespace
Converge-X Data Fabric
Operational Database
Manila Plugin
MapR Volumes
VMs Users & Groups
MapR Volumes
• Hosting multiple organizations (users, groups) on the same data platform
• Security: Ensuring intra-organization privacy as well as intra-organization policies
Competitive Note: Capability not found in Hadoop competitors (CDH, HDP), NoSQL competitors (MongoDB, Couchbase), or scale-out storage competitors.
The Challenge:
• Tenant concept built-in to data services – all users identified by (tenant, user, [groups])
• Tenant volumes hidden from other tenants
• Enforce intra-tenant access control within volumes
• Integration with OpenStack Manila for tenant self-service provisioning of data shares (volumes)
How MapR Solves:
Cloud-Scale Multi-Tenancy & OpenStack Manila Plugin
(for native file access)
© 2017 MapR TechnologiesMapR Confidential 16
Edge to Cloud File Migrate
The Challenge:
• Insufficient local compute for local analytics, need cloud.
• Existing ETL tools don’t meet reliability or time sensitivity requirements.
How MapR Solves:
• Edge to Cloud File Migrate service deploys to each edge site, watches MapR-XD for new files, immediately transfers to the cloud.
• Intelligent use of MapR metadata services to ensure performance and reliability.
Real-time, automatic movement of files from edge to the cloud
Ideal for mixed-processing workloads– some edge, some cloud
© 2017 MapR TechnologiesMapR Confidential 17
The New MapR Control System
© 2017 MapR TechnologiesMapR Confidential 18
Benefits of the New MCS
Greater Administrative Overhead
Higher OpEx
Increased Probability of Failures
Reduced Administrative Overhead through Unified Data Management
Lower OpEx
Unparalleled Cluster Stability and Health
MapR Converged Data Platform and The New MCSThe Other Guys (Crisis of Complexity)
© 2017 MapR TechnologiesMapR Confidential 19
MCS Demo
© 2017 MapR TechnologiesMapR Confidential 20
Summary
© 2017 MapR TechnologiesMapR Confidential 21
MapR Converged Data Platform 6.0
• Architected to power DataOps
• Available Now
• Cloud provider marketplaces, such as Microsoft Azure, Amazon Web Services, and Oracle Cloud will have version 6.0 available before end of year
MapR 6.0 delivers:• Automatic Platform Health and Security• Real-time Data Integration• Secure, Discoverable Data• Self-Service Machine Learning / Artificial Intelligence
© 2017 MapR TechnologiesMapR Confidential 22
Q&A
ENGAGE WITH US
@mapr
© 2017 MapR TechnologiesMapR Confidential 23
Appendix
© 2017 MapR TechnologiesMapR Confidential 24
The Trend from Data Warehousing to Data Science
Data Warehouses Data Lakes Limited Machine Learning Machine Learning Everywhere
Analysts Data Scientists
Need to Allow for Access to All DataFlexibility and Choice in Tools is Critical
Security
SecurityTargeted Offers
Fraud DetectionPredictive
Maintenance
Smart Cars
Targeted Offers
© 2017 MapR TechnologiesMapR Confidential 25
ENTRY POINTS IN THE CUSTOMER JOURNEY
McKinsey calls these companies “Adopters”. Gartner estimates they solve between 10-100 business problems in three to five years.
McKinsey calls these companies “Partial Adopters & Experimenters”. Gartner estimates they solve between 3-20 business problems in three to five years.
McKinsey calls these companies “Contemplators”
Data Science Curious
Adjacent Data Science Teams
Corporate Data Science Teams
20%41%40%