Post on 28-May-2015
description
Open-source softwaredevelopment frameworks
Open-source software dev frameworks
(and many more...)
• Avoid repeat work• Remain 100% customizable• Escape vendor lock-in
Every platform needsopen-source, code-based dev frameworks
But for data?Nothing.
Data work today• Proprietary / GUI / No framework• Not reusable• Little collaboration• Too many errors, too slow to fix
an open source dev framework for data
“Mortar takes something complex and makes it simple and intuitive.”
—Jon Coveney, Twitter
What big data system should I use? Hadoop, HPCC, Disco, Storm… What Hadoop distro should I use?How much will this cost? Should I do it some other way?How long is this going to take to learn? Should I do it some other way?How many machines should I buy?Can I run in the cloud?How should I interact with it? Pig, Hive, Cascading, Scalding, CascalogCan I use libraries I need?How do I install everything?How can my team share code?How can I reuse code?Is my workload typical?How can I safely deploy?How do I know if what I've written is correct?Are there any libraries I could start with?Can I connect to my key-value store?Can I do machine learning in Hadoop?Is it secure?What if I need help?
Weeks
More Weeks
Even More Weeks
“Our focus in designing Pig has always been to make Hadoop easy...
Mortar's approach is right on—they extend our quick start and ease of use focuses with pre-built Hadoop clusters, clear examples, code organization templates, and github for social sharing of the code.”
OPEN SOURCED PIG—Alan Gates, Hortonworks co-founder
Mortar is for analyzing lots of data in AWS.
Can Mortar help you?
Mortar serves companies of all sizes from any industry.
Who is Mortar for?
“...Mortar fits right in with our vision of the future... With this exciting launch, MongoDB users can now also seamlessly use Mortar.”
—Dwight MerrimanFOUNDER OF 10GEN (MAKER OF MONGODB), DOUBLECLICK (ACQ. GOOGLE), SHOPWIKI, BUSINESS INSIDER, GILT GROUPE
By and for engineers and data scientists
> gem install mortar
> mortar new my_project
> git clone your_project
> mortar run your_project
Pig is easy to learn(and we’ve made it easier)
Illustrate is awesome• Find your mistakes• Understand code before collaborating• Automated tests: a way to test every condition
Hadoop & Python are powerful data science tools
...but they haven’t worked together before.
Now you can use Hadoop & real Python on Mortar
• Installed Mortar• Made a new project• Cloned a project• Ran the project• Illustrated project• Use Python and other libraries on Hadoop
What you just saw
- Web Projects: zero install, in the browser
2 options for using Mortar:
- Git Projects: modularity, testability, code sharing, local dev, and revision control.
• Use your browser• Minutes to connect data• Productive in one hour
One-hour challenge
As a good citizen, Mortar has a rich API
How does Mortar fit with other
Full speed, directly on Hadoop
How about speed?
• Easy start• Keeps you productive• Collaborate with data• No lock-in• Easy to budget
Mortar revolutionizes your data pipeline.
• Free | Service use unlimited | 10 node-hours• Pay as you Go | $0.89/node-hour | support• Enterprise | $3,000/month | $0.69/node-hour | live
support
Tiers
mortardata.com / @mortardata