Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

14
dataFundamentals Hadoop Automation in 15 Minutes Or how to get to the fun stuff before your boss pulls the plug.

description

Hadoop ETL Automation - How to get to the fun part of big data in the shortest amount of time.

Transcript of Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

Page 1: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

dataFundamentals

Hadoop Automation in 15 Minutes

Or how to get to the fun stuff before your boss pulls the plug.

Page 2: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

ETL is not the Fun Stuff, in Big Data

❖ Analytics

❖ Machine Learning

❖ Spark

❖ [even just Building APIs]

But you can’t do the fun stuff until your corporate data is in place to work against. Chicken and egg problem.

Page 3: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

Quick!Before your boss turns off the spigot!

❖ Automate your ETL processes.

❖ Automate your server instances.

Page 4: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 5: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 6: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 7: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 8: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 9: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 10: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18
Page 11: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

What kind of code to Automate?

❖ Clean code. Super clean.

❖ Well designed code.

Page 12: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

Other pitfalls?

❖ NIH, Not Invented Here

Page 13: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

How to get the fun tasks?

❖ 2 week P.O.C.

❖ Your sample data

Page 14: Cloud Austin Hadoop Automation Lighting Talk 2014.11.18

Code, Content, Contacts❖ This Slide Deck: http://www.slideshare.net/petecarapetyan/cloud-austin-hadoop-automationlightingtalk141118

❖ or just remember slideshare.net/datafundamentals

❖ Youtube - 11 minute slide-less version of code demo - https://www.youtube.com/playlist?list=PLO_T9AjxEaYeByfqBqHVCmg4GbLFkYCJe

❖ Dev Code

❖ Carrie (ruby UI and generator) https://github.com/datafundamentals/df_ui_carrie

❖ Avro from delimited https://bitbucket.org/datafundamentals/avro_from_delimited

❖ Camel-Avro https://bitbucket.org/datafundamentals/camel-avro-etl

❖ Ops Code - cookbook recipes

❖ https://github.com/datafundamentals

❖ Contact

[email protected] [email protected] Jeff Twitter @devopsjeff Pete Twitter @appwritercom Site: datafundamentals.com

Be careful! It’s a competitive world out there!