OpenStack Data Processing ("Sahara") project update - December 2014

Post on 14-Jul-2015

170 views 0 download

Transcript of OpenStack Data Processing ("Sahara") project update - December 2014

PTLSergey Lukjanov

Data Processing UpdateOpenStack Sahara

To provide a scalable data processing stack

and associated management interfaces Sahara

● provisions & operates data proc. clusters● schedules & operates data proc. jobs /

workloads

Elastic Data Processing (EDP) is Sahara’s take on data processing workflow management.

Juno release overview

● 540+ code commits from 50 people● 32 blueprints implemented

● ~5000 code reviews● ~168 bugs fixed

details: https://launchpad.net/sahara/juno

Moved to specs for new features

Sahara UI completely merged into Horizon

Pluggable framework-agnostic EDP

Vanilla Apache Hadoop 2.4.1 support

Cloudera Distribution of Apache Hadoop 5.X support

Apache Spark 0.9.1 and 1.0.0 support

Ceilometer integration

Heat resources for Sahara available

Auto security groups creation

Kilo plans

● New versions for CDH, HDP, etc.● Dashboard UX improvements

● Better Heat integration● Ironic support