Scala: the unpredicted lingua franca for data science

Post on 09-Jan-2017

1.045 views 1 download

Transcript of Scala: the unpredicted lingua franca for data science

Scala: the Unpredicted lingua franca

for Data ScienceDean Wampler@deanwampler

lightbend

Andy Petrella@noootsab

Data Fellas

Distributed Data Science

Distributed Data Science is the “new” interpretation of “big data”

Big Data

Why Distributed Computing became Big Data?

Big Data was the visible part of the Iceberg

Business

Thanks @Google (for All the fish)

Enterprise ready Open Source Implementation

Hadoop (JVM -- Enterprise)

Big Data made easy → it becomes popular

Spark (Scala -- Functional)

After the How, the what

Distributed Data Science

WhyScala.snb

https://github.com/data-fellas/scala-for-data-science

Scala features for data science

Tooling, port models AND invent new models!

What’s missing in Scala/JVM?

Why Spark Notebook.snb

https://github.com/data-fellas/scala-for-data-science

Tooling for data science

Scala: the Unpredicted lingua franca

for Data ScienceDean Wampler@deanwampler

lightbend

Andy Petrella@noootsab

Data Fellas