Azure Spark - Big Data - Coresic 2016
-
Upload
nnakasone -
Category
Technology
-
view
105 -
download
2
Transcript of Azure Spark - Big Data - Coresic 2016
![Page 2: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/2.jpg)
Big Data es sinónimo de Grandes Cantidades de Datos
BIG DATA
![Page 3: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/3.jpg)
BIG DATA
• ¿Cuánta data genera un vuelo comercial entre Londres y Nueva York en dispositivos electrónicos?
• 640 TB
![Page 4: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/4.jpg)
• ¿Cuántos USB Angry Bird necesitaremos para almacenar BIG Data?
![Page 5: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/5.jpg)
Social Network
![Page 6: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/6.jpg)
4 v del Big Data
![Page 7: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/7.jpg)
Tecnologia para Manejar Big Data
![Page 8: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/8.jpg)
¿Qué es hadoop?• Hadoop consiste de dos servicios principales:
• Almacenamiento de Datos usando el Hadoop Distributed File System (HDFS)
• Procesamiento de Datos Paralelo de Alto Desempeño usando una técnica llamada MapReduce.
![Page 9: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/9.jpg)
Spark
![Page 10: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/10.jpg)
Spark
• Trabaja en Memoria• 100 x más rapido que Map Reduce• Soporta Tolerancia a Fallos
![Page 11: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/11.jpg)
Spark
• Spark SQL• Spark Streaming• Mlib (Machine Learning)• GraphX
![Page 12: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/12.jpg)
Jupiter
• Herramienta utilizada por los cientificos de datos
• Puede utilizar diferentes lenguajes de programacion (Python, R, Julia, Scala)
• Integracion con Big Data - Spark
![Page 13: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/13.jpg)
Demo – Spark en Acción
https://www.youtube.com/watch?v=fUmgd58Xe58
![Page 14: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/14.jpg)
![Page 15: Azure Spark - Big Data - Coresic 2016](https://reader036.fdocuments.in/reader036/viewer/2022070518/58e7395c1a28ab49038b4e97/html5/thumbnails/15.jpg)