1cailbbtbilgqjw00cff 140424042232-phpapp01

17
Hadoop Edit by Cassell Hsu 2013.04.19

description

 

Transcript of 1cailbbtbilgqjw00cff 140424042232-phpapp01

Page 1: 1cailbbtbilgqjw00cff 140424042232-phpapp01

HadoopEdit by Cassell Hsu

2013.04.19

Page 2: 1cailbbtbilgqjw00cff 140424042232-phpapp01

Hadoop

Master

Slave

Slave

Slave

Page 3: 1cailbbtbilgqjw00cff 140424042232-phpapp01

Hadoop

•Master

•NameNode

•JobTracker

•SecondaryNameNode

Page 4: 1cailbbtbilgqjw00cff 140424042232-phpapp01

NameNode•Where is NameNode?

•Master

•HDFS(Hadoop Distributed File System)

•What is NameNode?

•資料之位置資訊

•資料之屬性

??

Page 5: 1cailbbtbilgqjw00cff 140424042232-phpapp01

NameNode

•位置資訊 ?

•所有資料皆存放在 – DataNode

Page 6: 1cailbbtbilgqjw00cff 140424042232-phpapp01

DataNode

•What is DataNode

•存放資料

•Where is DataNode

•HDFS

•Slaves (and Master)

Page 7: 1cailbbtbilgqjw00cff 140424042232-phpapp01

User

DataNode

NameNode

DataNode

DataNode

128Mb

B64Mb

A64Mb

Check hdfs-site.xml

B64Mb

A64Mb

Page 8: 1cailbbtbilgqjw00cff 140424042232-phpapp01

Hadoop

DataNodeNameNode

MasterDataNode

DataNode

DataNode

Slaves

Page 9: 1cailbbtbilgqjw00cff 140424042232-phpapp01

Hadoop

•Master

•NameNode

•JobTracker

•SecondaryNameNode

Page 10: 1cailbbtbilgqjw00cff 140424042232-phpapp01

JobTracker

•What is JobTracker?

•排程工作

•Where is JobTracker?

•Master

誰來工作?

Page 11: 1cailbbtbilgqjw00cff 140424042232-phpapp01

JobTracker & TaskTrackerJobTracker TaskTracker

Where Master Slaves

What 排程工作 執行工作

Page 12: 1cailbbtbilgqjw00cff 140424042232-phpapp01

Hadoop

DataNodeNameNode

MasterDataNode

Slaves

JobTracker

TaskTracker

Page 13: 1cailbbtbilgqjw00cff 140424042232-phpapp01

Hadoop

•Master

•NameNode

•JobTracker

•SecondaryNameNode

Page 14: 1cailbbtbilgqjw00cff 140424042232-phpapp01

SecondaryNameNode

•What is SecondaryNameNode?

•NameNode 發生錯誤時補救

•Where is SecondaryNameNode?

•Master

Page 15: 1cailbbtbilgqjw00cff 140424042232-phpapp01

Hadoop

DataNodeNameNode

MasterDataNode

Slaves

JobTracker

TaskTracker

SecondaryNode

Page 16: 1cailbbtbilgqjw00cff 140424042232-phpapp01

MapReduce

User Master

Slave2

Slave1

A

A1

A2Task

NameNode

Task

Result1

Result2

Reduce

Final Result

HDFS

Page 17: 1cailbbtbilgqjw00cff 140424042232-phpapp01

MapReduce

•檔案切割

•Hadoop 上區塊切割

•程式指定