HDFS: Hadoop Distributed FS

14
© Hortonworks Inc. 2013 HDFS: Hadoop Distributed FS Steve Loughran, Hortonworks [email protected] @steveloughran ATLAS workshop, June 2013

description

HDFS: Hadoop Distributed FS. Steve Loughran, Hortonworks [email protected] @steveloughran ATLAS workshop, June 2013. What is a Filesystem?. Persistent store of data: write, read, probe, delete Metadata for organisation: locate, change A conceptual model for humans - PowerPoint PPT Presentation

Transcript of HDFS: Hadoop Distributed FS

Page 1: HDFS: Hadoop Distributed FS

© Hortonworks Inc. 2013

HDFS: Hadoop Distributed FSSteve Loughran, [email protected]@steveloughran

ATLAS workshop, June 2013

Page 2: HDFS: Hadoop Distributed FS

© Hortonworks Inc.

What is a Filesystem?

• Persistent store of data:write, read, probe, delete

• Metadata for organisation:locate, change

• A conceptual model for humans• API for programmatic access to data & metadata

Page 2

Unix is the model & POSIX its API

Page 3: HDFS: Hadoop Distributed FS

© Hortonworks Inc.

Unix is the model & POSIX its API

• directories and files:directories have children, files have data

• API: open, read, seek, write, stat, rename, unlink, flock

• Consistency: all sync()'d changes are globally visible• Atomic metadata operations: mv, rm, mkdir

Page 3

Features are also constraints

Page 4: HDFS: Hadoop Distributed FS

© Hortonworks Inc

Relax constraints scale and availability

Page 4

Sca

le a

nd a

vaila

bilit

y

Distance from Unix Filesystem model & APIext4

NFS+cross hostlocks, sync

HDFS+data locality(seek+write)

locks

S3+cross-site

appendmetadata ops consistency

Page 5: HDFS: Hadoop Distributed FS

© Hortonworks Inc.

HDFS: what

• Java code on Linux, Unix, Windows• Open Source: hadoop.apache.org• Replication rather than RAID

–break file into blocks–store across servers and racks–delivers bandwith and more locations for work

• Background work handles failures–replication of under-replicated blocks–rebalancing of unbalanced servers–checksum verification of stored files

Location data for the Job SchedulerPage 5

Page 6: HDFS: Hadoop Distributed FS

© Hortonworks Inc.

HDFS: why?

• Store Petabytes of web data: logs, web snapshots• Keep per-node costs down to afford more nodes• Commodity x86 servers, storage (SAS), GbE LAN• Accept failure as a background noise• Support computation in each server

Written for location aware applications -MapReduce, Pregel/Giraph & others that can tolerate partial failures

Page 6

Page 7: HDFS: Hadoop Distributed FS

Some of largest filesystems ever

An emergent software stack

Page 8: HDFS: Hadoop Distributed FS

© Hortonworks Inc.

HDFS: what next?

• Exabytes in a single cluster• Cross cluster, cross-sitewhat constraints can be relaxed here?

• Data Provenance, tainting• Evolving application needs. • Power budgets

Page 8

Page 9: HDFS: Hadoop Distributed FS

© Hortonworks Inc.

HDD HDD+ SSD SSD

• New solid state storage technologies emerging• When will HDDs go away?• How to take advantage of mixed storage• SSD retains the HDD metaphor, hides the details (access bus, wear levelling)

We need to give the OS and DFS control of the storage, work with the application

Page 9

Page 10: HDFS: Hadoop Distributed FS

© Hortonworks Inc

Download and Play!

http://hadoop.apache.orghttp://hortonworks.com

Page 10

Page 11: HDFS: Hadoop Distributed FS

© Hortonworks Inc

http://hortonworks.com/careers/Page 11

P.S: we are hiring

Page 12: HDFS: Hadoop Distributed FS

© Hortonworks Inc. Page 12

DataNode

DataNode

DataNode

DataNode

ToR Switch

DataNode

DataNode

DataNode

DataNode

ToR Switch

Switch

(Job Tracker)

ToR Switch

2ary Name Node

Name Node

file

block1block2block3…

Hadoop HDFS: replication is the key

Page 13: HDFS: Hadoop Distributed FS

© Hortonworks Inc.

Replication handles data integrity

• CRC32 checksum per 512 bytes• Verified across datanodes on write• Verified on all reads• Background verification of all blocks (~weekly)• Corrupt blocks re-replicated• All replicas corrupt operations team intervention

2009: Yahoo! lost 19 out of 329M blocks on 20K servers –bugs now fixed

Page 13

Page 14: HDFS: Hadoop Distributed FS

© Hortonworks Inc. Page 14

DataNode

DataNode

DataNode

DataNode

ToR Switch

DataNode

DataNode

DataNode

DataNode

ToR Switch

Switch

(Job Tracker)

ToR Switch

2ary Name Node

Name Node

file

block1block2block3…

Rack/Switch failure