Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

9
Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR

description

Million Monkeys Algorithm 3 Randomly generate a 9 character group TOBEORNOT Does it exist in Shakespeare? To be, or not to be- that is the question

Transcript of Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

Page 1: Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

Given Enough MonkeysSome Thoughts on RandomnessJesse Anderson | CLOUDERA, INSTRUCTOR

Page 2: Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

Infinite Monkey Theorem

2

“A million monkeys on a million typewriters will eventually recreate Shakespeare

Page 3: Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

3

Million Monkeys Algorithm

Randomly generate a 9 character group

TOBEORNOT

Does it exist in Shakespeare?

To be, or not to be- that is the question

Page 4: Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

4

Exponential Growth (aka Big Data)

Odds of finding a group of characters is 1 in 26 raised to the power of

the number of contiguous characters

1 in 26n

Contiguous Characters Combinations

8 208,827,064,576

9 5,429,503,678,976

10 141,167,095,653,376

Page 5: Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

5

Data Bias?

Page 6: Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

6

Hadoop Scalability

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 200

20

40

60

80

100Percent of Linear Scalability

RDBMSHadoop

Perc

ent

RDBMS = Relational DatabaseNodes

Page 7: Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

7

Scaling does not require massive re-engineering

and complete rewrites of code

Business Value of Scalability

Adding more computers to cluster gets a

predictable increase in computational power and

storage

$$$SAVETIMESAVE

Page 8: Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

8

Going Viral (and taking over the world)

26,000 unique visits from 119 countries in one day

Covered internationally in BBC, Wall Street Journal, Wired and Slashdot

Page 9: Given Enough Monkeys Some Thoughts on Randomness Jesse Anderson | CLOUDERA, INSTRUCTOR.

@jessetanderson