Operational Insight: Concepts and Examples (w/o Presenter Notes)

103
Operational Insight June 15, 2015 Roy Rapoport @royrapoport / linkedin.com/in/royrapoport / [email protected]

Transcript of Operational Insight: Concepts and Examples (w/o Presenter Notes)

Page 1: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Operational InsightJune 15, 2015 Roy Rapoport

@royrapoport / linkedin.com/in/royrapoport / [email protected]

Page 2: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Oh, The Places We’ll Go!

Page 3: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 4: Operational Insight: Concepts and Examples (w/o Presenter Notes)

John Boyd

Page 5: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 6: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Observe

Page 7: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Observe

Orient

Page 8: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Observe

Orient

Decide

Page 9: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Observe

Orient

Decide

Act

Page 10: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Observe

Orient

Decide

Act OODA

Page 11: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Observe

Orient

Decide

Act OODA

“This approach favors agility over raw power in dealing with human opponents in any endeavor” - Wikipedia

Page 12: Operational Insight: Concepts and Examples (w/o Presenter Notes)

This Is What We Do

Page 13: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 14: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 15: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 16: Operational Insight: Concepts and Examples (w/o Presenter Notes)

OODA KPI

Page 17: Operational Insight: Concepts and Examples (w/o Presenter Notes)

OODA KPI

Speed

Page 18: Operational Insight: Concepts and Examples (w/o Presenter Notes)

OODA KPI

Speed Effort

Page 19: Operational Insight: Concepts and Examples (w/o Presenter Notes)

OODA KPI

Speed Effort Reliability

Page 20: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Winning

Speed Effort Reliability

Page 21: Operational Insight: Concepts and Examples (w/o Presenter Notes)

WinningSpeed

Effort Reliability

Page 22: Operational Insight: Concepts and Examples (w/o Presenter Notes)

WinningSpeed

Effort

Reliability

Page 23: Operational Insight: Concepts and Examples (w/o Presenter Notes)

WinningSpeed

Effort

Reliability

Page 24: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Observation (aka measurement, telemetry, metrics)

Page 25: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Observation (aka measurement, telemetry, metrics)

• Make It Easy

Page 26: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Observation (aka measurement, telemetry, metrics)

• Make It Easy• Make It Scalable

Page 27: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Observation (aka measurement, telemetry, metrics)

• Make It Easy• Make It Scalable• Make it pluggable

Page 28: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Observation (aka measurement, telemetry, metrics)

• Make It Easy• Make It Scalable• Make it pluggable• (Eventually) Ruthlessly Cull

Page 29: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Observation (aka measurement, telemetry, metrics)

• Make It Easy• Make It Scalable• Make it pluggable• (Eventually) Ruthlessly Cull

“What decision will this help me make?”

Page 30: Operational Insight: Concepts and Examples (w/o Presenter Notes)

A Joke

Page 31: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 32: Operational Insight: Concepts and Examples (w/o Presenter Notes)

52

48

Page 33: Operational Insight: Concepts and Examples (w/o Presenter Notes)

% of servers in major region with an even IP address

Page 34: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Orientation (aka graphing, visualization)

Page 35: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Orientation (aka graphing, visualization)

• First-class product

Page 36: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Orientation (aka graphing, visualization)

• First-class product• Different decisions require different viz

Page 37: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Orientation (aka graphing, visualization)

• First-class product• Different decisions require different viz• Low cognitive load better than

Page 38: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Orientation (aka graphing, visualization)

• First-class product• Different decisions require different viz• Low cognitive load better than

• High refresh rates

Page 39: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Orientation (aka graphing, visualization)

• First-class product• Different decisions require different viz• Low cognitive load better than

• High refresh rates• Deep data density

Page 40: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Better Like This …

Page 41: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Or Better Like That …

Page 42: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Decisions (aka alerting, real-time analytics, etc)

Page 43: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Decisions (aka alerting, real-time analytics, etc)

• You already have (some of) this

Page 44: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Decisions (aka alerting, real-time analytics, etc)

• You already have (some of) this• Incremental improvement

Page 45: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Decisions (aka alerting, real-time analytics, etc)

• You already have (some of) this• Incremental improvement• Sky’s the limit

Page 46: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Decisions (aka alerting, real-time analytics, etc)

• You already have (some of) this• Incremental improvement• Sky’s the limit

• For benefits

Page 47: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Decisions (aka alerting, real-time analytics, etc)

• You already have (some of) this• Incremental improvement• Sky’s the limit

• For benefits• For cost

Page 48: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Action

Page 49: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Action

1. Humans beat bureaucracy

Page 50: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Action

1. Humans beat bureaucracy2. Machines beat humans

Page 51: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Action

1. Humans beat bureaucracy2. Machines beat humans3. Repeatability beats one-offs

Page 52: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Action

1. Humans beat bureaucracy2. Machines beat humans3. Repeatability beats one-offs

Repeatable machine processes TROUNCE one-off human bureaucracy

Page 53: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Action

1. Humans beat bureaucracy2. Machines beat humans3. Repeatability beats one-offs4. Start with humans

Repeatable machine processes TROUNCE one-off human bureaucracy

Page 54: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Implications … for Action

1. Humans beat bureaucracy2. Machines beat humans3. Repeatability beats one-offs4. Start with humans5. If IFTTT, deprecate humans

Repeatable machine processes TROUNCE one-off human bureaucracy

Page 55: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Decision: Do I Have Enough

Instances?

Page 56: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 57: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 58: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Decision: Is My Canary Good?

Page 59: Operational Insight: Concepts and Examples (w/o Presenter Notes)

25

Page 60: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Been there.Done that.Manually.Artisanally.

25

Page 61: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Been there.

• Started in the Data Center

Done that.Manually.Artisanally.

25

Page 62: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Been there.

• Started in the Data Center

• Manual, dashboard-driven

Done that.Manually.Artisanally.

25

Page 63: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Been there.Done that.Manually.

26

CPU

Requests

Errors

Page 64: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Been there.Done that.Manually.

27

Page 65: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Been there.Done that.Manually.• Context vs Precision

27

Page 66: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Been there.Done that.Manually.• Context vs Precision

• No …

27

Page 67: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Been there.Done that.Manually.• Context vs Precision

• No …

• Repeatability

27

Page 68: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Been there.Done that.Manually.• Context vs Precision

• No …

• Repeatability

• Trending

27

Page 69: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Been there.Done that.Manually.• Context vs Precision

• No …

• Repeatability

• Trending

• Manual effort is manual

27

Page 70: Operational Insight: Concepts and Examples (w/o Presenter Notes)

So Now What?

28

Page 71: Operational Insight: Concepts and Examples (w/o Presenter Notes)

So Now What?

• Automate Analysis

28

Page 72: Operational Insight: Concepts and Examples (w/o Presenter Notes)

So Now What?

• Automate Analysis

• Took Some Effort

28

Page 73: Operational Insight: Concepts and Examples (w/o Presenter Notes)

So Now What?

• Automate Analysis

• Took Some Effort

• Approach and analytics

28

Page 74: Operational Insight: Concepts and Examples (w/o Presenter Notes)

So Now What?

• Automate Analysis

• Took Some Effort

• Approach and analytics

• Presentation matters

28

Page 75: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Version Control System

1000 servers @ 1.0.1

Customers

Build & Deployment

System

Automated Canary Analysis

Pretty Pictures

29

Page 76: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Version Control System

1000 servers @ 1.0.1

Customers

Build & Deployment

System1 server @ 1.0.2

Automated Canary Analysis

Pretty Pictures

29

Page 77: Operational Insight: Concepts and Examples (w/o Presenter Notes)

10 servers @ 1.0.2Version

Control System

1000 servers @ 1.0.1

Customers

Build & Deployment

System

Automated Canary Analysis

Pretty Pictures

29

Page 78: Operational Insight: Concepts and Examples (w/o Presenter Notes)

1000 servers @ 1.0.2

Version Control System

1000 servers @ 1.0.1

Customers

Build & Deployment

System

Automated Canary Analysis

Pretty Pictures

29

Page 79: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Version

1000 servers @ 1.0.1

Custome

Build & Deployment

Automated

1000 servers @ 1.0.2

Pretty Pictures

30

Version Control System

Build & Deployment

System

Automated Canary Analysis

Customers

Page 80: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Version Custome

Build & Deployment

Automated

1000 servers @ 1.0.2

Pretty Pictures

30

Version Control System

Build & Deployment

System

Automated Canary Analysis

Customers

Page 81: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Version

1000 servers @ 1.0.1

Custome

Build & Deployment

Automated

1000 servers @ 1.0.2

Pretty Pictures

31

Version Control System

Build & Deployment

System

Automated Canary Analysis

Page 82: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Version

1000 servers @ 1.0.1

Custome

Build & Deployment

Automated

1000 servers @ 1.0.2

Pretty Pictures

31

Version Control System

Build & Deployment

System

Automated Canary Analysis

Page 83: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Just The Stats 4-Week View

Page 84: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Just The Stats 4-Week View

6309 canary analysis cycles

Page 85: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Just The Stats 4-Week View

6309 canary analysis cycles16% canaries failed

Page 86: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Decision: Do I Have an Outlier?

Page 87: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Outlier Detection

Page 88: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Would You Like to Play a Game?

Page 89: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Spot the Outlier

Page 90: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 91: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 92: Operational Insight: Concepts and Examples (w/o Presenter Notes)
Page 93: Operational Insight: Concepts and Examples (w/o Presenter Notes)

The Outlier Is

“A”

Page 94: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Just The Stats 4-Week View

Page 95: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Just The Stats 4-Week View

739 Server Terminations

Page 96: Operational Insight: Concepts and Examples (w/o Presenter Notes)

In a Nutshell Observe

Orient

Decide

Act

Page 97: Operational Insight: Concepts and Examples (w/o Presenter Notes)

In a Nutshell Observe

Orient

Decide

Act

Need This First http://bit.ly/nflx-atlas-2013

http://metrics20.org

Page 98: Operational Insight: Concepts and Examples (w/o Presenter Notes)

In a Nutshell Observe

Orient

Decide

Act

Need This First http://bit.ly/nflx-atlas-2013

http://metrics20.org

Understand the decision http://bit.ly/nflx-qcon-aca-2014

Page 99: Operational Insight: Concepts and Examples (w/o Presenter Notes)

In a Nutshell Observe

Orient

Decide

Act

Need This First http://bit.ly/nflx-atlas-2013

http://metrics20.org

Understand the decision http://bit.ly/nflx-qcon-aca-2014

Make it easier for humans

Page 100: Operational Insight: Concepts and Examples (w/o Presenter Notes)

In a Nutshell Observe

Orient

Decide

Act

Need This First http://bit.ly/nflx-atlas-2013

http://metrics20.org

Understand the decision http://bit.ly/nflx-qcon-aca-2014

Make it easier for humans

Make machinesdo it

Page 101: Operational Insight: Concepts and Examples (w/o Presenter Notes)

In a Nutshell Observe

Orient

Decide

Act

Need This First http://bit.ly/nflx-atlas-2013

http://metrics20.org

Understand the decision http://bit.ly/nflx-qcon-aca-2014

Make it easier for humans

Make machinesdo it

Higher speed Lower effort

Higher reliability

Page 102: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Questions, Attributions, Feedback

42

Page 103: Operational Insight: Concepts and Examples (w/o Presenter Notes)

Questions, Attributions, Feedback

@[email protected]/in/royrapoport?42