Latency: The Silent Monitoring System Killer

45
LATENCY THE SILENT MONITORING SYSTEM KILLER Saturday, 21 January 2012

description

Slides from a lighting talk presented at the January Sydney DevOps Meetup.

Transcript of Latency: The Silent Monitoring System Killer

Page 1: Latency: The Silent Monitoring System Killer

LATENCYTHE SILENT

MONITORING SYSTEM KILLER

Saturday, 21 January 2012

Page 2: Latency: The Silent Monitoring System Killer

#MONITORINGSUCKS

Saturday, 21 January 2012

Page 3: Latency: The Silent Monitoring System Killer

SCALABILITY

Saturday, 21 January 2012

Page 4: Latency: The Silent Monitoring System Killer

execute large volumes of monitoring checks under a variety of conditions

(good + bad) with a consistent throughput

Saturday, 21 January 2012

Page 5: Latency: The Silent Monitoring System Killer

CONSISTENT THROUGHPUT

Saturday, 21 January 2012

Page 6: Latency: The Silent Monitoring System Killer

SHORT CHECK EXECUTION TIME

Saturday, 21 January 2012

Page 7: Latency: The Silent Monitoring System Killer

WHAT INTRODUCES VARIABILITY?

Saturday, 21 January 2012

Page 8: Latency: The Silent Monitoring System Killer

LATENCY INTRODUCED c SYNCHRONOUS CALLS

Saturday, 21 January 2012

Page 9: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 10: Latency: The Silent Monitoring System Killer

= 1 monitoring check

Saturday, 21 January 2012

Page 11: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 12: Latency: The Silent Monitoring System Killer

150 monitoring checkseach executed every 300 secondseach takes 1 secondchecks are executed serially

Saturday, 21 January 2012

Page 13: Latency: The Silent Monitoring System Killer

all checks executed in 150 secondsmonitoring system at 50% capacity

Saturday, 21 January 2012

Page 14: Latency: The Silent Monitoring System Killer

DOUBLE THE CHECKS

Saturday, 21 January 2012

Page 15: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 16: Latency: The Silent Monitoring System Killer

all checks executed in 300 secondsmonitoring system at 100% capacity

Saturday, 21 January 2012

Page 17: Latency: The Silent Monitoring System Killer

DOUBLE THE EXECUTION TIME

Saturday, 21 January 2012

Page 18: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 19: Latency: The Silent Monitoring System Killer

all checks executed in 600 secondsmonitoring system at 200% capacityonly 50% of the checks are “on time”

Saturday, 21 January 2012

Page 20: Latency: The Silent Monitoring System Killer

CHECK LATENCY

Saturday, 21 January 2012

Page 21: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 22: Latency: The Silent Monitoring System Killer

HOW DO WE FIX THIS!?

Saturday, 21 January 2012

Page 23: Latency: The Silent Monitoring System Killer

PERFORMANCE ANALYSIS!

Saturday, 21 January 2012

Page 24: Latency: The Silent Monitoring System Killer

AN ANALOGY

Saturday, 21 January 2012

Page 25: Latency: The Silent Monitoring System Killer

MONITORING CHECK ==

“ACTION” ON MVC WEB APP

Saturday, 21 January 2012

Page 26: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 27: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 28: Latency: The Silent Monitoring System Killer

SEPARATE DATA COLLECTION

FROM

THRESHOLDINGf NOTIFICATIONS

Saturday, 21 January 2012

Page 29: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 30: Latency: The Silent Monitoring System Killer

THIS SHIFTS LATENCY

Saturday, 21 January 2012

Page 31: Latency: The Silent Monitoring System Killer

IT DOES NOT ELIMINATE IT!

Saturday, 21 January 2012

Page 32: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 33: Latency: The Silent Monitoring System Killer

RRDTOOL IS EVIL

Saturday, 21 January 2012

Page 34: Latency: The Silent Monitoring System Killer

USE SOMETHING BETTER!

Saturday, 21 January 2012

Page 35: Latency: The Silent Monitoring System Killer

USE

OPENTSDBOR

GANGLIAWITH

CHECK_TSDBOR

CHECK_GMOND

Saturday, 21 January 2012

Page 36: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 37: Latency: The Silent Monitoring System Killer

A DIFFERENT SET a PROBLEMS

Saturday, 21 January 2012

Page 38: Latency: The Silent Monitoring System Killer

STORAGE WILL GO AWAY

Saturday, 21 January 2012

Page 39: Latency: The Silent Monitoring System Killer

CHAOS WILL ENSUE

Saturday, 21 January 2012

Page 40: Latency: The Silent Monitoring System Killer

PAGERS WILL MELT

Saturday, 21 January 2012

Page 41: Latency: The Silent Monitoring System Killer

SET UP “META-

PARENTING”

Saturday, 21 January 2012

Page 42: Latency: The Silent Monitoring System Killer

BUILD A KILL SWITCH

Saturday, 21 January 2012

Page 43: Latency: The Silent Monitoring System Killer

Saturday, 21 January 2012

Page 44: Latency: The Silent Monitoring System Killer

READ MORE ABOUT THIS:

Saturday, 21 January 2012

Page 45: Latency: The Silent Monitoring System Killer

bit.ly/yN4mdy

Saturday, 21 January 2012