Graph everything
-
Upload
oliver-hankeln -
Category
Technology
-
view
479 -
download
0
description
Transcript of Graph everything
![Page 1: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/1.jpg)
Graph everything!Oliver Hankeln / gutefrage.net
Samstag, 21. September 13
![Page 2: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/2.jpg)
Who am I?
Senior Engineer - Data and Infrastructure at gutefrage.net GmbH
Was doing software development before
DevOps advocate
Samstag, 21. September 13
![Page 3: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/3.jpg)
Who is Gutefrage.net?
Germany‘s biggest Q&A platform
#1 German site (mobile) about 5M Unique Users
#3 German site (desktop) about 17M Unique Users
> 4 Mio PI/day
Part of the Holtzbrinck group
Running several platforms (Gutefrage.net, Helpster.de, Cosmiq, Comprano, ...)
Samstag, 21. September 13
![Page 4: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/4.jpg)
Flight AB6188Samstag, 21. September 13
![Page 5: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/5.jpg)
What you will get
How do we store our metrics?
Our experiences with that setup
Why the hell are we doing that?
Some thoughts on metrics
Samstag, 21. September 13
![Page 6: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/6.jpg)
How we store our metrics
Samstag, 21. September 13
![Page 7: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/7.jpg)
Our requirements
Creating new metrics has to be simple
no compaction (bye bye RRDTool)
System has to scale
Samstag, 21. September 13
![Page 8: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/8.jpg)
openTSDB
Written at StumbleUpon but OpenSource
Uses HBase as a storage
Distributed system (multiple TSDs)
Samstag, 21. September 13
![Page 9: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/9.jpg)
The ecosystem
App feeds metrics in via RabbitMQ
We base Icinga checks on the metrics
We evaluate etsy Skyline for anomaly detection
We deploy sensors via chef
Samstag, 21. September 13
![Page 10: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/10.jpg)
Our experiences
Samstag, 21. September 13
![Page 11: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/11.jpg)
What works well
We store about 200M data points in several thousand time series with no issues
tcollector is decoupling measurement from storage
Creating new metrics is really easy
Samstag, 21. September 13
![Page 12: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/12.jpg)
Challenges
The UI is seriously lacking
no annotation support out of the box
Only 1s time resolution (and only 1 value/s/time series)
Samstag, 21. September 13
![Page 13: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/13.jpg)
salvation is coming
OpenTSDB 2 is around the corner
millisecond precision
annotations and meta data
decent API
Samstag, 21. September 13
![Page 14: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/14.jpg)
Why the hell are we doing this?
Samstag, 21. September 13
![Page 15: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/15.jpg)
Communication
Replace gut feeling with real data
Helps to avoid the blame game
Brains prefer graphs to numbers
Samstag, 21. September 13
![Page 16: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/16.jpg)
Getting insights
We move towards Continuous Deployment
Complex systems show emergent behaviour
Graphs are the correct flight level
Samstag, 21. September 13
![Page 17: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/17.jpg)
Lean Startup
Build - Measure - Learn cycle
You have to define measureable goals
No. It‘s measure not guessing
Samstag, 21. September 13
![Page 18: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/18.jpg)
Perspectives
Operations (Server load, traffic, disk space,...)
Developers (DB Queries/PageView, JS errors,...)
Product Owners (Content creation, Content Quality, ...)
...
Samstag, 21. September 13
![Page 19: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/19.jpg)
Some random thoughts
Samstag, 21. September 13
![Page 20: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/20.jpg)
Public display
Helps that everyone feels involved
n+1 eyes see more than n eyes
Needs a culture of trust
Samstag, 21. September 13
![Page 21: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/21.jpg)
Alerting
Fixed values for alerts are not good enough
Drawing Attention vs. Alerting
False positives are bugs
Don‘t call the on-call-guy for nothing
Samstag, 21. September 13
![Page 22: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/22.jpg)
Metrics != boring
You can (and should) get creative with what you measure.
Have some brainstorming sessions
Insights may come from surprising places
Samstag, 21. September 13
![Page 23: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/23.jpg)
Track team happiness
There is no fixed scale
It forces you to communicate
If you listen you can find problems in the team
Samstag, 21. September 13
![Page 24: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/24.jpg)
Track ops confidence
create a platform where you can buy or sell your on-call shifts.
The price for a shift tells you how confident the team is.
This has not been tested - yet.
Samstag, 21. September 13
![Page 25: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/25.jpg)
Track recruiting efforts
Helps to get a feeling about the job market
Reminds everyone to keep looking for new colleagues
BTW: we are hiring ;-)
Samstag, 21. September 13
![Page 26: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/26.jpg)
Questions?
Please contact me:
@mydalon
I‘ll upload the slides and tweet about it
Samstag, 21. September 13
![Page 27: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/27.jpg)
one more thing
Samstag, 21. September 13
![Page 29: Graph everything](https://reader038.fdocuments.in/reader038/viewer/2022103114/554f73e4b4c905bb178b535d/html5/thumbnails/29.jpg)
Image Sources:
Plane: Felix Gottwald - www.felixgottwald.net (Creative Commons Attribution Share Alike 3.0German)
Talking men: Deutsche Fotothek - Peter, Richard sen.
Money: Wikimedia contributor Avij
Other images: Oliver Hankeln
This presentation is licenced under Creative Commons Attribution Share Alike 3.0
Samstag, 21. September 13