Application Monitoring using Datadog

24
@muktaa We are hiring! Send resume to: [email protected] Application Monitoring using Datadog Mukta Aphale (DevOps Practice Head, WhiteHedge Technologies) 12 Dec 2015, DevOps Meetup, Pune

Transcript of Application Monitoring using Datadog

Page 1: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Application Monitoring using Datadog

Mukta Aphale(DevOps Practice Head, WhiteHedge Technologies)

12 Dec 2015, DevOps Meetup, Pune

Page 2: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

WhiteHedge Technologies

Page 3: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Application Monitoring• Ensure that a software application processes and performs in an

expected manner and scope

APM Runtime  Metrics Health  of  the  application

Health  of  infrastructure

Provide  system/application  

feedback

Continuous  Improvement

Page 4: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Datadog• Monitoring as a Service• Agent Based• Python• Integrations• Dashboards• Tagging• Alerts• Checks

Page 5: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Environment

Page 6: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Challenge• Monitor System Health• Monitor Redis, MongoDB• Application (API Server) runs as docker container• Monitor application performance• Logging slowed down performance• Rapid development, No feedback mechanism• Insight in application needed by management• Insight needed by support, devops and developers

Page 7: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Integration Dashboards

Page 8: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

AWS

Page 9: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Amazon - ElastiCache

Page 10: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

System Overview

Page 11: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

But…

•How to impress the management?! J•How to deal with X environments and Y versions?•How to measure performance, without affecting performance?

Page 12: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Overview Dashboard

Dummy

Dummy

Dummy

Dummy

Dummy

Page 13: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Debug: API Error Rate

Which  endpoint   is  having  more  errors?  Which  error  codes  are  being   thrown? Search  logs  in  Loggly if  needed.

Page 14: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Datadog Tags• Inherited from

Integrations• Custom tags

Page 15: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Templated Dashboards• Dashboard variables• Dynamically explore metrics

Page 16: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Datadog API

•We can code:• Instance configuration• Infrastructure• Deployments

•Why not monitoring?!•Datadog has great API

Page 17: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Datadog module for nodejs

Page 18: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Measuring http response times from application code

Page 19: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Plot it!

Dummy1

Dummy2

Dummy3

Dummy4Dummy  Data

Page 20: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

What about performance while tracking performance?

• Metrics from code are sent to local datadog agent using UDP• Local datadog agent syncs the metrics to the datadog server• Datadog dashboard reflects the metrics with some delay• Application performance does not get affected

Page 21: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Alerts

Page 22: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Datadog Checks

•Collect metrics from datadog agent check•Out of the box agent checks•Custom agent checks in Python• Interesting use cases:• Keep alive check (Service is up)• Network Check (HTTP, TCP)• Validate response for expected data

Page 23: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Using Datadog we could…• Monitor dynamic infrastructure• Monitor system health• Monitor application availability• Monitor application performance• Show application & infra health graphically• Provide feedback about health of system

Page 24: Application Monitoring using Datadog

@muktaaWe are hiring! Send resume to: [email protected]

Thank You!Questions?

We are Authorized Datadog Partners!

You can write to me at: [email protected]

Twitter: @muktaa