How Netskope Mastered DevOps with Sumo Logic

32
How Netskope Mastered DevOps with Sumo Logic Kumar Saurabh, Co-Founder & VP Engineering 12/3/2014 Sumo Logic Confidential

description

This webinar discusses how the leader in cloud app analytics and policy enforcement uses Sumo Logic to ensure optimal performance, availability and security of their cloud platform. Sumo Logic Co-Founder & VP of Engineering, Kumar Saurabh, joins Netskope VP of Engineering, Abhay Kulkarni, to run a LIVE demo and discusses how Netskope: - Was able to set up the Sumo Logic service within a single day in various data centers across the world - Rapidly identifies and troubleshoots issues across 100’s of servers and virtual machines - Leverages real-time alerts to fix issues to deliver a reliable service - Makes informed business decisions by analyzing core user behaviors - Uses out-of-the box applications such as Ngnix and Apache

Transcript of How Netskope Mastered DevOps with Sumo Logic

Page 1: How Netskope Mastered DevOps with Sumo Logic

How Netskope Mastered DevOps with Sumo Logic

Kumar Saurabh,

Co-Founder & VP Engineering

12/3/2014

Sumo Logic Confidential

Page 2: How Netskope Mastered DevOps with Sumo Logic

Agenda

• Sumo Logic Overview

• Running Sumo on Sumo

• How Netskope mastered DevOps with Sumo Logic

• Q&A

Sumo Logic Confidential2

Page 3: How Netskope Mastered DevOps with Sumo Logic

Search

Visualize

Predict

Applications

Mobile

Internet of Things

Network and Server

Transforming Machine Data Into Meaningful Insights

Sumo Logic Confidential3

Page 4: How Netskope Mastered DevOps with Sumo Logic

On-Prem Data Centers

Cloud Sources

Collector

Collector

Powerful Architecture, Effortless Deployment

Sumo Logic Confidential4

Hybrid Data Sources

Private Public

PaaSIaaS

SaaS

Hosted Collector

Page 5: How Netskope Mastered DevOps with Sumo Logic

The Analytics Engine

Sumo Logic Confidential5

Core

Analytics

Page 6: How Netskope Mastered DevOps with Sumo Logic

Sumo Logic Confidential

LogReduce™- Transform Logs Into Meaningful Patterns

Reduce 100,000+ log messages into 20 relevant patterns

Search across multiple sources and timeframes

Discover “the needle in the haystack”

Drill down into results for additional context

Annotate results, influence future ranking with human context

6

Page 7: How Netskope Mastered DevOps with Sumo Logic

02/15/2014 10:03:16 UTC Health status check: zim-5 is OK

02/15/2014 10:03:11 UTC Health status check: gir-3 is OK

02/15/2014 10:03:07 UTC Health status check: gir-2 is TIMED OUT

02/15/2014 10:02:45 UTC Health status check: dib-1 is OK

A log line and is its LogReduce Signature

Sumo Logic Confidential7

$DATETIME Health status check: **** is ****

printf(“%s Health status check: %s is %s”, timestamp, hostid, hoststatus)

LogReduce

02/15/2014 10:03:16 UTC Health status check: zim-5 is OK

Page 8: How Netskope Mastered DevOps with Sumo Logic

A Baseline and its Underlying Signatures

Sumo Logic Confidential8

02/15/2014 10:39:16 UTC Health status check: asm-5 is OK02/15/2014 10:38:16 UTC module=xyz reported a TimeoutException02/15/2014 10:37:16 UTC Health status check: gar-3 is OK02/15/2014 10:37:16 UTC Health status check: gar-7 is OK02/15/2014 10:35:16 UTC database connection established: ora3302/15/2014 10:29:16 UTC Health status check: abd-3 is TIMED OUT02/15/2014 10:37:16 UTC Health status check: gar-3 is OK02/15/2014 10:27:16 UTC pix firewall: tcp denied: 10.12.2.1002/15/2014 10:22:16 UTC Health status check: dim-5 is OK02/15/2014 10:21:16 UTC database connection established: ora123302/15/2014 10:20:16 UTC Health status check: fim-5 is SLOW02/15/2014 10:19:16 UTC pix firewall: tcp accept: 111.12.2.1002/15/2014 10:18:16 UTC Health status check: zim-5 is OK02/15/2014 10:15:16 UTC database connection established: ora124302/15/2014 10:13:16 UTC module=adyz reported a NullPointer02/15/2014 10:12:16 UTC Health status check: z21-5 is OK02/15/2014 10:09:16 UTC Health status check: rsm-5 is OK02/15/2014 10:06:16 UTC database connection established: ora121302/15/2014 10:05:16 UTC pix firewall: tcp accept: 111.12.22.1002/15/2014 10:04:16 UTC Health status check: 5gm-5 is OK

$DATETIME Health status check: **** is ****

$DATETIME Module=**** reported a ****

$DATETIME pix firewall: tcp ****: ****

$DATETIME database connection established: ****

S1:S2:S3:S4:

SIGNATURE %(COUNT)

S1 55%(11)

S2 10%(2)

S3 20%(4)

S4 15%(3)

Baseline

Page 9: How Netskope Mastered DevOps with Sumo Logic

How does AD find events using the baseline?

Sumo Logic Confidential9

Baseline

SIG 12:20 12:25 12:30 12:35 12:40 12:45 12:50 12:55 13:00 13:05 13:10 13:15 13:20 13:25 13:30 13:35 13:40 13:45 13:50 13:55 14:00 14:05 14:10 14:15

55% S1 54% 52% 20% 52% 58% 58% 57% 56% 54% 48% 57% 57% 54% 55% 58% 56% 56% 52% 40% 54% 55% 54% 53% 57%

10% S2 11% 9% 14% 11% 10% 9% 30% 11% 10% 8% 9% 11% 10% 0% 10% 9% 10% 11% 10% 10% 11% 10% 9% 11%

20% S3 18% 23% 38% 20% 17% 19% 5% 19% 23% 18% 20% 19% 23% 26% 15% 20% 18% 23% 35% 21% 20% 20% 23% 19%

15% S4 17% 16% 28% 17% 15% 14% 8% 14% 13% 11% 14% 13% 13% 19% 17% 15% 16% 14% 15% 15% 14% 16% 15% 13%

S5 15%

S1

S2

S3

S4

Event 2

S1

S2

S3

S4

Event 1

S1

S2

S3

S4

S5 ✪

Event 3

S1

S2 ✖

S3

S4

Event 4

S1

S2

S3

S4

Event 5

Up

Down

✖ Gone

✪ New

Each event is a UNIQUE combination of changes from baseline

0%

50%

100%

S1

S2

S3

S4

S5

Page 10: How Netskope Mastered DevOps with Sumo Logic

Sumo Logic Confidential

Anomaly Detection: Expose Unknown Events

10

Page 11: How Netskope Mastered DevOps with Sumo Logic

Powerful Visualization of Transactional Relationships

Page 12: How Netskope Mastered DevOps with Sumo Logic

APPLICATIONSFOR ENTERPRISE SYSTEMS

APP MANAGEMENT SECURITY AND COMPLIANCE IT OPERATIONS

Page 13: How Netskope Mastered DevOps with Sumo Logic

How We Use Sumo Logic

Page 14: How Netskope Mastered DevOps with Sumo Logic

IN ONE DAY,

SUMO LOGIC

Analyzes

5.8petabytes

of data

Scans

14trillion records

Examines

247years

of data

Page 15: How Netskope Mastered DevOps with Sumo Logic

Overview of Sumo Logic Deployment

Sumo Logic Confidential15

Production(2000+ nodes)

Pre-Production(For Internal Use Only)

Logs2TB+ a day

Logs

https://service.sumologic.com

Dev QAPerformance

TestsBuild

SystemsIT

Logs LogsLogs

Page 16: How Netskope Mastered DevOps with Sumo Logic

DevOps Philosophy

Instrument Everything, Monitor KPIs

RCA: Flexible query language to ask any question

Turn incidents into actionable knowledge

– Alerts and Monitors

Peace for the paranoid: Anomaly Detection

– Proactive response better than reactive

Key to Running a Successful DevOps Shop

Sumo Logic Confidential16

Page 17: How Netskope Mastered DevOps with Sumo Logic

Monitoring Production KPIs

Sumo Logic Confidential17

Scheduled Queries in last 24h by user

Page 18: How Netskope Mastered DevOps with Sumo Logic

Drill downs from KPI

Sumo Logic Confidential18

Page 19: How Netskope Mastered DevOps with Sumo Logic

How We Use Sumo Logic

Abhay Kulkarni – Vice President of Engineering

Page 20: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved. 20

Discovery, Visibility, and

Granular Control for Safe

Cloud Enablement

• Real-time, granular control of any

cloud app, sanctioned or not

• Deep contextual visibility

• Comprehensive end-point

coverage and flexible deployment

The Netskope Active Platform™

Page 21: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved.

Discover Apps and Mitigate Risk

21

• Discover enterprise cloud apps

• Get their enterprise-readiness

score and details

• Understand your risk based on

your usage of those apps

Page 22: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved.

Visibility About Usage, Data, and Anomalies

22

• Drill into usage details

• See “Who’s sharing content?” or

“Who’s uploading PCI?”

• Detect anomalies like excessive

logins or downloads

Page 23: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved.

Granular Control for Data Protection and Compliance

23

• Enforce activity-level policies

like “no sharing outside of the

company”

• Prevent loss of sensitive data

with DLP policies like “no

downloading of PII to mobile”

• Coach users with automated

messages to build awareness

Page 24: How Netskope Mastered DevOps with Sumo Logic

How Netskope uses Sumo Logic

© 2014 Netskope. All Rights Reserved. 24

Page 25: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved.

Key Use Cases

25

• Track usage of Netskope Platform – gain insights into customer behavior

• Track application performance and throughput – measure for growth

• Track application errors – identify issues before customers do

Page 26: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved.

Netskope Infrastructure

26

• 1000s of servers across the globe

• Each server has Sumo Logic agent deployed

• Automated bootstrapping of new servers using Ansible

• Use Sumo Logic partitions and field extraction rules to automatically

detect regions and data centers along with categories

• Take time to design your collectors – source, source category etc.

Page 27: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved.

Track User Behavior

27

• Netskope instrumented its platform to emit key anonymous statistics

• Queries in Sumo Logic aggregate these stats

• Run daily reports to gain customer insights

• Use Sumo Logic built-in apps to create automatic dashboards – we use

Nginx and Apache

Page 28: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved.

Sumo Logic Nginx Dashboard

28

Page 29: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved.

Sumo Logic Nginx Visitor Access Dashboard

29

Page 30: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved.

Understanding Application Performance and Throughput

30

• Netskope platform is built to analyze large amounts of data every day

• Tracking analysis pipeline is key to great customer experience

• Instrument code for measuring time taken

• Sumo Logic aggregates logs

Page 31: How Netskope Mastered DevOps with Sumo Logic

© 2014 Netskope. All Rights Reserved.

Tracking Application Errors and Exceptions

31

• Instrument code to emit exceptions and errors in a standardized format

• Set-up real time alerts to send email notifications to operations team

• Track error volume for continuous improvement

Page 32: How Netskope Mastered DevOps with Sumo Logic

Thank you

© 2014 Netskope. All Rights Reserved. 32