Common Sense Performance Indicators in the Cloud

Common Sense Performance Indicators

Nick GernerJune 24, 2010

Common Sense in the Cloudsame as outside the cloud

1. Tune performance2. Investigate issues3. Visualize architecture

Nick Gerner

www.nickgerner.com@gerner

• Formerly senior engineer at SEOmoz• Linkscape: index of the web for SEO• Lead data services• Developer• Back-end ops guy

SEOmoz

• Seattle-based Startup (~7 engineers)• SEO Blog and Community• Toolset and Platform

OpenSiteExplorer.org

• 300TB/month processing pipeline• 5 mil req/day API hits

SEOmoz Engineering

• 50 < nodes < 500• AWS based since 2008

– EC2 – linux root access to bare VM– S3 – networked disk– EBS – local disk I/O– ELB – load balancing as a service

SEOmoz ArchitectureProcessing

TheWeb Crawlers

RawStorage

Crawlers Process Prepare

Data Pipeline

SEOmoz ArchitectureAPI

Partners

SEOmozApps

Lighttpd

Memcache

End-to-EndPerformance Indicators

Latency

Time toOn-load

Conversion Rate

Web ObjectCount

Great...but not the focus of this talk

Latency

Time toOn-load

Conversion Rate

Web ObjectCount

Performance Indicators

SystemCharacteristics

AppStack

Database WS-API

Back-end

Caching

Middleware

Front-End

Drives

CompetesFor

http://www.flickr.com/photos/dnisbet/3118888630/

DiskNet

Performance Indicators

SystemCharacteristics App

Database WS-API

Back-end

Caching

Middleware

Front-End

Drives

CompetesFor

http://www.flickr.com/photos/dnisbet/3118888630/

DiskNet

• System stats• Per-process stats• It all comes from here

...but use tools to see it

System Characteristics

Load AverageCPU

MemoryDisk

Network

Load Average

• Combines a few things• Good place to start• Explains nothing

http://www.flickr.com/photos/maple03/4176389418/

• Break out by process• Break out user vs system• User, System, I/O wait, Idle

http://www.flickr.com/photos/pacdog/213442876/

Why watch it?

• Who's doing work• Is CPU maxed?• Blocked on I/O?• Compare to Load Average

http://www.flickr.com/photos/pacdog/213442876/

Memory

• Break out by Process• Free, cached, used

http://www.flickr.com/photos/williamhook/3118248600/

Why watch it?• Cached + Free = Available• Do you have spare memory?

– App uses– Memcache– DB cache

http://www.flickr.com/photos/williamhook/3118248600/

• Read bytes/sec• Write bytes/sec• Disk utilization

http://www.flickr.com/photos/robfon/2174992215/

Why watch it?

• Is disk busy?• When?• Who's using it?

http://www.flickr.com/photos/robfon/2174992215/

Network

• Read bytes/sec• Write bytes/sec• Established connections

http://www.flickr.com/photos/ahkitj/20853609/

Why watch it?

• Max connections(~1024 is magic)

• Bandwidth is $$$• When are you busy?• SOA considerations

http://www.flickr.com/photos/ahkitj/20853609/

Perf Monitoring Solution

1. data collection (collectd)2. data storage (rrdtool)3. dashboard management (drraw)

FREE, in Aptv

Perf Monitoring Architecture

Cluster

Multiple Clusters

Multiple Applications

Nodes come upand go down

Cluster

collectd agents

new nodes getgeneric config

node namesfollow conventionaccording to role

Perf MonitoringServer

Cluster

On its own server:collectd server

Web serverdrraw.cgi

allows connections from new nodes

perf data backed up daily

Perf MonitoringServer

Cluster

Happy Sysadmin

Visibility into systemhistory of performance

Perf Dashboard Featurs

1. Summarize nodes/systems2. Visualize data over time3. Stack measurements– Per-process– Per-node

4. Handle new nodes–

Batch Mode Dashboard

Memory

Network

Web Server Dashboard

Web Requests

mod_status

System-Wide Dashboard

Per-request

Graph Summary

• cpu, mem, disk, net• over time• per node• per process• Through in relevant app measures

e.g. per request stats:• req/sec• median latency/req

Ad-hoc Tools

• $ dstat -cdnmlsystem characteristics

• $ iotopper-process disk I/O

• $ iostat -x 3detailed disk stats

• $ netstat -tnpfast, per-process TCP connection stats

Resources

• Perf Testing: What, How, Whyhttp://www.nickgerner.com/2010/02/performance-testing-what-andhow-why/

• Perf Testing Case Study: OSEhttp://www.nickgerner.com/2010/01/performance-testing-case-study-ose/

• S3 Benchmarkshttp://twopieceset.blogspot.com/2009/06/s3-performance-benchmarks.html

• Perf Measurement– http://twopieceset.blogspot.com/2009/03/performance-

measurement-for-small-and.html

More Resources

• http://www.collectd.org• http://oss.oetiker.ch/rrdtool/• http://web.taranis.org/drraw/• http://dag.wieers.com/home-made/dstat/

• $ man proc–

Q: Why? A: Perf Tuning

InterpretImprove

Validate Measure

Q: Why? A: System Arch

• Better Devs/Ops• Identify Bottlenecks• Scaling

Considerations

Q: Why? A: Issue Investigation

• Machine Specific?• System Wide?• Which Component?• Timeline?• Cascading Failures?

Common Sense Performance Indicators in the Cloud

Technology

Transcript of Common Sense Performance Indicators in the Cloud

Reading the cloud: Making sense of location based data

Making Sense of the Cloud

Does Cloud Computing Make Sense for Your Company? Your ...€¦ · Understanding Cloud Delivery Models Figure 1: The Three Main Public Cloud Delivery Models Public Cloud ... Small

Guide to Economic Indicators: Making Sense of Economics - Sixth Edition

Making sense of competitiveness indicators in …1 Making sense of competitiveness indicators in south-eastern Europe Peter Sanfey and Simone Zeh Abstract This paper sifts through

Point Cloud Processing: Estimating Normal Vectors and Curvature Indicators using Eigenvectors

Key Education Indicators (KEI): Making Sense of NAEP Contextual Variables

SENSE 2 Project –Exploring Web Technology for Sharing ......SENSE 2 Project –Exploring Web Technology for Sharing Environmental Indicators Darja Lihteneger, European Environment

5 Cloud Commandments - Why Cloud Management Makes Sense

Guide to Economic Indicators: Making Sense of Economics

Making Sense of the Cloud w/ Office 365

K-12 Mathematics€¦ · Web viewBenchmarks and Indicators by Standard Benchmarks. Number, Number Sense and Operations Standard. Students demonstrate number sense, including an

Green Sense Concrete - BASF Documents... · indicators and economic viability of concrete mix designs § Provides answers to customer ... Using the Green Sense Concrete ... the German

3A - Turning Data into Decisions - Implementing a Cloud-based HSE Leading Indicators System

Qlik Sense Enterprise - Heydes/Qlik Sense Cloud_Heyde.pdf · Qlik Sense Enterprise enables a full range of analytics users and use cases, on a new multi-cloud architecture that allows

Shear zones and shear sense indicators Please read (D&R, pp. 493-551)

Investigation of cloud indicators for low voltage distribution grid transformers · 2013-10-15 · 1 T Investigation of cloud indicators for low voltage distribution grid transformers

Shear Sense Indicators I: Ductile Rocks …...Shear Sense Indicators I: Ductile Rocks (porphyroclasts, mica fish, porphyroblasts) Ingrid Siegers Institute of Geology, TU Bergakademie

Slickenside petrography: slip-sense indicators and classification

Making sense of indicators and evidence on governance in developing countries – Jonathan Di John.