OpenStack Log Mining

Post on 21-Apr-2017

4.183 views 0 download

Transcript of OpenStack Log Mining

Accelerating adoption of Open Infrastructure

May 2014

Log Management and Mining

Copyright 2014 Solinea, Inc.

Logging has a Long History…

photo credit: <a href="http://www.flickr.com/photos/foresthistory/3662397221/">The Forest History Society</a> via <a href="http://photopin.com">photopin</a> <a href="http://creativecommons.org/licenses/by-nc/2.0/">cc</a>

Copyright 2014 Solinea, Inc.

In Multiple Domains

Copyright 2014 Solinea, Inc.

Like Many Things, It Has Evolved…

photo credit: <a href="http://www.flickr.com/photos/rahimageworks/9196119199/">Richard Hurd</a> via <a href="http://photopin.com">photopin</a> <a href="http://creativecommons.org/licenses/by/2.0/">cc</a>

photo credit: <a href="http://www.flickr.com/photos/rahimageworks/9179873919/">Richard Hurd</a> via <a href="http://photopin.com">photopin</a> <a href="http://creativecommons.org/licenses/by/2.0/">cc</a>

Copyright 2014 Solinea, Inc.

Here Too…

Copyright 2014 Solinea, Inc.

Complexity Reigns in Cloud

Copyright 2014 Solinea, Inc.

BEEF

Nova

Cinder

Etc.

rsyslog logstash elasticsearch tcp:5514 tcp:9200

verbose = True use_syslog = True syslog_log_facility=LOG_LOCAL{n}

local{n}.* @@logstash:5514

Copyright 2014 Solinea, Inc.

Standards are Elusive §  We have a couple standards that might apply:

–  RFC5424 (The Syslog Protocol) –  NCSA/Apache CLF (Web servers)

§  Project adoption varies, but right trajectory §  Some duplication of fields with rsyslog

–  When shipping remotely §  Don’t get me started on timestamps!

Copyright 2014 Solinea, Inc.

Anatomy of an OpenStack Message §  Most projects use a similar format

–  Date: 2014-05-02 14:10:57.278 –  PID: 3609 –  Level: INFO –  Prog: oslo.messaging._drivers.impl_qpid –  ID: [-] –  Msg: Connected to AMQP …

Copyright 2014 Solinea, Inc.

use_syslog = True §  Existing syslog format is DEPRECATED during I,

and then will be changed in J to honor RFC5424 –  <132> –  May 15 12:28:57 –  compute-01 –  2014-05-15 12:28:57.767 –  20739 WARNING nova.openstack.common.loopingcall –  [-] –  task run outlasted interval by 110.003069 sec

Note1: standard ryslog config on CentOS 6.5 with remote shipping to central server

Copyright 2014 Solinea, Inc.

use_syslog_rfc_format = True §  Adds APP-NAME before message §  Nice idea, but… §  Appears incompatible with use_syslog = True

–  Nova-compute fails to launch when both set §  With use_syslog = False

–  Messages in /var/log/nova/compute.log look the same §  Could be environmental, needs more exploration

Copyright 2014 Solinea, Inc.

Shipping via rsyslog §  rsyslog.conf global settings change:

–  $ActionFileDefaultTemplate RSYSLOG_FileFormat –  $ActionForwardDefaultTemplate RSYSLOG_ForwardFormat

§  Effect: –  <134> –  2014-05-15T13:37:11.138121+00:00 –  controller-01 –  2014-05-15 13:37:11.137 3412 INFO

nova.openstack.common.service [-] Caught SIGTERM, stopping children

Copyright 2014 Solinea, Inc.

Shipping via rsyslog (conf.d) §  rsyslog.d/10-goldstone.conf file: $WorkDirectory /var/lib/rsyslog # where to place spool files $ActionQueueFileName fwdGoldstone # unique name prefix for spool files $ActionQueueMaxDiskSpace 1g # 1gb space limit (use as much as possible) $ActionQueueSaveOnShutdown on # save messages to disk on shutdown $ActionQueueType LinkedList # run asynchronously $ActionResumeRetryCount -1 # infinite retries if host is down local0.* @@10.10.11.122:5514 # nova local1.* @@10.10.11.122:5514 # glance local2.* @@10.10.11.122:5514 # neutron local3.* @@10.10.11.122:5514 # ceilometer local4.* @@10.10.11.122:5514 # swift local5.* @@10.10.11.122:5514 # cinder local6.* @@10.10.11.122:5514 # keystone

Copyright 2014 Solinea, Inc.

Receiving via Logstash (Input) input { tcp { port => 5514 # matches port that rsyslog ships to type => syslog # insert a type field to identify this as an incoming message from syslog } }

Copyright 2014 Solinea, Inc.

Receiving via Logstash (Output) output { elasticsearch { host => localhost port => 9200 protocol => http } }

Copyright 2014 Solinea, Inc.

Receiving via Logstash (Patterns) OPENSTACK_PROG (?:[ a-zA-Z0-9_\-]+\.)+[ A-Za-z0-9_\-$]+ OPENSTACK_PROG_SINGLE [A-Za-z0-9_\-$]+ OPENSTACK_SOURCE %{OPENSTACK_PROG}|%{OPENSTACK_PROG_SINGLE} OPENSTACK_REQ_LIST (\[(?:(req-%{UUID}|%{UUID}|%{BASE16NUM}|None|-|%{SPACE}))+\])? OPENSTACK_PID ( %{POSINT:pid:int})? OPENSTACK_LOGLEVEL ([D|d]ebug|DEBUG|[N|n]otice|NOTICE|[I|i]nfo|INFO|[W|w]arn?(?:ing)?|WARN?(?:ING)?|[E|e]rr?(?:or)?|ERR?(?:OR)?|[C|c]rit?(?:ical)?|CRIT?(?:ICAL)?|[F|f]atal|FATAL|[S|s]evere|SEVERE|[A|a]udit|AUDIT) OPENSTACK_NORMAL %{TIMESTAMP_ISO8601:timestamp}%{OPENSTACK_PID} %{OPENSTACK_LOGLEVEL:loglevel} %{OPENSTACK_SOURCE:program} {OPENSTACK_REQ_LIST:request_id_list} %{GREEDYDATA:msg} RAW_TRACE (?:^[^0-9].*$|^$) OPENSTACK_TRACE %{TIMESTAMP_ISO8601:timestamp} %{POSINT:pid:int} ([T|t]race|TRACE) %{OPENSTACK_SOURCE:program} %{GREEDYDATA:msg}|%{RAW_TRACE:msg} OPENSTACK_MESSAGE %{OPENSTACK_NORMAL}|%{OPENSTACK_TRACE} OPENSTACK_SYSLOGLINE %{SYSLOG5424PRINUM}%{CISCOTIMESTAMP:syslog_ts} %{HOSTNAME:syslog5424_host} %{OPENSTACK_MESSAGE:os_message}

Copyright 2014 Solinea, Inc.

Receiving via Logstash (Filter Fun) filter { if ([type] == "syslog”) { grok { patterns_dir => "/opt/logstash/patterns" match => { "message" => "%{OPENSTACK_SYSLOGLINE}" } add_field => { "received_at" => "%{@timestamp}" } add_field => { "_message" => "%{syslog5424_host} %{message}" } } if ("_grokparsefailure" not in [tags]) { … see following slides … } } }

Copyright 2014 Solinea, Inc.

Receiving via Logstash (Filter Fun) syslog_pri { severity_labels => ["EMERGENCY", "ALERT", "CRITICAL", "ERROR", "WARNING", "NOTICE", "INFO", "DEBUG"] syslog_pri_field_name => "syslog5424_pri" } date { match => [ "timestamp", "yyyy-MM-dd HH:mm:ss.SSS" ] remove_field => "timestamp" timezone => "Etc/UTC" } … NOTE1: syslog_pri parses up that ugly number at the front of the incoming message (i.e. <132>) NOTE2: This date processing is based on the timestamp in the OpenStack generated message, not the rsyslog message. With enhanced rsyslog template, or better OpenStack message format, we can avoid inferring timezone.

Copyright 2014 Solinea, Inc.

Receiving via Logstash (Filter Fun) translate { field => "syslog_facility" dictionary => [ "local0", "nova", "local1", "glance", "local2", "neutron", "local3", "ceilometer", "local4", "swift", "local5", "cinder", "local6", "keystone" ] fallback => "unknown" destination => "component" } … NOTE1: syslog_facility generated by syslog_pri earlier. Adds a new component field so we can figure out who generated these messages.

Copyright 2014 Solinea, Inc.

Receiving via Logstash (Filter Fun) mutate { rename => [ "msg", "message" ] rename => [ "syslog5424_host", "host" ] remove_field => "syslog_ts" remove_field => "syslog5424_pri" remove_field => "os_message" add_tag => ["processed", "openstack_syslog", "filter_34"] } Note1: We made it to the end of the filter successfully, so let’s clean up a little and add some tags to indicate how we navigated the filter space.

Copyright 2014 Solinea, Inc.

Result in ES:

photo credit: <a href="http://www.flickr.com/photos/robbie73/4346732208/">Robbert van der Steeg</a> via <a href="http://photopin.com">photopin</a> <a href="http://creativecommons.org/licenses/by-sa/2.0/">cc</a>

Copyright 2014 Solinea, Inc.

Interpreting Specific Messages (Patterns) NOVA_API_CALL %{IP:ip} "(?:GET|PUT|POST|DELETE) %{URIPATH:uri} %{NOTSPACE:protocol}" status: %{NUMBER:response_status:int} len: %{NUMBER:response_length:int} time: %{NUMBER:response_time:float}

Copyright 2014 Solinea, Inc.

Interpreting Specific Messages if ("_grokparsefailure" not in [tags]) { # clean up extra fields and tag us mutate { replace => [ "type", "openstack_api_stats" ] remove_field => "pid" remove_field => "hostname" remove_field => "message" remove_field => "_message" remove_field => "loglevel" remove_field => "syslog_severity_code" remove_field => "syslog_facility_code" remove_field => "syslog_facility" remove_field => "syslog_severity" add_tag => ["metric", "filter_37"] } } Note1: Processed after successful openstack message filtering. We know the lineage, so we don’t need to keep a bunch of redundant information.

Copyright 2014 Solinea, Inc.

Result in ES:

photo credit: <a href="http://www.flickr.com/photos/calamity_photography/4778766879/">Www.CourtneyCarmody.com/</a> via <a href="http://photopin.com">photopin</a> <a href="http://creativecommons.org/licenses/by/2.0/">cc</a>

Copyright 2014 Solinea, Inc.

Querying ES for Logs { "query": { "bool": { "must": [

{"range": {"@timestamp": {"gte": "2014-05-08T16:31:07+00:00", "lte": "2014-05-15T16:31:07+00:00"}}}, {"terms": {"type": ["openstack_log"]}}

] } }, "aggs": { "events_by_time": { "date_histogram": {"field": "@timestamp", "interval": "5448.648648648648s", "min_doc_count": 0},

"aggs": { "events_by_loglevel": {"terms": {"field": "loglevel"}} } } } }

Copyright 2014 Solinea, Inc.

Querying Nova API Stats { "query": { "filtered": { "filter": {"match_all": {}}, "query": {"bool": "must": [ {"range": {"@timestamp": {"gte": "2014-04-15T16:45:53+00:00", "lte": "2014-05-15T16:45:53+00:00"}}}, {"term": {"component": "nova"}}]} }}}, "aggs": { "events_by_date": { "date_histogram": {"field": "@timestamp", "interval": "32400s", "min_doc_count": 0}, "aggs": {"range": {"range": { "ranges": [{"to": 299, "from": 200}, {"to": 399, "from": 300}, {"to": 499, "from": 400}, {"to": 599, "from": 500}], "field": "response_status", "keyed": true}}, "stats": {"extended_stats": {"field": "response_time"}} }}}}

Copyright 2014 Solinea, Inc.

Manage and Monitor OpenStack

http://gssr.jpl.nasa.gov/index.html

The Goldstone Deep Space Communications Complex (GDSCC), commonly called the Goldstone Observatory, is located in the U.S. state of California's Mojave Desert. Operated for the Jet Propulsion Laboratory, its main purpose is to track and communicate with space missions. It is named after Goldstone, California, a nearby gold-mining ghost town.

(Because everyone asks…)

Copyright 2014 Solinea, Inc.

Log Viewing, Filtering, and Searching

Copyright 2014 Solinea, Inc.

Key Metric Reporting

John Stanford, VP Development; john@solinea.com

Thank You

Copyright 2014 Solinea, Inc.

Solinea at a Glance

CUSTOMERS OVERVIEW Industry Focus Open Infrastructure, OpenStack

Headquarters San Francisco, CA Founded January 2013

Geographies Asia, USA

PAST DEPLOYMENTS TEAM Major US Telco

US Infrastructure Service Provider

North Asian Telco

Global Electronics Manufacturer

Global Gaming Company

Top 5 Global Automotive Firm

Tier 1 Network Provider

Risk Management Analytics

Easter Europe CSP

OpenStack Distro Provider

Security Analytics Firm

Leading SDN Provider

Copyright 2014 Solinea, Inc.

Solinea Services

! ! !"Conceive Architect Integrate Adopt

Conceive the cloud strategy for existing and new cloud services to drive customer adoption

Architect the cloud platforms based on market demand and internal capabilities

Implement the cloud offerings and integrate them into the existing infrastructure & processes

Operate the cloud, transfer knowledge, train the team and enable rapid adoption