HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we...

37
© 2009 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Operations Manager i 8.10 Jon Haworth – Product Marketing Manager Dave Trout – Senior Consultant

Transcript of HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we...

Page 1: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

© 2009 Hewlett-Packard Development Company, L.P.The information contained herein is subject to change without notice.

HP Operations Manager i 8.10Jon Haworth – Product Marketing ManagerDave Trout – Senior Consultant

Page 2: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Agenda• What’s the problem?

• Introducing OMi

• OMi Event Management Foundation

• OMi Health Perspectives

• OMi Topology Based Event Correlation

• OMi demo

• Review and Q&A

2

Page 3: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

The pressures facing the VP of Ops

VP Ops

Optimize IT Support Efficiency73-80% of IT budgets are spent “keeping the lights on” i

Reduce Downtime Downtime means lost business: $1.6m per hour across all industries ii

Downtime costs user productivity: application downtime causes an average14% loss in worker productivity iii

Network downtime is the most expensive at $69,000 per minute iv

Maximize “this generation”:No-one will give funding to invest in next-generation: “maximize what you already have”

i Economist (2008), Yankee (2007)ii Emerging Strategies for IT Mgmt (2004)

iii Yankee (2007)iv Aberdeen (2007)

Page 4: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Why Consolidate from multiple SilosApplications Infrastructure

DNS server issue

$225User

Experience Monitors

App. Monitor

Server Monitors

Network Monitors

Storage Monitors

Transaction Monitors

Business Process Monitors

App. Internals

Native Server Tools

3rd party and open

source

• Level 1 server support is working on fix

• Level 2 duplicates the level 1 response

User issue Connectivity issue

Application support Server support Network support

• Wasted time due to multiple people working on the same issue

• Other Level 1 operators will also start work - and will escalate when they cannot solve the problem

$75 $75$75

Page 5: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Working smart: Operations bridgeApplications Infrastructure

App. Monitor

Server Monitors

Network Monitors

Transaction Monitors

Business Process Monitors

App. Internals

Native Server Tools

3rd party and open source

Consolidated event consoleCross domain event consolidation & resolution

Application support

Serversupport

Networksupport

Storagesupport

Open ticket in service desk routed to SMEUnsolved incidents

Incidents that require a SMEAny remaining unsolved incidents

Tier 1Support

Tier 2Administrators

Tier 3Architects

Operations bridge

DNS server issueUser issue Connectivity issue

User Experience Monitors

Storage Monitors

Investigate related events

Investigate service impacting event

$150

$75 $75

Page 6: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Agenda• What’s the problem?

Introducing OMi

• OMi Event Management Foundation

• OMi Health Perspectives

• OMi Topology Based Event Correlation

• OMi demo

• Review and Q&A

Page 7: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

7

Operations Manager - released fifteen years ago - is successfully used by tens of thousands of customers throughout the world.

With the brand new Operations Manager i (OMi) we provide a Consolidated Event and Performance Management solution that can handle events from anything from business process, user experience, SOA, and application down to infrastructure and network.

It’s based on Operations Manager and leverages the “360 degree” CMDB used throughout HP’s BTO products to understand full business impact and to quickly determine the cause of problems.

Of course, with all this help from OMi, first level support can solve a lot more problems than they do at present, freeing up our experts to do things that move the business forward.

More operations efficiency with HP Operations Management OMi

Page 8: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

8

Introducing Operations Manager i 8.0 (OMi)

New product set built using BSM technologies

New Console: Common L&F with BAC 8.0 user experience

Tightly integrated with other BSM

applications such as BAC, EUM, BPM

UCMDB is common across many BSM products. It manages configuration items (=CIs) such as systems, applications, business services and their dependencies

Page 9: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

9

3 Components that make OMi 8.0

•High level value on top of HP OM technologies

•No rip and replace of existing OM installations

HP Operations Manager

Agents Agents

Agents

Agents

Agents

Agents

3 different OMi products: Topology Based Event Correlation and Health Perspective Views *require* Event Management Foundation

OMi

Event Management Foundation

Health Perspective ViewsTopology Based Event Correlation

New HP Software and Services Operational Management solution

Page 10: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Investment protection of existing OMs

• All events sent to OMi go via a single OM.• All tools / action requests initiated from the

OMi server are channelled thru the OM server to OM agents.

• Graphs require the agent infrastructure.

OMi depends on OM

OM is OMi’s admin proxy

• All deployment / configuration of OM agents is initiated and performed from the OM server.

• External notification interfaces (TT, paging etc.) are still maintained on the OM server.

• OM server is the report generator.• OM Smart Plug-Ins can populate the

UCMDB• You do not have to do a rip and replace

upgrade to enable OMi to be used.

HP Operations Manager

AgentsAgents

OM SiteScopeNNM3rd party

AgentsAgentsalerts

Discovery

OM SPIs

HP BSM Foundation UCMDB

Event

Health TBEC

events

External Notification

MoM scenarios are supported

OMi

Reporting

No massive changes required

10

Page 11: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

BSM Foundation & OMi

11

Core services

Business Service Dashboard

Service Model (UCMDB) 360º view

OMi Event Management Foundation OMi Health

Perspectives

OMi Topology Based Event Correlation

BAC Service Level

Management

BAC Business Transaction

Management

Operations Views

Service Level Managers

Views

Application Specialist

Views

Modules of business logic plug into the

BSM Foundation to provide

different facilities and views for

varied personnel – all based on a common set of data (UCMDB)

Multi-source discovery• DDM• Federation

HP Operations Manager

AgentsAgents

SiteScope

events

NNMi

Smart PlugIns

Page 12: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Agenda• What’s the problem?

• Introducing OMi

OMi Event Management Foundation

• OMi Health Perspectives

• OMi Topology Based Event Correlation

• OMi demo

• Review and Q&A

Page 13: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

13

OMi Event Management FoundationProvides the core event management functionality in the BSM platform

Events assigned per user or

group

Events Filtered by UCMDB CI

views

Related CI

Dynamically updated

UCMDB CI views

Event lifecycle actions

Event related actions

Assignment can be automatic

Event details

History browser

Page 14: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Event Mapping to CIs

14

Relationship of available events to dynamically updated CIs

BSM Platform

Event Consolidation through OM UCMDB maintains auto-discovered CIs

• Events and discovery data are brought together• End-to-end visibility of infrastructure and alerts by

showing relationships of events to CIs and business services that are impacted.

• Shows CIs in context.• 1 stop shop of discovery data, all data

at a single place• CIs in UCMDB are dynamically updated

impact analysis is vastly improved.

Page 15: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

OMi

TopologySynchroni-

zation

15

OMi CI View

Discovery information is not limited to DDM – it also includes discovery from OM SPIs and NNMi (8.10)

OM Service Maps

Dynamically updated CIs

DDM

NNMi

Comprehensive, automatic discovery

UCMDB OM Node

Map

OM SPI Discovery

UCMDB is the single source of truth / single authority

EUM

Timely: New infrastructure is included as soon as discovered

Page 16: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

OMi

16

Consolidating events from a variety of sources, heterogeneous, cross-domain, both HP and 3rd party

Storage Essentials

Event Consolidation through a dedicated OM server

SIMNNMiSiteScopeOM Agents

BACKPIsBPMRUM

Event Consolidation

EUM Alerts

KPI change alerts

OM messages

OM Server

Bi-directional event/message synchronization

CI resolutionbased on CMA,

service ID, application, object and host

3rd

Partytool

Page 17: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Performance Graphing

OM agent

UCMDBEvent

Event data

PA agent

OM agent

PA agent

OM Server

OM Server

PerformanceData

Architecture • Event communication only through OM servers.

• In contrast, performance data directly accessed from performance agents.

• Both the OM agent embedded performance component and performance agent (PA) metrics are available to the OMi user.

OMi

Highly scalable as performance data is distributed / kept locally on the agents. Allows for very

short sampling intervals.

17

Page 18: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

OMi tools to further investigate or remediate problems

OM agent

UCMDBEvent

OM agent

OM Server

OM Server

Tool Execution Architecture

Event data

Action Requests

Tools of type Executable and Script

Tools of type “URL”

• Tools of type “URL” will be directly launched from the OMi operator console.

Exe and Scripts

“URL” Tools • Tool communication through dedicated top level OM server and direct interaction with OM agents.

• All nodes that execute actions need to be configured on top level OM server.

• Flexible Management (MoM) environments are supported.

OMi

External Notification

No OM GUI Tools

18

Page 19: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Agenda• What’s the problem?

• Introducing OMi

• OMi Event Management Foundation

OMi Health Perspectives

• OMi Topology Based Event Correlation

• OMi demo

• Review and Q&A

Page 20: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Health Perspective Views

20

Consolidated event management • KPI degradations• Event details• Root event analysis

• Accurate Real-time IT infrastructure health views

• One model for whole BSM stack • True health status on multiple KPIs

• Accurate Infrastructure CI Health views based on ALL available events• Presented in a context consistent with Business Service views in OMi and BAC

Page 21: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Health Perspective Views and Navigation

1 2

Based on CI type

Health indicators are shown for mapped CI

Health indicator view is updated if another CI is selected in topology view

3

Event is mapped to CI.CI Type (& event category) defines which view is shown below.

21

Page 22: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

What are Health Indicators (= HIs)?

• More precise data – easy to understand for operators.

• Shows various aspects of a CI, not a single consolidated status only.

• More fine grained than KPIs.• Glue between events and health

KPIs.• CI state across domain managers.• HIs are generic and independent of

the monitoring solution

• Independent of the event life cycle− Operator can close events, but

the true health is still shown− Operators can see the detailed

health of a CI without having access to events.

Health Indicators expose detailed state of Configuration Items (CIs),

easy to understand for the operator

New with OMi: Detailed Health

22

Page 23: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

23

Health Perspectives KPIs

Health Indicators are used to calculate Operations Performance and Operations Availability KPIs

• KPIs represent the current, actual status of IT infrastructure elements

• KPI status is propagated up through a dependency graph

• KPI status expressed through severities

•Unresolved & Unassigned Events summarize events for the CI

• Present health and event summary of the CI in one view

Page 24: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Agenda• What’s the problem?

• Introducing OMi

• OMi Event Management Foundation

• OMi Health Perspectives

OMi Topology Based Event Correlation

• OMi demo

• Review and Q&A

Page 25: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

25

Topology Based Event Correlation (TBEC)Changes the way the operator interacts with the event stream

Focus the right team on dealing with the event Eliminate duplication of effort and chasing false leads.

Cross domain issues manifest themselves with multiple events. If the system had the logic to separate out causes from symptoms, the event is immediately assigned to the right team

that solves the problem and lets other groups focus on what matters to the business.

Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation

Page 26: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

26

TBEC: Causes and Symptoms

Cause

Cause and Symptom

Symptom

Pinpoint the cause - indication of causes and symptoms in browser

TBEC is unique and very effective as it operates on infrastructure and impacted business services in the

UCMDB that are dynamically updated.

Page 27: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

27

TBEC: Do proper assignments of problems

Cause

Cause and Symptom

Symptom

Storage team needs to fix this, NOT the database or Employee Self Service

Application specialists

As soon as Storage experts work on his message, DB and app specialists see that someone is working on “their” events”

Page 28: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

28

TBEC: Reduce event noise

+ working on/closing the cause, changes Lifecycle state of symptoms as well

Reduction of events in browser: From three …

… to one: filtered browser just shows the causes

Simplified event view enables focus on what matters to the business

Minimizes distraction for operations experts

Page 29: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

TBEC: A simple correlation rule

29

Event 1: Node state is down

Event 2: Web app state is slow

Host and J2EE app are some-how connected (topology)

Events 1 and 2 occur atroughly the same time

System will mark the “Node state down” event as cause and

the “Web application state slow” as symptom

Page 30: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Cross-domain correlation demystified

Database DomainJ2EE JDBC Data Source

Database

Host

Depends

Container Link

WebSphere Domain

Member

J2EE JDBC Data Source

J2EE /App Server

J2EE Cluster

Deployed

Container Link

J2EE Domain

Rules in multiple domains have been defined… the domains overlap

Carol teaches the system how problems with the datasource will affect the IBM

WebSphere J2EE cluster

Bill, the database domain expert, knows all about databases and its storage and how related problems are connected.

30

Depends

File System

Page 31: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Database Domain

Automatic cross-domain correlationSingle-domain correlation rules create rule-chains across domains

OMi will combine the domain knowledge and

identifies the file system CI event as the root cause for the event monitored at the J2EE

cluster.

31

J2EE JDBC Data Source

Database

Host

Depends

Container Link

Depends

File System

WebSphereDomain

Member

J2EE /App Server

J2EE Cluster

Deployed

Container Link

J2EE Domain

Page 32: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

With OMiApplications Infrastructure

App. Monitor

Server Monitors

Network Monitors

Transaction Monitors

Business Process Monitors

App. Internals

Native Server Tools

3rd party and open source

Consolidated event consoleOMi Cross domain event correlation (TBEC)

Application support

Serversupport

Networksupport

Storagesupport

Open ticket in service desk routed to SMEUnsolved incidents

Incidents that require a SMEAny remaining unsolved incidents

Tier 1Support

Tier 2Administrators

Tier 3Architects

Operations bridge

DNS server issueUser issue Connectivity issue

User Experience Monitors

Storage Monitors

$75

Investigate single causal event (with related symptoms)$75

For a SUBSETof incidents

32

Page 33: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

TBEC: Summary• Topology based event correlation allows you to pinpoint

cause and symptom events and to reduce the number of events in the browser views

• The correlation rules refer to Event Type Indicators, and are therefore only loosely coupled to the actual source events. Therefore correlation rules don’t have to change when the events or the underlying monitoring changes.

• Correlation rules can be easily added and modified using the Administration UI (Correlation Manager)

• Topology based correlation rules make use of the existing topology in the UCMDB

33

Page 34: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Agenda• What’s the problem?

• Introducing OMi

• OMi Event Management Foundation

• OMi Health Perspectives

• OMi Topology Based Event Correlation

OMi demo Dave Trout

• Review and Q&A

Page 35: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Agenda• What’s the problem?

• Introducing OMi

• OMi Event Management Foundation

• OMi Health Perspectives

• OMi Topology Based Event Correlation

• OMi demo

Review and Q&A

Page 36: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

Operations Manager i 8 : OMi• Bring together all event and discovery information from the

IT infrastructure and• Apply dynamic logic to these two sets of data to

– Links the events to the most up to date definition of the business services to enable prioritization of operations activities within the OMi Event Management Foundation

– Accurately display infrastructure health across multiple ‘domains’ to reduce duplication of effort and ‘finger pointing’ using OMi Health Perspective Views

– Simplify event streams – focus operators on what matters using OMi Topology Based Event Correlation

– Utilizes correlation and health calculation logic which ‘dynamically’ ties to discovery so administration effort is minimal

36 7/24/2009

Value to the business OpEx

Page 37: HP Operations Manager i8 · • One model for whole BSM stack ... Note, in this presentation, we use TBEC to abbreviate Topology Based Event Correlation. 26. TBEC: Causes and Symptoms.

© 2009 Hewlett-Packard Development Company, L.P.The information contained herein is subject to change without notice.

www.hp.com/go/OMi

Q&A