Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc....

54
Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. [email protected] www.perfman.com 610-865-0300 NCACMG Vienna, Virginia 5 June 2002

Transcript of Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc....

Page 1: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Analysis of Virtual Tape Subsystems Ned Diehl

The Information Systems Manager, [email protected]

www.perfman.com610-865-0300

NCACMGVienna, Virginia

5 June 2002

Page 2: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Trademarks (Omissions are unintentional)

• ISM• PerfMan• IBM• Parallel Sysplex• RMF• OS/390

• z/OS• Magstar• VSM• StorageTek• HSC• Nearline

Page 3: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Objectives

• Discuss sources of performance analysis and capacity planning data for virtual tape subsystems (VT) in an OS/390 environment

• Primary focus on hardware implementations– IBM VTS– StorageTek VSM

• Present graphical examples • System rather than application focus

Page 4: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Contents

• Introduction• Virtual Tape Structure• Data Sources• Key Performance Metrics• Recommendations• Summary• References

Page 5: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Introduction

• Solves many traditional tape problems– Small data sets– Low activity open data sets– Allocation wait for tape drives

• Weakest with “good” tape activity– Full volumes– High transfer rates

• Exploits high capacity tapes– VT helps remove implementation impediments

Page 6: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Introduction

• Consider disaster recovery– VTS Peer-to-Peer– Clustered VTSS Configuration

• Mount management considerations– Deferred mount should generally be avoided

– Premount can be desirable • Not yet everything for everyone – but getting

there– Fewer problem datasets with current

implementations

Page 7: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

VT StructureGeneral

• Looks (almost) like 3490E to host• Combination of hardware and software

– RISC processor– Library of high capacity tapes– RAID DASD buffer or CACHE – Tape drives

• Amount of host software varies with implementation

Page 8: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

VT StructureLogical Components

Page 9: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

VT StructureVTS - Physical Components

Page 10: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

VT StructureVSM - Physical Architecture

HSCHSC

Control PathControl Path

CDSCDS

VTCSVTCS

VSM Control & VSM Control & Data Path Data Path

Migrate/ Migrate/ Recall Recall VTSSVTSS

MVCMVCMVCMVC

MVCMVC

VTVVTVVTVVTV

VTVVTVVTVVTV

VTVVTV

• Physical tape drives• Physical tape volumes• Physical Libraries & slots

• Virtual tape drives• Virtual tape volumes

Page 11: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Data Sources

• Application– SMF 30, 72, 14, & 15

• Hardware– SMF 94 - tape library statistics – STK user SMF record– SMF 14 & 15 – data set activity– SMF 21 – error statistics by volume– RMF 73 - channel path activity– RMF 74-1 - device activity– RMF 78-3 - I/O queuing– Real time controller interface

Page 12: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Data SourcesSMF 94

• Easy to work with though not flexible• Hourly summary from VTS or Library

– Difficult to synchronize (do not use SMF94HHI)– Identical data to all attached recording images

• Base segments reflect all library statistics– ATL, mount, dismount, eject, & insert– Native and VTS activity

• Identical data in multiple records with a different serial number if VTS and native drives attached

• VTS segments reflect single VTS– VTS, import / export, enhanced statistics

Page 13: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Data SourcesSMF 94

• One record produced for each– Library with native drives, identified by ATL serial

(SMF94SNO). Contains base segments.– VTS, identified by VTS serial (SMF94SNO &

SMF94VLS). Contains base and VTS segments.

• With peer-to-peer, three (local) or four (remote) VTSs involved– Only two have physical tape– No easy way to associate them

• Generally consistent with RMF– RMF allows finer detail of comparable metrics

Page 14: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Data SourcesSMF 94 Data Issues

• Hour index (SMF94HHI) is interval end• Drives available (SMF94VTA) not always reset

after service– Drives used (SMF94VTV, VTN, & VTX) can be greater

than available

• Alignment problem with early Import/Export statistics (SMF94ACA & SMF94ACB)

• Duplicate library data with a different serial number if VTS and native drives attached

• Average age in cache (SMF94VCA) has had changed definitions

Page 15: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Data SourcesSMF 94 Data Issues

• Tape data transfer (SMF94VTR & SMF94VTW) reported as zero in some recent samples

• Max cache volume age (S94MTVCA) recorded as seconds, documented as minutes

• Backstore compression ratio (S94BSRAT) sometimes reported as less than one

• Recall throttle percent (S94RCPRT) sometimes greater than 100

Page 16: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Data SourcesSMF 94 Data Issues

• Reference Flash 10124 dated 20 November 2001. Recalls might be recorded as hits. Effected metrics:– S94MAXCH Maximum Cache Hit Mount Time– S94AVGCH Average Cache Hit Mount Time– SMF94VMH Number of Cache Hit Mounts– S94MAXRM Maximum Recall Mount Time– S94AVGRM Average Recall Mount Time– SMF94VMS Number of Recall Mounts– SMF94VPS Number of Physical Mounts for Recall

Page 17: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Data SourcesSTK User Record

• Complex to work with– Many subtypes– Repeating segments

• Allows detail analysis– Configuration info– CU and device busy– Volume level data– VTSS interval of 15 minutes– Grouping and summarization required– Some logical metrics require multiple subtypes

Page 18: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Data SourcesSTK User Record

• Interval metrics by VTSS (not image) to each recording OS/390– Subsystem, channel interface, and RTD

performance – Values vary with OS/390 recording time– Logically identically but physically different

• Event metrics recorded once– VTV mount, dismount, delete, migrate, recall,

movement, replicate– RTD mount, dismount, vary– MVC status

Page 19: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Data SourcesSTK Data Issues

• Must process event records from all recording images

• Mount events not necessarily recorded to original requesting image

• Dismount events not necessarily recorded to same image as mount

• Might be extra mounts or dismounts• Calculated times might be very large or

negative• Some time values in complex formats

Page 20: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Data SourcesSTK Data Issues

• RMF consistency varies– Most metrics close across multiple samples

• I/O Rates• Connect time• Allocated time• Total mount time

– Significant variation with mount counts and thus average mount time

• Probably a code level issue

Page 21: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics

• Application – Service levels

• Hardware – Data transfer– Device usage– Virtual mount time– Mount miss (recall) rate– Storage usage– Tape volume age (in DASD buffer or cache)

Page 22: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Application Performance

• Should have reasonable and measurable performance objectives

• Objectives will vary with– Installation– Application– VT– Time & day

Page 23: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Data Transfer

• Good for reporting work performed– Mounts and I/O rates also options

• Saturation varies with environment• SMF 14, 15, and 21 provide read and write• RMF 73 provides read and write for FICON.• VTS provides data transfer

– Host read (SMF94VBR) and write (SMF94VBW) are good throughput indicators

– Read (SMF94VTR) and write (SMF94VTW) between cache and real tapes

Page 24: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Data Transfer

• VTSS provides many data transfer metrics but assumptions are required– RTD bytes read (SMF20BTR) and written

(SMF20BTW) – RTD connect (SMF20DCT) and utilization

(SMF20DUT)– Host and RTD channel interface busy

(SMF11CUB)– VTV dismount has VTV size (SMF14VSZ)

• Probably best throughput indicator

Page 25: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Data Transfer

Page 26: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Data Transfer

Page 27: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Device Usage

• RMF 74-1 provides virtual by device or group.– Allows separation of mount and allocation

components (e.g. connect, pend, wait).

• VTS provides minimum, maximum, average, and configured for both virtual and physical devices. All values are integers.

• VTS average (SMF94VTV) and maximum (SMF94VTX) physical are good for trending

• While not normally a problem, virtual device use should not be ignored.

Page 28: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Device Usage

• Difficult to calculate VTSS minimum, maximum, average, and available for either virtual or physical devices– Mount, dismount, and vary subtypes must be

grouped and summarized– RTDs can be statically shared (operator

command) by multiple VTSSs and OS/390

• VTSS RTD connect and utilization, which were previously discussed, are useful

Page 29: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Device Usage

Page 30: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Device Usage

Page 31: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Device Usage

Page 32: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Device Usage (RMF)

Page 33: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Virtual Mount Time

• RMF 74-1 provides average mount time by device or group

• VTS provides minimum, maximum, and average virtual and physical mount times

• Average virtual (SMF94VRA) is good for trending

• High maximum virtual (SMF94VRX) often correlates with problems

Page 34: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Virtual Mount Time

• VTSS mount time calculated from mount event records (SMF13MET - SMF13MST)– Min, max, average and counts require grouping

and summarization– Average might be distorted by extra mounts

• VTSS provides several potentially interesting identifiers– Job, step, DSN, VTV management class– Scratch, existing

Page 35: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Virtual Mount Time

• Mount times tend to increase with:– Reduced tape volume age on DASD (SMF94VCA)– Increased mount miss rate (SMF94VMS)– Increased average real drives mounted

(SMF94VTV)

• Published targets:– Daily average less than 30 seconds– Hourly average less than 300 seconds– Maximum less than 900 seconds

Page 36: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Virtual Mount Time (RMF)

Page 37: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Virtual Mount Time

Page 38: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Max Virtual Mount Time

Page 39: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Average Virtual Mount Time

Page 40: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Mount Miss Rate

• VTS provides mounts by type– Fast ready (SMF94VFR)– Specific mount hits (SMF94VMH)– Specific mount misses (SMF94VMS)

• Related to average time on DASD (SMF94VCA) and application cycles

• Published target of miss percentage less than 20%– Miss % = 100 * Misses / Total Virtual Mounts– Miss % = 100 * VMS / (VFR + VMH + VMS)

Page 41: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Mount Miss Rate

• VTSS mount hit indicator added in recent update (SMF13RCI)– Scratch mounts indicated (SMF13VMT)– If older data, could assume a hit if calculated

mount time less than a threshold

Page 42: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Mount Miss Rate

Page 43: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Specific Mount Miss Rate

Page 44: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Mount Miss Rate

Page 45: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Storage Usage

• VTS provides estimate of data on stacked volumes (SMF94VBA) and available capacity on empty cartridges (SMF94VEC)– Reconciliation will probably free space

• High recalls (SMF94VMS) and low tape volume age on DASD (SMF94VCA) indicate need for more DASD

• More DASD can relieve saturated physical tape drives

Page 46: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Storage Usage

Page 47: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Storage Usage

• VTSS provides cache and DASD metrics– Configuration info

• DASD capacity (SMF10TCP), cache & NVS size, channels

– Used (SMF10FCP) and contiguous (SMF10CFP) free DASD space

• No easy way to report VTSS MVC metrics– Must produce & process MVC status records

Page 48: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Storage Usage

Page 49: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Tape Volume Age

• VTSS delete reference age can sometimes be related to application cycles– SMF delete time stamp – SMF15LTR– Consider selective summarization– Separate migrate immediate delete

• Subtype 18 with SMF18MTI set, closely followed by related Subtype 15 (same virtual VOLSER)

• Occasionally very high values

Page 50: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Key Performance Metrics Tape Volume Age

Page 51: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Recommendations

• Report and track application service levels• Host data transfer is good workload measure • Virtual mount time (average and maximum) is

good response measure • Keep history• Vary period and interval size with need

– Longer (e.g monthly) for trends– Shorter (e.g. daily) for glitches– Hour is good interval for most purposes

Page 52: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

Summary

• Many existing analysis tools useful• Some valuable data only available in unique

VT records• Packages probably needed for VT metrics• VT will become more common and important

Page 53: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

References

• WP100146 - IBM Magstar Model B16 Virtual Tape Server Elements of Performance, IBM

• WP100157 - VTS and Logical Paths, IBM• WP100199 - IBM Magstar 3494 Peer-to-peer VTS Performance White Paper, IBM• WP100187 - IBM Magstar 3494 Virtual Tape Server Performance White Paper, IBM• WP100239 - IBM TotalStorage Virtual Tape Server Performance, IBM• WP100243 - IBM TotalStorage Peer-to-peer Virtual Tape Server Performance, IBM• SG24-4917 - IBM Virtual Tape Server & Enhancements to Magstar, IBM• SG24-2229 - IBM Magstar VTS: Planning, Implementing, and Monitoring, IBM• SG24-4409 - Guide to Sharing and Partitioning IBM Tape Library Data Servers, IBM• SC28-1950 - OS/390 RMF Report Analysis, IBM• GC28-1783 - OS/390 System Management Facilities (SMF), IBM

Page 54: Analysis of Virtual Tape Subsystems Ned Diehl The Information Systems Manager, Inc. ned.diehl@perfman.com  610-865-0300 NCACMG Vienna, Virginia.

References

• Cheryl Watson’s Tuning Letter 2000, N0.5 - Focus: VTS Measurements

• Cheryl Watson’s Tuning Letter 2001, N0.6 - Focus: StorageTek VSM

• Introduction to Virtual Storage Manager, StorageTek

• StorageTek Virtual Tape Control System - Installation, Configuration & Administration Guide, StorageTek

• PerfMan for Tape Libraries, ISM

• PerfMan for OS/390, ISM

• CMG Proceedings

• Share Proceedings

• Vendor WEB sites