7 Habits for High Effective Disaster Recovery Administrators

36
7 Habits For Highly Effective DR Admins Henry Baltazar, Senior Analyst December 2, 2014

Transcript of 7 Habits for High Effective Disaster Recovery Administrators

7 Habits For Highly Effective DR Admins Henry Baltazar, Senior Analyst December 2, 2014

Today’s Presenter

Henry Baltazar- Senior Analyst, Forrester Research SENIOR ANALYST SERVING INFRASTRUCTURE & OPERATIONS PROFESSIONALS

Henry Baltazar is a senior analyst serving Infrastructure & Operations Professionals. He has evaluated and tested storage hardware and software offerings for more than 15 years as an industry analyst and as a journalist. Henry advises Forrester clients of data center infrastructure technologies including storage virtualization, cloud storage, solid-state storage, and primary storage arrays.

How much enterprise storage grew from 2010-2012

of companies have a RPO of

less than 1 hour

of organizations that say IT complexity is one of their top risks

A new world… by the numbers

4

Organizations struggle with many of the same backup and recovery challenges

Between 2010 and 2013, enterprise data stores grew by 60%

Business owners have less and less tolerance for any

data loss

More and more companies operate

close to 24x7 Data explosion

Increasing recovery demands

More complexity

and heterogeneity

Need to protect mobile

users

Limited backup

windows

44% of companies are now using Hyper-V

About one fifth of information workers

telecommute regularly

© 2014 Forrester Research, Inc. Reproduction Prohibited

Top Causes Of Business Downtime And Impact Of Disruption May 2014 “The State Of Business Technology Resiliency, Q2 2014”

© 2014 Forrester Research, Inc. Reproduction Prohibited

Top Causes Of Business Downtime And Impact Of Disruption (Cont.) May 2014 “The State Of Business Technology Resiliency, Q2 2014”

Step 1: Build And Refine A Clear DR Policy To Match Business Needs And Requirements

7

© 2013 Forrester Research, Inc. Reproduction Prohibited 8

Step 1: Build And Refine A Clear DR Policy To Match Business Needs And Requirements

§  RPO (Recovery Point Objective). Is defined by business continuity planning. It is the maximum tolerable period in which data might be lost from an IT service due to a major incident. The RPO gives systems designers a limit to work to.

§  RTO (Recovery Time Objective). Is the targeted duration of time and a service level within which a business process must be restored after a disaster (or disruption) in order to avoid unacceptable consequences associated with a break in business continuity.

© 2014 Forrester Research, Inc. Reproduction Prohibited

More And More Systems Are Considered Critical May 2014 “The State Of Business Technology Resiliency, Q2 2014”

© 2014 Forrester Research, Inc. Reproduction Prohibited

More And More Systems Are Considered Critical (Cont.) May 2014 “The State Of Business Technology Resiliency, Q2 2014”

11

Why cloud for backup and DR? R

ecov

ery

obje

ctiv

es

Services cost

Synchronous Replication

Asynchronous Replication

Data Loss

Recovery from tape

Seconds

Minutes

Hours

Days

$$$$ $$ $

Hot Sites, Warm Sites

Dedicated IT equipment

Cold Sites

Shared IT equipment

Gap

Recovery from disk This gap can be filled with virtualized and

cloud solutions

Base: 456 North American and European IT decision makers at enterprises who have implemented or have plans to implement IaaS

Source: Forrsights Hardware Survey, Q3 2013

“How important were the following in your firm’s decision to adopt public/hosted private/internal private cloud computing IaaS?”

Speed and improved BC/DR are the biggest drivers for cloud adoption

Step 2: Implement And Enforce DR Testing Policies

© 2013 Forrester Research, Inc. Reproduction Prohibited 14

Step 2: Implement And Enforce DR Testing Policies

§  Test after every change/upgrade/backup

§  Test servers at a minimum

§  Test things in combination to ensure you will have a successful recovery when needed

© 2014 Forrester Research, Inc. Reproduction Prohibited

DR Testing Is Trending Upward, But Updates Still Need Improvement May 2014 “The State Of Business Technology Resiliency, Q2 2014”

© 2014 Forrester Research, Inc. Reproduction Prohibited

DR Testing Is Trending Upward, But Updates Still Need Improvement (Cont.) May 2014 “The State Of Business Technology Resiliency, Q2 2014”

Step 3: Geography Matters! Leverage Cloud And Colocation Failover Sites

© 2013 Forrester Research, Inc. Reproduction Prohibited 18

Step 2: Geography Matters! Leverage Cloud And Colocation Failover Sites

§  Utilize multiple sites when possible to maximize protection

§  Make sure sites are a safe distance apart to avoid the impact of a major disaster

§  Clouds and collocation sites are becoming more popular for DR for increasing accessibility and to handle peak loads

© 2013 Forrester Research, Inc. Reproduction Prohibited 19

In-house sourcing recovery is still favored

Base: 94 global disaster recovery decision makers and influencers

How To Achieve Higher SLAs On Cloud Platforms November 2012 “Don’t Move Your Apps To The Cloud ”

© 2013 Forrester Research, Inc. Reproduction Prohibited 21

Cloud and colocation adoption for DR sites increase

Base: 180 global disaster recovery decision makers and influencers *Base: 85 global disaster recovery decision makers and influencers

There has been  a 9%inc rease in c loud-­‐based

provis ioning of rec overy s itess inc e 2010.

“How do you provision your recovery sites?”

© 2013 Forrester Research, Inc. Reproduction Prohibited 22

Average distance between DR sites is about 600 miles

Base: 85 global disaster recovery decision makers and influencers *Base: 180 global disaster recovery decision makers and influencers

Step 4: Warm Up Recovery Sites For Key Apps

23

© 2013 Forrester Research, Inc. Reproduction Prohibited 24

Step 4: Warm Up Recovery Sites For Key Apps

§  Make sure failovers can happen with minimal intervention

§  Negotiate readiness SLAs with your service provider to make sure mission critical workloads have low RPO and RTO.

© 2014 Forrester Research, Inc. Reproduction Prohibited

Recovery Time And Recovery Point Actuals Lengthen In 2013 May 2014 “The State Of Business Technology Resiliency, Q2 2014”

Like on-premise DR, recovery sites vary in temperature

› Hot cloud site: Recovery cloud is running replica VMs to production site using real-time replication.

•  Recovery time objective (RTO) : 0-2 hours •  Recovery point objective (RPO): 0-24 hours

› Warm cloud site: Recovery cloud contains offline copies of virtual machines that can be spun up during disasters or tests.

•  RTO: 2-6 hours •  RPO: 0-24 hours

› Cold cloud site: Recovery cloud contains backups of production systems that must be first rehydrated and turned into VMs before recovery can occur.

•  RTO: 4-24 hours •  RPO: 24-48 hours

$$$ $

27 © 2014 Forrester Research, Inc. Reproduction Prohibited

Think resiliency, not recovery

Recovery Implies downtime and that systems must first suffer an outage before they can resume normal operations

Resiliency Refers to the ability of a business to spring back from a disruption to its operations without an outage

Step 5: Analyze Your Networking Needs

© 2013 Forrester Research, Inc. Reproduction Prohibited 29

Step 5: Analyze Your Networking Needs

§  Configuration. Make sure primary and DR site configurations are synced including key services like DNS.

§  Connectivity. Test connectivity at your DR failover sites to ensure the replacement servers can communicate

§  Bandwidth. Adequate bandwidth must be provisioned at the failover sites.

Step 6: Fortify Your DR Environment

© 2013 Forrester Research, Inc. Reproduction Prohibited 31

Step 6: Fortify Your DR Environment

§  Security. Secondary sites need to be secured just as much as primary infrastructure.

§  Backup up your DR sites. The secondary site becomes a single point of failure without a backup.

© 2014 Forrester Research, Inc. Reproduction Prohibited

Regulatory And Legal Pressures Are Driving Improvement In DR Capabilities May 2014 “The State Of Business Technology Resiliency, Q2 2014”

© 2014 Forrester Research, Inc. Reproduction Prohibited

Regulatory And Legal Pressures Are Driving Improvement In DR Capabilities (Cont.) May 2014 “The State Of Business Technology Resiliency, Q2 2014”

Step 7: Expand The Value Of Your DR

34

© 2013 Forrester Research, Inc. Reproduction Prohibited 35

Step 7: Expand The Value Of Your DR

DR is more than an insurance policy! §  Workload migration. In the future the technology in DRaaS will be used to

facilitate workload migration and cloudbursting.

§  Test/Dev. Is a good use case for DR infrastructure.

Thank you Henry Baltazar [email protected] @StorageZar