7 Habits for High Effective Disaster Recovery Administrators
-
Upload
quorumlabs -
Category
Technology
-
view
1.799 -
download
2
Transcript of 7 Habits for High Effective Disaster Recovery Administrators
Today’s Presenter
Henry Baltazar- Senior Analyst, Forrester Research SENIOR ANALYST SERVING INFRASTRUCTURE & OPERATIONS PROFESSIONALS
Henry Baltazar is a senior analyst serving Infrastructure & Operations Professionals. He has evaluated and tested storage hardware and software offerings for more than 15 years as an industry analyst and as a journalist. Henry advises Forrester clients of data center infrastructure technologies including storage virtualization, cloud storage, solid-state storage, and primary storage arrays.
How much enterprise storage grew from 2010-2012
of companies have a RPO of
less than 1 hour
of organizations that say IT complexity is one of their top risks
A new world… by the numbers
4
Organizations struggle with many of the same backup and recovery challenges
Between 2010 and 2013, enterprise data stores grew by 60%
Business owners have less and less tolerance for any
data loss
More and more companies operate
close to 24x7 Data explosion
Increasing recovery demands
More complexity
and heterogeneity
Need to protect mobile
users
Limited backup
windows
44% of companies are now using Hyper-V
About one fifth of information workers
telecommute regularly
© 2014 Forrester Research, Inc. Reproduction Prohibited
Top Causes Of Business Downtime And Impact Of Disruption May 2014 “The State Of Business Technology Resiliency, Q2 2014”
© 2014 Forrester Research, Inc. Reproduction Prohibited
Top Causes Of Business Downtime And Impact Of Disruption (Cont.) May 2014 “The State Of Business Technology Resiliency, Q2 2014”
© 2013 Forrester Research, Inc. Reproduction Prohibited 8
Step 1: Build And Refine A Clear DR Policy To Match Business Needs And Requirements
§ RPO (Recovery Point Objective). Is defined by business continuity planning. It is the maximum tolerable period in which data might be lost from an IT service due to a major incident. The RPO gives systems designers a limit to work to.
§ RTO (Recovery Time Objective). Is the targeted duration of time and a service level within which a business process must be restored after a disaster (or disruption) in order to avoid unacceptable consequences associated with a break in business continuity.
© 2014 Forrester Research, Inc. Reproduction Prohibited
More And More Systems Are Considered Critical May 2014 “The State Of Business Technology Resiliency, Q2 2014”
© 2014 Forrester Research, Inc. Reproduction Prohibited
More And More Systems Are Considered Critical (Cont.) May 2014 “The State Of Business Technology Resiliency, Q2 2014”
11
Why cloud for backup and DR? R
ecov
ery
obje
ctiv
es
Services cost
Synchronous Replication
Asynchronous Replication
Data Loss
Recovery from tape
Seconds
Minutes
Hours
Days
$$$$ $$ $
Hot Sites, Warm Sites
Dedicated IT equipment
Cold Sites
Shared IT equipment
Gap
Recovery from disk This gap can be filled with virtualized and
cloud solutions
Base: 456 North American and European IT decision makers at enterprises who have implemented or have plans to implement IaaS
Source: Forrsights Hardware Survey, Q3 2013
“How important were the following in your firm’s decision to adopt public/hosted private/internal private cloud computing IaaS?”
Speed and improved BC/DR are the biggest drivers for cloud adoption
© 2013 Forrester Research, Inc. Reproduction Prohibited 14
Step 2: Implement And Enforce DR Testing Policies
§ Test after every change/upgrade/backup
§ Test servers at a minimum
§ Test things in combination to ensure you will have a successful recovery when needed
© 2014 Forrester Research, Inc. Reproduction Prohibited
DR Testing Is Trending Upward, But Updates Still Need Improvement May 2014 “The State Of Business Technology Resiliency, Q2 2014”
© 2014 Forrester Research, Inc. Reproduction Prohibited
DR Testing Is Trending Upward, But Updates Still Need Improvement (Cont.) May 2014 “The State Of Business Technology Resiliency, Q2 2014”
© 2013 Forrester Research, Inc. Reproduction Prohibited 18
Step 2: Geography Matters! Leverage Cloud And Colocation Failover Sites
§ Utilize multiple sites when possible to maximize protection
§ Make sure sites are a safe distance apart to avoid the impact of a major disaster
§ Clouds and collocation sites are becoming more popular for DR for increasing accessibility and to handle peak loads
© 2013 Forrester Research, Inc. Reproduction Prohibited 19
In-house sourcing recovery is still favored
Base: 94 global disaster recovery decision makers and influencers
© 2013 Forrester Research, Inc. Reproduction Prohibited 21
Cloud and colocation adoption for DR sites increase
Base: 180 global disaster recovery decision makers and influencers *Base: 85 global disaster recovery decision makers and influencers
There has been a 9%inc rease in c loud-‐based
provis ioning of rec overy s itess inc e 2010.
“How do you provision your recovery sites?”
© 2013 Forrester Research, Inc. Reproduction Prohibited 22
Average distance between DR sites is about 600 miles
Base: 85 global disaster recovery decision makers and influencers *Base: 180 global disaster recovery decision makers and influencers
© 2013 Forrester Research, Inc. Reproduction Prohibited 24
Step 4: Warm Up Recovery Sites For Key Apps
§ Make sure failovers can happen with minimal intervention
§ Negotiate readiness SLAs with your service provider to make sure mission critical workloads have low RPO and RTO.
© 2014 Forrester Research, Inc. Reproduction Prohibited
Recovery Time And Recovery Point Actuals Lengthen In 2013 May 2014 “The State Of Business Technology Resiliency, Q2 2014”
Like on-premise DR, recovery sites vary in temperature
› Hot cloud site: Recovery cloud is running replica VMs to production site using real-time replication.
• Recovery time objective (RTO) : 0-2 hours • Recovery point objective (RPO): 0-24 hours
› Warm cloud site: Recovery cloud contains offline copies of virtual machines that can be spun up during disasters or tests.
• RTO: 2-6 hours • RPO: 0-24 hours
› Cold cloud site: Recovery cloud contains backups of production systems that must be first rehydrated and turned into VMs before recovery can occur.
• RTO: 4-24 hours • RPO: 24-48 hours
$$$ $
27 © 2014 Forrester Research, Inc. Reproduction Prohibited
Think resiliency, not recovery
Recovery Implies downtime and that systems must first suffer an outage before they can resume normal operations
Resiliency Refers to the ability of a business to spring back from a disruption to its operations without an outage
© 2013 Forrester Research, Inc. Reproduction Prohibited 29
Step 5: Analyze Your Networking Needs
§ Configuration. Make sure primary and DR site configurations are synced including key services like DNS.
§ Connectivity. Test connectivity at your DR failover sites to ensure the replacement servers can communicate
§ Bandwidth. Adequate bandwidth must be provisioned at the failover sites.
© 2013 Forrester Research, Inc. Reproduction Prohibited 31
Step 6: Fortify Your DR Environment
§ Security. Secondary sites need to be secured just as much as primary infrastructure.
§ Backup up your DR sites. The secondary site becomes a single point of failure without a backup.
© 2014 Forrester Research, Inc. Reproduction Prohibited
Regulatory And Legal Pressures Are Driving Improvement In DR Capabilities May 2014 “The State Of Business Technology Resiliency, Q2 2014”
© 2014 Forrester Research, Inc. Reproduction Prohibited
Regulatory And Legal Pressures Are Driving Improvement In DR Capabilities (Cont.) May 2014 “The State Of Business Technology Resiliency, Q2 2014”
© 2013 Forrester Research, Inc. Reproduction Prohibited 35
Step 7: Expand The Value Of Your DR
DR is more than an insurance policy! § Workload migration. In the future the technology in DRaaS will be used to
facilitate workload migration and cloudbursting.
§ Test/Dev. Is a good use case for DR infrastructure.
Thank you Henry Baltazar [email protected] @StorageZar