COMPARISON OF VMWARE VSPHERE HA/FT AND … · Comparison of VMware vSphere A/FT ... multiple hosts...

6
www.stratus.com COMPARISON OF VMWARE VSPHERE ® HA/FT AND STRATUS ® ftSERVER ® SYSTEMS

Transcript of COMPARISON OF VMWARE VSPHERE HA/FT AND … · Comparison of VMware vSphere A/FT ... multiple hosts...

www.stratus.com

COMPARISON OF VMWARE VSPHERE® HA/FT AND STRATUS® ftSERVER® SYSTEMS

www.stratus.com

White Paper | 2Comparison of VMware vSphere HA/FT and Stratus ftServer Systems

Ensuring Availability of Virtualized Business-Critical Applications in an Always-On World

This paper presents two distinct approaches to ensuring availability for virtualized business-critical applications: VMware vSphere HA or FT and Stratus ftServer systems. The paper compares each solution in terms of installation, maintenance, and costs. Most important, it highlights the fundamental differences regarding downtime recovery versus downtime prevention.

Solution Implementation: Complexity vs. SimplicityEstablishing a high-availability environment based on clustering is often a complex and costly endeavor. VMware vSphere HA requires, at minimum, a two-node cluster to enable failover protection for VMware environments (Figure 1). Similarly, vSphere FT creates a shadow instance of a VM, which requires multiple hosts and additional storage configuration. vSphere FT also requires a dedicated network for VMware vMotion to enable failover.

In a VMware environment comprised of many hosts, each node belonging to the HA/FT cluster must be identified and appropriate network connectivity between them established. The solution also requires an external storage array. It is then necessary to configure a number of settings to determine how the HA or FT cluster will behave and ensure that adequate resources are available in the event of a failover.

IntroductionVirtualization has become standard practice in organizations of all types and sizes around the world, due to the advantages it offers such as agility, efficiency, and scalability compared

to traditional physical infrastructures. Initial concerns about virtualizing applications, including performance, support, and licensing, have disappeared thanks to advances provided by technologies such as VMware vSphere. In fact, organizations by the thousands have gained the confidence to virtualize even business-critical applications such as Microsoft® Exchange®, Oracle® Database, and SAP® Enterprise Resource Planning.

As more and more business-critical applications are virtualized, however, a new concern has emerged: availability. Outages are costing companies more and more each year with the average cost of an hour of downtime for large companies now at a staggering $686,2501. While some applications can tolerate brief outages, downtime of business-critical workloads, with the associated risk of data or transaction losses, is simply unacceptable. And with multiple virtual machines (VMs) running on a single physical host, one hardware failure can have widespread business impact.

In today’s always-on world, ensuring availability of virtualized business-critical applications is absolutely essential. Most approaches to minimizing downtime, such as VMware vSphere High Availability (HA), employ server clusters and failover mechanisms that restart VMs on another host in the event of a hardware or operating system fault. However, the process of recovery not only takes time but also implies that damage has already been incurred. Ideally, the systems on which virtualized business-critical applications run should prevent downtime in the first place by working through system faults, thus avoiding business disruption entirely.

External LAN

LAN 1 LAN 2

Heartbeat 1

Heartbeat 2

Switch 1

Switch 2

SAN

VMWare Host 1

LOGICAL SERVER

VMWare Host 2

Figure 1: Two-node vSphere HA cluster.

1 “Preventing Virtual Application Downtime,” Jim Rapoza, Aberdeen

www.stratus.com

White Paper | 3Comparison of VMware vSphere HA/FT and Stratus ftServer Systems

By contrast, Stratus ftServer systems provide out-of-the-box fault tolerance for VMware environments with no additional hardware, network, or software requirements (Figure 2).

Figure 2. Stratus ftServer Fault-Tolerant System

Stratus ftServer systems are built on cost-effective industry-standard hardware powered by Intel® multi-core processors. Installation is fast and easy, offering full support for all standard vSphere products — including Distributed Resource Scheduler (DRS), vMotion®, Storage vMotion and HA — with no additional configuration work or system modifications required.

A typical vSphere HA or FT cluster could take several days to install, configure, and validate considering the requirements to provide multiple servers, dedicated cluster networking, and shared storage. In contrast, a Stratus ftServer system can typically be installed in a few hours.

Total Cost EvaluationWhen evaluating the cost of ensuring availability using vSphere HA/FT clusters versus Stratus ftServer, it is important to look at the whole picture.

A vSphere HA/FT cluster requires at least one additional server, as well as a management console, a high-availability network, and external storage. With the average life of a standard server just three to five years, recurring hardware expenses can also be expected along with downtime to perform migrations and upgrades. In addition, the cluster may require multiple software licenses for each server and the management console.

Stratus ftServer systems are a complete, integrated solution for ensuring continuous application availability. They require no additional servers, networking, or storage. Moreover, only one license is needed for the operating system and application compared to multiple operating system licenses in a cluster scenario, lowering total costs substantially. In addition, ftServer systems have historically averaged eight years in ser-vice, enabling IT organizations to stretch their budgets and minimize upgrade cycles.

Another consideration is the ongoing cost of “renting” VMs off a virtualized cluster infrastructure compared to having a dedicated standalone solution like ftServer. Many times, IT departments will set up these virtual machines and cross-charge to each department using them. The increased costs and IT skills needed to run a VMware cluster can be passed to each individual department. The cost-efficient design and simple skills needed to run an ftServer system make it a more economical choice.

Management and support are also important cost considerations. Clusters require extensive hands-on care, often necessitating the expense of full-time administrative staff. In fact, cluster capacity, policies, resources, and software changes must all be regularly managed and tested to validate proper operation.

VirtualMachine 1

VirtualMachine 2

VirtualMachine 3

Virtualization Layer

APP

OS

DB

OS

APP

OS

www.stratus.com

White Paper | 4Comparison of VMware vSphere HA/FT and Stratus ftServer Systems

With resilient hardware and built-in service technology, administration of ftServer systems is minimal. In fact, the Stratus automated service technology makes ftServer systems essentially self-managing, requiring little to no hands-on atten-tion and, consequently, very little cost. When service is required, Stratus also performs remote software updates and proactively delivers hardware components on-site that can simply be plugged into the ftServer system while it continues to run. As a result, organizations can seamlessly upgrade the system without disrupting operations and avoid the normal IT lifecycle of rip and replace every few years. Over the long term, this ease of servicing avoids hundreds of thousands of dollars in upgrade expenses typically associated with clustered infrastructures.

Figures 3 and 4 highlight typical cost differences using output from the Stratus Total Cost of Ownership Analyzer.

Availability Report CardIn terms of maximizing availability of business-critical virtualized applications, vSphere HA and ftServer systems offer two fundamentally different approaches: vSphere HA clusters rely on system failover while ftServer systems provide downtime prevention. Because of these differences, there are several important factors to consider when evaluating the two solutions.

While vSphere HA can reduce application downtime by automatically restarting virtual machines upon VM failure, it does not protect the host server or hypervisor against downtime or performance degradation. However, this restart may take many minutes or hours for large systems. Moreover, any data uncommitted to disk is lost during application or server crashes and restarts (see Figure 5).

Figure 5: VMware vSphere HA restart times.

Stratus ftServer systems deliver fault tolerance and seamlessly integrate with standard vSphere products to avoid downtime completely instead of recovering from it. They provide 99.999+ percent availability to protect business-critical applications from host faults that would otherwise result in VM failures. And in-flight data for process and batch applications is fully protected from loss.

• Restart time varies depending on the application. Below are some observations: • ABAP Central / Dialog Instances - ~4-5 mins for OS restart and SAP to come online. • Java Central / Dialog Instances - ~15-17 mins for OS restart and SAP to come online. • DB Instance - ~4-5 mins for OS restart and SQL Server to recover. • Web Dispatcher Instance - ~3-4 mins.

Source: SAP Saber — Carving SAP into separate landscapes for company split.Van Vi and Rick Jones

Stratus ftServer systems deliver fault tolerance and seamlessly integrate with standard vSphere products to avoid downtime completely instead of recovering from it.

Figure 3: TCO three year comparison for manufacturing application.

Figure 4: TCO five year comparison for manufacturing application.

www.stratus.com

White Paper | 5Comparison of VMware vSphere HA/FT and Stratus ftServer Systems

Figure 6 provides a side-by-side comparison of the key availability capabilities offered by each solution.

VMware HA Stratus ftServerAvailability Philosophy Fast recovery after failure Failure preventionTypical Availability Level 99.9% to 99.99% 99.999% +

Points of Failure CPU’s & memory; storage array & initiators; switches None

Susceptibility to Lost Data Any in-flight data is lost NoneAbility to Detect and Isolate Transient Errors No Yes

Dependency on IT Staff Moderate to high Low

Configuration Requirements

2 or more servers & OS; shared storage; may require multiple copies of software

1 server & OS

Other Requirements Requires vCenter Server NoneSingle System View Yes Yes

Figure 6: VMware vSphere HA and Stratus ftServer availability matrix.

In addition, VMware FT clusters continue to have limitations. While the previous single processor core per virtual machine restriction has been eliminated, the company’s second generation product is still limited to a maximum of four vCPUs per protected VM and this capability is restricted to the most expensive version of vSphere (Enterprise Plus). On all other versions, the VMware FT feature is limited to two vCPUs per VM. Whichever version is elected, no more than four protected VMs and no more than eight vCPUs are supported per host.

Even with the move to support multiprocessors, VMware FT’s applicability to protect most business-critical applications may come into question. Typically, Tier 1 applications are performance intensive and require numerous processors to execute tasks effectively. For example, Microsoft recommends four to 12 dual-socket cores for Exchange and four to eight cores for SharePoint and SQL Server. Oracle recommends six to 12 cores. VMware is also discouraging the use of VMware FT for workloads that are latency sensitive or that have real time or near real time requirements.

In addition to the licensing and configuration restrictions, the cost associated with a VMware FT implementation may be prohibitively high for some customer environments. For example, VMware recommends a minimum of three host servers per FT cluster, two different networking switches,

and three or more NICs to ensure availability. Compounding the server and networking expenses is the requirement for an external shared storage device.

Stratus ftServer has no equivalent performance or configuration limitations, making it a more user friendly and cost-effective solution capable of deployments from the data center to branch and edge environments.

Stratus imposes no hard limits on the number of vCPUs or amount of virtual memory per virtual machine. Furthermore, the number of VMs that can be hosted on an ftServer is limited only by its configuration and an organization’s policies on system over-provisioning.

ConclusionWith organizations of all types and sizes having greater reliance on virtualized business-critical applications, ensuring availability of the VMware environment is a top priority. VMware vSphere HA and FT provide the availability needed for certain applications but still put organizations at risk for periods of downtime and potential data loss. To strengthen availability of VMware environments, Stratus ftServer offers organizations a fault-tolerant solution that prevents downtime and protects against data loss.

While vSphere HA reacts to an application or system failure by invoking recovery processes, ftServer systems continuously take proactive measures to prevent downtime from occurring in the first place. With vSphere HA, VMs must be restarted on another host meaning that in-flight data is lost. In contrast, ftServer systems ensure that no data is lost.

Availability clusters, as used with vSphere HA and FT, are inherently more complex. Even when performing perfectly, dependencies among third-party application components can result in extended restart times. Stratus ftServer is a simple, easy-to-use, turnkey solution that allows organizations to run VMware environments out of the box and automatically gain reliable downtime-prevention with no external dependencies and no special IT skills needed to operate.

When comparing vSphere HA/FT and ftServer systems the differences are clear: vSphere HA clusters require complex implementation and administration, multiple software licenses, and still, data loss can occur during failover with the vSphere HA product. Stratus ftServer systems offer plug-and-play simplicity, single software image, and zero data loss. Ultimately, ftServer systems provide the highest possible availability for VMware environments for the lowest total investment in time and money.

www.stratus.com

White Paper | 6Comparison of VMware vSphere HA/FT and Stratus ftServer Systems

Specifications and descriptions are summary in nature and subject to change without notice.

Stratus and the Stratus Technologies logo are trademarks or registered trademarks of Stratus Technologies Bermuda Ltd. All other marks are the property of their respective owners. ©2014 Stratus Technologies Bermuda Ltd. All rights reserved. 140326

About StratusStratus Technologies is the leading provider of infrastructure-based solutions that keep applications running continuously in today’s always-on world. Stratus enables rapid deployment of always-on infrastructures — from enterprise servers to clouds — without any changes to applications. Stratus’ flexible solutions — software, platform and services — prevent downtime before it occurs and ensure uninterrupted performance of essential business operations.

To learn more, visit stratus.com

Find Out More About Stratus ftServer Systems For complete details on how Stratus ftServer systems prevent downtime and ensure continuous availability of business-critical virtualized applications, visit http://www.stratus.com/Products/Platforms/ftServerSystems

Stratus ftServer systems provide the highest possible availability for VMware environments for the lowest total investment of time and money.

Stratus ftServer offers organizations a fault-tolerant solution that prevents downtime and protects against data loss.