File Data Virtualization with UniFS – A True Global File ... · File Data Virtualization with...

8
File Data Virtualization with UniFS ® – A True Global File System White Paper: File Data Virtualization with UniFS ® – A True Global File System

Transcript of File Data Virtualization with UniFS – A True Global File ... · File Data Virtualization with...

Page 1: File Data Virtualization with UniFS – A True Global File ... · File Data Virtualization with UniFS® – A True Global File System White Paper: File Data Virtualization with UniFS®

File Data Virtualization with UniFS® – A True Global File System

White Paper: File Data Virtualization with UniFS® – A True Global File System

Page 2: File Data Virtualization with UniFS – A True Global File ... · File Data Virtualization with UniFS® – A True Global File System White Paper: File Data Virtualization with UniFS®

[email protected]

White Paper: File Data Virtualization with UniFS® – A True Global File System

United States: 1.800.208.3418International: 1.508.433.62002

Introduction The traditional means to protect file data by making copies, combined with the need to provide access to shared data from multiple locations, has created an unwanted explosion in copies in an infrastructure already strained by a decade of relentless data growth. Organizations struggling under the combined pressure of file data growth and an expanding global footprint require a new generation of global file systems that enable IT to consolidate all copies of data required for protection and access into a single master copy.

File Data Virtualization completely separates data from any hardware dependencies and in doing so elevates the IT conversation from the nuts and bolts of storage (provisioning, backup, DR, etc.) to a strategic discussion about data management: Who needs access to this data? Where do they need it? What level of performance is required? File Data Virtualization is a storage architecture that combines ubiquitous cloud infrastructure with traditionally data center technology. Files are no longer tied to a piece of hardware or location. Data is stored globally in the cloud but available at the local levels of performance dictated by the hardware resources available at each site.

The cloud infrastructure providers are adept at dispersed geographic data replication across multiple availability zones which not only protects the data but has the significant benefit of increasing the fluidity of data and the speed at which data can be access from anywhere in the world. Data can move around and be where it needs to be, without the IT being involved in arduous and costly replication processes for replicating physical copies and without conflict since the system ensures that there is always only one source of the data.

While system virtualization enables compute needs and not server hardware to be at the center of application deployment, File Data Virtualization enables data and not hardware to be at the center of storage. The hardware becomes an end-point that can be adjusted to deliver the required level of performance. Central management moves to a global core service that can monitor and manage every component regardless of its location. File Data Virtualization maintains security with locally controlled encryption so that organizations can establish a secure data perimeter, every other infrastructure function that needs stability at scale shifts to the core cloud infrastructure.

To think global infrastructure, one must think beyond the confines of any one physical appliance or data center. Only then can organizations harness the power of File Data Virtualization: one copy of data protected in many ways, accessible everywhere.

File Data Virtualization Benefits:

• Simplified Central Management

• Unlimited and Elastic File System Scalability

• Unlimited File Versioning (replaces backup and archive)

• Single File Audit Trail Stream

• Global and Mobile File Access

• Right-Size Hardware Performance in Every Location

• Customer Controlled Security Model

• Dramatic Cost Reductions

This new generation of global file systems provides the abstraction layer that is necessary to separate file data from any hardware or location dependencies.

Page 3: File Data Virtualization with UniFS – A True Global File ... · File Data Virtualization with UniFS® – A True Global File System White Paper: File Data Virtualization with UniFS®

[email protected]

White Paper: File Data Virtualization with UniFS® – A True Global File System

United States: 1.800.208.3418International: 1.508.433.62003

Global File SystemInfinite Scalability and Global Access with a Global File SystemA global file system provides a hypervisor-like layer that separates the files from the file storage resources, managing one master copy of the data in a public or private cloud while distributing data access to where it’s needed. The global file system manages all meta-data required such as versioning, access control, audit records, and locking and provides access to the files via the necessary protocols such as CIFS/SMB and NFS.

A global file system is different from a distributed file system or a global name space. In a distributed file system or global name space model, files are tied to a specific piece of hardware, and the distributed file system or global name space finds the file and directs access to it. Sometimes replication is added on top of this to make some access closer to the user. With a global file system, the data is unbound from the hardware, stored in a public or private cloud, and cached access is provided locally to the user with a caching algorithm that keeps the data in use immediately and locally accessible. All of the data, including that in use in multiple local caches,

is stored as a single master copy in the public or private cloud, accessible globally from any access point. In this manner the global file system provides the infinite scalability of the cloud while maintaining the local performance of a traditional NAS device.

A global file system is also different from a simple cloud storage gateway or cloud-integrated storage. In the gateway model the cloud simply acts as a tier of storage tied to the single gateway device. A global file system can manage any number of caching devices at the edge and provide the data through any of them, including multiple devices simultaneously. A global file system also provides global locking to prevent any chance of contention. In the event a file is left open due to someone leaving a file open or a local system hang a global file system allows IT to break a lock. Even in the case of a manually broken lock, any file contention is handled by the file system with conflict resolution procedures and no data is lost.

The global file system allows for infinite scalability of file data storage, global access and mobility, as well as shared use of data between remote offices that feels to the user as if all the data is local.

Figure 1: File Data Virtualization: Infinite Scalability, Global Access, Local Performance

Page 4: File Data Virtualization with UniFS – A True Global File ... · File Data Virtualization with UniFS® – A True Global File System White Paper: File Data Virtualization with UniFS®

[email protected]

White Paper: File Data Virtualization with UniFS® – A True Global File System

United States: 1.800.208.3418International: 1.508.433.62004

Right Sized Performance with Caching At The EdgeWhile the master copy is stored in the cloud with a global file system, it is accessed at the edge in the local office through a caching system known as a filer. The filer appears to the user the same way any local NAS device or file server would. Files are accessed via CIFS/SMB, NFS or even FTP. Filers can also provide block storage via iSCSI. To a windows user this is another ‘letter’ drive, to a Mac user it is another server in the finder, and to a mobile device user, simple apps can be used to browse the files and access them. Desktop synchronization can also be provided.

Since the device appears as a standard file system, applications using the files function normally as they always have. The filer can be scaled to the workload in the office proving both the processing power and cache storage space required for the file data whether users access it from an application or directly as files. Filers can be provided as high availability appliances as well to ensure continuous access to the data.

With caching provided locally to the user, scaled to the performance requirement, user service levels and speed of access are maintained.

Infinite Versioning with File Data and Snapshots in the CloudAnother function provided by a global file system is versioning. As users change file data in the filer, snapshots can be taken as frequently as once a minute with changes sent to the cloud. With the data stored in the cloud, and snapshots sent there as well, infinite versions can be kept forever (unless requirements call for purging data after a certain time).

By using the cloud’s inherent reliability and stability combined with infinite versioning, separate, costly backup and archive systems become unnecessary saving massive amounts of time, money and frustration.

Central Management - Software Defined Today, the term ‘software defined’ is being applied to everything. The reason for this is the power of using software based policy and configuration to control things that use to require reconfiguring hardware. It is yet another benefit from virtualization. Flexibility is increased allowing efficient use of resources, and management can be central so all resources are coordinated in an efficient and secure manner.

This is also true of a file data virtualization using a global file system. Policies can be defined and management can take place through a single management system for a global file system populated with terabytes, even petabytes of data consisting of millions, even billions of discreet files. Through one console, accessibility, access control, protocol access, locking, quotas, audit policy, etc. can all be defined, and the state of the system in any location can be monitored. New storage can be deployed easily.

This allows IT to efficiently manage and control the system and provide user access as needed with a much lower staff burden.

Security with a Global File SystemThe global file system also increases the security of data, even while using public cloud as the means of storage. The global file system can encrypt all data stored in the cloud. Using the filers, keys are kept locally and owned by the filer administrator. No data in the cloud is ever in the clear, either as it is sent or retrieved (in motion) or while it is stored (at rest). In addition, no keys are kept with the encrypted data.

For access control the global file system can be integrated with and joined to Active Directory. This allows the existing enterprise permission system to be applied to all data, and for any data stored in the cloud to be completely secured from hackers, the cloud provider, and anyone else trying to access the data without authorization.

Page 5: File Data Virtualization with UniFS – A True Global File ... · File Data Virtualization with UniFS® – A True Global File System White Paper: File Data Virtualization with UniFS®

[email protected]

White Paper: File Data Virtualization with UniFS® – A True Global File System

United States: 1.800.208.3418International: 1.508.433.62005

The Only True Global File System - UniFSUniFS is Nasuni’s patented global file system that bridges NAS and SAN access to the scalable object storage of the cloud while providing global access to the data. It is the only file system of its kind. UniFS can operate at unlimited scale without any degradation of performance. This patented capability allows Nasuni to support unlimited scalability with unlimited snapshots and eliminate separate backup.

One Master Copy In the Cloud – Accessible EverywhereThe Nasuni Service’s UniFS keeps the master copy of the data in the cloud. This means that data storage is infinitely scalable and protected in the cloud. Availability and scalability of access are maintained since access to the data is not dependent on any given device.

UniFS is organized into Volumes that contain the structure of directories and files, as well as meta-data such as access control lists, lock state, etc. When a Volume is instantiated on a Filer, its structure is built in the cloud. As files and directories are created, data is chunked, deduplicated, compressed, and encrypted, and stored as objects in the cloud service.

UniFS keeps the mapping between the volume structure of files and directories, associated meta-data, and objects in the cloud. If given access any other Filer in the network can map to that same Volume and read and write the files and directories using the same process. In this way, access to the master data in the cloud is local to any user anywhere via The Nasuni Filer.

UniFS operates using WORM (write-once, read-many) principles. Once an object is written in the cloud, it is permanent. It makes UniFS stable and resilient. As data is changed on Filers, the changes are sent to the cloud as snapshots, allowing the current or any past version of the data to be retrieved instantly.

Infinite Versions, Instant RestoreUsing the scalability of cloud storage, snapshots can be taken as frequently as once a minute and kept forever providing built-in backup and archive at no additional cost and with no additional hardware or software – a complete version history of the system is always available at any time. UniFS allows data restoration to any point in time at the file, folder, or volume level instantly by restoring metadata first.

VSS Integration

End-users can navigate previous snapshots and perform self-service file restore (based on access control permissions)

Local and Global LockingUniFS provides file locking at the file system level as applications use it today on any CIFS/SMB or NFS share. When an application opens a file with a lock, it will appear to the application exactly as it does on any other NAS device or Windows File Server. Therefore the application will behave the same way it always does, and there is no change to the user’s experience whether collaborating with someone in the next office or across the globe. This also ensures that file locking with UniFS is compatible with every application and requires no special integration or management.

When users who write data are distributed at multiple sites, UniFS Global Locking Service allows them to collaborate using the data wherever they are without risk of losing or corrupting any data while preserving all data changes. The Global Locking Service runs in the cloud so locking is always available and scalable to the global enterprise without dependency on any device to maintain lock states.

Active Directory IntegrationAccess to can be controlled using Microsoft’s Active Directory, the standard for access control in the enterprise.

Page 6: File Data Virtualization with UniFS – A True Global File ... · File Data Virtualization with UniFS® – A True Global File System White Paper: File Data Virtualization with UniFS®

[email protected]

White Paper: File Data Virtualization with UniFS® – A True Global File System

United States: 1.800.208.3418International: 1.508.433.62006

Local Performance with Nasuni FilersWith caching of data managed by UniFS, The Nasuni Filer looks and feels just like a traditional storage controller – ensuring that end-users and applications have the exact same experience on a Nasuni Filer as they would on an array from the traditional vendors. Specifically, The Nasuni Filer can be used to handle both NAS and SAN workloads. The on-premise Nasuni Filer delivers local storage performance with a minimal footprint. Even though customers store hundreds of terabytes with Nasuni, UniFS only caches the most important and most frequently accessed files. Using sophisticated and proprietary caching algorithms, users receive local performance with only a small percentage of the dataset in their local storage. This means less hardware and lower costs.

Security with UniFSAll data stored with UniFS is encrypted both in flight and at rest in the cloud using customer-controlled keys. This simple design principle means that neither Nasuni nor a cloud storage vendor ever has access to customer data. Furthermore, it transforms cloud storage from an unprotected pool of shared storage into a secure repository that even the most hardened security teams can approve.

Using established security protocols, The Nasuni Filer encrypts all data before transmission using military-grade AES-256 encryption. Nasuni further employs the open-source and highly efficient

OpenPGP standard for key management – a standard that supports an extensive and thorough review process in the public domain. Using this publicly available and time-tested standard ensures customer data will always be available.

Bandwidth Efficiency with UniFSIntegrated WAN optimization enables Nasuni Filers to be deployed all over the world – even in offices and locations with limited and inconsistent connectivity. A perpetually cloud-connected storage system could saturate your network – especially if the local storage controllers frequently need to talk to one another. Not so with UniFS.

Local Site

SharedSite

NFSVMs

iSCSILUN

Figure 2: The Nasuni Filer provides local access

Figure 3: UniFS secure storage in the cloud

Original file

In cache Optimizes for transportMasks actual size

Removes duplicate chunksReducesbandwidthneeds

ReducesbandwidthneedsMasks sizeand contents

Uses customer’skeysMasks sizeand contents

Masks contents

SecureRedundant

Chunking Deduplication Compression Encryption Renaming Saved to Cloud

Original file

In cache Masks contents

Chunking Deduplication Compression Encryption Renaming Saved to Cloud

Optimizes for transport

Uses customer’skeysMasks sizeand contents

SecureRedundant

Removes duplicate chunksReducesbandwidthneeds

Reducesbandwidthneeds

Original file

In cache Masks contents

Chunking Deduplication Compression Encryption Renaming Saved to Cloud

Optimizes for transport

Uses customer’skeysMasks sizeand contents

SecureRedundant

Removes duplicate chunksReducesbandwidthneeds

Reducesbandwidthneeds

Page 7: File Data Virtualization with UniFS – A True Global File ... · File Data Virtualization with UniFS® – A True Global File System White Paper: File Data Virtualization with UniFS®

[email protected]

White Paper: File Data Virtualization with UniFS® – A True Global File System

United States: 1.800.208.3418International: 1.508.433.62007

UniFS running on Nasuni Filers communicates directly with the cloud – never by connecting directly to other Filers, leveraging the unlimited bandwidth of the cloud and not the limited connectivity of your offices. In addition, before any bits are transmitted, data is chunked, compressed and deduplicated against the global file system so that nothing is sent twice.

Global deduplication means UniFS is comparing file changes not just locally, but to every file stored in the cloud – dramatically reducing the number of bits that need to be sent. Sub-file chunking allows UniFS to transmit only the deduplicated parts of a file that have changed – not the whole new version. Compression of every chunk before transmission removes extraneous data, reducing bandwidth needs.

ConclusionLegacy file data storage cannot keep up with today’s requirements. The rise and growth of the global enterprise, requiring worldwide shared, mobile access to file data with data center performance, availability, protection, disaster recovery and archive has placed demands on IT that cannot be met with traditional file storage solutions - even if there was enough money procure them and time to manage them. There is a current and pressing need for a better way to store file data.

File Data Virtualization allows IT organizations to manage a single, fully protected, fully versioned instance of every data asset. When high performance access is needed, the data is instantiated in a local high performance system. When access is needed across multiple locations, the copies of the data appear at all those locations and a global lock prevents conflicts in order to maintain one clean version of the data. File Data Virtualization establishes complete control over each data asset and, once and for all, liberates IT from having to make, track and manage all the copies. By taking advantage of the scalability and economics of cloud storage plus the File Data Virtualization capabilities of the Global File System UniFS – while adding in local filers, central management and an operations center, The Nasuni Service is unique in the market providing Enterprise Storage as-a-Service, that meets this challenge.

The Nasuni Service can provide the infinitely scalable, globally accessible, locally performing, always available file data storage enterprises require at a fraction of the cost and management burden of traditional approaches.

Figure 4: File Data Virtualization – The Power of One

Global File System

Central Management

Access Control Protection

Scalability

Backup

Off-siteReplication/DR

VersionControl

FileAuditing

Archiving

Page 8: File Data Virtualization with UniFS – A True Global File ... · File Data Virtualization with UniFS® – A True Global File System White Paper: File Data Virtualization with UniFS®

About Nasuni

Nasuni offers a powerful Storage Infrastructure as a Service: a uniform, complete storage solution delivered as a service, enabling distributed organizations to offload the management of the storage infrastructure (including backup, replication and offsite protection) and consume storage on-demand, without any change to the way users access information. With Nasuni, corporate data is uniformly available in all locations, enabling organizations to increase user productivity, streamline costs and improve business agility.

Our team is made up of enterprise storage, security and networking industry veterans with a shared vision of transforming the way enterprise organizations view data storage. We believe that storage should be as easy to purchase, consume and manage as the electricity that keeps the lights on.

Nasuni Corporation © 2014, All Rights Reserved

313 Speen Street, Natick, MA 01760-1538

WP_2.14 1.508.433.6200 www.nasuni.com

About Nasuni

Nasuni is an enterprise storage company that provides globally-distributed organizations with a simple, unified storage solution. By combining on-premises hardware with cloud storage, Nasuni delivers a secure, all-in-one data storage solution that provides local performance for users, simplified and centralized management for IT, and an easily scalable, complete storage service for the global enterprise. With Nasuni, data is uniformly available in all locations, enabling organizations to increase user productivity, streamline costs and improve business agility.