Customizable Diskless Solution for HPC...

1
Customizable Diskless Solution for HPC Clusters Emanuel Mariano Ravera, Facundo Ferrer Argentina Software Development Center (ASDC) Intel Argentina Abstract. Diskless clusters provide a stable, easily replicable and auto- deployable solution for High Performance Computing (HPC). On the other hand, the Intel® Cluster Ready program simplifies the selection, deployment, and management of HPC clusters by establishing a standard specification for them ensuring component and application interoperability. This work proposes to use a framework for on-the-fly creation of diskless solutions that allows users with little or no knowledge to create and deploy customized HPC clusters based on the Intel® Cluster Ready Specification. Introduction and Motivation The installation of HPC clusters requires trained people and lots of exploration effort. Installation and maintenance of the cluster have high costs including the need of specific expertise. Diskless clusters provides a stable, easily replicable and auto-deployable solution over a Linux architecture through a live CD system. The Intel® Cluster Ready architecture establishes a standard specification for HPC clusters. The Intel Cluster Checker ® tool, a key component of the Intel Cluster Ready program, is an automated tool that validates a cluster settings against the Intel Cluster Ready specification, and checks the general wellness of the cluster. The proposed customization framework enables the user to engineer a personalized diskless solution from scratch based on specific requirements and taking as foundation the Intel® Cluster Ready architecture. The framework provides the capability to create different software stacks containing the Linux operating system plus customized configurations. This allows the user for instance, to deploy a cluster using a well-known and pre-loaded software stack to run specific HPC applications. Conclusions The main result of this work is to provide users with an simple method to enter the HPC world. The developed framework gives the user a customized diskless solution that enables him to create and deploy a diskless clustering solution with little no need of expertise in HPC. Another advantage is that the deployed cluster also complies with the Intel® Cluster Ready architecture. The architecture guarantee that registered HPC applications in the program will be executed successfully. These customized diskless solutions generates a great opportunity to approach the HPC world of computing and their application to industry, increasing the expansion of the usage of HPC in emerging markets. This opens a good opportunity to use these customized diskless solutions to deploy on-demand HPC systems. This usage model allows the user to create their customized image and deploy a cluster that fit his requirements. Next Steps The research will continue moving to use the diskless solution generated by the customization framework as base of a compute on demand model that satisfy specific end user needs and uses rented hardware from third parties. Generator Selection The simplicity showed in the generation and customization process, plus the flexibility and good results obtained using the Intel® Cluster Checker tool, made us select PelicanHPC as foundation of the customization framework. PelicanHPC is a Debian-based diskless solution that uses the live-helper* framework created by the Debian* Live Project. Diskless Solution Generation Process The generation process involves components and actions to create a diskless solution from scratch. The figure illustrates the relation among these components and how the process works with the inputs and outputs. The framework simplifies the customization of a diskless clustering solution to meet user needs by setting parameters and adding new capabilities. Framework Implementation Diskless Solution Validation Deployment Time Reduction: Average deployment time for a 32 node cluster using the diskless solution was 15 minutes against 3 or more hours required by an expert. Easy to use: Previous knowledge to use the framework was not required. Always Intel® Cluster Ready: Non-expert users can develop a customized diskless solution that matches Intel® Cluster Ready specification. Infiniband Support: the OFED software stack is included and configured as part of the generated diskless solution. Easy to run applications: the customization framework allows HPC applications to be included in the solution when the cluster is deployed. Thanks -To Intel for giving us the opportunity of making this research as part of an Internship activity. -To the ACE (Argentina Cluster Engineering) Team that helped in the research, specially to Ricardo Medel, Dan Hirsch, Cesar Martinez and Andres More that unselfishly gave their time in the review of this work. 40JAIIO - HPC 2011 - ISSN: 1851-9326 - Página 157

Transcript of Customizable Diskless Solution for HPC...

Page 1: Customizable Diskless Solution for HPC Clusters40jaiio.sadio.org.ar/sites/default/files/T2011/HPC/Posters/861.pdf · solutionovera Linux architecture through a live CD system. The

Customizable Diskless Solution for HPC Clusters

Emanuel Mariano Ravera, Facundo Ferrer

Argentina Software Development Center (ASDC)Intel Argentina

Abstract. Diskless clusters provide a stable, easily replicable and auto-

deployable solution for High Performance Computing (HPC). On the other

hand, the Intel® Cluster Ready program simplifies the selection,

deployment, and management of HPC clusters by establishing a standard

specification for them ensuring component and application interoperability.

This work proposes to use a framework for on-the-fly creation of diskless

solutions that allows users with little or no knowledge to create and deploy

customized HPC clusters based on the Intel® Cluster Ready Specification.

Introduction and Motivation

The installation of HPC clusters requires trained people and lots ofexploration effort. Installation and maintenance of the cluster have highcosts including the need of specific expertise.Diskless clusters provides a stable, easily replicable and auto-deployablesolution over a Linux architecture through a live CD system.The Intel® Cluster Ready architecture establishes a standard specificationfor HPC clusters. The Intel Cluster Checker ® tool, a key component of theIntel Cluster Ready program, is an automated tool that validates a clustersettings against the Intel Cluster Ready specification, and checks thegeneral wellness of the cluster.

The proposed customization framework enables the user to engineer apersonalized diskless solution from scratch based on specific requirementsand taking as foundation the Intel® Cluster Ready architecture. Theframework provides the capability to create different software stackscontaining the Linux operating system plus customized configurations. Thisallows the user for instance, to deploy a cluster using a well-known andpre-loaded software stack to run specific HPC applications.

Conclusions

The main result of this work is to provide users with an simple method toenter the HPC world. The developed framework gives the user acustomized diskless solution that enables him to create and deploy adiskless clustering solution with little no need of expertise in HPC.Another advantage is that the deployed cluster also complies with theIntel® Cluster Ready architecture. The architecture guarantee thatregistered HPC applications in the program will be executed successfully.These customized diskless solutions generates a great opportunity toapproach the HPC world of computing and their application to industry,increasing the expansion of the usage of HPC in emerging markets.

This opens a good opportunity to use these customized diskless solutionsto deploy on-demand HPC systems. This usage model allows the user tocreate their customized image and deploy a cluster that fit hisrequirements.

Next Steps

The research will continue moving to use the diskless solution generatedby the customization framework as base of a compute on demand modelthat satisfy specific end user needs and uses rented hardware from thirdparties.

Generator Selection

The simplicity showed in the generation and customization process, plus theflexibility and good results obtained using the Intel® Cluster Checker tool,made us select PelicanHPC as foundation of the customization framework.PelicanHPC is a Debian-based diskless solution that uses the live-helper*framework created by the Debian* Live Project.

Diskless Solution Generation Process

The generation process involves components and actions to create adiskless solution from scratch. The figure illustrates the relation amongthese components and how the process works with the inputs and outputs.The framework simplifies the customization of a diskless clustering solution tomeet user needs by setting parameters and adding new capabilities.

Framework Implementation

Diskless Solution Validation

•Deployment Time Reduction: Average deployment time for a 32 nodecluster using the diskless solution was 15 minutes against 3 or morehours required by an expert.•Easy to use: Previous knowledge to use the framework was not required.•Always Intel® Cluster Ready: Non-expert users can develop acustomized diskless solution that matches Intel® Cluster Readyspecification.•Infiniband Support: the OFED software stack is included and configuredas part of the generated diskless solution.•Easy to run applications: the customization framework allows HPCapplications to be included in the solution when the cluster is deployed.

Thanks-To Intel for giving us the opportunity of making this research as part of an Internship activity.-To the ACE (Argentina Cluster Engineering) Team that helped in the research, specially to Ricardo Medel, Dan Hirsch, Cesar Martinez and Andres More thatunselfishly gave their time in the review of this work.

40JAIIO - HPC 2011 - ISSN: 1851-9326 - Página 157