Intel® enterprise edition for lustre software product brief and data sheet

12
Intel ® Enterprise Edition for Lustre * Software A New Generation of Lustre Software Expands High Performance Computing into the Commercial Enterprise Intel® Enterprise Edition for Lustre* software (Intel® EE for Lustre* software) unleashes the Lustre parallel file system as an enterprise platform for a broad spectrum of commercial organizations, both large and small. It allows businesses that need large scale, high-bandwidth storage to tap into the power and scalability of Lustre, but with the simplified installation, configuration and monitoring features of Intel® Manager for Lustre* software, a management solution purpose-built for the Lustre file system. Intel EE for Lustre software includes proven support from the Lustre experts at Intel, includ- ing worldwide 24x7 technical support. High Performance Parallel Storage for the Enterprise Intel EE for Lustre software brings the benefits of Lustre and high performance computing (HPC) to data-intensive businesses. For decision makers seeking to achieve their busi- ness goals more quickly and cost effectively, Intel EE for Lustre software offers a parallel file system that enables more powerful computing for improved results and data-driven business and IT intelligence. Intel EE for Lustre software provides open interfaces that promote easy integration and high levels of interoperability with existing infrastructures. Because Lustre is open source software, IT can grow data center storage systems over time and on budget, employing a variety of networking and hardware options. KEY FEATURES • Built on open-source Lustre* software • Centralized, GUI-based adminis- tration for management simplicity • Enormous storage capacity • Real-time system monitoring for maximum availability • Open, documented interfaces for deep integration • Rigorously tested, stable soft- ware proven across diverse industries • I/O throughput in excess of 1 terabyte per second • Global 24X7 technical support REST API Extensibility Management and Monitoring Services Lustre File System Full distribution of open source Lustre software CLI Intel value-added software Open source software Intel® Manager for Lustre* Software Configure, Monitor, Troubleshoot, Manage Storage Plug-in Integration Hadoop Adapter Lustre storage for MapReduce applications Intel® Enterprise Edition for Lustre* Software PRODUCT BRIEF Intel® Enterprise Edition for Lustre* Software High Performance Data Division

description

Intel® Enterprise Edition for Lustre* software (Intel® EE for Lustre* software) unleashes the Lustre parallel file system as an enterprise platform for a broad spectrum of commercial organizations, both large and small. It allows businesses that need large scale, high-bandwidth storage to tap into the power and scalability of Lustre, but with the simplified installation, configuration and monitori 線上購買及更多軟體介紹及下載試用,歡迎至本公司線上商店 ,Buy Online : http://www.appcenter.com.tw/ or http://www.cheerchain.com.tw Cheer Chain Enterprise Co., Ltd. T +886 4 2386 3559 | F +886-42386 3159 [email protected] | www.cheerchain.com.tw Distribution of Software | Training Courses | Consulting Services

Transcript of Intel® enterprise edition for lustre software product brief and data sheet

Page 1: Intel® enterprise edition for lustre  software   product brief and data sheet

Intel® Enterprise Edition for Lustre* SoftwareA New Generation of Lustre Software Expands High Performance Computing into the Commercial Enterprise

Intel® Enterprise Edition for Lustre* software (Intel® EE for Lustre* software) unleashes the Lustre parallel file system as an enterprise platform for a broad spectrum of commercial organizations, both large and small. It allows businesses that need large scale, high-bandwidth storage to tap into the power and scalability of Lustre, but with the simplified installation, configuration and monitoring features of Intel® Manager for Lustre* software, a management solution purpose-built for the Lustre file system. Intel EE for Lustre software includes proven support from the Lustre experts at Intel, includ-ing worldwide 24x7 technical support.

High Performance Parallel Storage for the EnterpriseIntel EE for Lustre software brings the benefits of Lustre and high performance computing (HPC) to data-intensive businesses. For decision makers seeking to achieve their busi-ness goals more quickly and cost effectively, Intel EE for Lustre software offers a parallel file system that enables more powerful computing for improved results and data-driven business and IT intelligence. Intel EE for Lustre software provides open interfaces that promote easy integration and high levels of interoperability with existing infrastructures. Because Lustre is open source software, IT can grow data center storage systems over time and on budget, employing a variety of networking and hardware options.

KEY FEATURES

• Built on open-source Lustre* software

• Centralized, GUI-based adminis-tration for management simplicity

• Enormous storage capacity

• Real-time system monitoring for maximum availability

• Open, documented interfaces for deep integration

• Rigorously tested, stable soft-ware proven across diverse industries

• I/O throughput in excess of 1 terabyte per second

• Global 24X7 technical support

REST APIExtensibility

Management and Monitoring Services

Lustre File SystemFull distribution of open source Lustre software

CLI

Intel value-added software Open source software

Intel® Manager for Lustre* SoftwareConfigure, Monitor, Troubleshoot, Manage

Storage Plug-inIntegration

Hadoop AdapterLustre storage

for MapReduce

applications

Intel® Enterprise Edition for Lustre* Software

PRODUCT BRIEFIntel® Enterprise Edition for Lustre* SoftwareHigh Performance Data Division

Page 2: Intel® enterprise edition for lustre  software   product brief and data sheet

LustreRouter

OSS 1 OSS 2

Shared StorageEnabling Failover OSS

FailoverSupported

Object StorageServers (OSS) (1-1,000’s)

Lustre Clients (1-100,000)

Metadata Targets (MDT)

MDS 2(Standby)

Failover Supported

MDS 1(Active)

GigE

Infiniband or GbE

What is Lustre?The Lustre file system was purpose-built to provide sustained performance and stability at large scale for storage in HPC clusters. Lustre is POSIX-compliant, and capable of han-dling extremely large amounts of data and huge numbers of files shared concurrently across clustered servers. It offers wide scalability, both in performance and storage capacity. According to top500.org, Lustre is the most widely used file system in super-computing, powering over 60 percent of the TOP100 supercomputing sites worldwide1. Lustre has traditionally been employed in HPC environments that generate and process massive amounts of data for research, governmental, and industrial purposes, and is one of the best breakthrough technologies for addressing the exascale and emerging “big data” challenges of tomorrow.

However, HPC is no longer the exclusive province of such data-rich projects as genom-ics, climate change simulations, and mechanical modeling. Increasingly, organizations and businesses of all kinds generate high-volume data flows, and rely on scalable storage to deliver very high I/O rates and massive capacity to super-scale applications.

With the release of Intel Enterprise Edition for Lustre software, Intel provides enterprise customers with a commercial-grade version of Lustre optimized to address key storage and data throughput challenges of HPC-class computing in business.

Intel Enterprise Edition for Lustre software provides a parallel file system that delivers massive scalability in performance and storage capacity for commercial HPC.

INTEL AND OPEN SOURCE LUSTRE

Lustre has been developed in an open, collaborative community and is freely available in open-source software under the GPLv2 license. Intel is committed to maintaining Lustre as an open system while providing expert support and tools to enhance management. Intel takes pride in being a long-stand-ing member of the open-source community, and believes in open source development as a means to create rich business opportunities, advance promising technologies, and bring together top talent from diverse fields to solve computing challenges. Intel’s contributions to the open source community include reliable hardware architec-tures, professional development tools, work on essential open-source components, collaboration and co-engineering with leading companies, investment in academic research and commercial business-es, and helping to build a thriving ecosystem around open source.

Figure 1. Typical Lustre configuration

2

Page 3: Intel® enterprise edition for lustre  software   product brief and data sheet

PerformanceIntel EE for Lustre software is designed to enable fully parallel I/O throughput across thousands of clients, servers and storage devices. Metadata and data are stored on separate servers to allow optimization of each system for the different workloads they present. Intel EE for Lustre can also scale down efficiently to provide fast par-allel storage for smaller organizations.

• Many Lustre configurations are running in production at 500 to 750 gigabytes per second, with leading edge installa-tions achieving throughput in excess of 1 terabyte per second

• Delivers data faster to critical applica-tions, leading to improved decision-making based on near real-time analysis

• Massive data flows can utilize a high percentage of underlying storage and network bandwidth, for low performance overhead

• Provides an innovative edge by allow-ing an enterprise to run larger and more complex simulations faster and easier

CapacityThe object-based storage architecture of Intel EE for Lustre software can scale to tens of thousands of clients and pet-abytes of data.

• At 512 petabytes of storage for the cur-rent version of Lustre, storage capacity is enormous

• Lustre was developed to handle the de-mands of scientific data, and has been tested and trusted at extreme levels of throughput, with no upper limits to its storage scale

AffordabilityIntel EE for Lustre software is based on open source Lustre software, and is hard-ware, server, and network fabric neutral. Enterprises can scale their storage deploy-ments horizontally, yet continue to have simple-to-manage storage.

• Intel EE for Lustre promotes multi-ven-dor solutions, so administrators can customize their storage infrastructure to accommodate their unique storage re-quirements and budgets

• Intel Manager for Lustre software adds tools and GUI for fast, efficient manage-ment; Intel EE for Lustre software does not require specialized training or exper-tise to operate

MaturityLustre has been in use in the world’s largest datacenters for over a decade and hardened in the harshest big data environ-ments; recent versions are highly trusted, reliable and well architected. Today’s Lustre has been rigorously tested and proven for diverse markets and real-world use cases.

• Intel EE for Lustre delivers commercial-ready Lustre in a package that can scale efficiently both up and down to suit your business workloads, with built-in manageability.

• Intel EE for Lustre software has a clear product roadmap, with predictable releases

• Intel EE for Lustre software brings to-gether the best and brightest of Lustre expertise coupled with the resources and credibility of Intel

Intel® Manager for Lustre*Intel Manager for Lustre provides a uni-fied, consistent view of Lustre storage systems and simplifies the installation, configuration, monitoring and overall man-agement of Lustre. The manager consoli-dates all Lustre information in a central, browser-accessible location for ease of management and reduced complexity.

Intel® Manager for Lustre* software consolidates Lustre information in a central, browser-accessible location for ease of management.

3

Page 4: Intel® enterprise edition for lustre  software   product brief and data sheet

Key Features

Intuitive, browser-based administration• Simple but powerful graphical and script-

able command line interfaces

• Point-and-click simplicity for cluster con-figuration and management

• Centralized definition and management of common administrative tasks

Real-time system monitoring• Monitor storage health and key perfor-

mance indicators (KPIs) in real time

• View high level system performance or individual components

• Generate historical and real-time charts and reports

Advanced troubleshooting tools• Consolidated view of cluster-wide stor-

age log files

• Intelligent log-scanning for efficient problem isolation and analysis

• Configurable event notifications

Open, documented APIs• REST-compliant APIs for easy integra-

tion with other storage systems and software management tools

• Storage plug-in architecture provides easy extensibility

Apache Hadoop* AdapterWhen organizations operate both Lustre and Apache Hadoop* within a shared infrastructure, there is a strong case for using Lustre as the file system for Hadoop analytics as well as HPC storage.

Intel EE for Lustre software includes an Intel-developed adapter which allows users to run Map/Reduce* applications directly on Lustre. This optimizes the performance of Map/Reduce operations while delivering faster, more scalable and easier to manage storage.

Intel Manager for Lustre software interfaces allow insights into high-level system performance or in-depth focus into individual components.

• Hadoop users can access any Lustre files directly from Hadoop, without the need to copy them to the Hadoop environment

• Using Lustre in combination with Hadoop makes storage management simpler—since the platform will be running a single Lustre file system instance rather than Hadoop instances for each clus-ter—and makes more productive use of storage assets.

For more information on Intel® Enterprise Edition for Lustre* Software, visit the Intel Lustre Portal at Lustre.intel.com

1 www.top500.org. INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY

THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL’S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT.

A “Mission Critical Application” is any application in which failure of the Intel Product could result, directly or indirectly, in personal injury or death. SHOULD YOU PURCHASE OR USE INTEL’S PRODUCTS FOR ANY SUCH MISSION CRITICAL APPLICATION, YOU SHALL INDEMNIFY AND HOLD INTEL AND ITS SUBSIDIARIES, SUBCONTRACTORS AND AFFILIATES, AND THE DIRECTORS, OFFICERS, AND EMPLOYEES OF EACH, HARMLESS AGAINST ALL CLAIMS COSTS, DAMAGES, AND EXPENSES AND REASONABLE ATTORNEYS’ FEES ARISING OUT OF, DIRECTLY OR INDIRECTLY, ANY CLAIM OF PRODUCT LIABILITY, PERSONAL INJURY, OR DEATH ARISING IN ANY WAY OUT OF SUCH MISSION CRITICAL APPLICATION, WHETHER OR NOT INTEL OR ITS SUBCONTRACTOR WAS NEGLIGENT IN THE DESIGN, MANUFACTURE, OR WARNING OF THE INTEL PRODUCT OR ANY OF ITS PARTS.

Intelmaymakechangestospecificationsandproductdescriptionsatanytime,withoutnotice.Designersmustnotrelyontheabsenceorcharacteristicsofanyfeaturesorinstructionsmarked“reserved”or“undefined”.Intelreservestheseforfuturedefinitionandshallhavenoresponsibilitywhatsoeverforconflictsorincompatibilitiesarisingfromfuturechangestothem.Theinformationhereissubjecttochangewithoutnotice.Donotfinalizeadesignwiththisinformation.

Theproductsdescribedinthisdocumentmaycontaindesigndefectsorerrorsknownaserratawhichmaycausetheproducttodeviatefrompublishedspecifications.Currentcharacterizederrataareavailableonrequest. ContactyourlocalIntelsalesofficeoryourdistributortoobtainthelatestspecificationsandbeforeplacingyourproductorder. Copiesofdocumentswhichhaveanordernumberandarereferencedinthisdocument,orotherIntelliterature,maybeobtainedbycalling1-800-548-4725,orgoto:http://www.intel.com/design/literature.htm FTCOptimizationNotice Intel’scompilersmayormaynotoptimizetothesamedegreefornon-IntelmicroprocessorsforoptimizationsthatarenotuniquetoIntelmicroprocessors.TheseoptimizationsincludeSSE2, SSE3,andSSE3instructionsetsandotheroptimizations.Inteldoesnotguaranteetheavailability,functionality,oreffectivenessofanyoptimizationonmicroprocessorsnotmanufacturedbyIntel.

Microprocessor-dependentoptimizationsinthisproductareintendedforusewithIntelmicroprocessors.CertainoptimizationsnotspecifictoIntelmicroarchitecturearereservedforIntel microprocessors.PleaserefertotheapplicableproductUserandReferenceGuidesformoreinformationregardingthespecificinstructionsetscoveredbythisnotice.

Noticerevision#20110804 General Performance Disclaimer: For more complete information about performance and benchmark results, visit Performance Test

Disclosure http://www.intel.com/benchmarks Copyright©2013IntelCorporation.Allrightsreserved.Intel,theIntellogo,andXeonaretrademarksofIntelCorporationintheU.S.andothercountries. *Othernamesandbrandsmaybeclaimedasthepropertyofothers. PrintedinUSA 0813/CM/MB/PDF PleaseRecycle 329078-002US

diane
Typewritten Text
Page 5: Intel® enterprise edition for lustre  software   product brief and data sheet

A New Generation of Lustre* Software Expands HPC Into the Commercial Enterprise

Not so long ago, storage for high performance computing (HPC) meant complexity and massive data sets, and was the concern of only a small group of computer users. Super-scale computing was the province of government-sponsored research in national labs, data-intensive simulations for weather forecasting and climate modeling, or certain information-intense industries, such as defense, aeronautics, and oil and gas.

Today, HPC is undergoing democratization: Extracting knowledge and information from ever-expanding flows of data is now seen to be a key source of competitive advantage for modern businesses of any size.

Enterprises of all kinds now generate huge volumes of data. They rely on high-perfor-mance data processing applications to analyze and derive value from their data flows. They require a storage infrastructure that can scale endlessly and deliver large volume I/O for high-throughput data processing.

Lustre powers over

of the top 100supercomputers worldwide1

60%

However, roughly half of enterprise storage systems today are based on the Network File System (NFS), a type of distributed file system that consolidates data resources onto centralized net-worked servers, and allows remote clients to mount data files over the network and interact with these files as though they are available locally. While effective in smaller storage environments, NFS can quickly become a major bottleneck in high volume systems because it does not scale well, and requires increasingly costly management overhead, even as its perfor-mance diminishes.

Designed specifically for high performance computing, the open source Lustre* paral-lel file system is one of the most popular, powerful and scalable data storage system currently available, and is in widespread use today in super-computing scenarios where high performance and enormous storage capacity is required. However, from its early days of development in re-search labs and academia, Lustre has also

WHITE PAPERIntel® Enterprise Edition for Lustre* SoftwareHigh Performance Data Division

Page 6: Intel® enterprise edition for lustre  software   product brief and data sheet

gained the reputation of being difficult to manage. Though high performance, scalable storage could be useful for many business applications, these early versions of Lustre were considered too difficult to manage for enterprise use.

With the release of Intel® Enterprise Edition for Lustre* software (Intel® EE for Lustre* software), Lustre has moved beyond the lab and into the enterprise. Intel provides business customers with a commercial-grade version of Lustre optimized to address key storage and data throughput challenges of HPC-class computing in business. Intel EE for Lustre contains an open source distribution of Lustre, tested and validated by Intel, with the latest features and hardened for deployment in production. Intel® Manager for Lustre* software, a set of integrated management tools, simplifies installation, deployment, monitoring and management of Lustre. Intel EE for Lustre software delivers the Lustre parallel storage file system, backed with the full support and expertise of Intel, with the stability, efficiency, and reliability required by today’s enterprises.

The limitations of NFS-based storage and I/OHPC clusters are designed to provide ex-treme computational power to large-scale applications. This computational power often results in the creation of huge numbers of files and/or extremely large individual files that stress the capabilities of conventional file systems beyond their limits.

But while the speed of processors and memory has risen sharply in recent years, the performance of I/O systems has lagged behind. Even HPC infrastructures can oper-ate only as fast as its slowest component, usually its storage file system. Disk drives still essentially spin at the same speeds as 20 years ago, and poor I/O performance can severely degrade the overall throughput of even the fastest clusters.

The Network File System, developed by Sun Microsystems in the 1980s, is the de facto distributed file system for Linux* based computing, and Linux is the predomi-nant operating system used in HPC and large enterprise computing environments.

NFS is very stable, and for general-pur-pose enterprise and business computing, it is adequate for a wide variety of tasks. However, as commercial technical comput-ing workloads scale upward into HPC realm and storage system capacity tops 100 terabytes, many users discover that traditional data management solutions based on distributed file systems like NFS come up short—due to their inherent scalability constraints.

NFS and other distributed file systems work by designating a single node to function as the I/O server for the storage cluster. All I/O—reading data and writing data—go through that single node. While this system is relatively simple to manage in a single cluster deployment, pushing all of an enterprise’s I/O through one server node quickly presents a bottleneck for data-intensive workloads. With a single

INTEL AND OPEN SOURCE PARALLEL STORAGE

With Intel® EE for Lustre* software, Intel builds on its leadership in high performance server platforms, software tools for creating parallel applications, and datacenter opti-mization to extend its innovation to parallel storage. Intel, with the backing and support of OpenSFS and EOFS, leads the open, collab-orative development of the Lustre file system; in this role Intel man-ages the source code repositories on behalf of the community, ensur-ing coordinated feature develop-ment, rigorous testing and predict-able releases. Intel EE for Lustre software plays an essential role in the growing, Intel-supported open-source ecosystem that presents a multi-vendor approach to such leading-edge technical challenges as HPC, big data, and the cloud.

2

A New Generation of Lustre* Software Expands HPC Into the Commercial Enterprise

Page 7: Intel® enterprise edition for lustre  software   product brief and data sheet

gateway between an application and its target storage system, storage perfor-mance remains restricted by the process-ing capacity of the node.

This single server approach can also mean a single point of failure, as a failed I/O results in data that’s no longer available for applications.

Scaling up performance and capacity in NFS environments begins as an additive process—just add more server clusters to the network, each with its own I/O server node. This linear expansion of clusters can work well enough until management overhead and costs become onerous (each of the disparate NFS server clusters must be managed individually) and data bottle-necking compounds across the network.

Introduction to LustreThe Lustre global parallel file system was first conceived in 1999 at Carnegie Melon University in response to growing awareness of the limits of NFS, and has

become the predominant file system in HPC environments, or any supercomputing environment where high I/O bandwidth and scale are required. The Lustre file sys-tem is unmatched in speed, scalability, and availability and can support tens of thou-sands of client systems, tens of petabytes of storage, and more than a terabyte per second of aggregate I/O throughput. Lustre sotware powers over 60 percent of the top 100 supercomputers and is the most widely used file system for TOP500 supercomputing.

Lustre is an object-based file system that splits file metadata (such as the file system namespace, file ownership and ac-cess permission) from the actual file data and stores them on different servers. File metadata is stored on a Metadata Server (MDS), and the file data is split into mul-tiple objects and stored on Object Storage Targets (OST).

When a client opens a file, Lustre contacts the MDS to look up the file in the filesys-

Application Workloads Application Workloads

NFS Server Storage

NFS Network

NFS Server Storage NFS Server Storage

Figure 1. When scaling up an NFS-based environment, each of the dis-parate NFS server clusters must be managed individually, adding to data

bottlenecks as well as management overhead and costs.

1 TB = 1000000000000bytes = 1012bytes = 1000gigabytes

Leading-edge Lustre configurations can

deliver data throughput in excess of 1 terabyte

per second

1,000,000,000,000,Bytes

per second

3

A New Generation of Lustre* Software Expands HPC Into the Commercial Enterprise

Page 8: Intel® enterprise edition for lustre  software   product brief and data sheet

tem namespace, verify access permissions and return the file layout. This layout tells the client how the file’s data is distributed over OSTs. From then on, all I/O is trans-acted directly between the client and the OSTs without having to interact with the MDS again. Because metadata and object data are stored on separate servers, each system can be optimized for the different workloads they present.

This is the primary advantage that Lustre has over a file system such as NFS, where all I/O has to go through a single NFS server or head node. Lustre allows mul-tiple clients to access multiple OSS nodes at the same time independent of one

another, thereby allowing the aggregate throughput of the file system to scale with the simple addition of more hard-ware. Performance is essentially limited only by the amount and characteristics of the storage hardware available. Lustre does not slow down despite data growth and grows in storage capacity as comput-ing needs grow. Redundant servers and failover-enabled storage results in a file system that is highly available, with no single point of failure.

The Lustre Network (LNet) is a set of pro-tocols and APIs that provide a network for connecting clustered servers and clients to Lustre file systems. The LNet supplies

LustreRouter

OSS 1 OSS 2

Shared StorageEnabling Failover OSS

FailoverSupported

Object StorageServers (OSS) (1-1,000’s)

Lustre Clients (1-100,000)

Metadata Targets (MDT)

MDS 2(Standby)

Failover Supported

MDS 1(Active)

GigE

Infiniband or GbE

pluggable drivers to support ultra low latency network infrastructures such as Infiniband (verbs) or Ethernet. The drivers are loaded into the driver stack for each network type that is in use. Key features of LNet include support for Remote Direct Memory Access (RDMA) to provide fast memory-to-memory interconnects over InfiniBand or Ethernet networks for maximum performance and reduced CPU overhead. LNet also delivers high availabil-ity and transparent recovery in coordina-tion with failover storage servers, along with the ability to route across multiple network types.

Figure 2. Typical Lustre configuration

4

A New Generation of Lustre* Software Expands HPC Into the Commercial Enterprise

Page 9: Intel® enterprise edition for lustre  software   product brief and data sheet

HPC AND THE MISSING MIDDLE

In 2008, the Council on Competitiveness released a series of studies on U.S. man-ufacturing that found the vast majority of small to medium companies in the U.S. were missing out on high performance computing’s potential to drive innovation and make American companies more competitive.2 Of some 300,000 manufactur-ing companies in the nation, five percent are global industrial giants that have been using HPC for years for advanced modeling, simulation and analysis. The study found, however, that most small and medium companies in the U.S. not only lack sophisticated HPC capabilities, but some 65 percent didn’t even use rudimen-tary desktop-based modeling in their manufacturing research and development, instead relying on physical prototyping.

The Council on Competitiveness coined the term “the missing middle” to identify this large group of manufacturers who lack HPC capabilities and, increasingly, global competitiveness. According to the council, “HPC represents a crucial edge that can build and sustain competitive advantage through innovative product design, production techniques, cost savings, improved time-to-market cycles, and overall quality.”3 Making HPC R&D more widely available to the missing middle has the potential to transform U.S. manufacturing while creating jobs and allowing a wider range of companies to compete globally.

Intel Enterprise Edition for Lustre SoftwareIntel EE for Lustre software optimizes the Lustre parallel file system as an enterprise platform for a broad spectrum of com-mercial organizations. It allows enterprises with large scale, high-bandwidth storage to tap into the power and scalability of Lustre, but with the simplified installation, configuration and monitoring features of Intel Manager for Lustre, a management solution purpose-built for the Lustre file system. Intel Manager for Lustre helps bring the performance benefits of Lustre to data-intensive businesses and organiza-tions without the need for highly special-ized technical administrators.

A stable, commercial-ready version of Lustre, bundled with smart manage-ment tools, helps bring the benefits of high performance computing to a broad range of businesses, including smaller and mid-sized companies with more limited technology resources. HPC capacity can deliver a higher level of computing power and throughput, and make available infor-mation and insights derived from big data and compute-intensive applications—such as advanced modeling, simulation, and data analysis—to a new tier of enterprise users. Intel EE for Lustre software, in combination with other super-computing technologies, offers the potential to drive innovation, deliver higher quality products and designs, and sustain competitive ad-vantage for a wide array of businesses.

5

A New Generation of Lustre* Software Expands HPC Into the Commercial Enterprise

Page 10: Intel® enterprise edition for lustre  software   product brief and data sheet

AffordabilityThe Lustre distribution in Intel EE for Lustre software is open source, hardware neutral, and supports storage and servers from multiple vendors. There is no vendor lock-in, so administrators can customize the storage infrastructure to accommo-date individual budgets.

Intel Manager for Lustre software provides browser-based tools for fast, ef-ficient management, greatly reducing the once-imposing level of care and nurtur-ing that Lustre deployments formerly demanded. Intel EE for Lustre software does not require specialized training or expertise to operate, and businesses can scale their Lustre storage deployments horizontally, yet continue to manage them with the same efficiency and precision as before.

MaturityIntel EE for Lustre software delivers the most advanced Lustre features, rigorously tested and proven for diverse markets.

Intel EE for Lustre software brings together the best and brightest of Lustre expertise coupled with the resources, support and credibility of Intel. Intel EE for Lustre software has a clear product road-map, with predictable releases. It includes best-in-class support from the Lustre experts at Intel, including worldwide 24X7 technical support services.

Intel Manager for Lustre SoftwareIntel Manager for Lustre software provides a unified, consistent view of Lustre storage systems and simplifies the installation, configuration, monitoring and overall management of Lustre. The man-ager consolidates all Lustre information in a central, browser-accessible location for ease of management and reduced com-

PUSHING STORAGE BOUNDARIES: THE FASTFORWARD PROGRAM

Administered by the United States Department of Energy, the Fast-Forward program is designed to accelerate the research and devel-opment of critical technologies to enable extreme scale, or exascale, computing. The program seeks to address the nation’s most pressing scientific challenges by advancing simulation-based scientific dis-covery made possible by exascale supercomputers.

FastForward is contracted through a consortium of seven national laboratories, with subcontracts awarded to a network of private companies with expertise in high performance computing (HPC). In 2012, Whamcloud was awarded the Storage and I/O Research & Development subcontract for the FastForward program. Whamcloud, a global leader in parallel storage for HPC and Lustre research and support, was shortly afterward acquired by Intel. The Intel® High Performance Data Division (Intel® HPDD), the new home of Wham-cloud at Intel, has continued the storage and I/O research for the FastForward subcontract. The two-year project includes key R&D necessary for a new object storage paradigm for HPC exascale comput-ing; the new storage technology will also address next-generation storage mechanisms required by the Big Data market. All compo-nents developed in the project will be open sourced and benefit the entire Lustre community.

Benefits of Intel Enterprise Edition for Lustre Software for Commercial Organizations

PerformanceIntel EE for Lustre software is designed to enable fully parallel I/O throughput across many clients, servers and storage devices. Many Lustre configurations are running in production at 500 to 750 gigabytes per second, with leading edge installa-tions achieving throughput in excess of 1 terabyte per second.

This means extremely high volumes of data can be delivered to critical high performance applications, leading to improved decision-making based on near real-time analysis. High performance data flows allow an enterprise to run larger and more complex applications faster and easier, providing an innovative edge for businesses. Intel EE for Lustre can also scale down efficiently to provide fast par-allel storage for smaller organizations.

High throughput data flows can help contribute to a higher return on invest-ment (ROI) for HPC infrastructure. Massive data flows can utilize a high percentage of underlying storage and networking hardware performance, for low perfor-mance overhead.

CapacityLustre was developed to handle the demands of scientific data, and has been tested and trusted at extreme levels of throughput. The object-based storage ar-chitecture of Intel EE for Lustre software can scale to tens of thousands of clients; and at 512 petabytes of storage for the current version of Lustre, storage capac-ity is essentially unlimited.

6

A New Generation of Lustre* Software Expands HPC Into the Commercial Enterprise

Page 11: Intel® enterprise edition for lustre  software   product brief and data sheet

USING LUSTRE IN CONJUNCTION WITH APACHE HADOOP*

From Wall Street to the Great Wall, enterprises and institutions of all sizes are faced with the benefits – and challenges – promised by ‘Big Data’. But before users can take advantage of the near limitless potential locked within their data, they must have affordable, scalable and powerful software tools to manage the data.

High performance infrastructure workloads have expanded and are now key technologies used by today’s forward-looking commercial computer users. Parallel storage solutions powered by Lustre storage software have found a new home in these data-intensive business operations, and Apache Hadoop* has become the framework of choice for big data analytics. Hadoop transforms enormous amounts of data into manageable distributed datasets that applications can more easily analyze.

When organizations operate both Lustre and Hadoop within a shared infrastruc-ture, there is a strong case for using Lustre as the file system for Hadoop analyt-ics as well as HPC storage. Hadoop users can access any Lustre files directly from Hadoop, without the need to copy them over to the Hadoop environment. Using Lustre in combination with Hadoop also makes storage management simpler—since the platform will be running a single Lustre file system instance rather than Hadoop instances for each cluster—and makes more productive use of storage assets.

Moreover, Hadoop’s own file system, referred to as HDFS, is inconsistent with the HPC paradigm of decoupling computation from storage, as HDFS expects storage disks to be locally attached to individual compute nodes.

In addition, since HDFS is not POSIX-compliant—meaning that it does not con-form to standards that maintain compatibility between operating systems—it entails the performance overhead of moving extremely large datasets in and out of Lustre for the purposes of staging I/O throughput. Fortunately, Hadoop uses a storage abstraction layer for accessing persistent data, thus allowing the potential for plugging in different types of file systems. Lustre can be made to comply with Hadoop’s storage requirements by implementing its Java* file system API. Since Lustre is POSIX-compliant and can be mounted like an NFS, it is able to exploit Java’s inherent support for native file systems.

The only additional step for mounting Lustre as the file system for Hadoop analytics is to convey to the Hadoop task scheduler that Lustre is indeed a distributed file system and the input data are accessible uniformly from all the compute nodes. This allows tasks to be scheduled on any node independent of data locality, so all Hadoop “compute” nodes can access any data, eliminating the need to move the data itself between nodes. Additional optimization is possible by allowing reducers to read intermediate map outputs directly from the shared file system and eliminating the overhead of streaming large files over HTTP.

Intel EE for Lustre software includes an adapter for Apache Hadoop* which al-lows users to run Map/Reduce* applications directly on Lustre. This optimizes the performance of Map/Reduce operations while delivering faster, more scalable and easier to manage storage.

plexity. It helps lower the management costs of Lustre while accelerating the ben-efits of parallel storage software.

Key features of Intel Manager for Lustre software include:

• Intuitive, GUI-based administration. The easily navigated user interface cen-tralizes definitions and management of common administrative tasks and pro-vides insights across the entire storage system using simple but powerful graphi-cal views and scriptable command line interfaces. Cluster configuration, provi-sioning, and management are performed with point-and-click simplicity.

• Real-time system monitoring. Keep abreast of end-to-end storage system health and access key performance indi-cators (KPIs) in real time. The manager interface allows insights into high-level system performance or in-depth focus into individual components. Robust re-porting tools make it easy to generate historical and real-time charts and reports.

• Advanced troubleshooting tools. Proactively identify and correct faults before they become larger issues. Deal quickly with events and solve problems through a consolidated view of cluster-wide storage log files, with intelligent log-scanning capabilities for efficient problem isolation and analysis. Use re-peatable self-test metrics to monitor incremental changes within the system. Configurable event notifications make it easy to track and schedule automated patches and security fixes.

• Open, documented APIs. Deploy Intel EE for Lustre software in existing networks using open and scriptable REST-compliant APIs for simplified integration with other storage systems and software manage-ment tools. Storage plug-in architecture provides easy extensibility.

7

A New Generation of Lustre* Software Expands HPC Into the Commercial Enterprise

Page 12: Intel® enterprise edition for lustre  software   product brief and data sheet

For more information on Intel® Enterprise Edition for Lustre* Software, visit the Intel Lustre Portal at Lustre.intel.com

Lustre is Open for Business

With the release of Intel Enterprise Edition for Lustre software, Lustre is in a stron-ger position than it’s ever been, as high performance storage is set to become a vital competitive differentiator for a broad range of businesses and commercial or-ganizations. With its stable performance, simplified management, and support from Intel, Intel EE for Lustre software is poised to help move HPC into new markets, helping companies to improve data analysis; speed design, production and decision-making; and spur competitive advantage. Intel EE for Lustre software also stands ready to extend high-volume data processing into technology areas such as big data, business intelligence, and private cloud computing, where high data throughput is required.

Intel EE for Lustre software is built to solve today’s most demanding storage challenges and accelerate performance of critical applications and workflows to bring the benefits of HPC to a broader

community of businesses. With veteran Lustre engineers and developers work-ing at Intel and contributing to the Lustre code, Lustre will continue its growth in both HPC and commercial environments. It remains the best breakthrough tech-nology for addressing the exascale and emerging big data challenges of tomorrow.

A New Generation of Lustre* Software Expands HPC Into the Commercial Enterprise

1 www.top500.org 2 Council on Competitiveness and USC-ISI In-Depth Study of Technical Computing End Users and HPC 3 High Performance Computing to Enable Next-Generation Manufacturing

INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL’S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT.

A “Mission Critical Application” is any application in which failure of the Intel Product could result, directly or indirectly, in personal injury or death. SHOULD YOU PURCHASE OR USE INTEL’S PRODUCTS FOR ANY SUCH MISSION CRITICAL APPLICATION, YOU SHALL INDEMNIFY AND HOLD INTEL AND ITS SUBSIDIARIES, SUBCONTRACTORS AND AFFILIATES, AND THE DIRECTORS, OFFICERS, AND EMPLOYEES OF EACH, HARMLESS AGAINST ALL CLAIMS COSTS, DAMAGES, AND EXPENSES AND REASONABLE ATTORNEYS’ FEES ARISING OUT OF, DIRECTLY OR INDIRECTLY, ANY CLAIM OF PRODUCT LIABILITY, PERSONAL INJURY, OR DEATH ARISING IN ANY WAY OUT OF SUCH MISSION CRITICAL APPLICATION, WHETHER OR NOT INTEL OR ITS SUBCONTRACTOR WAS NEGLIGENT IN THE DESIGN, MANUFACTURE, OR WARNING OF THE INTEL PRODUCT OR ANY OF ITS PARTS.

Intelmaymakechangestospecificationsandproductdescriptionsatanytime,withoutnotice.Designersmustnotrelyontheabsenceorcharacteristicsofanyfeaturesorinstructionsmarked“reserved”or“undefined”.Intelreservestheseforfuturedefinitionandshallhavenoresponsibilitywhatsoeverforconflictsorincompatibilitiesarisingfromfuturechangestothem.Theinformationhereissubjecttochangewithoutnotice.Donotfinalizeadesignwiththisinformation.

Theproductsdescribedinthisdocumentmaycontaindesigndefectsorerrorsknownaserratawhichmaycausetheproducttodeviatefrompublishedspecifications.Currentcharacterizederrataareavailableonrequest. ContactyourlocalIntelsalesofficeoryourdistributortoobtainthelatestspecificationsandbeforeplacingyourproductorder. Copies of documents which have an order number and are referenced in this document, or other Intel literature, may be obtained by calling 1-800-548-4725, or go to: http://www.intel.com/design/literature.htm FTCOptimizationNotice Intel’scompilersmayormaynotoptimizetothesamedegreefornon-IntelmicroprocessorsforoptimizationsthatarenotuniquetoIntelmicroprocessors.TheseoptimizationsincludeSSE2, SSE3,andSSE3instructionsetsandotheroptimizations.Inteldoesnotguaranteetheavailability,functionality,oreffectivenessofanyoptimizationonmicroprocessorsnotmanufacturedbyIntel.

Microprocessor-dependentoptimizationsinthisproductareintendedforusewithIntelmicroprocessors.CertainoptimizationsnotspecifictoIntelmicroarchitecturearereservedforIntel microprocessors.PleaserefertotheapplicableproductUserandReferenceGuidesformoreinformationregardingthespecificinstructionsetscoveredbythisnotice.

Notice revision #20110804 General Performance Disclaimer: For more complete information about performance and benchmark results, visit Performance Test

Disclosure http://www.intel.com/benchmarks Copyright © 2013 Intel Corporation. All rights reserved. Intel, the Intel logo, and Xeon are trademarks of Intel Corporation in the U.S. and other countries. * Other names and brands may be claimed as the property of others. Printed in USA 0813/CM/MB/PDF Please Recycle 329079-002US

diane
Typewritten Text