Cluster Vision

Post on 04-Oct-2014

41 views 3 download

Transcript of Cluster Vision

Bright Cluster ManagerBright Cluster ManagerAdvanced cluster management made easy

Roland Richardson – ClusterVision UK

www.clustervision.com 20101

Frankfurt

www.clustervision.com 20102

Cluster Installation

Cl t Vi i i i d t l t E ClusterVision provision and support clusters across Europe● UK, Germany, France, The Netherlands, Italy, Spain, etc. etc.

Middle East● Middle East● US

Install OS, Configure Network drivers, install libraries and MPI implementations install execution daemons for WLM

Some tasks are annoyingly differentqueue configuration user admin hardware configuration Switches PDUs● queue configuration, user admin, hardware configuration, Switches, PDUs etc

ClusterVision wanted to automate all those standard tasks while accommodating the variation and new features

www.clustervision.com 20103

The Toolkit approachpp Most cluster management solutions use the “toolkit” approach

(Linux distro + tools)(Linux distro + tools) Examples: Rocks, PCM, OSCAR, UniCluster, xCAT, etc. Tools typically used: Ganglia Cacti Nagios Cfengine System Imager Tools typically used: Ganglia, Cacti, Nagios, Cfengine, System Imager,

etc. Issues with the “toolkit” approach:

Tools rarely designed to work together Tools rarely designed for HPC

T l l d i d t l Tools rarely designed to scale Each tool has its own command line interface and GUI Each tool has its own daemon and databaseEach tool has its own daemon and database Roadmap dependent on developers of the tools

Making a collection of unrelated tools work together Requires a lot of expertise and scripting Rarely leads to a really easy to use and scalable solution

www.clustervision.com 20104

Rarely leads to a really easy-to-use and scalable solution

Annoying Differencesy g

Each cluster deployed is unique● Different Applications● Different Workload● Different Infrastructure

For some this creates a complex management environment For some this creates a complex management environment● No common practice

Difficult to document● Difficult to document

For some it's fineo so e s e

www.clustervision.com 20105

www.clustervision.com 20106

www.clustervision.com 20107

www.clustervision.com 20108

Architecture

CMDaemon

procedure call

SOAP+SSL

event

www.clustervision.com 20109

Bright CM Elementsg Cluster Management Daemon Cluster Management Shell Cluster Management Shell Cluster Management GUI Linux distribution, with many enhancements, y

Red Hat & SUSE Enterprise CentOS & Scientific Linux

Node provisioning system Node provisioning system Workload management system

Sun Grid Engine, Torque/Maui MOAB, PBS Pro, LSF

HPC user environment Cluster management & monitoring functionality Cluster management & monitoring functionality Years of HPC expertise Consistent integration of all the aboveg------------------------------------------------------------------------------ +Bright Cluster Manager

www.clustervision.com 201010

HPC User Environment

Let users focus on performing computationsLet users focus on performing computations

Rich collection of HPC software Compilers (GNU, Intel*, Portland*, Open64, etc.) Parallel middleware (MPI libraries, threading libraries, OpenMP,

Gl b l A t )Global Arrays, etc.) Mathematical libraries (ACML, MKL*, LAPACK, BLAS, GOTO,

etc )etc.) Development tools (debuggers, profilers, etc.) Environment modules Environment modules

NVIDIA CUDA, OpenCL & GPGPU drivers, p

Intel Cluster Ready CompliantCompliant applications run out of the box

www.clustervision.com 201011

Compliant applications run out of the box

Bright Buzz:g

• "It's hands-down the best GPGPU management software out there." - Joe Lipman, VP of Sales at Advanced HPC

• "With Bright Cluster Manager now offering full support for ScaleMP vSMP Foundation, setting up and managing a powerful SMP cluster has never been so easy " -- Shai powerful SMP cluster has never been so easy. Shai Fultheim, CEO, ScaleMP

• "Bright Cluster Manager allows us to provision thousands of nodes within hours and rapidly deploy software

d " D b h N Di P f i l updates." -- Deborah Newman, Director Professional Services at PCPC Direct

www.clustervision.com 201012

Management Interface

Graphical User Interface (GUI)Graphical User Interface (GUI) Offers administrator full cluster control Standalone desktop application

Admin GUI

Standalone desktop application Manages multiple clusters simultaneously

R Li Wi d M OS X Runs on Linux, Windows, MacOS X Built on top of Mozilla XUL engine

Command Line Interface (CLI)( ) All GUI functionality also available through

Command Line Interface (CLI)Command Line Interface (CLI) Interactive and scriptable in batch mode

www.clustervision.com 201013

www.clustervision.com 201014

www.clustervision.com 201015

www.clustervision.com 201016

www.clustervision.com 201017

www.clustervision.com 201018

www.clustervision.com 201019

www.clustervision.com 201020

www.clustervision.com 201021

Advanced Features

Advanced Features Redundant head nodes Cluster health checkingg

Health checks before jobs are run Frequently scheduled health checks Hardware burn-in environment

Daemon with low resource consumption Multiple, load-balanced provisioning nodes

Allows scaling to thousands of nodesN d di i Eth t it h t d t ti Node discovery using Ethernet switch port detection

Live & incremental image updatesA t t d BIOS d t d fi ti Automated BIOS updates and configurations

www.clustervision.com 201022

Unique Featuresq Fundamental, integrated solution Complete management solution Scalable to thousands of nodes Minimal CPU overhead due to light weight daemon Minimal CPU overhead due to light-weight daemon Support for redundant head nodes Image based, incremental provisioningg , p g Support for diskless and InfiniBand-only slave nodes Single GUI and CLI for all cluster management tasks

S f S S C OS Support for Red Hat Enterprise, SUSE Enterprise, CentOS and Scientific Linux

Cluster health checkingCluster health checking Role based authentication Zoom-in function for monitoring graphs Visualization of metrics in Rackview Graphical parallel shell Fully graphical installation in 30 minutes

www.clustervision.com 201023

Fully graphical installation in 30 minutes

iQuestions?

www.clustervision.com 201024

h dThe End