Cluster Vision

25
Bright Cluster Manager Bright Cluster Manager Advanced cluster management made easy Roland Richardson – ClusterVision UK www.clustervision.com 2010 1

Transcript of Cluster Vision

Page 1: Cluster Vision

Bright Cluster ManagerBright Cluster ManagerAdvanced cluster management made easy

Roland Richardson – ClusterVision UK

www.clustervision.com 20101

Page 2: Cluster Vision

Frankfurt

www.clustervision.com 20102

Page 3: Cluster Vision

Cluster Installation

Cl t Vi i i i d t l t E ClusterVision provision and support clusters across Europe● UK, Germany, France, The Netherlands, Italy, Spain, etc. etc.

Middle East● Middle East● US

Install OS, Configure Network drivers, install libraries and MPI implementations install execution daemons for WLM

Some tasks are annoyingly differentqueue configuration user admin hardware configuration Switches PDUs● queue configuration, user admin, hardware configuration, Switches, PDUs etc

ClusterVision wanted to automate all those standard tasks while accommodating the variation and new features

www.clustervision.com 20103

Page 4: Cluster Vision

The Toolkit approachpp Most cluster management solutions use the “toolkit” approach

(Linux distro + tools)(Linux distro + tools) Examples: Rocks, PCM, OSCAR, UniCluster, xCAT, etc. Tools typically used: Ganglia Cacti Nagios Cfengine System Imager Tools typically used: Ganglia, Cacti, Nagios, Cfengine, System Imager,

etc. Issues with the “toolkit” approach:

Tools rarely designed to work together Tools rarely designed for HPC

T l l d i d t l Tools rarely designed to scale Each tool has its own command line interface and GUI Each tool has its own daemon and databaseEach tool has its own daemon and database Roadmap dependent on developers of the tools

Making a collection of unrelated tools work together Requires a lot of expertise and scripting Rarely leads to a really easy to use and scalable solution

www.clustervision.com 20104

Rarely leads to a really easy-to-use and scalable solution

Page 5: Cluster Vision

Annoying Differencesy g

Each cluster deployed is unique● Different Applications● Different Workload● Different Infrastructure

For some this creates a complex management environment For some this creates a complex management environment● No common practice

Difficult to document● Difficult to document

For some it's fineo so e s e

www.clustervision.com 20105

Page 6: Cluster Vision

www.clustervision.com 20106

Page 7: Cluster Vision

www.clustervision.com 20107

Page 8: Cluster Vision

www.clustervision.com 20108

Page 9: Cluster Vision

Architecture

CMDaemon

procedure call

SOAP+SSL

event

www.clustervision.com 20109

Page 10: Cluster Vision

Bright CM Elementsg Cluster Management Daemon Cluster Management Shell Cluster Management Shell Cluster Management GUI Linux distribution, with many enhancements, y

Red Hat & SUSE Enterprise CentOS & Scientific Linux

Node provisioning system Node provisioning system Workload management system

Sun Grid Engine, Torque/Maui MOAB, PBS Pro, LSF

HPC user environment Cluster management & monitoring functionality Cluster management & monitoring functionality Years of HPC expertise Consistent integration of all the aboveg------------------------------------------------------------------------------ +Bright Cluster Manager

www.clustervision.com 201010

Page 11: Cluster Vision

HPC User Environment

Let users focus on performing computationsLet users focus on performing computations

Rich collection of HPC software Compilers (GNU, Intel*, Portland*, Open64, etc.) Parallel middleware (MPI libraries, threading libraries, OpenMP,

Gl b l A t )Global Arrays, etc.) Mathematical libraries (ACML, MKL*, LAPACK, BLAS, GOTO,

etc )etc.) Development tools (debuggers, profilers, etc.) Environment modules Environment modules

NVIDIA CUDA, OpenCL & GPGPU drivers, p

Intel Cluster Ready CompliantCompliant applications run out of the box

www.clustervision.com 201011

Compliant applications run out of the box

Page 12: Cluster Vision

Bright Buzz:g

• "It's hands-down the best GPGPU management software out there." - Joe Lipman, VP of Sales at Advanced HPC

• "With Bright Cluster Manager now offering full support for ScaleMP vSMP Foundation, setting up and managing a powerful SMP cluster has never been so easy " -- Shai powerful SMP cluster has never been so easy. Shai Fultheim, CEO, ScaleMP

• "Bright Cluster Manager allows us to provision thousands of nodes within hours and rapidly deploy software

d " D b h N Di P f i l updates." -- Deborah Newman, Director Professional Services at PCPC Direct

www.clustervision.com 201012

Page 13: Cluster Vision

Management Interface

Graphical User Interface (GUI)Graphical User Interface (GUI) Offers administrator full cluster control Standalone desktop application

Admin GUI

Standalone desktop application Manages multiple clusters simultaneously

R Li Wi d M OS X Runs on Linux, Windows, MacOS X Built on top of Mozilla XUL engine

Command Line Interface (CLI)( ) All GUI functionality also available through

Command Line Interface (CLI)Command Line Interface (CLI) Interactive and scriptable in batch mode

www.clustervision.com 201013

Page 14: Cluster Vision

www.clustervision.com 201014

Page 15: Cluster Vision

www.clustervision.com 201015

Page 16: Cluster Vision

www.clustervision.com 201016

Page 17: Cluster Vision

www.clustervision.com 201017

Page 18: Cluster Vision

www.clustervision.com 201018

Page 19: Cluster Vision

www.clustervision.com 201019

Page 20: Cluster Vision

www.clustervision.com 201020

Page 21: Cluster Vision

www.clustervision.com 201021

Page 22: Cluster Vision

Advanced Features

Advanced Features Redundant head nodes Cluster health checkingg

Health checks before jobs are run Frequently scheduled health checks Hardware burn-in environment

Daemon with low resource consumption Multiple, load-balanced provisioning nodes

Allows scaling to thousands of nodesN d di i Eth t it h t d t ti Node discovery using Ethernet switch port detection

Live & incremental image updatesA t t d BIOS d t d fi ti Automated BIOS updates and configurations

www.clustervision.com 201022

Page 23: Cluster Vision

Unique Featuresq Fundamental, integrated solution Complete management solution Scalable to thousands of nodes Minimal CPU overhead due to light weight daemon Minimal CPU overhead due to light-weight daemon Support for redundant head nodes Image based, incremental provisioningg , p g Support for diskless and InfiniBand-only slave nodes Single GUI and CLI for all cluster management tasks

S f S S C OS Support for Red Hat Enterprise, SUSE Enterprise, CentOS and Scientific Linux

Cluster health checkingCluster health checking Role based authentication Zoom-in function for monitoring graphs Visualization of metrics in Rackview Graphical parallel shell Fully graphical installation in 30 minutes

www.clustervision.com 201023

Fully graphical installation in 30 minutes

Page 24: Cluster Vision

iQuestions?

www.clustervision.com 201024

Page 25: Cluster Vision

h dThe End