Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment RedHat, SuSE, Debian,...

18
Cluster Software Overview

Transcript of Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment RedHat, SuSE, Debian,...

Page 1: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Cluster Software Overview

Page 2: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Cluster Software Stack

Linux Kernel/Environment RedHat, SuSE, Debian, etc.

Page 3: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Cluster Software Stack

HPC Device Drivers Interconnect driver (e.g., Myrinet, Infiniband, Quadrics) Storage drivers (e.g., PVFS)

Page 4: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Cluster Software Stack

Job Scheduling and Launching Sun Grid Engine (SGE) Portable Batch System (PBS) Load Sharing Facility (LSF)

Page 5: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Cluster Software Stack

Cluster Software Management E.g., Rocks, OSCAR, Scyld

Page 6: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Cluster Software Stack

Cluster State Management and Monitoring Monitoring: Ganglia, Clumon, Nagios, Tripwire, Big Brother Management: Node naming and configuration (e.g., DHCP)

Page 7: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Cluster Software Stack

Message Passing and Communication Layer E.g., Sockets, MPICH, PVM

Page 8: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Cluster Software Stack

Parallel Code / Web Farm / Grid / Computer Lab Locally developed code

Page 9: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Cluster Software Stack

Questions: How to deploy this stack across every machine in the cluster? How to keep this stack consistent across every machine?

Page 10: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Software Deployment

Known methods: By hand “Add-on” method

Bring up a frontend, then add cluster packages OpenMosix, OSCAR, Warewulf

Integrated Cluster packages are added at frontend installation time

Rocks, Scyld

Page 11: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

OpenMosix

Overview Single system image - all nodes look like one large

multiprocessor Jobs migrate from machine to machine (based on machine

load) No changes required for apps to use system

Processors supported x86 Opteron port in progress Itanium port unclear

Interconnects supported All IP-based networks

Page 12: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

OpenMosix Installation

Install all nodes with Linux Can use RedHat, SuSE, Debian, Gentoo, etc.

Download and install OpenMosix add-on packages on each node (or configure and compile custom kernel) Kernel patches

Supports diskful and diskless

Reboot all nodes Now running OpenMosix kernel

Page 13: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

OSCAR

Overview Imaged based installation

Bring up frontend by hand Add OSCAR packages to frontend Construct image for compute nodes within a directory

on the frontend Redhat 9.0 and Mandrake 9.0 support Includes LAM/MPI, MPICH, PVM, Maui/PBS, PVFS, Clumon

Processors supported: x86, Itanium (in beta) Interconnects: Ethernet, Myrinet

Page 14: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

OSCAR

Diskless support in development Opteron support in development High-availability support in alpha testing

Page 15: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Warewulf Overview

Install frontend first Recommend using RPM-based distribution

Imaged based installation “Virtual node filesystem”

Attacks problem of generic slave node management

Standard cluster software not included Added separately Use ‘chroot’ commands to add in extra software

Supports diskful and diskless Processors supported: x86, Opteron

Page 16: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Scyld

Commercial distribution Single system image design

“Bproc” Based on Red Hat Enterprise Server 3.0

Processors: x86 and Opteron Itanium in development

Interconnects: Ethernet and Infiniband MPI and PVM Diskful and diskless support

Page 17: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Rocks

Cluster on a CD Based on Red Hat Enterprise Workstation 3.0

Processors: x86, Opteron, Itanium Interconnects: Ethernet and Myrinet

Infiniband in development

Compute node management via Red Hat’s kickstart mechanism

Diskfull only

Page 18: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc.

Many Others

Clustermatic Uses LinuxBIOS

Clic From mandrakesoft

Fully Automatic Installation Debian-based

Callident Commericial version of Rocks

Qlusters Commercial version of openMosix