Requesting Resources on an HPC Facility · SLURM • Bessemer login nodes are gateways to the...

Requesting Resources on an

HPC Facility

Michael Griffiths and Norbert Gyenge

Corporate Information and Computing Services

The University of Sheffield

www.sheffield.ac.uk/cics/research

(Using the Slurm Workload Manager)

1. Understand what High Performance Computing is

2. Be able to access remote HPC Systems by different methods

3. Run Applications on a remote HPC system

4. Manage files using the Linux Operating Systems

5. Know how to use the different kinds of file storage systems

6. Run applications using a Scheduling System or Workload Manager

7. Know how to get more resources and how to get resources dedicated

for your research

8. Know how to enhance your research through shell scripting

9. Know how to get help and training

Review: Objectives

1. Using the Job Scheduler – Interactive Jobs

2. Batch Jobs

3. Task arrays

4. Running Parallel Jobs

5. Beyond Bessemer Accessing tier 2 resources

6. Course examples available using

1. git clone --single-branch --branch bessemer https://github.com/rcgsheffield/hpc_intro

Outline

1. USING THE JOB

SCHEDULER• Interactive Jobs

• https://docs.hpc.shef.ac.uk/en/latest/bessemer/slurm.html#request-an-interactive-shell

• Batch Jobs

• https://docs.hpc.shef.ac.uk/en/latest/bessemer/slurm.html#submitting-non-interactive-jobs

• SLURM Documentation

• https://slurm.schedmd.com/pdfs/summary.pdf

• https://slurm.schedmd.com/man_index.html

RUNNING JOBS

A NOTE ON INTERACTIVE JOBS

• Software that requires intensive computing should be run on the worker nodes and not the head node.

• You should run compute intensive interactive jobs on the worker nodes by using the command

• srun --pty bash –I

• Maximum ( and also default) time limit for interactive jobs is 8 hours.

SLURM• Bessemer login nodes are gateways to the cluster of worker

nodes.

• Login nodes’ main purpose is to allow access to the worker nodes but NOT to run cpu intensive programs.

• All cpu intensive computations must be performed on the worker nodes. This is achieved by;

• srun --pty bash –I for interactive jobs

• sbatch submission.sh for batch jobs

• Once you log into Bessmer, taking advantage of the power of a worker-node for interactive work is done simply by typing. srun --pty bash –I and working in the shell window. The next set of slides assume that you are already working on one of the worker node.

PRACTICE SESSION 1: RUNNING

APPLICATIONS ON BESSEMER

(PROBLEM 1)• Case Studies

• Analysis of Patient Inflammation Data

• Running an R application how to submit jobs and run R interactively

• List available and loaded modules load the module for the R package

• Start the R Application and plot the inflammation data

MANAGING YOUR JOBS

SLURM OVERVIEWSLURM is the workload management system, job scheduling and

batch control system. (Others available such as PBS, Torque/Maui,

Platform LSF )

• Starts up interactive jobs on available workers

• Schedules all batch orientated ‘i.e. non-interactive’ jobs

• Fault Tolerant, highly scalable cluster management and job

scheduling system

• Optimizes resource utilization

SCHEDULING BATCH JOBS ON THE CLUSTER

worker

SLURM MASTER

Queue-A Queue-B Queue-C

Queues

Policies

Priorities

Share/Tickets

Resources

Users/ProjectsJOB Y JOB Z

JOB OJOB N

MANAGING JOBS MONITORING AND CONTROLLING

YOUR JOBS

• There are a number of commands for querying and modifying the status of a job running or waiting to run. These are;

• squeue (query job status)

• squeue –jobs jobid

• squeue –-users “username”

• squeue –-users “*”

• scancel (delete a job)

• scancel jobid

DEMONSTRATION 1

Using the R package to analyse patient data

sbatch example:

sbatch myjob

the first few lines of the submit script myjob contains -

$!/bin/bash

#SBATCH --time=10:00:00

#SBATCH --output myoutputfile

#SBATCH –error myerroroutput

and you simply type; SBATCH myjob

Running Jobs batch job example

PRACTICE SESSION: SUBMITTING JOBS TO BESSEMER

(PROBLEM 2 & 3)

• Patient Inflammation Study run the R example as a batch job

• Case Study

• Fish population simulation

• Submitting jobs to SLURM

• Instructions are in the readme file in the slurm folder of the course examples

• From an interactive session

• Load the compiler module

• Compile the fish program

• Run test1, test2 and test3

MANAGING JOBS: REASONS FOR JOB

FAILURES

• SLURM cannot find the binary file specified in the job script

• You ran out of file storage. It is possible to exceed your filestore allocation limits during a job that is producing large output files. Use the quota command to check this.

• Required input files are missing from the startup directory

• Environment variable is not set correctly (LM_LICENSE_FILE etc)

• Hardware failure

FINDING OUT THE MEMORY REQUIREMENTS OF A JOB

• Real Memory Limits:

• Default real memory allocation is 2 Gbytes

• Request 64GB memory using a batch file

• #SBATCH --mem=64000

• Real memory resource can be requested by using --mem="NN"G

Determining the memory requirements for a job;

• scontrol show jobid –dd <jobid>

MANAGING JOBS : RUNNING CPU-PARALLEL JOBS

• More many processor tasks• Shared memory

• Distributed Memory

#!/bin/bash

#SBATCH --nodes=1

#SBATCH --ntasks-per-node=40

#SBATCH --mem=64000

#SBATCH --mail-user=username@sheffield.ac.ukmodule load apps/openmpi/4.0.1/binary

• Jobs limited to a single node with a maximum of 40 tasks

• Compilers that support MPI.

• PGI , Intel, GNU

DEMONSTRATION 3

• Test 6 provides an opportunity to practice submitting parallel jobs to the

scheduler.

• To run testmpi6, compile the mpi example

• Load the openmpi compiler module

• module load apps/openmpi/4.0.1/binary

• compile the diffuse program

• mpicc diffuse.c -o diffuse -lm

• sbatch testmpi6

• Use squeue to monitor the job examine the output

Running a parallel job

MANAGING JOBS: RUNNING ARRAYS OF JOBS

• Many processors running a copy of a task independently

• Add the –-array parameter to the script file (with

#SBATCH at beginning of the line)

• Example: #SBATCH --array=1-4:1

• This will create 4 tasks from one job

• Each task will have its environment variable

$SLURM_ARRAY_TASK_ID set to a single unique value

ranging from 1 to 10.

• There is no guarantee that task number m will start

before task number n , where m<n

• https://slurm.schedmd.com/job_array.html .

PRACTICE SESSION: SUBMITTING A TASK

ARRAY TO BESSEMER (PROBLEM 4)

• Case Study

• Fish population simulation

• Submitting jobs to Slurm

• Instructions are in the readme file in the Slurm folder of the course examples

• From an interactive session

• Run the Slurm task array example

• Run test4, test5

BEYOND BESSEMER

• Bessemer and ShARC OK for many compute

problems

• Purchasing dedicated resource

• National tier 2 facility for more demanding compute

problems

• Archer Larger facility for grand challenge problems

(pier review process to access)

https://www.sheffield.ac.uk/cics/research/hpc/costs

HIGH PERFORMANCE

COMPUTING TIERS

• Tier 1 computing

• Archer

• Tier 2 Computing

• Peta-5, jade

• Tier 3 Computing

• Bessemer, ShARC

PURCHASING RESOURCE

• Buying nodes using framework

• Research Groups purchase HPC equipment against their research grant this hardware is integrated with Iceberg cluster

• Buying slice of time

• Research groups can purchase servers for a length of time specified by the research group (cost is 1.0p/core per hour)

• Servers are reserved for dedicated usage by the research group using a provided project name

• When reserved nodes are idle they become available to the general short queues. They are quickly released for use by the research group when required.

• For information e-mail research-it@Sheffield.ac.uk

https://www.sheffield.ac.uk/cics/research/hpc/costs

NATIONAL HPC SERVICES

• Tier-2 Facilities• http://www.hpc-uk.ac.uk/• https://goo.gl/j7UvBa

• Archer• UK National Supercomputing Service• Hardware – CRAY XC30

• 2632 Standard nodes• Each node contains two Intel E5-2697 v2 12-core processors• Therefore 2632*2*12 63168 cores.• 64 GB of memory per node• 376 high memory nodes with128GB memory

• Nodes connected to each other via ARIES low latency interconnect• Research Data File System – 7.8PB disk• http://www.archer.ac.uk/

• EPCC• HPCC Facilities

• http://www.epcc.ed.ac.uk/facilities/national-facilities

• Training and expertise in parallel computing

LINKS FOR SOFTWARE

DOWNLOADS

• Moba X-term

https://mobaxterm.mobatek.net/

• Putty

http://www.chiark.greenend.org.uk/~sgtatham/putty/

• WinSCP

http://winscp.net/eng/download.php

• TigerVNC

http://sourceforge.net/projects/tigervnc/

Requesting Resources on an HPC Facility · SLURM • Bessemer login nodes are gateways to the...

Documents

Transcript of Requesting Resources on an HPC Facility · SLURM • Bessemer login nodes are gateways to the...

Introduction to Unix/Linux - Oak Ridge … Terminal Introduction to Unix User’s view users terminal Supercomputer compute nodes login nodes Introduction to Unix What is a Terminal?

Streamlining Research Computing Infrastructure- 1 front end! - 2 login nodes! - 1 NAS node (2 TB RAID1 storage)! - 32 compute nodes! - 50+ software suites! - 150+ users First version

IDAA v4.1 PTF 5 - Update - The Fillmore Group, Inc.thefillmoregroup.com/blog/wp-content/uploads/2015/06/IDAA-v41-PTF5... · Determines how data is partitioned across worker nodes

of Master/login/management nodes ... ETERNUS DX200 S3, 6 nos Drive Enclosure, ... for 30 minutes backup 5 Years comprehensive

Josh Poduska, Sr. Business Analytics Consultant, Actian ......Hadoop platform •3 worker nodes, 1 head node •Running Cloudera CDH4 •Distributed Dataflow 6.1 Analysis Tools (run

Hitachi Storage Plug-in for Containers Quick Reference Guideitdoc.hitachi.co.jp/manuals/st_plugcon/dkc/MK-92ADPTR142-01.pdf · Docker swarm Consists of a manager and worker nodes

# whoami Preventing attacks to Helm on K8s Martin Cruz ......despliegue y funcionamiento de pods. KubeletComponente principal en los Worker Nodes. Actúa como bridge entre el API

HS06 on last generation of HEP worker nodes Berkeley, Hepix Fall ‘09

What comes after Yellowstone and how do we get ready for it? · – 31.7 TB/sec bisection bandwidth • Login/Interactive Nodes – 6 IBM x3650 M4 Nodes; Intel Sandy Bridge EP processors

Cloud Orchestration at the Level of Application · 2017. 12. 15. · Policy Keeper: Prometheus alerts Register policies Scale/update worker nodes Scale/update containers TOSCA description

Virtualization of Worker Nodes in the Grid

APEX 2020 Draft Technical Specifications - Los Alamos ... · Web viewOther system infrastructure components (e.g., disks, switches, login nodes, and mechanical subsystems such as

High-Performance Virtualized Spark Clusters on Kubernetes ... · Spark BigDL ResNet50 image classifier was run on the same 16 worker nodes, first while configured as Spark worker

Introduction to Minerva - Icahn School of Medicine at ...€¦ · 11/03/2020 · Minerva cluster @ Mount Sinai Chimera Computes: 4x login nodes - Intel Skylake 8168 24C, 2.7GHz -

Impact of Virtualization on Cloud NetworkingImpact of Virtualization on Cloud Networking! Arista Networks Whitepaper !! 3! of worker and data nodes, requiring unprecedented VM-to-VM

Description of Facilities and Resources · Web view(SGI Altix ICE 8200) consists of three racks totaling 128 compute nodes, 5 service nodes (1 batch node and 4 login nodes), 2 rack

Poll Worker User Guide47B68709...If configured, the Poll Worker can begin the voting process by scanning a QR code containing the ballot style. This option skips the PIN login. Once

Experiences Implementing Apache Spark and Apache Hadoop ... · The Apache Spark site illustrates the Spark architecture. The nodes types are: Client Driver, Cluster Manager, and Worker

Software stack on the worker nodes

6 Roadmap Operating Pentaho at Scale - … –Worker Nodes Hear about new upcoming capabilities for scaling out the Pentaho platform in large enterprise operations. This will cover