Logging in to SCW

Overview

Teaching: 20 min
Exercises: 15 min
Questions
  • How can do I login to SCW?

Objectives
  • Understand how to login to SCW

  • Understand the difference between the login node and each cluster’s head node.

Logging in

Your username is usually firstname.surname. You should have been emailed details of your login prior to this workshop.

$ ssh username@login.hpcwales.co.uk

Windows users should use PuTTY and enter login.hpcwales.co.uk in the hostname box.

Change your password

If you haven’t already changed your password from the one that was emailed to you, do so now by running the passwd command. Passwords need to contain some numbers, letters and punctuation characters. If you forget your password email support@hpcwales.co.uk to reset it.

What’s available?

The hpcwhosts command will list the available clusters.

$ hpcwhosts
HPC Wales Clusters Available

Phase    System Location & Type             Login Node(s)
------------------------------------------------------------------
1        Cardiff High Throughput            cwl001   cwl002   cwl003
1        Bangor Medium Processing           bwl001   bwl002
2        Swansea Capability/Capacity/GPU    ssl001   ssl002   ssl003
2        Cardiff Capacity/GPU               csl001   csl002
Cluster Number of Nodes Cores per node Architecture RAM Other
Cardiff High Throughput 162 12 Westmere 36GB  
Cardiff High Throughput 4 2 Nehalem 128GB  
Cardiff High Throughput 1 8 Nehalem 512GB  
Cardiff Capacity 384 16 Sandy Bridge 64GB  
Cardiff GPU 16 16 Sandy Bridge 64GB Nvidia Tesla M GPU
Swansea Capability 16 16 Sandy Bridge 128GB  
Swansea Capability 240 16 Sandy Bridge 64GB  
Swansea Capacity 128 16 Sandy Bridge 64GB  
Swansea GPU 16 16 Sandy Bridge 64GB Nvidia Tesla M2090 (512 core, 6GB RAM)
Bangor 54 12 Westmere 36GB  

Total: 15520 cores, 304.7 Trillion Floating Point Operations Per Second (TFlops)

SCW vs HPCW

SCW is still in the process of being purchased. We are probably getting Intel Sandybridge Xeon processors. Approximately double the speed of a Sandybridge processor. Expect total speed around 700 TFLOPs. The top500 list compiles a list of the world’s 500 fastest computers. The November 2017 (https://www.top500.org/list/2017/11/?page=4) list has a 700 FLOP system at position 383.

Slurm

Slurm is the management software used on HPC Wales. It lets you submit (and monitor or cancel) jobs to the cluster and chooses where to run them.

Other clusters might run different job management software such as LSF, Sun Grid Engine or Condor, although they all operate along similar principles.

How busy is the cluster?

The sinfo command tells us the state of the cluster. It lets us know what nodes are available, how busy they are and what state they are in.

Clusters are sometimes divided up into partitions. This might separate some nodes which are different to the others (e.g. they have more memory, GPUs or different processors).

PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST
work*        up   infinite      2 drain* bwc[022,052]
work*        up   infinite      1  down* bwc016
work*        up   infinite     13    mix bwc[001-002,010-012,031-036,050-051]
work*        up   infinite     38  alloc bwc[003-009,013-015,017-021,023-030,037-049,053-054]
long         up   infinite      2 drain* bwc[022,052]
long         up   infinite      1  down* bwc016
long         up   infinite     13    mix bwc[001-002,010-012,031-036,050-051]
long         up   infinite     38  alloc bwc[003-009,013-015,017-021,023-030,037-049,053-054]

Exercises

Logging into HPC Wales

  1. Login to login.hpcwales.co.uk using your SSH client. Your username is usually firstname.surname and has been emailed to you.
  2. Run the hpcwhosts and pick a system to login to.
  3. Login to that host by running ssh <hostname>
  4. Run the sinfo command to see how busy things are.
  5. Try sinfo --long, what extra information does this give?

Key Points

  • ssh login.hpcwales.co.uk for the login node, from there connect to the cluster of your choice

  • hpcwhosts command shows a list of clusters