Logging in to SCW
Overview
Teaching: 20 min
Exercises: 15 minQuestions
How can do I login to SCW?
Objectives
Understand how to login to SCW
Understand the difference between the login node and each cluster’s head node.
Logging in
Your username is usually firstname.surname
. You should have been emailed details of your login prior to this workshop.
$ ssh username@login.hpcwales.co.uk
Windows users should use PuTTY and enter login.hpcwales.co.uk in the hostname box.
Change your password
If you haven’t already changed your password from the one that was emailed to you, do so now by running the passwd
command. Passwords need to contain some numbers, letters and punctuation characters. If you forget your password email support@hpcwales.co.uk to reset it.
What’s available?
The hpcwhosts
command will list the available clusters.
$ hpcwhosts
HPC Wales Clusters Available
Phase System Location & Type Login Node(s)
------------------------------------------------------------------
1 Cardiff High Throughput cwl001 cwl002 cwl003
1 Bangor Medium Processing bwl001 bwl002
2 Swansea Capability/Capacity/GPU ssl001 ssl002 ssl003
2 Cardiff Capacity/GPU csl001 csl002
Cluster | Number of Nodes | Cores per node | Architecture | RAM | Other |
---|---|---|---|---|---|
Cardiff High Throughput | 162 | 12 | Westmere | 36GB | |
Cardiff High Throughput | 4 | 2 | Nehalem | 128GB | |
Cardiff High Throughput | 1 | 8 | Nehalem | 512GB | |
Cardiff Capacity | 384 | 16 | Sandy Bridge | 64GB | |
Cardiff GPU | 16 | 16 | Sandy Bridge | 64GB | Nvidia Tesla M GPU |
Swansea Capability | 16 | 16 | Sandy Bridge | 128GB | |
Swansea Capability | 240 | 16 | Sandy Bridge | 64GB | |
Swansea Capacity | 128 | 16 | Sandy Bridge | 64GB | |
Swansea GPU | 16 | 16 | Sandy Bridge | 64GB | Nvidia Tesla M2090 (512 core, 6GB RAM) |
Bangor | 54 | 12 | Westmere | 36GB |
Total: 15520 cores, 304.7 Trillion Floating Point Operations Per Second (TFlops)
SCW vs HPCW
SCW is still in the process of being purchased. We are probably getting Intel Sandybridge Xeon processors. Approximately double the speed of a Sandybridge processor. Expect total speed around 700 TFLOPs. The top500 list compiles a list of the world’s 500 fastest computers. The November 2017 (https://www.top500.org/list/2017/11/?page=4) list has a 700 FLOP system at position 383.
Slurm
Slurm is the management software used on HPC Wales. It lets you submit (and monitor or cancel) jobs to the cluster and chooses where to run them.
Other clusters might run different job management software such as LSF, Sun Grid Engine or Condor, although they all operate along similar principles.
How busy is the cluster?
The sinfo
command tells us the state of the cluster. It lets us know what nodes are available, how busy they are and what state they are in.
Clusters are sometimes divided up into partitions. This might separate some nodes which are different to the others (e.g. they have more memory, GPUs or different processors).
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
work* up infinite 2 drain* bwc[022,052]
work* up infinite 1 down* bwc016
work* up infinite 13 mix bwc[001-002,010-012,031-036,050-051]
work* up infinite 38 alloc bwc[003-009,013-015,017-021,023-030,037-049,053-054]
long up infinite 2 drain* bwc[022,052]
long up infinite 1 down* bwc016
long up infinite 13 mix bwc[001-002,010-012,031-036,050-051]
long up infinite 38 alloc bwc[003-009,013-015,017-021,023-030,037-049,053-054]
- work* means this is the default partition.
- AVAIL tells us if the partition is available.
- TIMELIMIT tells us if there’s a time limit for jobs
- NODES is the number of nodes in the this partition.
- STATE, drain means the node will become unavailable once the current job ends. down is off, allow is allocated and mix is …
Exercises
Logging into HPC Wales
- Login to login.hpcwales.co.uk using your SSH client. Your username is usually firstname.surname and has been emailed to you.
- Run the
hpcwhosts
and pick a system to login to.- Login to that host by running
ssh <hostname>
- Run the
sinfo
command to see how busy things are.- Try
sinfo --long
, what extra information does this give?
Key Points
ssh login.hpcwales.co.uk for the login node, from there connect to the cluster of your choice
hpcwhosts command shows a list of clusters