profiles/cluster/client: Start slurmd after transient cirrus mount error
Description of changes
lab6p8 booted but slurmd.service failed due to mnt-cirrus.mount being temporarily failed during boot. This MR should make systemd continue to try starting slurmd.service after mnt-cirrus fails.
Additionally, it stops slurmd
when the cirrus mount fails, removing it from the cluster.
Things done
-
Tested -
Updated documentation (Wiki/NetBox) -
Breaking change