Skip to content

profiles/cluster/client: Start slurmd after transient cirrus mount error

André Breda requested to merge ist189409/nixrnl:slurm-upholds into master

Description of changes

lab6p8 booted but slurmd.service failed due to mnt-cirrus.mount being temporarily failed during boot. This MR should make systemd continue to try starting slurmd.service after mnt-cirrus fails.

Additionally, it stops slurmd when the cirrus mount fails, removing it from the cluster.

Things done

  • Tested
  • Updated documentation (Wiki/NetBox)
  • Breaking change

Merge request reports

Loading