Add support for GPUs in Slurm
Description of changes
Slurm currently does not detect or track GPU usage. Combined with ConstrainDevices=yes
, I believe this makes it impossible to utilize GPUs within Slurm, and the reason for jobs started with --exclusive
to not be currently scheduled in these nodes.
This MR changes Slurm configuration to recognize GPUs as a type of resource, and to autodetect them in machines with NVIDIA GPUs. It may be interesting to further investigate this later to also detect the Intel iGPU.
Things done
-
Tested -
Updated documentation (Wiki/NetBox) -
Breaking change