Add support for ROCR_VISIBLE_DEVICES environment variable manipulation, similar to that of CUDA_VISIBLE_DEVICES. Add support equivalent to that of the Nvidia NVML / MPS libraries assuming sufficient API availability.
Just tidying up. I'm marking this as complete - the gpu/rsmi plugin has been available since the 20.02 release last year as is working as intended. *** This ticket has been marked as a duplicate of ticket 7714 ***
Opening this ticket up publicly, and adding a couple of documentation links: AMD's ROCm SMI library is what the Slurm gpu/rsmi plugin depends on for device info: https://github.com/RadeonOpenCompute/rocm_smi_lib The rsmi.h header itself is the best description of the API they've defined: https://github.com/RadeonOpenCompute/rocm_smi_lib/blob/master/include/rocm_smi/rocm_smi.h