Ticket 20040 - Remove dependency on NVML for basic NVIDIA GPU detection and management
Summary: Remove dependency on NVML for basic NVIDIA GPU detection and management
Status: RESOLVED FIXED
Alias: None
Product: Slurm
Classification: Unclassified
Component: GPU (show other tickets)
Version: 24.11.x
Hardware: Linux Linux
: 5 - Enhancement
Assignee: Scott Hilton
QA Contact: Documentation
URL:
: 20175 (view as ticket list)
Depends on:
Blocks:
 
Reported: 2024-05-31 09:55 MDT by Tim Wickberg
Modified: 2024-10-30 12:42 MDT (History)
2 users (show)

See Also:
Site: SchedMD
Slinky Site: ---
Alineos Sites: ---
Atos/Eviden Sites: ---
Confidential Site: ---
Coreweave sites: ---
Cray Sites: ---
DS9 clusters: ---
Google sites: ---
HPCnow Sites: ---
HPE Sites: ---
IBM Sites: ---
NOAA SIte: ---
NoveTech Sites: ---
Nvidia HWinf-CS Sites: ---
OCF Sites: ---
Recursion Pharma Sites: ---
SFW Sites: ---
SNIC sites: ---
Tzag Elita Sites: ---
Linux Distro: ---
Machine Name:
CLE Version:
Version Fixed: 24.11.0rc1
Target Release: 24.11
DevPrio: 2 - Vital
Emory-Cloud Sites: ---


Attachments

Note You need to log in before you can comment on or make changes to this ticket.
Comment 1 Tim Wickberg 2024-06-14 15:48:17 MDT
*** Ticket 20175 has been marked as a duplicate of this ticket. ***
Comment 14 Ben Roberts 2024-10-30 12:42:43 MDT
The following commits have been checked in ahead of 24.11.0-0rc1 and will be visible on the website with the release of 24.11.

commit d10c3e5d3ea63f92637266b7ae38be0815554183
Author: Stephen Kendall <stephen@schedmd.com>
Date:   Wed Oct 23 15:00:24 2024 -0600

    Docs - Reformat GPU plugins list in nested list
    
    Ticket 20040

commit 376044d3b172f713013cd97c483d9b69adeadb53
Author: Scott Hilton <scott@schedmd.com>
Date:   Wed Oct 23 11:43:54 2024 -0600

    Docs - Add documentation for Autodetect=nvidia
    
    Ticket 20040