| Summary: | Option to avoid jobs starting on selected GPU devices | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | VUB HPC <hpcadmin> |
| Component: | GPU | Assignee: | Tim Wickberg <tim> |
| Status: | OPEN --- | QA Contact: | |
| Severity: | 5 - Enhancement | ||
| Priority: | --- | CC: | brian.gregory, djacobsen, kilian, scott |
| Version: | 25.11.5 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| Site: | VUB | Slinky Site: | --- |
| Alineos Sites: | --- | Atos/Eviden Sites: | --- |
| Confidential Site: | --- | Coreweave sites: | --- |
| Cray Sites: | --- | DS9 clusters: | --- |
| Google sites: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | NoveTech Sites: | --- |
| Nvidia HWinf-CS Sites: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Tzag Elita Sites: | --- |
| Linux Distro: | --- | Machine Name: | |
| CLE Version: | Version Fixed: | ||
| Target Release: | --- | DevPrio: | --- |
| Emory-Cloud Sites: | --- | ||
|
Description
VUB HPC
2026-05-07 03:41:24 MDT
Hey Alex - We're discussing something of this nature for the 26.11 release, but can't promise a specific implementation just yet. But the broad idea would be to add independent status tracking for each GRES device - you'd then have options to mark them down individually while persisting them in the configuration. Right now the only way to readily achieve this is to alter the gres.conf/slurm.conf definition for the node, which I know isn't ideal. - Tim Hi Tim, Happy to hear that there are plans in motion for this feature. A target on the 26.11 release would be convenient to us as well. Thanks for the update. Alex Domingo VUB-HPC |