| Summary: | RFE: new flag to allocate subset of GRES on exclusive nodes | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | Felix Abecassis <fabecassis> |
| Component: | Accounting | Assignee: | Marcin Stolarek <cinek> |
| Status: | RESOLVED FIXED | QA Contact: | |
| Severity: | 5 - Enhancement | ||
| Priority: | --- | CC: | jbernauer, lcapps, lyeager, tim |
| Version: | 22.05.0 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| See Also: |
https://bugs.schedmd.com/show_bug.cgi?id=15873 https://bugs.schedmd.com/show_bug.cgi?id=15874 |
||
| Site: | NVIDIA (PSLA) | Alineos Sites: | --- |
| Atos/Eviden Sites: | --- | Confidential Site: | --- |
| Coreweave sites: | --- | Cray Sites: | --- |
| DS9 clusters: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Linux Distro: | --- |
| Machine Name: | CLE Version: | ||
| Version Fixed: | 23.02pre1 | Target Release: | 23.02 |
| DevPrio: | 1 - Paid | Emory-Cloud Sites: | --- |
|
Description
Felix Abecassis
2022-06-03 14:47:35 MDT
"consume_requested" might be a better name, I was thinking of `ReqTres` but Req stands for "Requested" and not "Required". Felix, this would be possible, however would require being sponsored as paid development by Nvidia. Is this something you are interested in sponsoring? Updating ticket metadata to reflect status as a potential future enhancement. Hey Felix - We're working on wrapping this up, but stumbled on one subtle implementation detail that we wanted to check with you on. Each Gres can have an (optional) Type field. Common device definitions look like: Name=gpu Type=k20 File=/dev/nvidia0 Internally, the internal flags field - where this new "explicit" flag is being added - is mapped to the Gres - not each individual (Gres,Type) tuple. This means that, in our current implementation, if you specify any Explicit Gres like: Name=gpu Type=k20 File=/dev/nvidia0 Flags=Explicit That the explicit flag applies not only to the k20 types, but all Gres=gpu defined on the node. So any further definitions like: Name=gpu Type=h100 File=/dev/nvidia1 would automatically inherit the "explicit" flag, and be treated as such in the configuration. We're hoping that's not an issue for your expected use case here, but wanted to confirm that with you in case you have some use for this flag that doesn't match up to this. - Tim Thanks for asking, I think that's fine. Felix, I'm happy to let you know that requested feature got merged into our public repository[1] and will be part of Slurm 23.02 release. I'll go ahead and mark the ticket as fixed. Should you have any questions please don't hesitate to reopen. cheers, Marcin [1]https://github.com/SchedMD/slurm/commit/75be81090106b9b083698e66e8821f0113af72b1 |