Ticket 6734

Summary: Affinity based on GRES for non-GRES jobs
Product: Slurm Reporter: Albert Gil <albert.gil>
Component: GPUAssignee: Unassigned Developer <dev-unassigned>
Status: OPEN --- QA Contact:
Severity: 5 - Enhancement    
Priority: --- CC: hpc-staff
Version: 19.05.x   
Hardware: Linux   
OS: Linux   
See Also: https://bugs.schedmd.com/show_bug.cgi?id=6693
https://bugs.schedmd.com/show_bug.cgi?id=6548
Site: NYU Alineos Sites: ---
Atos/Eviden Sites: --- Confidential Site: ---
Coreweave sites: --- Cray Sites: ---
DS9 clusters: --- HPCnow Sites: ---
HPE Sites: --- IBM Sites: ---
NOAA SIte: --- OCF Sites: ---
Recursion Pharma Sites: --- SFW Sites: ---
SNIC sites: --- Linux Distro: ---
Machine Name: CLE Version:
Version Fixed: Target Release: ---
DevPrio: --- Emory-Cloud Sites: ---

Description Albert Gil 2019-03-21 10:48:29 MDT
I'm creating this enhanecment from bug 6693 where NYU was asking for the following enhancement.

The need a way to ensure or encourage as much as possible that jobs that are not asking for a GRES but are allocated to a node with some GRES with CPUs assigned in the gres.conf are not bind to those CPUs but to "free CPUs" (CPUs not specified in the gres.conf).

An initial idea could be adding some new option(s) to current  --gres-flags or --distribution to add this bindig logic: "not use or try to avoid GRES-assigned CPUs for jobs not asking for that GRES".
Comment 1 Moe Jette 2019-03-21 10:57:32 MDT
cons_tres does not have anything like this today, but perhaps the cores could be assigned a weight similar to the node weight. Cores associated with more GRES types (and systems could possibly have many GRES types with various bindings) could have a higher weight and be avoided when possible. That would be a fairly major effort and is unlikely to happen soon.