In bug 15439 comment 11 we have discussed the need for the ResumeProgram and SuspendProgram to have access to node features for the purpose of performing different suspend/resume actions such as IPMI or cloud commands. For example, we have defined node features such as: $ scontrol show node x002 NodeName=x002 Arch=x86_64 CoresPerSocket=12 CPUAlloc=24 CPUTot=24 CPULoad=24.01 AvailableFeatures=xeon2650v4,opa,xeon24,power_ipmi ActiveFeatures=xeon2650v4,opa,xeon24,power_ipmi (lines deleted) It would be most helpful if the SLURM_RESUME_FILE as well as a new SLURM_SUSPEND_FILE would add the node features information so that ResumeProgram and SuspendProgram do not need to inquire slurmctld explicitly to read such features. Thanks, Ole
Ole is your site interested in funding/sponsoring this feature?
Hi Jason, (In reply to Jason Booth from comment #2) > Ole is your site interested in funding/sponsoring this feature? I have considered this question. The suggested enhancement would give the power_save plugin's scripts SuspendProgram and ResumeProgram some extra relevant data to work with. However, my power saving scripts in https://github.com/OleHolmNielsen/Slurm_tools/tree/master/power_save in stead use a single call of sdiag to obtain the data directly from slurmctld. Therefore I do not have a strong need for the suggested enhancement. Thanks for considering this anyhow. Best regards, Ole
Ole, I am moving this over to our sev 5 categories so that we can track this appropriately.