The slurm-users mailing list has a discussion of the command "sacctmgr show event where node=<nodename>" for listing node events. This is an *extremely useful* command! However, the sacctmgr documentation[1] seems to be slightly incorrect when stating: event: Events like downed or draining nodes on clusters. As pointed out in the mailing list: Be aware that down and drainED nodes are there, but not drainING. Can you please reformulate the documentation so that the precise behavior is given? IMHO, having the state "Draining" from the database would also be very useful, if it actually gets recorded. Can you kindly clarify this possibility? Thanks a lot, Ole [1] https://slurm.schedmd.com/sacctmgr.html#OPT_event
Hello Ole, I have done some quick tests to make sure and yes, the DB only gets populated with the event once the node actually finishes up draining. >> IMHO, having the state "Draining" from the database would also be very useful, >> if it actually gets recorded. Can you kindly clarify this possibility? I cannot comment on that *yet*. I would need to discuss this internally first. I will let you know when I have some feedback. Best regards, Ricard.
Hi Ricard, (In reply to Ricard Zarco Badia from comment #1) > I have done some quick tests to make sure and yes, the DB only gets > populated with the event once the node actually finishes up draining. > > >> IMHO, having the state "Draining" from the database would also be very useful, > >> if it actually gets recorded. Can you kindly clarify this possibility? > > I cannot comment on that *yet*. I would need to discuss this internally > first. I will let you know when I have some feedback. Thanks for confirming this. In the meantime, would you be able to work on updating the documentation[1]? Thanks, Ole
Hello Ole, >> In the meantime, would you be able to work on updating the documentation? Sure, I have proposed some changes to be reviewed. I will let you know as soon as I have any news. And coming back to this: >> IMHO, having the state "Draining" from the database would also be very useful, >> if it actually gets recorded. Can you kindly clarify this possibility? We did some digging and found out that we already have an enhancement request for this. I cannot say when/if/how it will be done, but this is present in our development backlog. Best regards, Ricard.
Hello Ole, We have added the following documentation changes in this commit [1]. It is not in the web page yet, but it will be there as soon as we push a new version of the docs. I think this covers all right now, I will be closing the ticket. Please feel free to reopen and reply if necessary. Best regards, Ricard. [1] https://github.com/SchedMD/slurm/commit/2c115fd5e1e4b9261128e398a79a65cb4f38e05a