Some pending jobs show "Reserved for maintenance" even though no reservation is in the system. 70919 F1cpu HPhi k014819 PD 0:00 1 (ReqNodeNotAvail, Reserved for maintenance) 75614 F1cpu XC6_L2_D k003900 PD 0:00 1 (ReqNodeNotAvail, Reserved for maintenance) 75618 F1cpu XC6_L4_D k003900 PD 0:00 1 (ReqNodeNotAvail, Reserved for maintenance) 75677 F1cpu XC4_L10_ k003900 PD 0:00 1 (ReqNodeNotAvail, Reserved for maintenance) 75679 F1cpu GP_XC4_L k003900 PD 0:00 1 (ReqNodeNotAvail, Reserved for maintenance)
Did you have any maintenance reservation in the recent past? I think that you're hitting an issue that got improved by 3d6902ebe9d and was already released in Slurm 20.02.6? The upgrade should be enough avoid it in the future. As the workaround for those jobs, you can call `scontrol release JOBID` on those jobs. After that, they will get displayed with "(None)" reason which will be updated in the next scheduler cycle. Let me know if that worked. After the confirmation, I'll close the case as a duplicate of Bug 9720. cheers, Marcin
Did the workaround from comment 1 work for you? cheers, Marcin
Yes. It was fixed by this workaround. Thanks.
Thanks for the confirmation. To avoid the issue in the future please upgrade to Slurm 20.02.6. I'm closing the bug as a duplicate now. cheers, Marcin *** This ticket has been marked as a duplicate of ticket 9720 ***