Hello, In the past we've allowed some jobs (which requested it) to allow themselves to be preempted by a reservation (e.g., maintenance or system-exclusive time). Is there any way to do this in SLURM? We are planning on using qos-based preemption for some things, but would also like to allow reservations to preempt jobs within that preemptable qos. The goal is to increase even further our ability to backfill heading into system exclusive testing or maintenance periods. Thanks, Doug
When you say "preempted" here, you mean canceled at that time presumably? Suspend + resume, or gang scheduling obviously wouldn't work if you're bringing the machine down. Could the jobs simply be submitted with a modest --time-min set? sched/backfill would then try to assign them as much time is available that schedules them soonest, which seems like what you're trying to accomplish to a limited extent. You may want to combine that (possibly through a job_submit plugin) with a dedicated lower-priority partition to limit churn.
I think this is a great solution -- I'll work with our user services folks to get users to do this! Thanks, Doug