Ticket 845 - scontrol release job=123 does nothing, but gives no error
Summary: scontrol release job=123 does nothing, but gives no error
Status: RESOLVED FIXED
Alias: None
Product: Slurm
Classification: Unclassified
Component: Other (show other tickets)
Version: 14.11.x
Hardware: Linux Linux
: 3 - Medium Impact
Assignee: David Bigagli
QA Contact:
URL:
Depends on:
Blocks:
 
Reported: 2014-06-01 21:10 MDT by Phil Schwan
Modified: 2014-06-03 09:04 MDT (History)
2 users (show)

See Also:
Site: DownUnder GeoSolutions
Slinky Site: ---
Alineos Sites: ---
Atos/Eviden Sites: ---
Confidential Site: ---
Coreweave sites: ---
Cray Sites: ---
DS9 clusters: ---
Google sites: ---
HPCnow Sites: ---
HPE Sites: ---
IBM Sites: ---
NOAA SIte: ---
NoveTech Sites: ---
Nvidia HWinf-CS Sites: ---
OCF Sites: ---
Recursion Pharma Sites: ---
SFW Sites: ---
SNIC sites: ---
Tzag Elita Sites: ---
Linux Distro: ---
Machine Name:
CLE Version:
Version Fixed: 14.03.4
Target Release: ---
DevPrio: ---
Emory-Cloud Sites: ---


Attachments

Note You need to log in before you can comment on or make changes to this ticket.
Description Phil Schwan 2014-06-01 21:10:00 MDT
running your 666bbe05fe2 + our usual modifications

Maybe because array tasks are in a mixture of states?

# squeue -u davef
PARTITION  PRIORITY   NAME                     USER ST       TIME  NODES NODELIST(REASON JOBID
teambond   400        dp_hal_srme_match       davef CG       0:00     64 clus1149        336247_21
teambond   400        dp_hal_srme_match       davef PD       0:00      1 (Priority)      336247_[13,15-17,19-20,22-816]
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_1
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_2
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_3
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_4
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_5
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_6
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_7
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_8
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_9
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_10
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_11
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_12
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_14
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_18
# scontrol release job=336247
# squeue -u davef
PARTITION  PRIORITY   NAME                     USER ST       TIME  NODES NODELIST(REASON JOBID
teambond   400        dp_hal_srme_match       davef CG       0:00     64 clus1149        336247_21
teambond   400        dp_hal_srme_match       davef PD       0:00      1 (Priority)      336247_[13,15-17,19-20,22-816]
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_1
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_2
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_3
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_4
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_5
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_6
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_7
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_8
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_9
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_10
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_11
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_12
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_14
teambond   0          dp_hal_srme_match       davef SE       0:00      1 (JobHeldUser)   336247_18
Comment 1 Phil Schwan 2014-06-01 21:11:43 MDT
I see!  The problem is actually one of syntax.

"scontrol release job=336247" does nothing, silently.

"scontrol release 336247" works.
Comment 2 David Bigagli 2014-06-03 09:03:15 MDT
The syntax is now consistent allowing 3 types of specification:

scontrol release jobid=150698
scontrol requeuehold state=specialexit jobid=150698

scontrol release job=150698
scontrol requeuehold state=specialexit job=150698

scontrol release 150698
scontrol requeuehold state=specialexit 150698

scontrol requeue jobid=150698
scontrol requeue job=150698
scontrol requeue 150698

Thanks,
       David
Comment 3 David Bigagli 2014-06-03 09:04:13 MDT
Forgot to mention the commit: b1534743250.

David