Ticket 17299

Summary: When one array job is pending, run scancel one sub-job, however, I cannot find this sub-job any more
Product: Slurm Reporter: ppju <pan_qu>
Component: User CommandsAssignee: Jacob Jenson <jacob>
Status: OPEN --- QA Contact:
Severity: 6 - No support contract    
Priority: ---    
Version: 21.08.8   
Hardware: Linux   
OS: Linux   
Site: -Other- Slinky Site: ---
Alineos Sites: --- Atos/Eviden Sites: ---
Confidential Site: --- Coreweave sites: ---
Cray Sites: --- DS9 clusters: ---
Google sites: --- HPCnow Sites: ---
HPE Sites: --- IBM Sites: ---
NOAA SIte: --- NoveTech Sites: ---
Nvidia HWinf-CS Sites: --- OCF Sites: ---
Recursion Pharma Sites: --- SFW Sites: ---
SNIC sites: --- Tzag Elita Sites: ---
Linux Distro: CentOS Machine Name:
CLE Version: Version Fixed:
Target Release: --- DevPrio: ---
Emory-Cloud Sites: ---

Description ppju 2023-07-27 03:22:03 MDT
The job submitting script is:
=====================================
#!/bin/sh
#SBATCH --job-name=myarrayjob
#SBATCH --ntasks=1
#SBATCH --cpus-per-task=1
#SBATCH --array=1-10

sleep 10000
====================================

And the get array job 203


Run the following commands:
scancel 203_2
scancel 203_3



Then I cannot find these sub-jobs which have been cancled with squeue and sacct
[slurm@parasvr01 ~]$ squeue -j 203_2
JOBID PARTITION     NAME     USER ST       TIME  NODES NODELIST(REASON)



The scontrol show job 203
===========================
JobId=203 ArrayJobId=203 ArrayTaskId=1,4-10 JobName=myarrayjob
   UserId=slurm(889) GroupId=slurm(889) MCS_label=N/A
   Priority=4294901732 Nice=0 Account=(null) QOS=normal
   JobState=PENDING Reason=Priority Dependency=(null)
   Requeue=1 Restarts=0 BatchFlag=1 Reboot=0 ExitCode=0:0
   RunTime=00:00:00 TimeLimit=UNLIMITED TimeMin=N/A
   SubmitTime=2023-07-27T15:36:42 EligibleTime=2023-07-27T15:36:43
   AccrueTime=2023-07-27T15:36:43
   StartTime=2024-07-26T16:08:04 EndTime=Unknown Deadline=N/A
   SuspendTime=None SecsPreSuspend=0 LastSchedEval=2023-07-27T16:34:49 Scheduler=Backfill:*
   Partition=pay AllocNode:Sid=parasvr01:23279
   ReqNodeList=(null) ExcNodeList=(null)
   NodeList=(null)
   NumNodes=1 NumCPUs=1 NumTasks=1 CPUs/Task=1 ReqB:S:C:T=0:0:*:*
   TRES=cpu=1,mem=1M,node=1,billing=1
   Socks/Node=* NtasksPerN:B:S:C=0:0:*:* CoreSpec=*
   MinCPUsNode=1 MinMemoryNode=0 MinTmpDiskNode=0
   Features=(null) DelayBoot=00:00:00
   OverSubscribe=OK Contiguous=0 Licenses=(null) Network=(null)
   Command=/var/lib/slurm/array.sh
   WorkDir=/var/lib/slurm
   StdErr=/var/lib/slurm/slurm-203_4294967294.out
   StdIn=/dev/null
   StdOut=/var/lib/slurm/slurm-203_4294967294.out
   Power=


Hope the canceled jobs can be displayed, although they have been canceled. Thank you.