Ticket 16422 - jobs containing -hetjob in job-name fail
Summary: jobs containing -hetjob in job-name fail
Status: OPEN
Alias: None
Product: Slurm
Classification: Unclassified
Component: Accounting (show other tickets)
Version: - Unsupported Older Versions
Hardware: All Linux
: 6 - No support contract
Assignee: Jacob Jenson
QA Contact:
URL:
Depends on:
Blocks:
 
Reported: 2023-04-01 00:54 MDT by Jan Streffing
Modified: 2023-04-01 00:54 MDT (History)
0 users

See Also:
Site: -Other-
Alineos Sites: ---
Atos/Eviden Sites: ---
Confidential Site: ---
Coreweave sites: ---
Cray Sites: ---
DS9 clusters: ---
HPCnow Sites: ---
HPE Sites: ---
IBM Sites: ---
NOAA SIte: ---
NoveTech Sites: ---
Nvidia HWinf-CS Sites: ---
OCF Sites: ---
Recursion Pharma Sites: ---
SFW Sites: ---
SNIC sites: ---
Tzag Elita Sites: ---
Linux Distro: ---
Machine Name:
CLE Version:
Version Fixed:
Target Release: ---
DevPrio: ---
Emory-Cloud Sites: ---


Attachments

Note You need to log in before you can comment on or make changes to this ticket.
Description Jan Streffing 2023-04-01 00:54:56 MDT
The following is a valid hetjob sbatch script for slurm 20.11.9-Bull.1.3 on DKRZ levante:

#!/bin/bash
#SBATCH --partition=compute
#SBATCH --time=08:00:00
#SBATCH --job-name=TCO1279-DART-1D
#SBATCH --account=ab0995
#SBATCH --mail-type=NONE
#SBATCH --exclusive
#SBATCH --nodes=10
#SBATCH --partition=compute
#SBATCH hetjob
#SBATCH --nodes=40
#SBATCH --partition=compute
#SBATCH hetjob
#SBATCH --nodes=10
#SBATCH --partition=compute
#SBATCH --constraint=512G 

While 

#!/bin/bash
#SBATCH --partition=compute
#SBATCH --time=08:00:00
#SBATCH --job-name=TCO1279-DART-1D-hetjob
#SBATCH --account=ab0995
#SBATCH --mail-type=NONE
#SBATCH --exclusive
#SBATCH --nodes=10
#SBATCH --partition=compute
#SBATCH hetjob
#SBATCH --nodes=40
#SBATCH --partition=compute
#SBATCH hetjob
#SBATCH --nodes=10
#SBATCH --partition=compute
#SBATCH --constraint=512G 

results in: 
sbatch TCO1279-DART-1D-hetjob_compute_20000101-20000101.run 
sbatch.bin: error: Batch job submission failed: Invalid account or account/partition combination specified

job-names containing the -hetjob within lead to the error.