| Summary: | Possible to deny jobs with failed MaxTRES association limits? | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | Doug Jacobsen <dmjacobsen> |
| Component: | Configuration | Assignee: | Brian Christiansen <brian> |
| Status: | RESOLVED INFOGIVEN | QA Contact: | |
| Severity: | 3 - Medium Impact | ||
| Priority: | --- | CC: | alex, brian, da, tim |
| Version: | 15.08.2 | ||
| Hardware: | Cray XC | ||
| OS: | Linux | ||
| Site: | NERSC | Slinky Site: | --- |
| Alineos Sites: | --- | Atos/Eviden Sites: | --- |
| Confidential Site: | --- | Coreweave sites: | --- |
| Cray Sites: | --- | DS9 clusters: | --- |
| Google sites: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | NoveTech Sites: | --- |
| Nvidia HWinf-CS Sites: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Tzag Elita Sites: | --- |
| Linux Distro: | --- | Machine Name: | |
| CLE Version: | Version Fixed: | ||
| Target Release: | --- | DevPrio: | --- |
| Emory-Cloud Sites: | --- | ||
|
Description
Doug Jacobsen
2015-10-28 15:12:38 MDT
nevermind, it appears that setting DenyOnLimit in the job QOS had the effect of denying association limits. Thanks, Doug As you figured out, the DenyOnLimit qos flag will reject a job submission if the job violates a qos or association Max* limit.
brian@compy:~/slurm/15.08/compy$ sacctmgr modify user brian account=test_acct set maxtres=cpu=0
Modified user associations...
C = compy A = test_acct U = brian
Would you like to commit changes? (You have 30 seconds to decide)
(N/y): y
brian@compy:~/slurm/15.08/compy$ sbatch --account=test_acct --qos=test_qos --wrap="hostname"
Submitted batch job 99112
brian@compy:~/slurm/15.08/compy$ squeue
JOBID PARTITION NAME USER ST TIME NODES NODELIST(REASON)
99112 debug wrap brian PD 0:00 1 (AssocMaxCpuPerJobLimit)
brian@compy:~/slurm/15.08/compy$ sacctmgr modify qos test_qos set flags=denyonlimit
Modified qos...
test_qos
Would you like to commit changes? (You have 30 seconds to decide)
(N/y): y
brian@compy:~/slurm/15.08/compy$ sbatch --account=test_acct --qos=test_qos --wrap="hostname"
sbatch: error: Batch job submission failed: Job violates accounting/QOS policy (job submit limit, user's size and/or time limits)
|