Summary: | GrpSubmit exceeded with job arrays | ||
---|---|---|---|
Product: | Slurm | Reporter: | Ryan Cox <ryan_cox> |
Component: | Limits | Assignee: | Moe Jette <jette> |
Status: | RESOLVED FIXED | QA Contact: | |
Severity: | 4 - Minor Issue | ||
Priority: | --- | CC: | da |
Version: | 2.6.x | ||
Hardware: | Linux | ||
OS: | Linux | ||
Site: | BYU - Brigham Young University | Alineos Sites: | --- |
Atos/Eviden Sites: | --- | Confidential Site: | --- |
Coreweave sites: | --- | Cray Sites: | --- |
DS9 clusters: | --- | HPCnow Sites: | --- |
HPE Sites: | --- | IBM Sites: | --- |
NOAA SIte: | --- | NoveTech Sites: | --- |
Nvidia HWinf-CS Sites: | --- | OCF Sites: | --- |
Recursion Pharma Sites: | --- | SFW Sites: | --- |
SNIC sites: | --- | Tzag Elita Sites: | --- |
Linux Distro: | --- | Machine Name: | |
CLE Version: | Version Fixed: | 2.6.6 | |
Target Release: | --- | DevPrio: | --- |
Emory-Cloud Sites: | --- | ||
Attachments: | grpsubmit exceeded |
I have been able to reproduce this problem. There are use cases that definitely fail. Fixed with one line change. This will be in version 2.6.6 when released, probably within a day or two. The commit with the fix is shown below. Append ".patch" to the URL to generate a patch file. https://github.com/SchedMD/slurm/commit/9469053d55233c208e58dedff5a9fae5ce29f0a2 Thanks for the quick fix. We'll upgrade when 2.6.6 is released. |
Created attachment 627 [details] grpsubmit exceeded It seems that users can get around GrpSubmit limits due to job arrays. I attached a file showing a failed attempt to exceed the job limit first without job arrays then a successful attempt with job arrays. We're on 2.6.4 8fba61066b30a4874632c59b770b9ed46fb5adf1. We have seen other accounts do this as well, not just staff.