| Summary: | Wrong CPUs/Task value | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | Stephane Thiell <sthiell> |
| Component: | Scheduling | Assignee: | Dominik Bartkiewicz <bart> |
| Status: | RESOLVED FIXED | QA Contact: | |
| Severity: | 3 - Medium Impact | ||
| Priority: | --- | CC: | akkornel, alex, kilian, sthiell |
| Version: | 17.11.4 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| See Also: |
https://bugs.schedmd.com/show_bug.cgi?id=4976 https://bugs.schedmd.com/show_bug.cgi?id=7876 https://bugs.schedmd.com/show_bug.cgi?id=19722 |
||
| Site: | Stanford | Slinky Site: | --- |
| Alineos Sites: | --- | Atos/Eviden Sites: | --- |
| Confidential Site: | --- | Coreweave sites: | --- |
| Cray Sites: | --- | DS9 clusters: | --- |
| Google sites: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | NoveTech Sites: | --- |
| Nvidia HWinf-CS Sites: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Tzag Elita Sites: | --- |
| Linux Distro: | --- | Machine Name: | |
| CLE Version: | Version Fixed: | 17.11.7 18.08.0pre2 | |
| Target Release: | --- | DevPrio: | --- |
| Emory-Cloud Sites: | --- | ||
|
Description
Stephane Thiell
2018-03-09 11:19:05 MST
Hi Stephane - Can you attach your current slurm.conf file for the cluster? I'm going to see if Dominik can chase down a reason this could happen. If you have a way to trigger this again, it might be helpful if you could attach logs captured while the TraceJobs and Backfill DebugFlags were turned on temporarily. Hi Without data Tim mentioned I can't be sure but I can reproduce similar/same behavior. I am not sure if this is a bug or just effect of submitting to multiple partition with different MaxMemPerCPU. I will look at this and let you know what we can do about it. Dominik Hi Tim and Dominik, Thanks much for looking into this! I just sent you the current slurm.conf by email. Stephane Hi Stephane, this should have been fixed in commit: https://github.com/SchedMD/slurm/commit/bf4cb0b1b01f3e starting from 17.11.7. You can apply it at your earliest convenience by appending ".patch" to the github URL. We're gonna go ahead and mark this as resolved/fixed. Please, reopen if there's any new issue you find after applying it. Thanks. |