| Summary: | reconfigure while jobs are running leads to leaky TRESRunMins data | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | Luke Yeager <lyeager> |
| Component: | Accounting | Assignee: | Scott Hilton <scott> |
| Status: | RESOLVED FIXED | QA Contact: | |
| Severity: | 2 - High Impact | ||
| Priority: | --- | ||
| Version: | 20.02.3 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| See Also: | https://bugs.schedmd.com/show_bug.cgi?id=9356 | ||
| Site: | NVIDIA (PSLA) | Alineos Sites: | --- |
| Atos/Eviden Sites: | --- | Confidential Site: | --- |
| Coreweave sites: | --- | Cray Sites: | --- |
| DS9 clusters: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Linux Distro: | --- |
| Machine Name: | CLE Version: | ||
| Version Fixed: | 20.02.4 20.11.0pre1 | Target Release: | --- |
| DevPrio: | --- | Emory-Cloud Sites: | --- |
| Attachments: |
job430139
Reconfigure Issue Fix v1 |
||
|
Description
Luke Yeager
2020-07-28 10:40:53 MDT
Increasing to Sev2. We don't have a good workaround for this. Luke, I've been able to reproduce the issue like Ben did and will be looking into it. -Scott Created attachment 15260 [details]
Reconfigure Issue Fix v1
Luke,
Here is a patch that fixes the issue. You are free to try it and if you do, let me know if it works on your setup.
-Scott
Created attachment 15265 [details]
Reconfigure Issue Fix v2
Targeted at 20.02
Luke, The fix will be included in the 20.02.4 release which is coming up soon. This is the commit ID: 8f28de91efa07984020b247f272738a93e4dd5f8 Take care, Scott Verified as fixed in 20.02.4. Thanks! |