This issue may be related to bug 20207. We have enabled AcctGatherEnergyType=acct_gather_energy/ipmi in slurm.conf and I started to test some accounting with sacct printing the ConsumedEnergyRaw values. The ConsumedEnergyRaw mostly looks sensible, but we have found some empty values like in this example: /usr/bin/sacct --user <omitted> --partition a100 -np -X -S 061424 -E 061924 -o JobID,Group,Partition,AllocNodes,AllocCPUS,Submit,Eligible,Start,End,CPUTimeRAW,State,Nodelist,ConsumedEnergyRaw -s to 7393882|catvip|a100|1|32|14-Jun-2024_15:02|14-Jun-2024_15:02|14-Jun-2024_15:02|14-Jun-2024_15:14|23104|TIMEOUT|sd651|| 7393903|catvip|a100|1|32|14-Jun-2024_15:16|14-Jun-2024_15:16|14-Jun-2024_15:16|14-Jun-2024_19:16|461088|TIMEOUT|sd651|| 7403327|catvip|a100|1|32|17-Jun-2024_12:47|17-Jun-2024_12:47|17-Jun-2024_13:03|18-Jun-2024_13:04|2765216|TIMEOUT|sd652|| This was found both with 23.11.7 and after upgrading top 23.11.8 at this time: # rpm -qi slurm-slurmd | grep Install Install Date: Mon 17 Jun 2024 11:37:04 AM CEST Other jobs on these nodes report non-empty values for ConsumedEnergyRaw. I can't see any reason offhand for this behavior.
Hi Ole, If you don't mind let's work on ticket 20207 which seems the same issue I am closing this one and responding in the other bug. *** This ticket has been marked as a duplicate of ticket 20207 ***
I'm out of the office, back on June 24. Jeg er ikke pƄ kontoret, tilbage igen 24. juni. Best regards / Venlig hilsen, Ole Holm Nielsen