Summary: | Enhance energy, power and time data in slurm.h | ||
---|---|---|---|
Product: | Slurm | Reporter: | Robert Schoene <robert.schoene> |
Component: | Accounting | Assignee: | Unassigned Developer <dev-unassigned> |
Status: | OPEN --- | QA Contact: | |
Severity: | 5 - Enhancement | ||
Priority: | --- | CC: | robert.schoene, thomas.cadeau, yiannis.georgiou |
Version: | 15.08.x | ||
Hardware: | All | ||
OS: | All | ||
Site: | Universitat Dresden (Germany) | Alineos Sites: | --- |
Atos/Eviden Sites: | --- | Confidential Site: | --- |
Coreweave sites: | --- | Cray Sites: | --- |
DS9 clusters: | --- | HPCnow Sites: | --- |
HPE Sites: | --- | IBM Sites: | --- |
NOAA SIte: | --- | NoveTech Sites: | --- |
Nvidia HWinf-CS Sites: | --- | OCF Sites: | --- |
Recursion Pharma Sites: | --- | SFW Sites: | --- |
SNIC sites: | --- | Tzag Elita Sites: | --- |
Linux Distro: | --- | Machine Name: | |
CLE Version: | Version Fixed: | ||
Target Release: | --- | DevPrio: | --- |
Emory-Cloud Sites: | --- | ||
Attachments: | Change type of energy variables from uint32 to uint64 |
Description
Robert Schoene
2014-11-12 22:29:24 MST
Created attachment 1980 [details]
Change type of energy variables from uint32 to uint64
Changes are made in:
-slurm.h structures
-acct_gather_energy plugins
-ext_sensors plugins
-pack/unpack functions with new version checks
There is very few changes in accounting parts since energy are already saved as uint64 or double.
I update also inside src/common/slurm_topology.h in strucure switch_record.
But I don't see code using this variable.
Thomas, does it makes sense to change the watts variables as well? On June 19, 2015 4:40:58 AM PDT, bugs@schedmd.com wrote: >http://bugs.schedmd.com/show_bug.cgi?id=1258 > >Thomas Cadeau <thomas.cadeau@bull.net> changed: > > What |Removed |Added >---------------------------------------------------------------------------- > CC| |thomas.cadeau@bull.net > >--- Comment #1 from Thomas Cadeau <thomas.cadeau@bull.net> --- >Created attachment 1980 [details] > --> http://bugs.schedmd.com/attachment.cgi?id=1980&action=edit >Change type of energy variables from uint32 to uint64 > >Changes are made in: > -slurm.h structures > -acct_gather_energy plugins > -ext_sensors plugins > -pack/unpack functions with new version checks > >There is very few changes in accounting parts since energy are already >saved as >uint64 or double. > >I update also inside src/common/slurm_topology.h in strucure >switch_record. >But I don't see code using this variable. > >-- >You are receiving this mail because: >You are on the CC list for the bug. >You are the assignee for the bug. I don't think so, there is no sum on watts for several nodes, and I don't think we will see nodes, or other equpments with more than 4.29 GJ. Or maybe on a complete cluster if we want to sum all powers. That makes since. The only reason I bring it up is because that is the way it is stored in the HDF5 code. On 06/19/15 06:24, bugs@schedmd.com wrote: > > *Comment # 3 <http://bugs.schedmd.com/show_bug.cgi?id=1258#c3> on bug > 1258 <http://bugs.schedmd.com/show_bug.cgi?id=1258> from Thomas Cadeau > <mailto:thomas.cadeau@bull.net> * > I don't think so, there is no sum on watts for several nodes, and I don't think > we will see nodes, or other equpments with more than 4.29 GJ. > Or maybe on a complete cluster if we want to sum all powers. > ------------------------------------------------------------------------ > You are receiving this mail because: > > * You are on the CC list for the bug. > * You are the assignee for the bug. > This has been added. Is the time_t still an issue here? If not I'll close the bug. Thomas, Robert, any input? Sorry for the late reply, I've been at paternity leave. As I see it, the initial problem is still there. There is no way to represents time stamps with a finer granularity than 1 second, which means that bug #2 (initial post) is still there. There is also no way to represent a more fine grained power or energy measurement. The only bug that is tackled is the overflow. I have a couple of questions regarding the request for finer grained timestamps. 1. What does having finer grained timestamps buy? The total energy will still be the same. And the hd5 profile will just show a little more energy for that time interval. 2. The hdf5 profile timestamps are in seconds. How would you expect this to be presented if it was in ms? Thanks, Brian If I remember correcty, the energy value in SLURM acct_gather_energy.ipmi is calculated as power times time. As I described in issue (2) a 1-second granularity leads to a lower precision on energy values. I have no problem to store the power information at a per second scale in HDF. It's just about the energy values. Regarding question 1: The energy will only be the same for workflows which provide a constant power consumption over time. When the power consumption varies, the result is unpredictable. Regarding question 2: It's ok for me that SLURM-written hdf files use seconds. Thats the external representation. I only want the INTERNAL resolution to be higher. Still, I would like to have a higher resolution in the external representation as well, but that might cause some issues with tools depending on the current data format, so I understand if that's not feasible. |