Ticket 16641 - [NNL] power management plugin for intel
Summary: [NNL] power management plugin for intel
Status: RESOLVED INFOGIVEN
Alias: None
Product: Slurm
Classification: Unclassified
Component: Configuration (show other tickets)
Version: 23.02.1
Hardware: Linux Linux
: 4 - Minor Issue
Assignee: Ben Roberts
QA Contact:
URL:
Depends on:
Blocks:
 
Reported: 2023-05-03 08:54 MDT by Eric Hausgaard
Modified: 2023-05-04 08:59 MDT (History)
0 users

See Also:
Site: UNNPP
Slinky Site: ---
Alineos Sites: ---
Atos/Eviden Sites: ---
Confidential Site: ---
Coreweave sites: ---
Cray Sites: ---
DS9 clusters: ---
Google sites: ---
HPCnow Sites: ---
HPE Sites: ---
IBM Sites: ---
NOAA SIte: ---
NoveTech Sites: ---
Nvidia HWinf-CS Sites: ---
OCF Sites: ---
Recursion Pharma Sites: ---
SFW Sites: ---
SNIC sites: ---
Tzag Elita Sites: ---
Linux Distro: RHEL
Machine Name: NNL
CLE Version:
Version Fixed:
Target Release: ---
DevPrio: ---
Emory-Cloud Sites: ---


Attachments

Note You need to log in before you can comment on or make changes to this ticket.
Description Eric Hausgaard 2023-05-03 08:54:16 MDT
I reviewed https://slurm.schedmd.com/power_mgmt.html and it indicates there is only a plugin for cray systems.  is there a power plugin for intel based CPUs and IPMI?
Comment 1 Ben Roberts 2023-05-03 09:30:15 MDT
Hi Eric,

We do have the ability to power up and down nodes with power management scripts at different times.  I think the page of the documentation that would be more useful to you is the Slurm Power Saving Guide:
https://slurm.schedmd.com/power_save.html

The SuspendProgram and ResumeProgram allow you to write your own scripts that can make calls to IPMI to control the nodes.  This page should give you a good starting point, but if you have any questions about anything please let me know.

Thanks,
Ben
Comment 2 Eric Hausgaard 2023-05-03 10:56:05 MDT
I think I was looking for this instead:
https://slurm.schedmd.com/slurm.conf.html#OPT_AcctGatherEnergyType
Comment 3 Ben Roberts 2023-05-03 13:28:29 MDT
Ah, sorry I misunderstood what you were looking for.  I thought you were looking for a way to power nodes on and off rather than collecting statistics about their power usage.  I'm glad you found what you wanted.  Let me know if there's anything else I can do to help or if this ticket is ok to close.

Thanks,
Ben
Comment 4 Eric Hausgaard 2023-05-04 05:15:13 MDT
I was looking for power statistics,  but having the ability to shut down a node with IPMI would be useful. Not that I think I'd be able to use it in production.
Comment 5 Ben Roberts 2023-05-04 08:59:19 MDT
Hi Eric,

Powering on and off nodes is an option if you ever think it might be feasible in your environment.  I'll go ahead and close this ticket for now.  Let us know if there is anything else we can do to help.

Thanks,
Ben