| Summary: | [NNL] power management plugin for intel | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | Eric Hausgaard <eric.hausgaard> |
| Component: | Configuration | Assignee: | Ben Roberts <ben> |
| Status: | RESOLVED INFOGIVEN | QA Contact: | |
| Severity: | 4 - Minor Issue | ||
| Priority: | --- | ||
| Version: | 23.02.1 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| Site: | UNNPP | Alineos Sites: | --- |
| Atos/Eviden Sites: | --- | Confidential Site: | --- |
| Coreweave sites: | --- | Cray Sites: | --- |
| DS9 clusters: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Linux Distro: | RHEL |
| Machine Name: | NNL | CLE Version: | |
| Version Fixed: | Target Release: | --- | |
| DevPrio: | --- | Emory-Cloud Sites: | --- |
|
Description
Eric Hausgaard
2023-05-03 08:54:16 MDT
Hi Eric, We do have the ability to power up and down nodes with power management scripts at different times. I think the page of the documentation that would be more useful to you is the Slurm Power Saving Guide: https://slurm.schedmd.com/power_save.html The SuspendProgram and ResumeProgram allow you to write your own scripts that can make calls to IPMI to control the nodes. This page should give you a good starting point, but if you have any questions about anything please let me know. Thanks, Ben I think I was looking for this instead: https://slurm.schedmd.com/slurm.conf.html#OPT_AcctGatherEnergyType Ah, sorry I misunderstood what you were looking for. I thought you were looking for a way to power nodes on and off rather than collecting statistics about their power usage. I'm glad you found what you wanted. Let me know if there's anything else I can do to help or if this ticket is ok to close. Thanks, Ben I was looking for power statistics, but having the ability to shut down a node with IPMI would be useful. Not that I think I'd be able to use it in production. Hi Eric, Powering on and off nodes is an option if you ever think it might be feasible in your environment. I'll go ahead and close this ticket for now. Let us know if there is anything else we can do to help. Thanks, Ben |