| Summary: | Steer job to partition based on size | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | Matt Ezell <ezellma> |
| Component: | Other | Assignee: | Unassigned Developer <dev-unassigned> |
| Status: | OPEN --- | QA Contact: | |
| Severity: | 5 - Enhancement | ||
| Priority: | --- | ||
| Version: | 18.08.6 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| Site: | NOAA | Alineos Sites: | --- |
| Atos/Eviden Sites: | --- | Confidential Site: | --- |
| Coreweave sites: | --- | Cray Sites: | --- |
| DS9 clusters: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | ORNL | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Linux Distro: | --- |
| Machine Name: | CLE Version: | ||
| Version Fixed: | Target Release: | --- | |
| DevPrio: | --- | Emory-Cloud Sites: | --- |
|
Description
Matt Ezell
2019-04-08 06:39:57 MDT
Unless it's explicitly provided, it's not possible to determine (from Slurm) a job's node count at submission time since it will be calculated later. You might be able to write something that would move the job to a different partition later after the node calculation has been made, though. Take a look at the partition spec "Alternate" in slurm.conf: https://slurm.schedmd.com/slurm.conf.html You may be able to use it with your novel partition when using a job filter. You will still need to do your own size calculation to figure out if you have a large job but this spec may help with using a different partition. Thanks Chad. I feared that would be the answer, but it makes sense. I'm going to close this. I would like to reopen this, possibly as an RFE, to understand if it is feasible or not. On heterogeneous clusters, the number of nodes for a job could be dependent on which nodes it gets scheduled to. Would it make sense to calculate the minimum and maximum node count based on the resources available in the cluster? Matt - I do not think we would want to do this at job submission because this would introduce a "smaller scheduler" to determine this, however, we will look into this some and let you know. |