| Summary: | Job allocations seem to span disjoint networks, despite using topology/tree | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | Kilian Cavalotti <kilian> |
| Component: | Scheduling | Assignee: | Dominik Bartkiewicz <bart> |
| Status: | RESOLVED FIXED | QA Contact: | |
| Severity: | 4 - Minor Issue | ||
| Priority: | --- | CC: | bart |
| Version: | 17.11.4 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| Site: | Stanford | Slinky Site: | --- |
| Alineos Sites: | --- | Atos/Eviden Sites: | --- |
| Confidential Site: | --- | Coreweave sites: | --- |
| Cray Sites: | --- | DS9 clusters: | --- |
| Google sites: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | NoveTech Sites: | --- |
| Nvidia HWinf-CS Sites: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Tzag Elita Sites: | --- |
| Linux Distro: | --- | Machine Name: | |
| CLE Version: | Version Fixed: | 17.11.5 | |
| Target Release: | --- | DevPrio: | --- |
| Emory-Cloud Sites: | --- | ||
|
Description
Kilian Cavalotti
2018-03-02 12:41:04 MST
Hi Yes, that is expected when TopoOptional is used, unless jobs request for some switches. Dominik Hi Dominik, (In reply to Dominik Bartkiewicz from comment #1) > Yes, that is expected when TopoOptional is used, > unless jobs request for some switches. Ah I see, thanks! Do you think it would be useful to add some clarification to the documentation? The TopoOptional description in the slurm.conf man page doesn't mention anything about disjoint networks, it would probably be worth a mention that using this option could span jobs over disjoint networks. Same thing for topology.conf something like: "no job will span leaf switches without a common parent (unless the TopologyParam=TopoOptional option is used)." Thanks! -- Kilian Hi As you suggested we added this info to doc https://github.com/SchedMD/slurm/commit/2d09a777443ded4b1 It is in 17.11.5 and up. Dominik (In reply to Dominik Bartkiewicz from comment #5) > Hi > > As you suggested we added this info to doc > https://github.com/SchedMD/slurm/commit/2d09a777443ded4b1 > It is in 17.11.5 and up. Great, thanks! Cheers, -- Kilian |