Please provide the equivalent of: $ scontrol show topology but for a specific job: $ scontrol show topology -j <jobid> We are trying to define a set of processes to test and automate verification of the slurmsmwd features as implemented in ticket bug#5285. Is there a means to collect the communications tree for a given job? The purpose of this request is to satisfy a goal regarding resilient codes and regression tests. Our plan is to systematically disable specific nodes in the communication tree from leaf nodes up to root nodes and verify that the slurmsmwd correctly detects that the node is unavailable and communicates this state to the slurmctld. A minimal test would disable a random node within the job allocation, avoiding leaf nodes in the communication tree. Does such a test already exist in a (private?) branch of the slurm verification test suite? Thank you.
> We are trying to define a set of processes to test and automate verification > of the slurmsmwd features as implemented in ticket bug#5285. Is there a > means to collect the communications tree for a given job? The purpose of > this request is to satisfy a goal regarding resilient codes and regression > tests. The topology plugin is, in the absence of RoutePlugin=route/topology, not related to the communication pattern used by slurmctld / slurmd. If you'd like such a diagnostic capability, that's something that we can look at addressing under a development contract, although I'd need some time to see if this is desirable from our side, and write up an appropriate SoW. > Our plan is to systematically disable specific nodes in the communication > tree from leaf nodes up to root nodes and verify that the slurmsmwd > correctly detects that the node is unavailable and communicates this state > to the slurmctld. A minimal test would disable a random node within the job > allocation, avoiding leaf nodes in the communication tree. Does such a test > already exist in a (private?) branch of the slurm verification test suite. No.