Summary: | mpi_launch time out, unless sbcast used prior - request barrier or equiv. like Cray alps did | ||
---|---|---|---|
Product: | Slurm | Reporter: | S Senator <sts> |
Component: | Cray ALPS | Assignee: | Unassigned Developer <dev-unassigned> |
Status: | OPEN --- | QA Contact: | |
Severity: | 5 - Enhancement | ||
Priority: | --- | CC: | fullop, lena |
Version: | 17.02.6 | ||
Hardware: | Cray XC | ||
OS: | Linux | ||
Site: | LANL | Alineos Sites: | --- |
Atos/Eviden Sites: | --- | Confidential Site: | --- |
Coreweave sites: | --- | Cray Sites: | --- |
DS9 clusters: | --- | Google sites: | --- |
HPCnow Sites: | --- | HPE Sites: | --- |
IBM Sites: | --- | NOAA SIte: | --- |
NoveTech Sites: | --- | Nvidia HWinf-CS Sites: | --- |
OCF Sites: | --- | Recursion Pharma Sites: | --- |
SFW Sites: | --- | SNIC sites: | --- |
Tzag Elita Sites: | --- | Linux Distro: | --- |
Machine Name: | CLE Version: | ||
Version Fixed: | Target Release: | --- | |
DevPrio: | --- | Emory-Cloud Sites: | --- |
Description
S Senator
2017-07-18 10:47:07 MDT
Did you see at srun's --bcast option? The application does not launch until after the file transfer completes. Copy executable file to allocated compute nodes. If a file name is specified, copy the executable to the specified destination file path. If no path is specified, copy the file to a file named "slurm_bcast_<job_id>.<step_id>" in the current working. For example, "srun --bcast=/tmp/mine -N3 a.out" will copy the file "a.out" from your current directory to the file "/tmp/mine" on each of the three allocated compute nodes and execute that file. This option applies to step allocations. Have you been able to try the --bcast option that Moe mentioned? So we have used bcast in its various incarnations but many user applications have too many file requirements to make this practical. We have been profiling our launches and have recently identified some problems in the configuration of the environment and we are starting to resolve those issues. We expect this to help, but do not know if it will be enough to stabilize our launches. The discussions here have centered on the need for a synchronization mechanism similar to how Alps implemented it. We understand that this could be rather involved to implement, but wanted to communicate where we are at on this currently. We also realize that there are ways to address file system contention. But since that is not the only delay-inducing variable at work in job launches, a more encompassing solution for synchronization might be necessary. We will know more and report as our environment work progresses. Marking this as an enhancement. Please let keep us posted as you get you new information. |