| Summary: | Cannot get MPI to run via srun (pmix) | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | Mike Jarsulic <mjarsulic> |
| Component: | PMIx | Assignee: | Skyler Malinowski <skyler> |
| Status: | RESOLVED INFOGIVEN | QA Contact: | |
| Severity: | 3 - Medium Impact | ||
| Priority: | --- | CC: | mcmullan |
| Version: | 21.08.6 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| Site: | University of Chicago | Alineos Sites: | --- |
| Atos/Eviden Sites: | --- | Confidential Site: | --- |
| Coreweave sites: | --- | Cray Sites: | --- |
| DS9 clusters: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Linux Distro: | RHEL |
| Machine Name: | Randi | CLE Version: | |
| Version Fixed: | Target Release: | --- | |
| DevPrio: | --- | Emory-Cloud Sites: | --- |
|
Description
Mike Jarsulic
2022-12-09 09:39:00 MST
Hi Mike, When Slurm is configured and compiled, it must be able to find the PMIx libraries to be able to integrate properly. Either from default path or explicitly set via --with-pmix=PATH. Do you know if the RPMs were created on a machine with PMIx present? Our documentation has some information about [MPI](https://slurm.schedmd.com/mpi_guide.html#pmix) integration that discussed the build process. If it was not built with support, then that would most likely be the cause of the problem. You can also run the following command to see the available plugins to the Slurm client: srun --mpi=list If the old (21.08) RPMs were built without PMIx support, then it may be easier to just build the new (22.05) RPMs with PMIx support and then go from there. -- Skyler Dropping severity, awaiting reply. I will assume this is no longer an issue given the severity and lack of reply and close the ticket. If this is still an issue, please reopen the ticket and I will be more than happy to assist. Thanks, Skyler |