Summary: | Building Slurm RPMs with PMIx fails | ||
---|---|---|---|
Product: | Slurm | Reporter: | VUB HPC <hpcadmin> |
Component: | Build System and Packaging | Assignee: | Oriol Vilarrubi <jvilarru> |
Status: | RESOLVED FIXED | QA Contact: | |
Severity: | 4 - Minor Issue | ||
Priority: | --- | CC: | alex, matt, ward.poelmans |
Version: | 20.11.7 | ||
Hardware: | Linux | ||
OS: | Linux | ||
See Also: |
https://bugs.schedmd.com/show_bug.cgi?id=9293 https://bugs.schedmd.com/show_bug.cgi?id=11111 |
||
Site: | VUB | Alineos Sites: | --- |
Atos/Eviden Sites: | --- | Confidential Site: | --- |
Coreweave sites: | --- | Cray Sites: | --- |
DS9 clusters: | --- | HPCnow Sites: | --- |
HPE Sites: | --- | IBM Sites: | --- |
NOAA SIte: | --- | NoveTech Sites: | --- |
Nvidia HWinf-CS Sites: | --- | OCF Sites: | --- |
Recursion Pharma Sites: | --- | SFW Sites: | --- |
SNIC sites: | --- | Tzag Elita Sites: | --- |
Linux Distro: | CentOS | Machine Name: | |
CLE Version: | Version Fixed: | 21.08.1 | |
Target Release: | --- | DevPrio: | --- |
Emory-Cloud Sites: | --- |
Description
VUB HPC
2021-07-06 04:26:44 MDT
Hello, The autodetect not working is intended behavior as we think that is better to only autodetect the required things like munge and that if you want autodetect to happen with something optional like pmix (that maybe for some reason you have it installed in the compilation node but not on the compute nodes) you specify the --with pmix As you said setting --with pmix did not worked either, but it was not because of the configure but because of the slurm.spec file. As you can see here, the pmix installation gets detected byt the configure script: [jvilarru@centos slurm-build]$ ../slurm-21.08.0-0rc2/configure --with-pmix 2>/dev/null | grep "pmix installation" checking for pmix installation... /usr The problem is that there was an error while generating the "Requires" tag for the slurmd package, that was caused due to some variable that was unescaped in the slurm.spec file. I'm working on testing that the fix I made can pass all the testing, and when that happens I will tell you in which slurm version it will ship. Thanks for the feedback. I just checked again and the execution of `configure --with-pmix` on our side does not work as described $./configure --with-pmix [...] checking for pmix installation... configure: WARNING: unable to locate pmix installation [...] This is with version 20.11.8 (commit 15a9f49). Looking at the configure script it actually seems that this is the intended behavior for this version though. I see a list of default paths for PMIx in the configure script 21686: _x_ac_pmix_dirs="/usr /usr/local" However, those are only used for executions of `configure` without "--with-pmix". Using `configure --with-pmix` gets a "yes" passed down as argument of --with-pmix, which replaces the default paths in $_x_ac_pmix_dirs with the string "yes". Hence, the search for PMIx is carried out under the prefix "yes/" and fails. Hello, You're completely right, I was checking with the 21.08rc2 version not with 20.11. The configure script has been modified in regards to pmix in this commit https://github.com/SchedMD/slurm/commit/e8036c5adb0585b7af208d1e2eb3bcd9afc687f5 This commit went into the version 21.08.0 and hopefully, the fix for the requires in the .rpm will also get into 21.08 but I cannot tell you in which minor version. I'll keep you updated. *** Ticket 9293 has been marked as a duplicate of this ticket. *** Hello, The fix for the "Requires" tag in slurmd for pmix and ucx is present already in the Slurm master branch, and it will ship with Slurm 21.08.1 version. So in version 21.08.01 you will find the problems you stated resolved, from the configure fix that comes in 21.08.0, commit e8036c5adb0585b7af208d1e2eb3bcd9afc687f5 and the fix for the Requires, coming from commit 5f58273fc5c9e8b30655837fe0493099628a5b4c. I'll close this ticket as fixed, but if you find that the it does not work properly in your system, do not hesitate to re-open this bug by simply replying to this comment or creating a new one. |