Summary: | On UV SLURM_CPU_BIND_LIST and SLURM_CPU_BIND are too big | ||
---|---|---|---|
Product: | Slurm | Reporter: | Josko Plazonic <plazonic> |
Component: | slurmd | Assignee: | David Bigagli <david> |
Status: | RESOLVED FIXED | QA Contact: | |
Severity: | 3 - Medium Impact | ||
Priority: | --- | CC: | da |
Version: | 2.6.6 | ||
Hardware: | Linux | ||
OS: | Linux | ||
Site: | Princeton (PICSciE) | Alineos Sites: | --- |
Atos/Eviden Sites: | --- | Confidential Site: | --- |
Coreweave sites: | --- | Cray Sites: | --- |
DS9 clusters: | --- | HPCnow Sites: | --- |
HPE Sites: | --- | IBM Sites: | --- |
NOAA SIte: | --- | NoveTech Sites: | --- |
Nvidia HWinf-CS Sites: | --- | OCF Sites: | --- |
Recursion Pharma Sites: | --- | SFW Sites: | --- |
SNIC sites: | --- | Tzag Elita Sites: | --- |
Linux Distro: | --- | Machine Name: | |
CLE Version: | Version Fixed: | 14.03.1 | |
Target Release: | --- | DevPrio: | --- |
Emory-Cloud Sites: | --- | ||
Attachments: | Move size test earlier in setenvf or it doesn't work |
Description
Josko Plazonic
2014-03-14 15:28:06 MDT
These bitmaps identify which CPUs are associated with each task/rank of the application, all in a single environment variable. That works fine for most systems, but not so well for nodes with huge CPU counts. The solution that comes to mind for me is to create a different environment variable for each rank. That means Slurm will need to set a large number of environment variables, but its far more scalable. SLURM_CPU_BIND_RANK_1 SLURM_CPU_BIND_RANK_2 etc. There are equivalent bitmaps for memory binding also which would need the same sort of modification. That sounds great. If you don't have time to incorporate that solution in the next 14.* release we'd be also happy with something like: if ((strlen(str_bind_list)+strlen("SLURM_CPU_BIND_LIST="))>=MAX_ARG_STRLEN)) { notice("SLURM_CPU_BIND_LIST too big, skipping adding to environment"); } else { // define it } ... (In reply to Josko Plazonic from comment #2) > That sounds great. > > If you don't have time to incorporate that solution in the next 14.* release > we'd be also happy with something like: > > if ((strlen(str_bind_list)+strlen("SLURM_CPU_BIND_LIST="))>=MAX_ARG_STRLEN)) > { > notice("SLURM_CPU_BIND_LIST too big, skipping adding to environment"); > } else { > // define it > } > ... I doubt we will have time to get this into the next release, but here is a patch as you describe below to at least prevent the exec from failing. This will be in the next release. https://github.com/SchedMD/slurm/commit/34a883e0557638774132cb68b54ab88f74ec7e79.patch Almost - please correct me if I am reading this wrong - but with this change setenvf will now return ENOMEM which is non zero and therefore if (setenvf(&env->env, "SLURM_CPU_BIND_LIST", str_bind_list)) { error("Unable to set SLURM_CPU_BIND_LIST"); rc = SLURM_FAILURE; } will now be triggered and we'll have a SLURM_FAILURE. In our view this is a non fatal error, at least with these vars, and we need the execution to continue... (In reply to Josko Plazonic from comment #4) > Almost - please correct me if I am reading this wrong - but with this change > setenvf will now return ENOMEM which is non zero and therefore > if (setenvf(&env->env, "SLURM_CPU_BIND_LIST", > str_bind_list)) { > error("Unable to set SLURM_CPU_BIND_LIST"); > rc = SLURM_FAILURE; > } > will now be triggered and we'll have a SLURM_FAILURE. In our view this is a > non fatal error, at least with these vars, and we need the execution to > continue... This code only logs the event, so the user sees it, but the job will still run (albeit without specific environment variables). Created attachment 738 [details]
Move size test earlier in setenvf or it doesn't work
We tested today on our SGI UV 14.03 and we still couldn't launch large scale jobs (300+ CPUs). Same error. Turns out that we need to move the overflow test earlier - right at the start. Once there it works and despite warning, exactly as you told us, jobs start and work. Attached patch is what we used. Hvala Jozko, this is the right fix I think. It will be in release 14.03.1 commit 407a814b4b5. David |