| Summary: | could not find partition | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | frank.schluenzen |
| Component: | slurmctld | Assignee: | Dominik Bartkiewicz <bart> |
| Status: | RESOLVED FIXED | QA Contact: | |
| Severity: | 4 - Minor Issue | ||
| Priority: | --- | CC: | bart, sergey.yakubov, sven.sternberger |
| Version: | 17.02.2 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| Site: | DESY | Alineos Sites: | --- |
| Atos/Eviden Sites: | --- | Confidential Site: | --- |
| Coreweave sites: | --- | Cray Sites: | --- |
| DS9 clusters: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Linux Distro: | --- |
| Machine Name: | CLE Version: | ||
| Version Fixed: | 17.02.7 17.11.0-pre2 | Target Release: | --- |
| DevPrio: | --- | Emory-Cloud Sites: | --- |
| Attachments: | slurm.conf | ||
|
Description
frank.schluenzen
2017-07-24 07:35:57 MDT
Hi Could you send me this job (650600) description? "scontrol show job" is probably not available now but maybe something has been saved in log file or you have batch script. Dominik Hi, I captured the job-header: #!/bin/bash #SBATCH -p cfel #partition #SBATCH -n 128 #SBATCH --time 1-13:00 #time (D-hh:MM) #SBATCH -o 5x5x5_CG.%N.%j.out #STDOUT #SBATCH -e 5x5x5_CG.%N.%j.err #STDERR #SBATCH --job-name 1x1x1_CG #job_name module load mpi/openmpi-x86_64 mpirun `which mdrun_openmpi` ... logs are not available anymore. There where no particular nodes or features requested as far as I can see. Cheers, Frank. > From: bugs@schedmd.com > To: "frank schluenzen" <frank.schluenzen@desy.de> > Sent: Tuesday, 25 July, 2017 12:40:02 > Subject: [Bug 4019] could not find partition > [ mailto:bart@schedmd.com | Dominik Bartkiewicz ] changed [ > https://bugs.schedmd.com/show_bug.cgi?id=4019 | bug 4019 ] > What Removed Added > CC bart@schedmd.com > Assignee support@schedmd.com bart@schedmd.com > [ https://bugs.schedmd.com/show_bug.cgi?id=4019#c1 | Comment # 1 ] on [ > https://bugs.schedmd.com/show_bug.cgi?id=4019 | bug 4019 ] from [ > mailto:bart@schedmd.com | Dominik Bartkiewicz ] > Hi > Could you send me this job (650600) description? > "scontrol show job" is probably not available now but maybe > something has been saved in log file or you have batch script. > Dominik > You are receiving this mail because: > * You reported the bug. > From: bugs@schedmd.com > To: "frank schluenzen" <frank.schluenzen@desy.de> > Sent: Tuesday, 25 July, 2017 12:40:02 > Subject: [Bug 4019] could not find partition > [ mailto:bart@schedmd.com | Dominik Bartkiewicz ] changed [ > https://bugs.schedmd.com/show_bug.cgi?id=4019 | bug 4019 ] > What Removed Added > CC bart@schedmd.com > Assignee support@schedmd.com bart@schedmd.com > [ https://bugs.schedmd.com/show_bug.cgi?id=4019#c1 | Comment # 1 ] on [ > https://bugs.schedmd.com/show_bug.cgi?id=4019 | bug 4019 ] from [ > mailto:bart@schedmd.com | Dominik Bartkiewicz ] > Hi > Could you send me this job (650600) description? > "scontrol show job" is probably not available now but maybe > something has been saved in log file or you have batch script. > Dominik > You are receiving this mail because: > * You reported the bug. Hi Yesterday we fixed some bug which could couse this error. https://github.com/SchedMD/slurm/commit/13b78dd2064c8bc7 This will be in next 17.02 release. Dominik Hi Let me know if this patch solves your issue. Dominik I'm marking this resolved by commit 13b78dd2064c8bc7, which was included in the 17.02.7 maintenance release last week. Please reopen if you're still seeing issues, or there is anything further we can address. - Tim |