Ticket 3032

Summary: Intel MPI under 16.05
Product: Slurm Reporter: Ade <ade.fewings>
Component: User CommandsAssignee: Jacob Jenson <jacob>
Status: RESOLVED WONTFIX QA Contact:
Severity: 6 - No support contract    
Priority: ---    
Version: 16.05.4   
Hardware: Linux   
OS: Linux   
Site: -Other- Slinky Site: ---
Alineos Sites: --- Atos/Eviden Sites: ---
Confidential Site: --- Coreweave sites: ---
Cray Sites: --- DS9 clusters: ---
Google sites: --- HPCnow Sites: ---
HPE Sites: --- IBM Sites: ---
NOAA SIte: --- NoveTech Sites: ---
Nvidia HWinf-CS Sites: --- OCF Sites: ---
Recursion Pharma Sites: --- SFW Sites: ---
SNIC sites: --- Tzag Elita Sites: ---
Linux Distro: --- Machine Name:
CLE Version: Version Fixed:
Target Release: --- DevPrio: ---
Emory-Cloud Sites: ---

Description Ade 2016-08-30 05:26:50 MDT
Noted a small issue in the behaviour of all versions of Intel MPI (4.1 - 5.1) with Slurm since upgrading to 16.05 from 15.08.  Specifically, if specifying  '--ntasks-per-node' and '--ntasks', we seem to always get a warning from srun:

srun: Warning: can't honor --ntasks-per-node set to 16 which doesn't match the requested tasks 1 with the number of requested nodes 1.  Ignoring --ntasks-per-node.

...which appears to suggest that the job specs are not being passed correctly through to Intel MPI's mpiexec.hydra for job launching, which in turn I think calls srun?

Mentioned this on slurm-dev and one other user confirmed they see the same, and that they also -cpus-per-task is not doing what it should when using OpenMPI, although I haven't tested that.
Comment 1 Jacob Jenson 2017-11-03 15:01:58 MDT
Before this ticket can be sent to the support team we need to put a support contract in place for your site. Please let me know if you would like to discuss Slurm support.
Comment 2 Ade 2017-11-03 15:02:10 MDT
Hello

I'm on annual leave on Friday 3rd November, so will respond next week!

Urgent Support issues should go to the Support Desk support@hpcwales.co.uk

Cheers

Ade

________________________________

   [HPC Wales - www.hpcwales.co.uk] <http://www.hpcwales.co.uk>

________________________________

The contents of this email and any files transmitted with it are confidential and intended solely for the named addressee only.  Unless you are the named addressee (or authorised to receive this on their behalf) you may not copy it or use it, or disclose it to anyone else.  If you have received this email in error, please notify the sender by email or telephone.  All emails sent by High Performance Computing Wales have been checked using an Anti-Virus system.  We would advise you to run your own virus check before opening any attachments received as we will not in any event accept any liability whatsoever, once an email and/or attachment is received.

High Performance Computing Wales is a private limited company incorporated in Wales on 8 March 2010 as company number 07181701.

Our registered office is at Finance Office, Bangor University, Cae Derwen, College Road, Bangor, Gwynedd. LL57 2DG. UK.

High Performance Computing Wales is part funded by the European Regional Development Fund through the Welsh Government.