Intel MPI version 4.1.3, Slurm version 2.6.9-1
I am trying to follow the Intel MPI documentation to run a job under Slurm with -bootstrap jmi but am
getting the error message as below:
salloc -N 1 :
export I_MPI_HYDRA_JMI_LIBRARY=/opt/intel/impi/4.1.3/lib/intel64/lib/libjmi_slurm.so
mpiexec.hydra -bootstrap slurm -n 2 hostname ## << this works
mpiexec.hydra -bootstrap jmi -n 2 hostname ## <<this does not work
srun: error: Unable to create job step: Requested node configuration is not available
srun: error: Unable to create job step: Requested node configuration is not available
If I look at Slurm logs, it is trying to get a node assignment for the fqdn of the node, even though
I only use short names in slurm.conf. Not sure it this has anything to do with JMI/Slurm interaction.
If I use
I_MPI_PMI_LIBRARY=/opt/slurm/14.03.1-2/lib64/libpmi.so
srun -n 2 mympiprog
it works too.