Dear Slurm Team, we encountered quite a strange behavior. Sometimes a submitted job (sbatch) gets stuck before writing the output file. squeue shows the job in R state. scancel puts the job in CG state and stays there forever. Logs show that the last message shown for that job is that the prolog completed. We figured out that when a job gets stuck, the slurmd.service shows a second slurmd process. We never encountered the problem with salloc. Cancelling such a stuck job and killing the "additional" slurmd processes solves the problem. If you need any other information, please let us know! Thanks a lot, Mario