| Summary: | Job's pending reason lists unavailable nodes that are not in the job's partition | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | Kilian Cavalotti <kilian> |
| Component: | Scheduling | Assignee: | Alejandro Sanchez <alex> |
| Status: | RESOLVED CANNOTREPRODUCE | QA Contact: | |
| Severity: | 4 - Minor Issue | ||
| Priority: | --- | CC: | alex, patrice.peterson |
| Version: | 16.05.4 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| Site: | Stanford | Slinky Site: | --- |
| Alineos Sites: | --- | Atos/Eviden Sites: | --- |
| Confidential Site: | --- | Coreweave sites: | --- |
| Cray Sites: | --- | DS9 clusters: | --- |
| Google sites: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | NoveTech Sites: | --- |
| Nvidia HWinf-CS Sites: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Tzag Elita Sites: | --- |
| Linux Distro: | --- | Machine Name: | Sherlock |
| CLE Version: | Version Fixed: | ||
| Target Release: | --- | DevPrio: | --- |
| Emory-Cloud Sites: | --- | ||
|
Description
Kilian Cavalotti
2016-09-06 15:00:51 MDT
Hi Kilian. We're looking into this and will come back to you. Kilian, can you please upload your slurm.conf and indicate a specific sinfo state and detailed job submission to reproduce this? I'm able to reproduce on 15.08 but not on 16.05 (where some changes to job reason logic have been made). We've a local copy of sherlock from 11 days ago and xstream from 19 days ago. Not sure if this is happening in any of these clusters or another one. Thanks. I see the Machine Name in the bug is sherlock, anyhow a specific sinfo state + job submission would help. Also an updated slurm.conf just in case something changed during these days. Hi Alejandro, (In reply to Alejandro Sanchez from comment #4) > I see the Machine Name in the bug is sherlock, anyhow a specific sinfo state > + job submission would help. Also an updated slurm.conf just in case > something changed during these days. Yes, it's on Sherlock and the configuration didn't change since last time. What options would you need for sinfo and for the job submission info? Cheers, Kilian Just 'sinfo' just before the submission and the whole request/batch script with the parameters you are using for the job submission. Let's see if I'm able to reproduce with this and then be able to work on the problem. (In reply to Alejandro Sanchez from comment #6) > Just 'sinfo' just before the submission and the whole request/batch script > with the parameters you are using for the job submission. Let's see if I'm > able to reproduce with this and then be able to work on the problem. Mmmh, I can't seem to be ab;e to replicate the issue right now. I guess it's ok to close the ticket, I'll reopen if it happens again. Cheers, -- Kilian (In reply to Kilian Cavalotti from comment #7) > Mmmh, I can't seem to be ab;e to replicate the issue right now. I guess it's > ok to close the ticket, I'll reopen if it happens again. > > Cheers, > -- > Kilian Ok, closing the ticket as WORKSFORME. Please reopen if you happen to reproduce this. |