| Summary: | Handling of "buffer size limit exceeded" errors | ||
|---|---|---|---|
| Product: | Slurm | Reporter: | Goran Pocina <gpocina> |
| Component: | slurmdbd | Assignee: | Tim Wickberg <tim> |
| Status: | RESOLVED DUPLICATE | QA Contact: | |
| Severity: | 4 - Minor Issue | ||
| Priority: | --- | CC: | gpocina |
| Version: | 15.08.13 | ||
| Hardware: | Linux | ||
| OS: | Linux | ||
| Site: | D E Shaw Research | Alineos Sites: | --- |
| Atos/Eviden Sites: | --- | Confidential Site: | --- |
| Coreweave sites: | --- | Cray Sites: | --- |
| DS9 clusters: | --- | HPCnow Sites: | --- |
| HPE Sites: | --- | IBM Sites: | --- |
| NOAA SIte: | --- | OCF Sites: | --- |
| Recursion Pharma Sites: | --- | SFW Sites: | --- |
| SNIC sites: | --- | Linux Distro: | --- |
| Machine Name: | CLE Version: | ||
| Version Fixed: | Target Release: | --- | |
| DevPrio: | --- | Emory-Cloud Sites: | --- |
| Attachments: | slurmdbd.conf | ||
|
Description
Goran Pocina
2017-02-21 13:15:51 MST
Created attachment 4080 [details]
slurmdbd.conf
I'm assuming you meant "15.08.13"? That maintenance release wasn't in the list previously, that's been corrected now. Is this query being constantly re-run, or does this recur after restarting slurmdbd? The problem has not recurred since restarting slurmdbd, however I've also not yet confirm that our daily utilization query kicked it off. Looking at old slurmdbd log files it seems to have happened 1 week and 1 hour prior to yesterday's event, so I have a pretty good chance of finding the script responsible. I'll update here once I do. Okay, I just wanted to make sure this wasn't blocking slurmdbd from normal service. There's an enhancement bug 2346 open that covers adding some configuration options to help prevent these from triggering, although we haven't made any commitment to addressing this just yet. Goran - I'm marking this closed as a duplicate of 3624. We'll try to get some mitigation in place to keep the log level spam to a minimum. As mentioned, bug 2346 discusses longer-term plans to mitigate this issue with some configuration options to limit the queries directly. - Tim *** This ticket has been marked as a duplicate of ticket 3624 *** |