Ticket 22419 - Unable to add some users to account
Summary: Unable to add some users to account
Status: OPEN
Alias: None
Product: Slurm
Classification: Unclassified
Component: Accounting (show other tickets)
Version: 24.05.0
Hardware: Linux Linux
: 4 - Minor Issue
Assignee: Brian Gregory
QA Contact:
URL:
Depends on:
Blocks:
 
Reported: 2025-03-21 15:14 MDT by Larry Laugesen
Modified: 2025-03-31 15:53 MDT (History)
1 user (show)

See Also:
Site: Cadence
Alineos Sites: ---
Atos/Eviden Sites: ---
Confidential Site: ---
Coreweave sites: ---
Cray Sites: ---
DS9 clusters: ---
HPCnow Sites: ---
HPE Sites: ---
IBM Sites: ---
NOAA SIte: ---
NoveTech Sites: ---
Nvidia HWinf-CS Sites: ---
OCF Sites: ---
Recursion Pharma Sites: ---
SFW Sites: ---
SNIC sites: ---
Tzag Elita Sites: ---
Linux Distro: ---
Machine Name:
CLE Version:
Version Fixed:
Target Release: ---
DevPrio: ---
Emory-Cloud Sites: ---


Attachments

Note You need to log in before you can comment on or make changes to this ticket.
Description Larry Laugesen 2025-03-21 15:14:32 MDT
I found a list of users I cannot add to an account:

sacctmgr -i add user operator Accounts+=site_users
Request didn't affect anything
Nothing added.
Make sure the accounts requested here have been added to the system/cluster. (sacctmgr add account...)

To prove this account works:

[root@cadence-rh76-lsf-1 tmp]# sacctmgr -i add user llawrenc Accounts+=site_users
 Adding User(s)
  llawrenc
 Settings
 Associations =
  C = slurmatnorth01 A = site_users           U = llawrenc 

This is a valid account:

[root@cadence-rh76-lsf-1 tmp]# id operator
uid=11(operator) gid=0(root) groups=0(root)

How do I debug why this account cannot be added?
Comment 1 Brian Gregory 2025-03-24 08:03:42 MDT
(In reply to Larry Laugesen from comment #0)
> I found a list of users I cannot add to an account:
> 
> sacctmgr -i add user operator Accounts+=site_users
> Request didn't affect anything
> Nothing added.
> Make sure the accounts requested here have been added to the system/cluster.
> (sacctmgr add account...)
> 
> To prove this account works:
> 
> [root@cadence-rh76-lsf-1 tmp]# sacctmgr -i add user llawrenc
> Accounts+=site_users
>  Adding User(s)
>   llawrenc
>  Settings
>  Associations =
>   C = slurmatnorth01 A = site_users           U = llawrenc 
> 
> This is a valid account:
> 
> [root@cadence-rh76-lsf-1 tmp]# id operator
> uid=11(operator) gid=0(root) groups=0(root)
> 
> How do I debug why this account cannot be added?

Larry,

This should be possible. I just confirmed that I could create this user and add it to an account my test system. 

Can you send me the output of the following commands on your system?

uname -a

sacctmgr show assoc tree


Also, as a quick check, ensure the "operator" user exists on the controller and accounting machines and that the UIDs match. (if they aren't the machine mentioned above)
Comment 2 Larry Laugesen 2025-03-25 08:37:11 MDT
[root@cadence-rh86-spr-2 tmp]# id operator
uid=11(operator) gid=0(root) groups=0(root)

As for the association tree, this account has 16000+ users in it, is it possible I am running up against a size limit?
Comment 3 Brian Gregory 2025-03-25 08:52:18 MDT
(In reply to Larry Laugesen from comment #2)
> [root@cadence-rh86-spr-2 tmp]# id operator
> uid=11(operator) gid=0(root) groups=0(root)

I'm assuming you have centralized user management? LDAP?
Are your controller and accounting machines the same?
What backend are you using for slurmdbd?

> As for the association tree, this account has 16000+ users in it, is it
> possible I am running up against a size limit?

I don't think that is the case, but we can test by adding a different user that doesn't already have an association.

What O/S are you running? (The username and/or group name "operator" is special for some linux distributions)
Comment 4 Brian Gregory 2025-03-25 09:39:51 MDT
In your original post, you mention a "list of users." 
What are the other users you are having trouble with?
Comment 5 Larry Laugesen 2025-03-31 15:14:23 MDT
The OS of the server in questions is Redhat 8.6

Below is some of the accounts:

uucp
sys
pbctemgr
sys
tcinview
Comment 6 Brian Gregory 2025-03-31 15:53:33 MDT
(In reply to Larry Laugesen from comment #5)
> The OS of the server in questions is Redhat 8.6
> 
> Below is some of the accounts:
> 
> uucp
> sys
> pbctemgr
> sys
> tcinview

"operator," "uucp," and "sys" are special Linux accounts that shouldn't be used for slurm.

Are slurmdbd and slurmctld both running on the same machine?
Are you issuing the sacctmgr command on the same machine?

If any of these are different machines, you need to make sure any accounts you want to add to slurm exists on each machine and has the same UUID.

Is the RH 8.6 machine the same as the one mentioned above named "cadence-rh76-lsf-1"?
Are you using LDAP or similar to sync accounts across machines?