[prev in list] [next in list] [prev in thread] [next in thread] 

List:       npaci-rocks-discussion
Subject:    [Rocks-Discuss]  rocks sync users and 411 problems
From:       Rhys Morris <R.Morris () bristol ! ac ! uk>
Date:       2017-07-27 16:44:17
Message-ID: AM3PR06MB10260397F7013DDB62071BE2DFBE0 () AM3PR06MB1026 ! eurprd06 ! prod ! outlook ! com
[Download RAW message or body]


Hi All,

I have a rocks 6.2 cluster running centos 6.6. It has been working fine, but \
recently, I added a new user and his jobs would not run. It turns out his account had \
not appeared in /etc/passwd on any of the nodes despite me running rocks sync users \
without errors as below:

rocks sync users
make: Entering directory `/var/411'
rm -rf /etc/411.d/*
make
make[1]: Entering directory `/var/411'
/opt/rocks/sbin/411put --comment="#" /etc/auto.home
411 Wrote: /etc/411.d/etc.auto..home
Size: 7031/5030 bytes (encrypted/plain)
/opt/rocks/sbin/411put --comment="#" /etc/auto.master
411 Wrote: /etc/411.d/etc.auto..master
Size: 555/235 bytes (encrypted/plain)
/opt/rocks/sbin/411put --comment="#" /etc/auto.misc
411 Wrote: /etc/411.d/etc.auto..misc
Size: 1357/829 bytes (encrypted/plain)
/opt/rocks/sbin/411put --comment="#" /etc/auto.net
411 Wrote: /etc/411.d/etc.auto..net
Size: 2686/1808 bytes (encrypted/plain)
/opt/rocks/sbin/411put --comment="#" /etc/auto.share
411 Wrote: /etc/411.d/etc.auto..share
Size: 5710/4051 bytes (encrypted/plain)
/opt/rocks/sbin/411put --comment="#" /etc/auto.smb
411 Wrote: /etc/411.d/etc.auto..smb
Size: 1649/1044 bytes (encrypted/plain)
/opt/rocks/sbin/411put --comment="#" /etc/ssh/shosts.equiv
411 Wrote: /etc/411.d/etc.ssh.shosts..equiv
Size: 1130/660 bytes (encrypted/plain)
/opt/rocks/sbin/411put --comment="#" /etc/ssh/ssh_known_hosts
411 Wrote: /etc/411.d/etc.ssh.ssh_known_hosts
Size: 19274/14091 bytes (encrypted/plain)
/opt/rocks/sbin/411put --nocomment /etc/passwd
411 Wrote: /etc/411.d/etc.passwd
Size: 15448/11263 bytes (encrypted/plain)
/opt/rocks/sbin/411put --nocomment /etc/group
411 Wrote: /etc/411.d/etc.group
Size: 11849/8598 bytes (encrypted/plain)
/opt/rocks/sbin/411put --nocomment /etc/shadow
411 Wrote: /etc/411.d/etc.shadow
Size: 13742/9999 bytes (encrypted/plain)
make[1]: Leaving directory `/var/411'
make: Leaving directory `/var/411'

But the logs contain lots of messages like the following:

Jul 27 17:39:43 compute-0-8 411-alert-handler[13333]: Error: \
http://10.1.1.1:372/411.d/etc.auto..master updating Could not get file \
'http://10.1.1.1:372/411.d/etc.auto..master': 400 Bad

So I set the nodes to rebuild on boot and rebooted a node. On coming up, the node had \
no user accounts at all. If I run 411get as below, I get the following error, \
--verbose does not provide any more information.

411get --all
Error: Could not get file 'http://10.1.1.1:372/411.d//': 400 Bad

I'm guessing something has gone wrong with the 411 service on the server. Http is \
running, I have restarted everything I can think of and rebooted the head node. I \
have run some updates on the head node, which I think may be the cause of the \
problem. Can someone tell me some useful diagnostics to work out what is wrong.

Thanks for any help,
Rhys




--------------------------------------------
Rhys Morris
Astrophysics/Unix Support Specialist
Room 4.13, HH Wills Physics Lab
University of Bristol
--------------------------------------------


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic