[prev in list] [next in list] [prev in thread] [next in thread] 

List:       npaci-rocks-discussion
Subject:    [Rocks-Discuss] Re: Nodes down in the Torque queue after kickstart
From:       Bart Brashers <bbrashers () ramboll ! com>
Date:       2017-04-10 16:16:33
Message-ID: 49A331846AB86243A537DEC40498922E1ED66D () GRPCPHMBX18 ! ramboll-group ! global ! network
[Download RAW message or body]

After a reboot, I often have to do several cycles of these:

# service pbs_server restart
# service mail restart
# rocks run host compute 'service pbs_mom restart'

Bart Brashers

> -----Original Message-----
> From: npaci-rocks-discussion-bounces@sdsc.edu [mailto:npaci-rocks-
> discussion-bounces@sdsc.edu] On Behalf Of Max Pinheiro Jr
> Sent: Monday, April 10, 2017 7:18 AM
> To: npaci-rocks-discussion@sdsc.edu
> Subject: [Rocks-Discuss] Nodes down in the Torque queue after kickstart
> 
> Dear all,
> 
> I have tried to reinstall the system on the computer nodes of my cluster
> using the rocks command "rocks run host compute
> '/boot/kickstart/cluster-kickstart-pxe'". The installation seems to work
> normally. However, after sync and reboot the nodes, all computer nodes
> appears as down in Torque PBS system. I am not an expert in the rocks
> system and I have not configured the xml file of rocks for the the
> system
> restoring. I searched for some tip in the forum but I could not find any
> thing that works for my case. Even after re-configuring the Torque
> system
> with pbs_server -t create and restarting the pbs_server and mom
> services,
> the problem of nodes down still persist.
> 
> Could anyone help me with this issue? I will appreciate any help you may
> provide.
> 
> All the best,
> 
> Max
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: http://lists.sdsc.edu/pipermail/npaci-rocks-
> discussion/attachments/20170410/4a9689cb/attachment.html

[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic