Tuesday, September 28, 2010

Unable to clear the offline mode for Compute node using pbsnodes command

If you try to make a node online after you deliberately off-line the node perhaps for some repair work and unable to online again. Instead on your Torque Server,  you see errors such as these

PBS_Server: LOG_ERROR::is_request, bad attempt to connect from (address not trusted - check entry in server_priv/nodes

As the log says, there is some issue in the server_priv configuration. If all your nodes are working well except that, you should take a look at the client settings. Maybe your server_priv has the wrong configration.

Once you have cleared the error, just fire off the command
# pbsnodes -c node_name

