Wednesday, November 30, 2011

pbs_mom LOG_ERROR sys_copy, command /usr/bin/scp -rpB

I encountered 1 of my parallel job failed and this error appeared on the log file for my compute nodes. 

pbs_mom: LOG_ERROR::sys_copy, command '/usr/bin/scp -rpB...............failed with status=1, 
giving up after 4 attempts

My SSH public/private key authentication is working without a hitch. Similarly, my /etc/hosts and firewall is as what I expected. But I realise my /etc/resolv.conf and /etc/sysconfig/network are incorrect. I got a hint of this possibility when I was reading this forum http://www.mail-archive.com/mauiusers@supercluster.org/msg00998.html . A quick amendment and everything seems ok at least for a while. Will write if this solution is incorrect. :)

No comments: