Sunday, March 13, 2011

Manually Deleting Torque amd PBS jobs using MAUI

Tracing Jobs
To trace a job with MAUI commands including the nodes the jobs are residing, you can use the commands
# showq -r

Alternatively, you can use the MAUI commands to trace the job activity
# trace job_id


Deleting Jobs
To delete a job with MAUI commands, you can use the commands,
# canceljob job_id

Alternatively, you can also use PBS commands to delete a job
# qdel job_id


PBS mom control
If not able to delete a stale job which has no process, you can use the momctl command
# momctl

If you are unable to delete the stale job with has no process, you can use momctl to do diagnostic. Basically The momctl command allows remote shutdown, reconfiguration, diagnostics, and querying of the pbs_mom daemon. For more information on momctl, do look at momctl by http://www.clusterresources.com/:

Example 1: Diagnosis of pbs_mom
# momctl -h node1 -d 1
Example 2: Cycle the pbs_mom on node 1
# momctl -h nod1 -C

Manually deleting the jobs
To manually delete the jobs, you should shutdown the pbs server
# service pbs_server stop

Remove the job spool files
# rm /var/spool/pbs/server_priv/jobs/111.host.SC 
# rm /var/spool/pbs/server_priv/jobs/111.host.JB

Restart the pbs_server
# service pbs_server restart

Further Information:
  1. Deleting PBS/Maui Jobs

No comments: