Dear community,
I am trying to run Tutorial 1 on a PBS cluster. The jobs can be submitted successfully to the server, but they got killed immediately. The error messages are like this:
/var/spool/PBS/mom_priv/jobs/8588204.pbs.SC: line 63: checkjob: command not found
/var/spool/PBS/mom_priv/jobs/8588204.pbs.SC: line 64: qstat: command not found
grep: …/workflow/control/all.ctrl: No such file or directory
grep: …/workflow/control/all.ctrl: No such file or directory
mkdir: cannot create directory ‘/tangm5’: Permission denied
Our IT team commented as follows:
“The file you are trying to access in the mom_priv area is actually located on the nodes and is protected for the pbs daemon. Its basically the submit script copied over to actually do the run.
I think you might have other issues in your program. The reason being that for one the output file generated is called job-1.1_${PBS_JOBID}.out and not the resolved PBS_JOBID. Also it looks like your PATH variable is not honoured and that is why the qstat commands can’t be found.”
Does anyone have any suggestions for my issues?
Thanks,
Ming