Page Navigation: | |
---|---|
|
Confidence Level High This article been formally reviewed and is signed off on by a relevant subject matter expert. |
---|
As briefly mentioned before SciFlo is a workflow framework. It uses a XML workflow definition document to know the sequence of steps. Each step of a workflow is known as a process. The workflow definition documents defines the flow of the processes and how to run each process. More about the workflow definition documents can be found here: [placeholder]
SciFlo Bash Script
Any workflow job is executed by the run_sciflo.sh
script.
It first sets up the BASE_PATH
, PYTHONPATH
and other environment requirements for the sciflo job.
It then reads in the inputs mentioned below:
Location to the adaptation module
Path to the directory containing the workflow definition document
Name of the Algorithm/Job/PGE to run
Code Block |
---|
MODULE_PATH="$1"
WF_DIR="$2"
WF_NAME="$3" |
It exports the adaptation module’s path to the python path so the scripts can resolve dependency imports.
Code Block |
---|
export PYTHONPATH=${MODULE_PATH}:$PYTHONPATH |
It constructs the location of the workflow definition document with WF_DIR
and WF_NAME
i.e. $WF_DIR/$WF_NAME.sf.xml
It runs the run_sciflo.py
script with the required inputs.
Code Block |
---|
python $BASE_PATH/run_sciflo.py $WF_DIR/$WF_NAME.sf.xml _context.json output > run_sciflo_$WF_NAME.log 2>&1
STATUS=$?
echo -n "Finished running $PGE run_sciflo.py: " 1>&2
date 1>&2
if [ $STATUS -ne 0 ]; then
echo "Failed to run $PGE run_sciflo.py" 1>&2
cat run_sciflo_$WF_NAME.log 1>&2
echo "{}"
exit $STATUS
fi |
SciFlo Python Script
The run_sciflo.py
is the python script that executes the sciflo job. It takes 2 positional arguments:
Workflow definition document - has the file extension
.wf.xml
Job’s context file aka
_context.json
Directory to write outputs of the job i.e.
output
This script imports the sciflo_util
and calls the run_sciflo
function.
Code Block |
---|
from chimera.commons.sciflo_util import run_sciflo
def main(sfl_file, context_file, output_folder):
sfl_file = os.path.abspath(sfl_file)
context_file = os.path.abspath(context_file)
output_file = os.path.abspath(output_folder)
LOGGER.info("sfl_file: %s" % sfl_file)
LOGGER.info("context_file: %s" % context_file)
return run_sciflo(sfl_file, ["sf_context=%s" % context_file], output_folder) |
The command to run sciflo is $HOME/verdi/bin/sflExec.py -s -f -o output --args "sfl_context=_context.json", "pge.wf.xml"
In the sciflo_util.run_sciflo
function:
This command is constructed and run.
Code Block |
---|
# build paths to executables
sflexec_path = os.path.join(
os.environ['HOME'], 'verdi', 'bin', 'sflExec.py')
# execute sciflo
cmd = [sflexec_path, "-s", "-f", "-o", output_dir,
"--args", '"%s"' % ','.join(sfl_args), sfl_file]
print("Running sflExec.py command:\n%s" % ' '.join(cmd))
status = os.system(' '.join(cmd)) |
Based on status of the Sciflo run it logs out the error and copies the files generated by the sciflo run into the specified output directory.
Code Block |
---|
if status != 0:
extract_error('%s/sciflo.json' % output_dir)
status = 1
# copy smap_sciflo work and exec dir
try:
copy_sciflo_work(output_dir)
except Exception:
pass |