Page Navigation: | |
---|---|
|
Confidence Level TBD This article has not been reviewed for accuracy, timeliness, or completeness. Check that this information is valid before acting on it. |
---|
Operators can identify failed jobs using any precondition failure in the Resource Management interface (Figaro). This allows jobs to be processed in bulk using any of the lightweight job management functions found in the On-Demand window.
Instructions
Precondition failure “is when a PGE fails not because of a runtime error but due to missing inputs, corrupt inputs, etc. These show up as failed jobs… Not something easily visible. These aren’t necessarily distinguished from a failed job”
Inside Figaro, first select the “job-failed” facet (See #1 in image) under the left-hand menu column labeled “status”
...
. This will narrow the scope of jobs displayed within Figaro to only those with that status.
Info |
---|
Note: the left-hand menu column dynamically updates according to the chosen facets. If an option is not visible confirm that any undesired facets aren’t selected by mistake. |
2. Next, select the type of error
...
. In this example, the error
...
“SoftTimeLimitExceeded()” (#2) is faceted on.
...
3.
...
Each facet appears across the top of the updated job results in a blue box (#3). After the desired job-failed facet is selected, the total number of matching jobs (#4) can be seen under the facet tags. Next, click the “On-Demand” button (#5) to select the desired
...
lightweight job action to perform.
...
Info |
---|
The applied facets appear in a row of blue boxes, in this example 92 total jobs are faceted on. |
4. In the On-Demand window, add a unique user-defined tag
...
(#6) and select the lightweight job
...
to perform from the drop-down menu (#7).
...
5. When selecting the job from the Action drop-down menu,(TODO: confirm accuracy and wording of following text) its recommended to use the latest release of the desired job type. The release date is noted in square brackets following the job type name. In this example (#8) the “Purge jobs” action is chosen, and the latest release date is: [release-20180529].
6. Select “Process Now” to complete the selected task on the targeted failed jobs.
...
...
hidden | true |
---|
...