Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

...

...

...

...

...

...

...

...

Page Navigation:

Table of Contents

(blue star) Confidence Level TBD  This article has not been reviewed for accuracy, timeliness, or completeness. Check that this information is valid before acting on it.


Operators can identify failed jobs using any precondition failure in the Resource Management interface (Figaro). This allows jobs to be processed in bulk using any of the lightweight job management functions found in the On-Demand window.

Instructions

...

Precondition failure “is when a PGE fails not because of a runtime error but due to missing inputs, corrupt inputs, etc. These show up as failed jobs… Not something easily visible. These aren’t necessarily distinguished from a failed job”

...

  1. Inside Figaro, first select the “job-failed” facet (See #1 in image) under the left-hand menu column labeled “status”. This will narrow the scope of jobs displayed within Figaro to only those with that status.

Image Modified

Info

Note: the left-hand menu column dynamically updates according to the chosen facets. If an option is not visible confirm that any undesired facets aren’t selected by mistake.

2. Next, select the type of error. In this example, the error “SoftTimeLimitExceeded()” (#2) is faceted on.

Image Modified

3. Each facet appears across the top of the updated job results in a blue box (#3). After the desired job-failed facet is selected, the total number of matching jobs (#4) can be seen under the facet tags. Next, click the “On-Demand” button (#5) to select the desired lightweight job action to perform.

Image Modified

4. In the On-Demand window, add a unique user-defined tag (#6) and select the lightweight job to perform from the drop-down menu (#7).

Image Modified

5. When selecting the job from the Action drop-down menu,(TODO: confirm accuracy and wording of following text) its recommended to use the latest release of the desired job type. The release date is noted in square brackets following the job type name. In this example (#8) the “Purge jobs” action is chosen, and the latest release date is: [release-20180529].

Image Modified

6. Select “Process Now” to complete the selected task on the targeted failed jobs.


...

...

hiddentrue

...

(lightbulb) Have Questions? Ask a HySDS Developer:

Anyone can join our public Slack channelto learn more about HySDS. JPL employees can join #HySDS-Community

(blue star)

JPLers can also ask HySDS questions atStack Overflow Enterprise

(blue star)

Live Search
placeholderSearch HySDS Wiki