Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

...

...

...

...

...

...

...

...

Question: what are all of the possible precondition failures? (this is mentioned in Jira 601).

Rough Draft outline for JIRA #597: How to facet on all failed jobs with the same failure message, purge them and then re-submit them:

  1. Navigate to Resource Manager

  2. Facet on jobs, and job-failed to see all failed jobs (#1).

  3. Next, need to identify a specific, unique searchable term shared among all of the jobs you want to purge and resubmit. There is no hard and fast rule for this; its somewhat trial and error. In this example (#2) I chose a unique string/phrase/segment of the Job ID

...

4. Enclose that search phrase in “quotations” and enter it in the search bar along the top to facet on all similar jobs:

...

5. With this list of all similar failed jobs (304 in this example), next click “On Demand” to process these using “purge” from the drop down menu. (Unclear if its beneficial to add a unique tag here for later steps)

...

6. Leave other settings unchanged, click “Process Now”

(from here on I’m unclear how to retry these jobs, this is my best guess understanding)

7. Remove the job-failed facet in the Resource Manager. Then search for the unique tag created (in “quotations”) when purging the jobs, “purge_tag_test” in this example. This shows all the failed and now purged similar jobs.

8. Now click On-Demand, add another unique tag, and select “Retry Jobs/Tasks” from the drop-down action menu.

9. This retries the failed and purged jobs. You can remove the “purge_tag_test” facet and add the unique tag created in step #8 to facet on the newly retried similar jobs.

Instructions

 

...

Page Navigation:

Table of Contents

(blue star) Confidence Level TBD  This article has not been reviewed for accuracy, timeliness, or completeness. Check that this information is valid before acting on it.


Faceting On & Purging Similar Failed Jobs:

  1. Navigate to the Resource Manager (Figaro).

  2. Inside Figaro, facet on the desired parameters in the left-hand column under the status menu (See #1 in image).

  3. Narrow the scope to a specific, unique job failure type by selecting the targeted message in the left-hand column under the error column (#2). {{INSERT UPDATED SCREENSHOTS}}

Info

Note: Multiple job types can share an error message. Users can confirm the faceted error message belongs to only one job type by checking the “type” menu in the left-hand column.

4. After the similar failed jobs have been faceted on, click the “On Demand” button (#3).

Image Added

5. In the On-Demand window, add a unique key phrase in the “Tag” text box (#4). This helps facilitate job tracking and management.

6. Select “Purge jobs” from the action drop-down menu (#5). {{INSERT SCREENSHOTS}}

7. Modify the remaining job parameters as needed. Click “Process Now” (#6).


 

...

...

hiddentrue

...

(lightbulb) Have Questions? Ask a HySDS Developer:

Anyone can join our public Slack channelto learn more about HySDS. JPL employees can join #HySDS-Community

(blue star)

JPLers can also ask HySDS questions atStack Overflow Enterprise

(blue star)

Live Search
placeholderSearch HySDS Wiki