Adding a Task to a Workflow

This topic provides instructions for configuring each type of task that is available for adding to a workflow.

Adding a Task that Runs an Unstructured Pipeline

Follow the instructions below to add a task that runs an unstructured pipeline.

  1. In the Administration application, expand the Tools menu and click Workflow Manager. Anzo displays the Workflows screen, which lists any existing workflows. For example:

  2. Expand the workflow that you want to add a task to. For example:

  3. Click Add Task. The Create Task dialog box is displayed:

  4. Configure the Task by completing the following fields as needed:
    • Task Type: The drop-down list at the top of the dialog box specifies the type of task to create. Distributed Unstructured Pipeline Load Service is selected by default. Accept the default value.
    • Load Service Name: This field specifies the name for the task.
    • Target Unstructured Pipeline: This field specifies the unstructured pipeline that this task should run. Click the drop-down list and select the desired pipeline.
    • Keep Last N-Datasets: This field specifies the number of file-based linked data sets (FLDS) from this pipeline to retain on disk before deleting the oldest ones.
    • Load Threshold: This field specifies the percentage of the pipeline that must complete successfully for the ingestion to be considered a success.
    • Distributed Unstructured Pipeline Stop Timeout: This field specifies the number of milliseconds to wait for a pipeline to stop.
    • Distributed Unstructured Pipeline Percent Timeout: This field specifies the number of milliseconds to wait before timing out if there is no change in the percentage of documents processed.
    • Index: This field specifies a numeric value that represents the order in which this task should run in the workflow.
  5. Click Create to add the task to the workflow. For example, the image below shows a workflow with one task.

You can repeat this process to add tasks that run additional unstructured pipelines.

Adding a Task that Refreshes or Reloads a Graphmart

Follow the instructions below to add a task that refreshes or reloads a graphmart.

  1. In the Administration application, expand the Tools menu and click Workflow Manager. Anzo displays the Workflows screen, which lists any existing workflows. For example:

  2. Expand the workflow that you want to add a task to. For example:

  3. Click Add Task. The Create Task dialog box is displayed:

  4. At the top of the dialog box, click the drop-down list and select Graphmart Load Service to set up a task that reloads or refreshes a graphmart. The dialog box presents the options that are valid for graphmart Load Service Tasks:

  5. Configure the Task by completing the following fields as needed:
    • Load Service Name: This field specifies the name for the task.
    • Target Graphmart: This field specifies the graphmart that this task should reload or refresh. Click the drop-down list and select the desired graphmart.
    • Target AnzoGraph: This field specifies the AnzoGraph instance that hosts this graphmart.
    • Keep Last N-Datasets: This field is not relevant for Graphmart Load Service tasks.
    • Load Threshold: This field is not relevant for Graphmart Load Service tasks.
    • Graphmart Action: This field specifies whether to refresh or reload the target graphmart. For refresh, click the drop-down list and select Refresh Target Graphmart. To perform a reload, click the drop-down list and select Reload Target Graphmart.
    • Activate: This option indicates whether the target graphmart needs to be activated before the refresh or reload is attempted. If the target graphmart is offline when the workflow is run, this task will fail unless Activate is enabled.
    • Deactivate: This option indicates whether to deactivate the graphmart after the task is complete. If you want Anzo to deactivate the target graphmart after the reload or refresh is complete, select the Deactivate checkbox.
    • Index: This field specifies a numeric value that represents the order in which this task should run in the workflow.
  6. Click Create to add the task to the workflow. For example, the image below shows a workflow with one Task.

You can repeat this process to add tasks that refresh or reload additional graphmarts.

Adding a Task that Pauses the Workflow

Follow the instructions below to create a task that adds a pause between tasks in a workflow. For example, you may want to add a pause between one task that reloads a graphmart and another task that refreshes a graphmart that depends on the updated data from the reloaded graphmart.

  1. In the Administration application, expand the Tools menu and click Workflow Manager. Anzo displays the Workflows screen, which lists any existing workflows. For example:

  2. Expand the workflow that you want to add a task to. For example:

  3. Click Add Task. The Create Task dialog box is displayed:

  4. At the top of the dialog box, click the drop-down list and select Pause Load Service. The dialog box presents the options that are valid for Pause Load Service Tasks:

  5. Configure the Task by completing the following fields as needed:
    • Load Service Name: This field specifies the name for the task.
    • Pause Time: This field defines the number of milliseconds to pause between tasks. For example, 10000 is 10 seconds.
    • Keep Last N-Datasets: This field is not relevant for Pause Load Service tasks.
    • Load Threshold: This field is not relevant for Pause Load Service tasks.
    • Index: This field specifies a numeric value that represents the order in which this task should run in the workflow.
  6. When you have finished configuring the task, click Create to add the task to the workflow. For example, the image below shows a workflow with a pause between reloading one graphmart and refreshing another.

You can repeat this process to create additional pause tasks.