Introduction

The Workflow Runner (WR) component of the DataHub allows users and applications to execute workflows using the processing available on the Hub. This allows for complex computation to be carried out on datasets remotely, with the outputs available to the users and applications via the Resource Catalogue (RC) and S3 upon completion.


Interaction with the WR is currently handled via HTTPS with requests supported to deploy, update, delete and execute workflows as well as for monitoring the execution of a job once started. Logs are also available upon workflow completion.


We also have support for public workflows to be defined within a user workspace. These are currently configured with policy documents saved in S3 which allow one user to specify other users who are able to run the workflow within their own workspace, and retrieve output results in their own separate catalogue. If you wish to configure one of your workflows to be public this needs to be managed by a Hub administrator, so please reach out to Tom Jellicoe (Thomas.Jellicoe@telespazio.com) for assistance. For the time being, all other workflows are private to users who have access to the workspace that deployed them.


Workflows can be deployed to the Hub via HTTPS and eventually via uploading them to a Git repository, ready to be harvested directly into a workspace.