About Jobs
- What is a job?
-
A job is a computation task performed inside projects on Saagie using one of the embedded or external technologies. Jobs run through a command line and can be launched individually or as part of a data pipeline.
Jobs are listed with some basic information, such as the name of the job (a), the technology icon (b), the type of the job (c), the job status (d), and the last executed job instance (e). You can also run (f), upgrade, duplicate the latest job version, or delete the job (g) from the job library.
A clock |

When creating a job, you choose a category and a technology for it. For each category, the available technologies depend on the technologies you selected when you created the project.
If the technology you need does not appear in the list of available technologies, that means it was not selected when the project was created.
You will either have to choose another technology or update the project settings to include the technology you need. Besides, the category and technology of a job cannot be changed after the job is created. If you want to change them, you will have to create a new job. |
A job category does not impact the job execution, it helps you to organize your jobs. There are three default job categories:
-
Extraction: For jobs that retrieve data.
-
Processing: For jobs that process data.
-
Smart Apps: For jobs that use or expose data.
When choosing your technology, it is important to note that the technologies have different requirements. For more information, see the following table:
Technology | File type | Default shell command |
|
---|---|---|---|
Bash |
Any file type (Optional) |
|
|
Generic |
Docker image |
none |
|
Java/Scala |
|
|
|
Python |
|
|
|
R |
|
|
|
Spark |
|
|
|
Sqoop |
Any file type (Optional) |
|
|
Talend |
|
|
Click a job to access its:
Overview Page
By default, the page opens when you click a job in the project’s job library.
The first part of the page (1) provides general information about the job, such as the name and alias, the description, the job category, the last instance status, the runtime mode (manual or scheduled), the start and end date of the last instance and its duration, the version used, the job creator, and other job settings. You can also duplicate the latest job version or delete the job from the kebab menu (a).
A job alias is unique for each job within a project and allows you to refer to a job within another job.
It can be used in pipelines when the |
You also have information about the logs of the job. You can choose to display Saagie Logs, Pipeline Variable Logs, or Error Logs Only for each, and you can download them.

For external jobs, you can choose to display the External Connection Logs alone or simultaneously with the Saagie Logs. |
The second part of the page (2) provides information about the job consumption through different graphic resources. Use these graphs to check your job consumption during and after its execution.
![]() |
Monitoring the RAM consumption of your job can help you anticipate potential memory issues.
Indeed, a job that consumes more than the available RAM limit goes into an For more information on monitoring your platform resources, see About the Monitoring Module. |
The third part of the page (3) provides information about the pipelines related to the job and information about the technology, and either the job package if it is an embedded job, or the external connection if it is an external job.
|

Instances Page
- What is an instance?
-
An instance is a single run of a job or pipeline in a project. The execution information and logs of all instances are saved on your platform.
Whenever you run a job, a new instance of that job is created. All instances are saved and remain accessible: they are listed on the right side of the page (1). You can view the information of an instance by selecting it in the list. You can also delete instances from this list.

By default, the page opens on the last executed instance of the selected job.
The first part of the page (2) provides general information about the instance of the job, such as the instance number (specified by #001
in the title), its execution status, the start and end times, the duration, the version of the job used, and the external connection if it is an external job.
The second part of the page (3) provides information about the logs of the job. You can choose to display Saagie Logs, Pipeline Variable Logs, or Error Logs Only for each, and you can download them.

For external jobs, you can choose to display the External Connection Logs alone or simultaneously with the Saagie Logs. |
The third part of th page (4) provides information about the consumption of the selected job instance through different graphic resources. Use these graphs to check your job consumption during and after its execution.
![]() |
Versions Page
- What is a version?
-
A version is a single iteration of a job, pipeline, or app. Each new update is stored as a version, enabling you to roll back to previous iterations and keep track of successive changes.
Whenever you upgrade a job (1), a new version of that job is created and automatically defined as the Current
version.
All versions are saved and remain accessible: they are listed on the right side of the page (2).
You can view the information of a version by selecting it in the list. You can also delete versions from this list.
You can switch back and forth between versions as required by selecting a version from the list and clicking Rollback to this version (3).
This action of rolling back to another version of the job will make the selected version the new Current
version.
You can also define a version as major to highlight the most stable version of a job. Select a version from the list and click Set as major version (4) to label the version as Major version. Sparks will appear in front of the major version. Similarly, click Unset as a major version to remove the label from a version. |

By default, the page opens on the version of the job in use, tagged with the Current
badge
The first part of the page (5) provides general information about the version, such as the release note, the creation date and creator, and the runtime context.
The second part of the page (6) display:
-
For embedded jobs, information about the job package. You can edit it and modify the runtime context of the job technology, the package, the command lines, and add a release note.
-
For external jobs, information about the external connection. You can edit it and modify the name, the access key ID, the secret access key, and the region.