Glossary


A B C D E F G H I J K L M N O P Q R S T U V W X Y Z


For Kubernetes-specific terms, see the Kubernetes glossary.

A


app

Docker image that can be mounted inside projects using one of the integrated technologies. Persistent storage can be assigned to an app to keep data in a defined volume path when the app is stopped.

Argo

Container-native workflow engine for orchestrating parallel jobs on Kubernetes. For more information, see the Argo documentation.

authorization

Rights granted to groups through the Security module. These rights include access to specific features of the app, access to datasets, and editing rights on individual projects.

C


Conf’o’rama

Component created by Saagie to import files for jobs and apps. Files are managed on a Kubernetes platform and accessible across that platform. Conf’o’rama’s API is exposed and can be used by Saagie users working with Kubernetes clusters.

D


dataset

Collection of data accessible on the Governance data portal.

dataset access

Set of rules defining which groups can access which datasets in Saagie.

Docker credentials

Login information stored in projects allowing you to pull Docker images from private registries when creating apps.

domain

Entity in the Governance module that groups datasets. Domain organization is flexible to fit the company’s data organization (for example, by geography, branch, or team).

E


environment variable

Dynamic values that can be referenced in jobs, limiting the need to modify job execution code each time the value of the variable changes.
Environment variables can be created at the platform (global) or project level to ensure proper control over who can access and use the variables.

F


Fluent Bit

Multi-platform log processor and forwarder that collects data and logs from different sources, then unifies and sends them to multiple destinations. For more information, see the Fluent Bit documentation.

FreeIPA

RedHat tool used for Identity Management. For more information, see FreeIPA.

G


group

Entity in which a selection of users is placed. Groups are granted authorizations in the Projects and Governance modules. When an administrator grants authorizations to a group, each user in that group receives those authorizations.

H


Helm

Helm Charts help you define, install, and upgrade Kubernetes applications. For more information, see the Helm documentation.

I


instance

A single run of a job or pipeline in a project. The execution information and app and job logs for all instances are saved on your platform.

J


job

Computation task performed inside projects on Saagie using one of the integrated technologies. Jobs run through a command line and can be launched individually (in both manual and scheduled modes) or as part of a data pipeline.

K


Kafka

Also called Apache Kafka, it is a distributed streaming platform used to facilitate communication between systems and applications. It must be used with ZooKeeper. In Saagie, Kafka is used to manage communication between Saagie components. For more information, see the Kafka documentation.

Keycloak

Security-oriented federation and identity provider (IdP) server. In Saagie, Keycloak is used with LDAP to get security tokens. For more information, see the Keycloak documentation

M


Manager

Legacy user interface of Saagie. It allows the creation of extraction, processing, and smart app jobs, as well as access to data lake and data mart services. In the Manager module, all jobs are listed in a single menu, while in the Projects module, they are organized by project to sort them more easily.

Mesos/Chronos/Marathon

Tool used in Saagie legacy orchestration. Chronos schedules jobs in Mesos. Marathon ensures that jobs restart automatically if a job fails in Mesos.

MinIO

Object storage server. It is compatible with Amazon S3 cloud storage service. In Saagie, MinIO is installed in the Kubernetes cluster to store files. For more information, see the MinIO documentation.

MongoDB

Document-oriented NoSQL database that uses JSON-like documents with optional schemas. Queries are made in JavaScript. For more information, see the MongoDB documentation.

MySQL

Relational database management system. For more information, see the MySQL documentation.

P


pipeline

End-to-end process encompassing multiple jobs that are connected and executed within projects. A pipeline can include advanced orchestration logic, such as conditions or parallel job executions.

platform
  1. Individual Saagie environment. A customer can have several platforms—for example, experimentation, pre-production, and production—with the possibility to promote jobs between each of them when moving from one phase to another.

  2. Kubernetes CustomResourceDefinition (CRD) created by Saagie. The Saagie platform is created during the installation process and information is saved in MongoDB. This automatically prompts the creation of a Kubernetes platform.

platform-k8s-controller

Kubernetes controller created by Saagie to manage CRD platforms in a Kubernetes cluster.

PostgreSQL

Relational database management system. For more information, see the PostgreSQL documentation.

project
  1. Entity set up within a platform in which you can create jobs, pipelines, and apps. Projects are isolated from each other allowing you to manage their access rights.

  2. Kubernetes CustomResourceDefinition (CRD) created by Saagie. When you create a project on Saagie, a message is sent to Kafka. Kafka reads the message and prompts for the creation of a project on Kubernetes. Pipelines, apps, and jobs are then executed using the Kubernetes project. Each project environment is isolated, which means that projects cannot communicate with other projects.

project-k8s-controller

Kubernetes controller created by Saagie to manage CRD projects in a Kubernetes cluster.

projects-and-jobs

Component created by Saagie to manage projects and their elements, such as jobs, apps, or pipelines. Projects-and-jobs' API is exposed and can be used by all Saagie users.

R


Realm

Secures and manages security isolation for a given Saagie customer. In Saagie, it is a string used in the application URL and request headers.

S


Scredz

Component created by Saagie to store Docker credentials used for private images in jobs and apps.

storage

Persistent storage that can be allocated to an app to retain data in a defined volume path.

U


user

Single Saagie account created on one or more platforms. Authorizations are granted to users by administrators.

V


version

Single iteration of a job, pipeline, or app. Each new update is stored as a version, enabling you to roll back to previous iterations and keep track of successive changes.

W


workflow

Kubernetes resource created by Argo that facilitates the creation and management of workflows. In Saagie, we use it to operate pipelines and jobs.

Z


ZooKeeper

Centralized service used to maintain configuration information and provide distributed synchronization and group services. In Saagie, ZooKeeper is mainly used to maintain and operate a Kafka cluster. For more information, see the ZooKeeper documentation.