About the Monitoring Module

Platform Overview Page

The The "Overview" page icon is a square divided into several other squares. Overview page of a platform provides information about the resource consumption of the selected platform.

By default, the Platform Overview page opens when you click the The "Monitoring" module icon is a heart with an electrocardiogram in it. Monitoring module.

The page gives you an overview of your platform’s node consumption and reservation. It is composed of a two-column table describing the consumption of CPU CPU and RAM RAM resources.
This is the entry point for monitoring the overall capacity of your platform and the health status of the associated nodes. Based on these information, you can adjust the limits and requests accordingly.

To have proper access to monitoring details, your administrator must have configured Saagie to isolate the workload between platforms. This is done by specializing your nodes per platform. If workload isolation has not been configured, the The "Monitoring" module icon is a heart with an electrocardiogram in it. Monitoring module will not be fully operational, as all your cluster nodes will be displayed on your platform without disassociation. For more information, see Node Isolation.

Example 1. Saagie in isolated and non isolated mode.

Here is the example of Saagie installed with two platforms: One installation with your isolated workload and the other without.

Saagie installed in isolated mode

If you have four nodes in your cluster, the The "Overview" page icon is a square divided into several other squares. Platform Overview page will only show the node(s) that are dedicated to the selected platform. If a node is not labeled, it will not be considered for Saagie runs and will not appear.

monitoring overview platform iso p1
Figure 1. The platform Demo1 has one specialized node.
monitoring overview platform iso p2
Figure 2. The platform Demo2 has two specialized nodes.
Saagie installed in non-isolated mode

If you have four nodes in your cluster, the The "Overview" page icon is a square divided into several other squares. Platform Overview page will show all the cluster nodes on each platform without dissociation, even if no resources are used on those nodes by your platform.

monitoring overview platform niso p1
Figure 3. The platform Demo1 displays all the cluster nodes without dissociation.
monitoring overview platform niso p2
Figure 4. The platform Demo2 displays all the cluster nodes without dissociation.

App, Job, and Pipeline Resource Consumption Graphs

The The "Overview" page icon is a square divided into several other squares. Overview and The "Instances" page icon is three overlapping squares. Instances pages for apps, jobs, and pipelines also include graphs that allow you to track consumption as the job, app, or pipeline runs on the node.
It can help you quickly identify bottlenecks, debug jobs and apps going OOM Out Of Memory, and better optimize resource usage on your platform.

You can click the graph line to display tooltips with more information, and you can zoom in on a specific period.

monitoring graph consumption jobs apps overview