Prometheus Metrics

Prometheus metrics exported by Jobset

Prometheus Metrics

JobSet exposes prometheus metrics to monitor the health of the controller.

JobSet controller health

Use the following metrics to monitor the health of the jobset controller:

Metric name Type Description Labels
controller_runtime_reconcile_errors_total Counter The total number of reconciliation errors encountered by each controller. controller: name of controller (i.e. use value jobset to obtain metrics for jobset controller)
controller_runtime_reconcile_time_seconds Histogram The latency of a reconciliation attempt in seconds. controller: name of controller (i.e. use value jobset to obtain metrics for jobset controller)
Last modified February 9, 2024: add jobset docsite (7edbdd2)