Prometheus Metrics
Prometheus metrics exported by Jobset
Prometheus Metrics
JobSet exposes prometheus metrics to monitor the health of the controller.
JobSet controller health
Use the following metrics to monitor the health of the jobset controller:
Metric name | Type | Description | Labels |
---|---|---|---|
controller_runtime_reconcile_errors_total |
Counter | The total number of reconciliation errors encountered by each controller. | controller : name of controller (i.e. use value jobset to obtain metrics for jobset controller) |
controller_runtime_reconcile_time_seconds |
Histogram | The latency of a reconciliation attempt in seconds. | controller : name of controller (i.e. use value jobset to obtain metrics for jobset controller) |
Last modified February 9, 2024: add jobset docsite (7edbdd2)