Pulp SLIs
Pulp uses Service Level Indicators (SLIs) to monitor the health and performance of its components. These SLIs are defined in the metrics catalog and are used to measure availability and latency for the service.
Overview
Section titled “Overview”The Pulp service has the following SLIs:
| SLI | Description | Type |
|---|---|---|
pulp_app_api | Pulp application API service requests | Apdex + Error Rate |
pulp_app_content | Pulp application content API requests | Apdex + Error Rate |
pulp_nginx | Nginx ingress controller load balancer requests | Apdex + Error Rate |
pulp_cloudsql | GCP CloudSQL PostgreSQL database operations | Request Rate |
pulp_gcs | GCS bucket storage operations | Request Rate |
pulp_redis | GCP Redis Memorystore caching operations | Request Rate |
Observability
Section titled “Observability”The SLIs appear in the Pulp Overview dashboard.
Application SLIs
Section titled “Application SLIs”pulp_app_api
Section titled “pulp_app_api”The pulp_app_api SLI monitors the Pulp API service, which handles administrative operations such as repository management, content synchronization, and user management.
Metrics:
api_request_duration_milliseconds_bucket- Request latency histogramapi_request_duration_milliseconds_count- Total request count
Significant Labels:
http_method- HTTP request method (GET, POST, PUT, DELETE, etc.)http_target- API endpoint pathhttp_status_code- HTTP response status code
Error Rate: Tracks 5xx HTTP status codes as errors.
pulp_app_content
Section titled “pulp_app_content”The pulp_app_content SLI monitors the Pulp Content API service, which handles package downloads and content delivery to clients (e.g., yum/dnf clients fetching packages).
Metrics:
content_request_duration_milliseconds_bucket- Request latency histogramcontent_request_duration_milliseconds_count- Total request count
Apdex Thresholds:
Significant Labels:
http_method- HTTP request methodhttp_route- Content API routehttp_status_code- HTTP response status code
Error Rate: Tracks 5xx HTTP status codes as errors.
Other Metrics
Section titled “Other Metrics”In addition to the SLIs above, the following metrics are available in the Pulp Overview dashboard.
Task Queue Metrics
Section titled “Task Queue Metrics”Longest Unblocked Task Wait Time
Section titled “Longest Unblocked Task Wait Time”Tracks how long the oldest unblocked task has been waiting in the queue. Lower values are better.
Metric:
Unblocked Task Queue Length
Section titled “Unblocked Task Queue Length”Tracks the number of unblocked tasks waiting to be processed. Lower values are better.
Metric:
Infrastructure SLIs
Section titled “Infrastructure SLIs”pulp_nginx
Section titled “pulp_nginx”Monitors the nginx ingress controller that load balances traffic to Pulp services.
Metrics:
nginx_ingress_controller_request_duration_seconds_bucket- Request latency histogramnginx_ingress_controller_requests- Total request count
Apdex Threshold:
Significant Labels:
method- HTTP methodpath- Request pathstatus- HTTP status code
pulp_cloudsql
Section titled “pulp_cloudsql”Monitors the GCP CloudSQL PostgreSQL instance used by Pulp.
Metrics:
Significant Labels:
database_id- Cloud SQL database identifierdatabase- Database nameoperation_type- Type of SQL operation
pulp_gcs
Section titled “pulp_gcs”Monitors the GCS bucket used for package storage.
Metrics:
Significant Labels:
bucket_name- GCS bucket namemethod- API method
pulp_redis
Section titled “pulp_redis”Monitors the GCP Redis Memorystore instance used for caching and session management.
Metrics:
Significant Labels:
instance_id- Redis instance identifier