Skip to content

ci-apdex-violating-slo

Runner Manager’s queues violating the SLI of the ci-runners service

Section titled “Runner Manager’s queues violating the SLI of the ci-runners service”

To Check the overall health of the runners:

This alert has the following possible causes, in the first few minutes it is important to determine the high-level cause before investigating further, the following are the common three causes of this alert:

Look for quota-exceeded errors in logs to determine if we are hitting any GCP gitlab-ci project quotas that are causing scaling issues: https://log.gprd.gitlab.net/goto/8f65b43718b6e95ccf5f6972e7ca1887

Check the Quotas Runbook for more details.

If we believe there is a GCP scaling or quota issue:

If we believe there is a problem with PostgreSQL:

See https://gitlab.com/gitlab-com/runbooks/-/blob/master/docs/ci-runners/ci-abuse-handling.md