Skip to content

AI-Assisted Service

AI Assisted is a dedicated Rails fleet that provides an AI Abstraction Layer in front of AI Gateway. It handles AI-specific requests that are often long running and depend on external services (such as Vertex AI and Anthropic). To prevent these requests from occupying Puma workers on the main fleet, potentially impacting performance, AI Assisted runs on an isolated fleet to safely serve these endpoints without affecting core GitLab traffic.

For diagram, refer to architecture blueprint.

Currently, all traffic served by the AI Assisted service supports Code Suggestions features, such as Code Generation and Completion, which are owned by the Code Creation Group. However, if additional feature teams begin contributing endpoints under /api/v4/ai_assisted in the future, service ownership may need to be reassessed.

The service is implemented across the following projects:

AI-Assisted Service is deployed in:

  • Staging
  • Production

AI-Assisted Service is currently deployed across 3 pods in the following regions:

  1. us-east1-b
  2. us-east1-c
  3. us-east1-d

AI-Assisted Service includes the following SLIs/SLOs: