Overview

Deploy Confident AI on your AWS infrastructure for complete control over your evaluation and observability platform.

Architecture

The AWS deployment uses Terraform to provision a production-ready infrastructure with the following components:

AWS Self-Hosting Reference Architecture

Core components

VPC & Networking

Public/private/database subnets with NAT Gateway, Internet Gateway, and S3 VPC Endpoint.

EKS Cluster

Managed Kubernetes with autoscaling node groups and EBS CSI driver.

RDS PostgreSQL

Managed database in isolated subnets with automated backups, encryption, and password rotation.

ALB

HTTPS load balancer managed by ALB Controller with ACM certificates.

ArgoCD

GitOps tool for managing Kubernetes deployments.

S3 Storage

Private bucket with VPC endpoint for application data.

Deployed services

The Kubernetes cluster runs the following services:

ServiceNamespaceDescription
confident-backendconfident-aiExpress.js API service handling core platform logic
confident-frontendconfident-aiNext.js web application for the Confident AI dashboard
confident-evalsconfident-aiFastAPI service for running LLM evaluations
confident-otelconfident-aiOpenTelemetry collector for trace ingestion
redisconfident-aiIn-memory cache for session and queue management
ArgoCDargocdGitOps continuous delivery
ClickHouse Operatorclickhouse-operatorManages the analytics database
ALB Controllerkube-systemCreates ALBs from Kubernetes Ingress resources
External Secretsconfident-aiSyncs credentials from AWS Secrets Manager