For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Trust CenterStatusSupportGet a demoPlatform
DocumentationEvals API ReferenceIntegrations & OTELPlatform SettingsSelf-HostingChangelog
DocumentationEvals API ReferenceIntegrations & OTELPlatform SettingsSelf-HostingChangelog
  • Get Started
    • Introduction
    • Setup and Installation
  • LLM Evaluation
    • Introduction
    • Experiments
      • Manage Datasets
        • Introduction
        • Data Source Connectors
  • Metrics
    • Introduction
    • Metric Collections
    • Custom Metrics
  • LLM Tracing
    • Introduction
    • Signals
    • Troubleshooting
  • Human-in-the-Loop
    • Introduction
    • Collect Feedback
  • Reporting & Analytics
    • Dashboards
    • Executive Insights
  • Red Teaming
    • Introduction
    • Quickstart
    • Frameworks & Policies
    • Risk Profiles
    • Red Team Using DeepTeam
  • Resources
    • Why Confident AI
    • Support
    • Data Handling
    • LLM Use Cases
LogoLogo
Trust CenterStatusSupportGet a demoPlatform
On this page
  • Overview
  • How It Works
  • Supported Data Sources
  • Next Steps
LLM EvaluationDatasetsSynthetic Data Generation

Introduction

Generate synthetic goldens from your own data sources
Was this page helpful?
Previous

Data Source Connectors

Set up credentials and connect external data sources to generate synthetic goldens
Next
Built with

Overview

Synthetic data generation allows you to automatically create high-quality goldens from your existing data sources — documents stored in Google Drive, messages in Slack channels, pages in Notion, or files in SharePoint. Instead of manually writing goldens one by one, you can connect a data source and let Confident AI generate evaluation-ready goldens at scale.

How It Works

At a high level, synthetic data generation follows three steps:

1

Connect a Data Source

Navigate to Project Settings > Data Sources and connect an external source such as Google Drive, Slack, Notion, or SharePoint. Each source type requires its own set of credentials.

2

Create a Generation Config

Under Datasets > Automations, create a generation configuration that points to your data source. You can control parameters like the maximum number of goldens generated per context chunk.

3

Generate

Click Generate and Confident AI will pull documents from your data source, chunk them into contexts, and use an LLM to synthesize goldens — complete with inputs, expected outputs, and context fields.

Supported Data Sources

Google Drive

Connect a shared folder and generate goldens from .txt, .pdf, and .docx files.

Slack

Generate goldens from channel message histories.

Notion

Pull page content and generate goldens from your knowledge base.

SharePoint

Connect via Azure AD and generate goldens from SharePoint files.

Next Steps

Data Source Connectors

Learn how to set up credentials and connect each supported data source.

Manage Datasets

Learn how to review, finalize, and manage the goldens that were generated.