This section covers how to programmatically manage goldens in datasets using the Evals API:
finalized=True to make goldens available for evaluation, or finalized=False to queue for reviewPush goldens to a dataset. If the dataset does not already exist, Confident AI will create it for you.
For single-turn datasets:
For multi-turn datasets:
You can include custom column values when pushing goldens. Custom columns must already exist on the dataset, or Confident AI will create them for you.
Delete a dataset programmatically via the Evals API.
This action cannot be undone. All goldens or conversational goldens in the dataset will be permanently deleted.
You can push or manage datasets in any project by configuring a CONFIDENT_API_KEY.
CONFIDENT_API_KEY as an environment variable.confident_api_key directly when creating the EvaluationDataset.When both are provided, the confident_api_key passed to EvaluationDataset always takes precedence over the environment variable.
Now that you know how to push goldens, learn how to pull them for evaluation.