Run LLM Evals | Confident AI Docs

Run online evals for your test cases using the metrics in metricCollection.

metricCollectionstringRequired

The name of the metric collection you wish to use for evaluation.

llmTestCaseslist of objectsOptional

This is a list of single-turn test cases to evaluate. If you are evaluating multi-turn test cases, this should be null.

conversationalTestCaseslist of objectsOptional

This is a list of multi-turn test cases to evaluate. If you are evaluating single-turn test cases, this should be null.

hyperparametersmap from strings to anyOptional

This is any hyperparameters like model or prompt you wish to associate with the test run.

identifierstringOptional

A unique identifier for the test run.

This endpoint returns an object.

successboolean

This is true if the test cases were successfully evaluated.

dataobject

deprecatedboolean

This is true if this endpoint is deprecated.

1	import requests
2
3	url = "https://api.confident-ai.com/v1/evaluate"
4
5	payload = {
6	"metricCollection": "Collection Name",
7	"llmTestCases": [
8	{
9	"input": "How tall is mount everest?",
10	"actualOutput": "No clue, pretty tall I guess?"
11	}
12	]
13	}
14	headers = {
15	"CONFIDENT_API_KEY": "<PROJECT-API-KEY>",
16	"Content-Type": "application/json"
17	}
18
19	response = requests.post(url, json=payload, headers=headers)
20
21	print(response.json())