Confident AI Blog - Resources to help teams stay confident in AI
SlackJust In: New Slack Community! Connect with AI engineers building with Confident AI, join now →

Stay Confident

Subscribe to our weekly newsletter to stay confident in the AI systems you build.

The People's Choice of Top LLM Evaluation Tools in 2025

The People's Choice of Top LLM Evaluation Tools in 2025

In this article, we'll bring you a hand-picked, carefully curated list of top LLM evaluation tools in the market.

Jeffrey Ip

Jeffrey Ip

Jan 15, 2025
.
6 min read
The Comprehensive LLM Safety Guide: Navigate AI regulations and Best Practices for LLM Safety

The Comprehensive LLM Safety Guide: Navigate AI regulations and Best Practices for LLM Safety

In this article, you'll teach you about LLM regulations and how to maintain the safety of your LLM applications.

Kritin Vongthongsri

Kritin Vongthongsri

Nov 2, 2024
.
15 min read
How to Jailbreak LLMs One Step at a Time: Top Techniques and Strategies

How to Jailbreak LLMs One Step at a Time: Top Techniques and Strategies

In this article, I'll show you how to jailbreak your LLM application to detect it for vulnerabilities.

Kritin Vongthongsri

Kritin Vongthongsri

Oct 30, 2024
.
16 min read
What is LLM Observability? - The Ultimate LLM Observability Guide

What is LLM Observability? - The Ultimate LLM Observability Guide

In this article, I'll share what you should definitely look for in your next LLM Observability solution.

Kritin Vongthongsri

Kritin Vongthongsri

Oct 29, 2024
.
9 min read
Top LLM Chatbot Evaluation Metrics: Conversation Testing Techniques

Top LLM Chatbot Evaluation Metrics: Conversation Testing Techniques

In this article, you'll learn about LLM red teaming and how it can be carried out using DeepTeam.

Jeffrey Ip

Jeffrey Ip

Oct 5, 2024
.
10 min read
LLM-as-a-Judge Simply Explained: The Complete Guide to Run LLM Evals at Scale

LLM-as-a-Judge Simply Explained: The Complete Guide to Run LLM Evals at Scale

In this article, I'll debunk what LLM judges are and go through why they are the best for LLM evaluation.

Jeffrey Ip

Jeffrey Ip

Sep 1, 2024
.
13 min read
The Definitive LLM Security Guide: OWASP Top 10 2025, Safety Risks and How to Detect Them

The Definitive LLM Security Guide: OWASP Top 10 2025, Safety Risks and How to Detect Them

In this article, I'll go through all the major pillars of LLM security you must know and how to mitigate them.

Kritin Vongthongsri

Kritin Vongthongsri

Aug 19, 2024
.
12 min read
LLM Red Teaming: The Complete Step-By-Step Guide To LLM Safety

LLM Red Teaming: The Complete Step-By-Step Guide To LLM Safety

In this article, you'll learn about LLM red teaming and how it can be carried out using DeepTeam.

Kritin Vongthongsri

Kritin Vongthongsri

Jun 29, 2024
.
16 min read
Evaluating LLM Systems: Essential Metrics, Benchmarks, and Best Practices

Evaluating LLM Systems: Essential Metrics, Benchmarks, and Best Practices

In this article, you'll learn how to evaluate LLM systems using LLM evaluation metrics and benchmark datasets.

Jeffrey Ip

Jeffrey Ip

Jun 24, 2024
.
16 min read
Using LLMs for Synthetic Data Generation: The Definitive Guide

Using LLMs for Synthetic Data Generation: The Definitive Guide

In this article, I'm show you everything you need on how to generate realistic synthetic datasets using LLMs.

Kritin Vongthongsri

Kritin Vongthongsri

May 9, 2024
.
12 min read