@here Look At This Trace
TGIF! Thank god it’s features, here’s what we shipped this week:
Confident AI goes multi-player—and kills the context switch while it’s at it. Comments are now live across traces, spans, threads, and test cases, and when someone @-mentions you, it lands in your Slack with a direct link back to the exact trace. No more “screenshot this span and DM it to me,” no more five-tab scavenger hunts, no more “wait, which trace ID?” The conversation happens exactly where the data lives. That loop works because we also gave Slack & Discord a full glow-up this week—1-click setup, way more signals you can pipe through. And to the voice AI crowd: WebSocket response mode for AI Connections just shipped. We’re coming for you. Custom Dashboards also picked up enough new widgets that the beta sticker is barely hanging on. Oh, and Claude Opus 4.7 is now available everywhere—Arena, Experiments, Evaluations, Platform. Plus Prompt Auto-Refinement on failing test cases, traces, and spans, and image support on annotations. Scroll down, there’s a lot.

Added
- Comments - Stop screenshotting spans into Slack DMs. Comments are now live on traces, spans, threads, and test cases—with full permissions and @-mentions that ping your teammate’s Slack with a deep link straight back to the exact trace. No context switching, no “which trace again?”, no losing the thread across three tabs. The conversation happens where the data lives. Oh, and you can mute or be muted. Finally, a proper comment section.
- Revamped Slack & Discord Integrations - Our Slack and Discord integrations got a full rebuild: 1-click setup, way less config, and a lot more you can actually pipe through them—alerts, eval results, and @-mentions from comments, all landing in the channels your team already lives in. Channel your inner ops engineer.
- WebSocket Response Mode for AI Connections - Voice AI, we’re coming for you. AI Connections now speak WebSocket—true bidirectional, low-latency streaming for the stuff HTTP was never going to handle: voice agents, real-time assistants, long-running generations, anything where “wait for the full response” isn’t an option. If you’re building voice AI and you’re not on Confident AI yet, this is your sign. Socket to ‘em.
- Metric FN/FP/TP/TN Over Time for Online Evals - Online Evals now plot false negatives, false positives, true positives, and true negatives over time. Catch metric drift before it catches you. Positively informative.
- Native Annotation Test Cases - Annotations are now first-class test cases. Turn human feedback directly into evaluation data without any glue code or CSV gymnastics. Noted.
- Tables & Big Number Widgets for Custom Dashboards - Two new widget types land in Custom Dashboards: Tables for row-by-row detail and Big Number for the one metric that matters most. Dashboards are inching closer to general availability—count on it.
- Bar & Stacked Bar Graphs for Custom Dashboards - Bar and stacked bar charts join the Custom Dashboards widget lineup. Stack, compare, and break down your metrics any way you like. Raise the bar.
- Prompt Auto-Refinement - Point at a failing test case (single-turn or multi-turn), trace, or span, and Confident AI will auto-refine the prompt for you—no more staring at a broken output and guessing which instruction to tweak. Your prompts, on autopilot. Refined to taste.
- Image Support on Annotations - Annotations can now include images. Attach a screenshot of what went wrong, what it should’ve looked like, or the exact UI state that broke things. Human feedback with receipts. Picture perfect.
- Claude Opus 4.7 Everywhere - Opus 4.7 is now available across Arena, Experiments, Evaluations, and the Platform. Pick your battles, pick your model. A true magnum opus.
Changed
- Inline Table Editing - Editing values directly in tables got a serious polish pass—snappier, smarter, fewer misclicks, and a much better keyboard flow. The kind of upgrade you feel on every row.
- PortKey Model Slug Fetching - Automatically fetch the model slugs available to your org’s PortKey provider across Evaluation, Platform, and Arena. No more copy-pasting model names or guessing what’s available. Slug it out no more.
- Invitations for Organizations & Projects - Invitations now work at both the organization and project level. Bring people into the whole org or scope them to a single project—whichever fits the relationship. Invite-ing flexibility.