Build the reason AI can be trusted

You'll work alongside people who care deeply about the problem and each other. No ego, no busywork — just hard problems, fast shipping, and a team that has your back.

WHY US

Why Confident AI.

Confident AI is a small, fast-moving team building the infrastructure that makes AI trustworthy. We started by building DeepEval, one of the most used packages for LLM evaluation in the world, used by companies such as OpenAI, Google, and Microsoft.

  • The problem matters. AI is shipping to production faster than anyone can verify it works. We're building the trust layer.
  • Small team, outsized impact. A handful of people used by hundreds of thousands of developers — from solo builders to OpenAI and Google.
  • Speed is the culture. Ideas go from conversation to production in days, not months.
  • Real ownership. You pick up a problem, you own it end-to-end — architecture, implementation, shipping, and the metrics that prove it worked.

If you want to do the best work of your career and actually see it matter, this is the place.

OUR CULTURE

What We Value.

No excuses, no BS
If something is wrong, say it so someone can help. We don't sugarcoat, we don't dance around problems, and we don't let ego get in the way of fixing what's broken. Directness isn't rude here — it's respected.
Ownership
You don't wait to be told. You see the problem, you pick it up, you see it through. You test your own work, catch your own mistakes, and ship things you'd stake your name on. Nobody here is checking behind you — because they shouldn't have to.
First principles thinking
We don't do things because that's how they're done. Every decision gets pressure-tested. If the best answer is uncomfortable or unfamiliar, good — that's usually the right direction.
Customer obsession
We exist to solve our customers' problems. We talk to them directly, we respond fast, and we never leave them guessing or ghosted. If a customer has a problem, it's our problem — and they'll always know where they stand with us.
Radical transparency
Hiding a problem won't make it go away. We surface issues early, share context openly, and trust each other with the full picture. No politics, no back-channels — just the truth, delivered with respect.
Never stop sharpening
Nobody here will nag you to get better. We hire people who are already wired that way — who read, ask questions, seek feedback, and come back sharper every week. Growth here isn't a performance review conversation. It's just how you operate.
OPEN POSITIONS

Join our team.

Engineering

Founding Product Engineer (Frontend)

San Francisco$175K–$250K base + equityEngineering

Overview

Confident AI is building the infrastructure that makes AI trustworthy. Engineering teams spend hours a day inside our platform looking at traces, evals, and test results — the frontend isn't a layer on top of the product. It is the product.

We're hiring a Founding Product Engineer to own that product end-to-end. You'll talk to users, decide what gets built, design the UI yourself, and ship it — from the component layer down to the API routes that power it. No PM writing specs, no designer handing you mocks.

As a founding engineer, the quality bar you set and the interfaces you ship will be the reason engineering teams choose us over the alternatives.

What you'll be doing

  • Own user-facing features from product decision to design to shipped code. You make the design calls — there are no Figma files waiting for you.
  • Build data-dense, fast, polished interfaces for traces, evaluation results, and testing workflows — the screens our users live in every day.
  • Talk to users, understand their workflows, and turn what you learn into product decisions. You're the engineer in the room closest to the customer.
  • Work across the stack. Most of your time is in the frontend, but you'll write the API routes, queries, and server logic your features need without waiting on anyone.
  • Architect the frontend to last — the components, state management, and patterns you establish need to hold up as the product and team grow around them.
  • Set the standard for product quality and user experience that every engineer we hire after you builds to.

You should be someone who

  • 3+ years building production web applications, ideally at a fast-moving product company or startup.
  • Deep proficiency with React, Next.js, TypeScript, and CSS. You don't reach for a UI library because you can't write the styles yourself — you reach for one when it's the right call.
  • A genuine eye for design. You notice when spacing is off by 2px, you have opinions on motion and hierarchy, and you can take a feature from idea to polished UI without a designer.
  • You understand frontend at scale — rendering performance, state management, data fetching, and architecture that doesn't collapse as the product grows.
  • Enough backend fluency to move fast — databases, APIs, caching, auth. You won't be scaling them, but you build against them confidently.
  • Fluent with AI coding tools like Claude Code and Cursor as part of your daily workflow. At our size, every engineer operates at a multiplied level.
  • You think in user experiences, not components. The question you ask is 'what should this feel like to use,' not 'what props does this take.'
  • High-agency and comfortable with ambiguity. We're seed-stage — you identify what matters, make a plan, and ship.

Your work will

  • Be the reason our platform feels like the best product in AI infrastructure, not just the most capable one.
  • Own the user experiences that engineering teams interact with for hours every day.
  • Set the product engineering bar the rest of the team builds to as we scale from seed to market leader.

By joining us, you will

  • Full ownership: you decide what gets built and how it feels, and you ship it yourself.
  • A seat at the table: direct access to the founding team at the stage where every product decision compounds.
  • The problem: you'll work on what makes all other AI work trustworthy. The impact ceiling is massive.
Developer Relations

Founding Developer Advocate

San Francisco$130K–$175K base + equityDeveloper Relations

Overview

Confident AI is building the infrastructure that makes AI trustworthy. We created DeepEval, the open-source evaluation framework, and we're building the commercial platform that engineering teams use to ship reliable AI products. We have strong product-market fit, a developer community that's growing fast, and teams actively choosing us.

We're looking for a Founding Developer Advocate to own the developer experience from first touch to activation — across both our open-source framework and our commercial platform. You'll create the content, build the community, and represent us at events alongside the founding team.

This is a founding role with a seat at the table. You'll have full freedom to decide what to build, what to say, and how to say it. You won't be executing someone else's content calendar — you'll define the strategy and own the results. When developers tell you something about the product isn't working, you're in the room changing the roadmap, not filing a ticket.

What you'll be doing

  • Own developer content strategy and execution across both DeepEval (open-source) and the Confident AI platform (commercial product). These are distinct products with different audiences and different adoption paths — you'll understand both and create content that serves each.
  • Create onboarding content, demo videos, tutorials, and technical walkthroughs that help developers get value from the product fast.
  • Build and grow our developer community. Be present in the forums, Discord channels, GitHub discussions, and social platforms where our users spend time. Engage with them as a peer, not a marketer.
  • Represent Confident AI at developer events, meetups, and conferences alongside the founders. We're all out there building relationships and talking to developers — you'll be a key part of that.
  • Write technical blog posts, thought leadership, and sharp content that positions us as the authority in AI evaluation and testing infrastructure. Real insight, not recycled takes.
  • Be the voice of the developer internally. You'll have direct influence on product decisions based on what you're hearing from the community.
  • Own competitive positioning in developer conversations. Make sure we show up in every discussion where engineering teams are evaluating AI infrastructure solutions.
  • Coordinate with the founding team on product launches across both open-source and commercial products.

You should be someone who

  • 3+ years of experience in developer relations or developer advocacy at a developer tools, open-source, or infrastructure company. This is non-negotiable — the autonomy we're offering requires that you've done this before and done it well.
  • Has an existing network in the developer tools and AI community. You know people, and people know you. When you vouch for a product, it carries weight.
  • Understands the difference between open-source community building and commercial product marketing, and can navigate both authentically.
  • Can actually write — clear, sharp technical content that developers respect, not marketing copy they scroll past.
  • Comfortable on camera and on stage. You'll be producing video content and speaking at events regularly — this isn't optional.
  • Proficient with AI tools like Claude Code and Cursor as part of your daily workflow.
  • Technical enough to understand the product deeply and speak credibly to engineering teams about AI evaluation, testing, and observability.
  • Self-directed and high-agency. You don't wait to be told what to do — you identify what matters, make a plan, and ship.
  • Comfortable with ambiguity and fast iteration. We're a seed-stage startup; the playbook doesn't exist yet.

Your work will

  • Be the reason developers go from signing up to becoming active, engaged users of both DeepEval and the Confident AI platform.
  • Shape how the developer community perceives us and the category we're defining.
  • Directly influence product direction based on what you're hearing from developers every day.
  • Build the community and content engine that scales with the company from seed to market leader.

By joining us, you will

  • Full autonomy: You own the strategy. We're not hiring you to follow a playbook — we're hiring you to write it.
  • A seat at the table: Direct access to the founding team, influence on product decisions, and a voice in company strategy.
  • The problem: You'll work on the problem that makes all other AI work trustworthy. The impact ceiling here is massive.
HIRING PROCESS

Our Hiring Process.

The entire process is usually fully remote and all communication happens over email or via video chat in Google Meet. We know that you may be interviewing elsewhere as well so are respectful of your time and will get back no later than 2 days of each step along the process.

The entire process has 4 steps and takes around 1.5 weeks in total:

  1. Initial 15-30 minute phone screening interview.
  2. One 30-45 minute technical interview.
  3. One week fully-paid work trial.
  4. Full-time offer.

No hires will be made without a work trial. You'll be working with the founders directly throughout the entire process. For any questions, email hiring@confident-ai.com.

Interested? Let's talk.