Build the reason AI can be trusted

You'll work alongside people who care deeply about the problem and each other. No ego, no busywork — just hard problems, fast shipping, and a team that has your back.

WHY US

Why Confident AI.

Confident AI is a small, fast-moving team building the infrastructure that makes AI trustworthy. We started by building DeepEval, one of the most used packages for LLM evaluation in the world, used by companies such as OpenAI, Google, and Microsoft.

  • The problem matters. AI is shipping to production faster than anyone can verify it works. We're building the trust layer.
  • Small team, outsized impact. A handful of people used by hundreds of thousands of developers — from solo builders to OpenAI and Google.
  • Speed is the culture. Ideas go from conversation to production in days, not months.
  • Real ownership. You pick up a problem, you own it end-to-end — architecture, implementation, shipping, and the metrics that prove it worked.

If you want to do the best work of your career and actually see it matter, this is the place.

OUR CULTURE

What We Value.

No excuses, no BS
If something is wrong, say it so someone can help. We don't sugarcoat, we don't dance around problems, and we don't let ego get in the way of fixing what's broken. Directness isn't rude here — it's respected.
Ownership
You don't wait to be told. You see the problem, you pick it up, you see it through. You test your own work, catch your own mistakes, and ship things you'd stake your name on. Nobody here is checking behind you — because they shouldn't have to.
First principles thinking
We don't do things because that's how they're done. Every decision gets pressure-tested. If the best answer is uncomfortable or unfamiliar, good — that's usually the right direction.
Customer obsession
We exist to solve our customers' problems. We talk to them directly, we respond fast, and we never leave them guessing or ghosted. If a customer has a problem, it's our problem — and they'll always know where they stand with us.
Radical transparency
Hiding a problem won't make it go away. We surface issues early, share context openly, and trust each other with the full picture. No politics, no back-channels — just the truth, delivered with respect.
Never stop sharpening
Nobody here will nag you to get better. We hire people who are already wired that way — who read, ask questions, seek feedback, and come back sharper every week. Growth here isn't a performance review conversation. It's just how you operate.
OPEN POSITIONS

Join our team.

Marketing

Founding Marketing Lead

US or Remote$100-170k USD (+equity)Marketing

What you'll be doing

  • Own marketing end to end. We have strong product and a developer community that loves what we've built. Now we need someone to turn that into a brand the entire AI industry recognizes.
  • Own brand positioning, messaging, and narrative for the company and our products.
  • Build and scale distribution channels — SEO, content, partnerships, community, events.
  • Develop go-to-market strategy for our commercial platform alongside the founders.
  • Write and publish content that makes us the authority in our space — thought leadership, not just docs and tutorials.
  • Own competitive positioning and make sure we show up in every relevant conversation.
  • Define what growth looks like, measure it, and find new levers.

You should be someone who

  • Has done marketing at a dev tools or infrastructure company and understands how open-source adoption becomes commercial revenue.
  • Can actually write — clear, sharp copy that technical audiences respect.
  • Thinks in positioning and distribution, not just content calendars.
  • Comfortable building the marketing function from scratch, not inheriting one.
  • Moves fast, communicates clearly, and doesn't wait to be told what to do.
  • Willing to work 6 days a week in a high-intensity startup.

Your work will

  • Shape how the market thinks about us and the category we're defining.
  • Be the reason engineering teams choose us over the alternatives.
  • Build the brand and distribution engine behind one of the fastest-growing companies in AI infrastructure.

By joining us, you will

  • Work on the problem that makes all other AI work trustworthy.
  • Work directly with the founders at a stage where your decisions shape the company.
  • Be compensated well, with generous equity. This also means we expect a lot from you.
Engineering

Founding Platform Engineer

US or Remote$100-200k USD (+equity)Engineering

What you'll be doing

  • Own the reliability and uptime of Confident AI's platform. Everything works, 24/7 — that's your job.
  • Design and scale infrastructure to handle the high, unpredictable loads that come with AI workloads — large evaluation runs, bursty traffic, heavy data ingestion.
  • Build and maintain monitoring, alerting, and incident response so we catch problems before customers do.
  • Architect systems that scale horizontally — queuing, caching, database optimization, auto-scaling.
  • Support on-premise deployments for enterprise customers with strict security and compliance requirements.
  • Write infrastructure that is tested, documented, and doesn't require you to be awake for it to work.

You should be someone who

  • Has built and scaled production systems that handle real load — not toy projects, real traffic with real consequences when things break.
  • Knows PostgreSQL and ClickHouse deeply — performance tuning, query optimization, scaling patterns.
  • Experienced with Docker, Kubernetes, and AWS (EKS, ECS, or equivalent).
  • Understands distributed systems and the tradeoffs involved in making them reliable at scale.
  • Has dealt with multi-tenant architecture, data isolation, and security compliance (SOC 2, HIPAA).
  • Takes ownership. If the platform is down, you're already looking at it — nobody needs to ping you.
  • Willing to work 6 days a week in a high-intensity startup.

Your work will

  • Be the reason enterprise customers trust us with their AI evaluation infrastructure.
  • Keep a platform running that thousands of teams depend on daily.
  • Solve some of the hardest scaling problems in AI tooling — evaluation workloads are large, spiky, and unforgiving.

By joining us, you will

  • Own the platform at a company where reliability is the product.
  • Work directly with the founders at the stage where infrastructure decisions define the company.
  • Be compensated well, with generous founding equity. This also means we expect a lot from you.
Engineering & Growth

Founding Open-Source Growth Engineer

US or Remote$100-200k USD (+equity)Engineering & Growth

What you'll be doing

  • Build features for DeepEval across LLM evaluation and red teaming.
  • Write documentation and blog posts that the open-source community actually wants to read.
  • Distribute content across Reddit, Twitter, LinkedIn, and developer communities.
  • Own our Discord and GitHub community — answer questions, triage issues, build relationships.
  • Define what growth means for each channel, measure it, and find new distribution levers.
  • Form partnerships and integrations with other open-source projects.

You should be someone who

  • Codes proficiently in Python and TypeScript — this is an engineering role, not just a marketing role.
  • Writes well and enjoys it. Documentation, blog posts, community replies — you care about how things read.
  • Has a green GitHub profile and is already active in open-source.
  • Picks things up fast. You'll learn SEO, GEO, and growth strategies on the job.
  • Has genuine curiosity — you read papers, explore new tools, and stay close to what's happening in AI.
  • Communicates clearly and directly.
  • Willing to work 6 days a week in a high-intensity startup.

Your work will

  • Be used by hundreds of thousands of developers, from individual builders to teams at OpenAI and Google.
  • Educate thousands of people on how to properly evaluate their LLM applications.
  • Help grow DeepEval into the standard for AI evaluation.

By joining us, you will

  • Shape the future of LLM testing and evaluation.
  • Work directly with the founders, with a real path to an executive role.
  • Be compensated well, with generous founding equity. This also means we expect a lot from you.
HIRING PROCESS

Our Hiring Process.

The entire process is usually fully remote and all communication happens over email or via video chat in Google Meet. We know that you may be interviewing elsewhere as well so are respectful of your time and will get back no later than 2 days of each step along the process.

The entire process has 4 steps and takes around 1.5 weeks in total:

  1. Initial 15-30 minute phone screening interview.
  2. One 30-45 minute technical interview.
  3. One week fully-paid work trial.
  4. Full-time offer.

No hires will be made without a work trial. You'll be working with the founders directly throughout the entire process. For any questions, email [email protected].

Interested? Let's talk.