Build the reason AI can be trusted

You'll work alongside people who care deeply about the problem and each other. No ego, no busywork — just hard problems, fast shipping, and a team that has your back.

WHY US

Why Confident AI.

Confident AI is a small, fast-moving team building the infrastructure that makes AI trustworthy. We started by building DeepEval, one of the most used packages for LLM evaluation in the world, used by companies such as OpenAI, Google, and Microsoft.

  • The problem matters. AI is shipping to production faster than anyone can verify it works. We're building the trust layer.
  • Small team, outsized impact. A handful of people used by hundreds of thousands of developers — from solo builders to OpenAI and Google.
  • Speed is the culture. Ideas go from conversation to production in days, not months.
  • Real ownership. You pick up a problem, you own it end-to-end — architecture, implementation, shipping, and the metrics that prove it worked.

If you want to do the best work of your career and actually see it matter, this is the place.

OUR CULTURE

What We Value.

No excuses, no BS
If something is wrong, say it so someone can help. We don't sugarcoat, we don't dance around problems, and we don't let ego get in the way of fixing what's broken. Directness isn't rude here — it's respected.
Ownership
You don't wait to be told. You see the problem, you pick it up, you see it through. You test your own work, catch your own mistakes, and ship things you'd stake your name on. Nobody here is checking behind you — because they shouldn't have to.
First principles thinking
We don't do things because that's how they're done. Every decision gets pressure-tested. If the best answer is uncomfortable or unfamiliar, good — that's usually the right direction.
Customer obsession
We exist to solve our customers' problems. We talk to them directly, we respond fast, and we never leave them guessing or ghosted. If a customer has a problem, it's our problem — and they'll always know where they stand with us.
Radical transparency
Hiding a problem won't make it go away. We surface issues early, share context openly, and trust each other with the full picture. No politics, no back-channels — just the truth, delivered with respect.
Never stop sharpening
Nobody here will nag you to get better. We hire people who are already wired that way — who read, ask questions, seek feedback, and come back sharper every week. Growth here isn't a performance review conversation. It's just how you operate.
OPEN POSITIONS

Join our team.

Marketing

Founding Head of Marketing

San Francisco (or Bay Area)$120K–$175K base + 0.5%–1.5% equityMarketing

Overview

Confident AI is building the infrastructure that makes AI trustworthy. We have strong product-market fit, a developer community that loves what we've built, and engineering teams actively choosing us. Now we need someone to turn that momentum into a brand the entire AI infrastructure industry recognizes.

We're looking for a Founding Head of Marketing to own marketing end-to-end — brand positioning, developer marketing, content strategy, SEO, distribution, and go-to-market — as our first dedicated marketing hire. You'll work directly with the founders at a stage where your decisions shape the company, the brand, and the category we're defining.

This is not a role where you inherit a playbook. You'll build the marketing function from scratch at a seed-stage startup, define what growth looks like, and create the distribution engine behind one of the fastest-growing companies in AI infrastructure. If you want to do the most consequential marketing work of your career at an early-stage developer tools company, this is it.

What you'll be doing

  • Own brand positioning, messaging, and narrative for the company and our products. You'll define how the market perceives us and the category we're creating in AI infrastructure.
  • Build and scale developer marketing distribution channels — SEO, technical content marketing, partnerships, developer community, and events — and figure out which levers drive adoption at each stage of growth.
  • Develop go-to-market strategy for our commercial platform alongside the founders, turning open-source developer adoption into commercial revenue.
  • Write and publish content that establishes us as the authority in AI infrastructure and developer tooling. This means thought leadership and sharp technical storytelling, not just documentation and tutorials.
  • Own competitive positioning and make sure we show up in every conversation where engineering teams are evaluating AI infrastructure solutions — including in AI-generated recommendations and search results.
  • Define growth marketing metrics, measure them honestly, and find new channels before the old ones plateau.
  • Represent the company at developer events, meetups, and conferences. You'll be a visible face of the brand — on stage, on camera, and in the communities where our users spend time.
  • Create demo videos, tutorials, and technical walkthroughs that help developers discover and adopt the product. You don't need to be a YouTuber, but you should be comfortable putting your face on content and shipping it consistently.
  • Coordinate product launches across engineering, bringing structure to how we ship, announce, and communicate publicly.

You should be someone who

  • 5+ years in marketing at a developer tools, open-source, or infrastructure company.
  • You understand how open-source developer adoption becomes commercial revenue — and you've been part of making that happen.
  • You've built a marketing function or growth channel from scratch, not just operated within an existing one.
  • Hands-on experience across multiple marketing disciplines: content strategy, product positioning, developer campaigns, and distribution.
  • You can actually write — clear, sharp copy that technical audiences respect, not marketing fluff they ignore.
  • You think in positioning and distribution, not content calendars.
  • Comfortable translating complex technical concepts into narratives that resonate with engineering leaders, developer advocates, and individual developers alike.
  • Comfortable on camera and on stage. You'll be producing video content and representing us at events — this isn't optional.
  • Self-directed and high-agency. You don't wait to be told what to do — you identify what matters, make a plan, and execute.
  • Comfortable with ambiguity and fast iteration. We're a seed-stage startup; the playbook doesn't exist yet.
  • Excellent communicator who keeps founders and teammates in the loop without being asked.
  • Willing to work at startup intensity. We're a small team building something important, and we expect a lot from each other.

Your work will

  • Shape how the market thinks about us and the category we're defining.
  • Be the reason engineering teams choose us over the alternatives.
  • Build the brand and distribution engine that scales with the company from seed to market leader.

By joining us, you will

  • Compensation: $120K–$175K base salary, depending on experience. We want this to be competitive enough that you're not distracted, but honest about our stage.
  • Equity: 0.5%–1.5% equity (stock options), vesting over 4 years. You're employee #6 at a seed-stage company — the equity reflects that.
  • Direct founder access: You'll work side-by-side with the founding team at the stage where every decision compounds.
  • The problem: You'll work on the problem that makes all other AI work trustworthy. The impact ceiling here is massive.
Engineering

Founding Platform Engineer

US or Remote$100-200k USD (+equity)Engineering

What you'll be doing

  • Own the reliability and uptime of Confident AI's platform. Everything works, 24/7 — that's your job.
  • Design and scale infrastructure to handle the high, unpredictable loads that come with AI workloads — large evaluation runs, bursty traffic, heavy data ingestion.
  • Build and maintain monitoring, alerting, and incident response so we catch problems before customers do.
  • Architect systems that scale horizontally — queuing, caching, database optimization, auto-scaling.
  • Support on-premise deployments for enterprise customers with strict security and compliance requirements.
  • Write infrastructure that is tested, documented, and doesn't require you to be awake for it to work.

You should be someone who

  • Has built and scaled production systems that handle real load — not toy projects, real traffic with real consequences when things break.
  • Knows PostgreSQL and ClickHouse deeply — performance tuning, query optimization, scaling patterns.
  • Experienced with Docker, Kubernetes, and AWS (EKS, ECS, or equivalent).
  • Understands distributed systems and the tradeoffs involved in making them reliable at scale.
  • Has dealt with multi-tenant architecture, data isolation, and security compliance (SOC 2, HIPAA).
  • Takes ownership. If the platform is down, you're already looking at it — nobody needs to ping you.
  • Willing to work 6 days a week in a high-intensity startup.

Your work will

  • Be the reason enterprise customers trust us with their AI evaluation infrastructure.
  • Keep a platform running that thousands of teams depend on daily.
  • Solve some of the hardest scaling problems in AI tooling — evaluation workloads are large, spiky, and unforgiving.

By joining us, you will

  • Own the platform at a company where reliability is the product.
  • Work directly with the founders at the stage where infrastructure decisions define the company.
  • Be compensated well, with generous founding equity. This also means we expect a lot from you.
Engineering & Growth

Founding Open-Source Growth Engineer

US or Remote$100-200k USD (+equity)Engineering & Growth

What you'll be doing

  • Build features for DeepEval across LLM evaluation and red teaming.
  • Write documentation and blog posts that the open-source community actually wants to read.
  • Distribute content across Reddit, Twitter, LinkedIn, and developer communities.
  • Own our Discord and GitHub community — answer questions, triage issues, build relationships.
  • Define what growth means for each channel, measure it, and find new distribution levers.
  • Form partnerships and integrations with other open-source projects.

You should be someone who

  • Codes proficiently in Python and TypeScript — this is an engineering role, not just a marketing role.
  • Writes well and enjoys it. Documentation, blog posts, community replies — you care about how things read.
  • Has a green GitHub profile and is already active in open-source.
  • Picks things up fast. You'll learn SEO, GEO, and growth strategies on the job.
  • Has genuine curiosity — you read papers, explore new tools, and stay close to what's happening in AI.
  • Communicates clearly and directly.
  • Willing to work 6 days a week in a high-intensity startup.

Your work will

  • Be used by hundreds of thousands of developers, from individual builders to teams at OpenAI and Google.
  • Educate thousands of people on how to properly evaluate their LLM applications.
  • Help grow DeepEval into the standard for AI evaluation.

By joining us, you will

  • Shape the future of LLM testing and evaluation.
  • Work directly with the founders, with a real path to an executive role.
  • Be compensated well, with generous founding equity. This also means we expect a lot from you.
HIRING PROCESS

Our Hiring Process.

The entire process is usually fully remote and all communication happens over email or via video chat in Google Meet. We know that you may be interviewing elsewhere as well so are respectful of your time and will get back no later than 2 days of each step along the process.

The entire process has 4 steps and takes around 1.5 weeks in total:

  1. Initial 15-30 minute phone screening interview.
  2. One 30-45 minute technical interview.
  3. One week fully-paid work trial.
  4. Full-time offer.

No hires will be made without a work trial. You'll be working with the founders directly throughout the entire process. For any questions, email [email protected].

Interested? Let's talk.