Confident AI

Active

The LLM Eval and Observability Platform for AI Quality

Winter 2025Founded 20247 peopleSan Francisco, CA, USA

confident-ai.com ↗LinkedIn ↗X ↗GitHub ↗See on the Idea Map B2B momentum

About

Confident AI allows companies of all sizes to benchmark, safeguard, and improve LLM applications, with best-in-class metrics and guardrails powered by DeepEval. Built by the creators of DeepEval (12.6k stars, >3m monthly downloads), Confident AI is able to offer battle-tested, open-source evaluation algorithms while providing the infrastructure needed for teams to stay confident their LLM systems.

Founders · 2

Jeffrey IpCEO & Cofounder

Google

Microsoft

Creator of DeepEval, the open-source LLM evaluation framework. and grew it to over 400k monthly downloads and counting. Previously SWE @ Google, Microsoft.

LinkedIn ↗X ↗

Kritin VongthongsriCo-Founder

Princeton

Building the #1 LLM Evaluation Platform & empowering teams to red-team and safeguard LLM apps. AI Researcher and CHI-published author, previously built NLP pipelines for fintech startup and researched self-driving cars/HCI during @ Princeton (ORFE'24 + CS).

LinkedIn ↗

Launch

Launched on Y Combinator · Feb 2025

View launch post ↗

An evaluation platform for engineers, QAs, and PMs to pinpoint which iteration of their AI to put in production.

Confident AI is a cloud-based evaluation platform for engineering teams to iterate LLM apps by using DeepEval’s deterministic, use-case-specific metrics; it enables collaboration on pre- and post-deployment evals, benchmarks, and reporting. The launch highlights integration with the DeepEval framework, collaboration features, and case-study ROI potential.