Atla

Inactive

The improvement engine for AI agents

Summer 2023Founded 202310 peopleLondon, England, United Kingdom

www.atla-ai.com/ ↗LinkedIn ↗X ↗GitHub ↗Crunchbase ↗See on the Idea Map B2B momentum

About

Find and fix your agent’s most critical failures in hours, not days. Atla helps developers cut time spent on manually reviewing traces. Atla’s LLM judge evaluates your agent step-by-step, uncovers error patterns across runs, and suggests specific fixes—so you know exactly what to fix and why. Atla supports the most popular agent frameworks teams build with, including LangChain, CrewAI, and OpenAI Agents. With real-time monitoring, automated error detection, and prompt experimentation, Atla gives teams the visibility and control needed to confidently ship agentic systems that work. We’re a team of researchers, engineers, entrepreneurs and operational leaders. Our expertise in evals was honed through training our own purpose-built LLM Judges, Selene and Selene Mini, which are available open-source and have been downloaded 60,000+ times.

Founders · 2

Maurice BurgerFounder

Harvard

UPenn🎓 University of Pennsylvania. Half

Co-founder of Atla (S23). Startup veteran @ Syrup, Trim, and Merantix. Masters in CS @ University of Pennsylvania. Half an MBA @ Harvard Business School.

LinkedIn ↗

Roman EngelerFounder

Stanford

Co-founder & CTO of atla (S23). AI safety researcher @ MATS. MSc. Robotics @ ETH, Stanford, Imperial.

LinkedIn ↗

Launch

Launched on Y Combinator · Feb 2025

View launch post ↗

Half of AI’s answers are brilliant, half aren’t. We trained a model to tell them apart.

Selene is an LLM evaluator that scores, classifies, and compares AI outputs. The launch announces an API/SDK to integrate Selene into AI workflows and an Alignment Platform to create custom evaluation metrics, targeting teams requiring reliable evaluation to reduce hallucinations, inconsistencies, and unsafe outputs.