Polymath

Active

Simulation environments to train & evaluate long-horizon AI agents

Winter 2026Founded 20262 peopleSan Francisco, CA, USA

polymathlabs.ai/ ↗LinkedIn ↗X ↗See on the Idea Map B2B momentum

Generate ideas →

AI insightcan contain mistakes

Agentic Simulation EnvironmentsSaaSAI labs training long-horizon autonomous agentsMedium competition

Moat

Large-scale simulation worlds trained on frontier model post-training expertise; data systems scalability.

Key risk

Few customers; simulation fidelity vs. real-world reality gap; agent training moving in-house at large labs.

Why now

Frontier labs training long-horizon agents; simulation critical for safety and evaluation before deployment.

Competitors

OpenAI, DeepMind, Unity, Unreal Engine, custom in-house environments

About

We’re heading towards a future where AI agents will be able to perform useful work over long horizons, with little or no human supervision. To increase the reliability, performance, and safety of autonomous agents, they must be trained in simulation environments that reflect the real world. Polymath builds simulated worlds for agents to practice and learn through experience. We're a team of researchers and engineers from UC Berkeley, Hume AI, Plaid, and Amazon. We have years of experience post-training frontier models in industry, and building large scale data systems. Polymath is backed by Y Combinator.

Founders · 2

Dylan MaFounder

Amazon

Berkeley

Co-Founder / CEO @ Polymath. Previously @ Hume AI, AWS, UC Berkeley

LinkedIn ↗

Naren YenugantiFounder

Amazon

Berkeley

Co-Founder / CTO @ Polymath. Previously @ Plaid, Amazon, UC Berkeley

LinkedIn ↗X ↗

Launch

Launched on Y Combinator · Feb 2026

View launch post ↗

Simulation environments to train & evaluate long-horizon AI agents

Polymath builds simulated worlds where AI agents learn to operate autonomously over long horizons using running applications, real tools, and multi-step tasks. They launched Horizon-SWE, a benchmark placing frontier models in a simulated software company to measure end-to-end software engineering tasks across the full lifecycle.

Formerly “Palette AI” · why startups rename →

B2B

Related startups

PolymorphWinter 2026

Personalization infra that improves retention, LTV, and CAC

AI Personalization InfrastructureActive

AbundantFall 2024

Agent simulation and RL for researchers

Active

3 more related startups + AI insights

Free account · no credit card.

Also in Winter 2026

Cardinal Wideframe Martini Condor Energy Remix Shofo