Relace

Active

Models and infra for coding agents

Winter 2023Founded 20227 peopleSan Francisco, CA, USA

relace.ai ↗LinkedIn ↗See on the Idea Map B2B momentum

About

Relace makes it easy to deploy production-ready coding agents. Our models are co-optimized with infrastructure to achieve SoTA performance: 10k+ token/s code merging and retrieval across million-line repositories in seconds.

From their website

as of Jun 7, 2026relace.ai ↗

API/InfraUsage-based · Token-based pricing by model: Jacq relace-apply-3: $0.80/million input tokens, $1.20/million output tokens; relace-search: $1.00/million input, $3.00/million output; relace-rank: $0.05/million input; relace-embed: $0.18/million input. Hosted free tier available.

Relace provides purpose-built AI models and infrastructure for coding agents, focusing on fast code retrieval, merging, and autonomous workflows to accelerate development and reduce errors.

Relace offers in-house models optimized for coding workflows, including fast codebase retrieval and a universal code merging model that edits files at 10,000 tokens per second. The system includes source control designed for agents with lightweight push/pull, automatic indexing for two-stage retrieval, and high-throughput rate limits. It supports self-hosted, VPC-isolated, and hosted deployments, enabling on-premise use while maintaining encryption in transit and at rest. Users can experiment via a hosted API, with a free tier, and pricing depends on token usage across multiple models such as Jacq, relace-apply, relace-search, relace-rank, and relace-embed.

Who it’s for: engineering teams and organizations building autonomous coding agents, CI/CD workflows, and developer tooling that require fast retrieval, merging, and reliable code generation.

Features

out-of-the-box codebase retrieval
fast code merging at high tokens/second
small, fast in-house models tailored for coding tasks
two-stage retrieval with automatic indexing
lightweight repo push/pull with sandboxed agents
on-premise and VPC-isolated deployments
SOC 2 compliant and enterprise security

Pricing page and hosted API availability; free tier and onboarding notes; mentions of enterprise/self-hosted deployments and SOC 2 compliance.

Founders · 2

Preston ZhouFounder

Physics -> ML

Eitan BorgniaFounder

Caltech

Former PhD student in machine learning at UChicago, with a math degree from Caltech.

LinkedIn ↗

Launch

Launched on Y Combinator · May 2025

View launch post ↗

Kicking off with Instant Apply, Code Reranker, and Embeddings

Relace releases models for AI codegen workflows, including Instant Apply for merging code snippets at high speed and an Embeddings + Reranker pipeline to locate relevant context in large codebases, aiming to reduce latency and token costs. The products target AI code generation startups and developers needing robust, efficient context management and code merging.