Kalpa Labs

Active

Scaling Generalist Speech models

Fall 2025Founded 20252 peopleSan Francisco, CA, USA

kalpalabs.ai/ ↗LinkedIn ↗X ↗See on the Idea Map B2B momentum

About

We're building the next frontier of speech models. Generalist speech models that unlock in-context learning & strong instruction following for speech models alongside unifying existing speech capabilities like speech to text, text to speech, voice cloning, etc.

Founders · 2

Prashant ShishodiaFounder

Google

Pushing frontier of speech models @ KalpaLabs. Previously led full stack ML @ Google scaling to billions of queries / month.

LinkedIn ↗X ↗

Gautam JhaFounder

Pushing frontier of speech models @ KalpaLabs. Previously built nanoseconds latency software at HFTs.

LinkedIn ↗X ↗

Launch

Launched on Y Combinator · Nov 2025

View launch post ↗

Scaling Foundational speech models for In-context Learning & Instruction Following

KalpaLabs announces a generalist speech model that handles speech-to-text, text-to-speech, and cross-modal tasks with in-context learning and steerability, aiming to unify STT, TTS, and voice actions into one system. They pretrained 800M–4.8B parameter models on 2M hours of audio and highlight scalable, aligned, and efficient speech modeling with long context and audio-in-context prompts.