platoseed
Optimize performance on GPUs - 10x faster
Identifying performance bottlenecks and strategizing ways to solve them takes 4-8x longer than actually writing the code to fix them. We're building an agent that is an expert at analyzing the performance GPU systems like inference engines at all levels of the stack - from CPU-GPU interactions down to GPU kernels. Pairing our agent with Cursor / Claude Code allows you to automate both the reasoning and code implementation steps of performance optimization. What used to take weeks can now be done in days. Along with our AI agent, we have features such as running diffs on system traces as well as sharing and collaboration features that make our VSCode extension the most powerful way to work on performance optimization.
nCompass is an AI-powered performance profiling IDE aimed at optimizing GPU performance. It emphasizes writing performant code with AI assistance and provides profiling and debugging within VSCode and Cursor.
An AI-powered performance profiling IDE that integrates with VSCode and Cursor, assisting developers to write and optimize high-performance code through profiling and debugging capabilities.
Who it’s for: Developers and engineering teams focusing on GPU performance optimization and high-performance computing
I am a recent PhD graduate from Imperial College London with experience in machine learning algorithms, compilers and hardware architectures. I've worked in compiler teams at Qualcomm and Huawei as well as served as a reviewer for ICML. My co-founder and I are building nCompass which is a platform for accelerating and hosting both open-source and custom large AI models. Our focus is on providing rate unlimited and low latency large AI inference with only one line of code.
I'm a recent Imperial College London PhD Graduate where I specialized in reconfigurable hardware architectures for accelerated machine learning and reduced precision training algorithms. I have worked as an AI feasibility consultant prototyping and evaluating AI spin-outs. We are building nCompass, a platform for accelerating and hosting both open-source and custom large AI models. Our focus is on providing rate-unlimited and low latency large AI inference with only one line of code.
nCompass is an API that requires only one-line-of-code to integrate low latency versions of open-source/custom models into your AI pipeline.
nCompass provides an API to deploy accelerated, open-source or custom LLMs with a single API key and one-line integration, targeting tools that need predictable timing and cost. The launch highlights hosting/open-source model acceleration, a pay-by-time pricing model, and support for OpenAI-style templates and Hugging Face models.
From the original launch (Mar 2024) — may be outdated.

AI coding copilot engineered for performance

AI for AI Infrastructure