platoseed
Low-latency AI engine for mobile devices & wearables
Check out the project: https://github.com/cactus-compute/cactus
Former quant & economist with a background in product and data engineering. Pilot, triathlete, chess enthusiast. Working on mobile inference @ Cactus.
Cross-platform open-source framework for deploying LLMs, VLMs, Embedding, and other models locally in your apps.
Cactus is an open-source, cross-platform framework for running inference on smartphones and low-power devices, supporting any HuggingFace model (LLMs, VLMs, embeddings) with Flutter and React Native bindings. It enables local, private, offline deployment with optional cloud fallbacks and quantization down to 2-bit for efficiency.

All-In-One AI Growth Partner. Built for Trades.

Your partner for everyday work, lives in iMessage