platoseed
Metadata for GenAI workflows
Deasy Labs was acquired by Collibra in July 2025 (global leader in enterprise data governance). Deasy Labs provides metadata orchestration for AI workflows. Deasie's platform provides the best way for AI teams to create and embed high-quality, customized metadata into their AI workflows (e.g., RAG, Agentic frameworks). Our three founders (from Amazon, McKinsey/QuantumBlack & MIT) previously built an ML data governance tool from 0 to 1 within McKinsey, which we deployed with 11 Fortune 500 companies. We saw in early 2023 the ability to create high-quality metadata (without reliance on domain experts) would be a key factor in achieving the accuracy & speed in GenAI applications required for production. Our investors include General Catalyst, Y Combinator, RTP Global and world experts in enterprise data. Website: https://deasylabs.com
Deasy Labs turns unstructured organizational data into AI-ready datasets by enriching, curating, and tagging content so AI systems can retrieve relevant, high-quality information. It focuses on mapping data, removing unwanted content, and enriching with metadata at scale to support RAG, search, and AI agents.
Connect to data sources (e.g., SharePoint, S3), OCR and parse files, chunk content, and normalize unstructured data. Automatically tag thousands of files per minute with metadata, build domain-specific taxonomies, detect sensitive data, and assess quality/relevance. Slice data by relevance, topic, time, quality, or sensitivity to create purpose-built datasets for RAG, search, and agents. Write enriched metadata back to source systems or downstream pipelines, and continuously refresh datasets to prevent staleness. Supports integration with existing models/LLMs and deployment in customersβ own environments with a single metadata standard across teams.
Who itβs for: Medium to large enterprises with substantial unstructured data needs, especially those using SharePoint, email archives, and PDFs, aiming to improve AI data quality, governance, and efficiency of retrieval and RAG pipelines.
customer-facing platform with enterprise features (petabyte scale, on-premise deployment, governance, data enrichment) and mentions of case studies/testimonials; hiring/traction indicators not explicit in text
Co-founder of Deasy Labs (YC S23): Data governance for language models in the enterprise. Exited Deasy Labs to Collibra in < 2 yrs (global leader in data governance). My documented startup journey: https://reecegriffiths.substack.com/ Prior experience: QuantumBlack: Product Manager for AI for Data Governance product (1.5 yrs); McKinsey & Company: Management consultant (2 yrs); X26: Founder of X26 (global entrepreneurs community); University of Cambridge: Masters in information engineering.
Mikko is the co-founder of Deasie. He is a technologist with deep experience (5+ years) in data and machine learning space from McKinsey and QuantumBlack, where he implemented and deployed models at large enterprises. Prior to moving into the world of data governance for language models with Deasie, he was the Lead data scientist building AI data quality tool at QuantumBlack. Mikko holds a graduate degree from MIT.
Co-founder of Deasy Labs: The Metadata Engine for search usecases. Prior experience: QuantumBlack: Software Engineer, Machine Learning at AI for Data Quality product; Previous Engineer at Amazon, Mercedes-Benz and ChatBot Startup (E-Bot 7); Bachelor from National University of Singapore and Graduate from UWC Mostar
Is your enterprise data ready for language models?
Deasie provides an automated platform that checks unstructured data for compliance (e.g., PII and proprietary data) and quality (relevance, timeliness, consistency) to govern which data is used for language model use cases, targeting enterprises with large data inventories.
From the original launch (Aug 2023) β may be outdated.
Formerly βDeasieβ Β· why startups rename β

The home for shared data logic

Open source tools & services for reliable AI Agents & AI Applications