platoseed
← All companies
Reworkd logo

Reworkd

Active

The simplest way to extract web data at scale

Backed by Y Combinator · AI Grant

Summer 2023Founded 20231 peopleSan Francisco, CA, USA

About

At Reworkd, we're working on multimodal LLM agents that serve as the simplest way to extract web data at scale. Customers come to us with lists of 100s to 1000s of websites along with a data schema. Our agents traverse these websites, understand their structure, and generate code to extract data from them. We've been working on LLM agents since their inception and have received over 30k stars on GitHub and 1M+ users across previous agent products. If you're interested in our pilot program, shoot us an email!

From their website

reworkd.ai
SaaSSubscription

Reworkd offers end-to-end web data extraction at scale, aiming to automate the entire scraping pipeline from site scanning to output delivery without requiring code or maintenance. The product emphasizes automation of extraction, handling dynamic content, and managing data infrastructure concerns for large-scale web data needs.

Reworkd automates the entire web data pipeline: it scans websites, generates code, runs extractors, validates results, and outputs data from a single system. Key capabilities include automated extraction of text, images, and documents across many sites, self-healing scrapers that repair data failures on the fly, handling of common scraping challenges (pagination, infinite scroll, rate limits, proxies), and an interactive analytics dashboard to monitor what’s being extracted and how it’s performing. It positions itself as eliminating the need to hand-write extraction code and manage infrastructure.

Who it’s for: Organizations with large-scale web data extraction needs, including teams that want to avoid custom scraping code and infrastructure overhead for maintaining data from hundreds or thousands of sites.

Features
  • end-to-end web data extraction
  • no-code data extraction
  • self-healing scrapers
  • automated code generation for extractors
  • data validation and output in one system
  • dynamic content handling
  • analytics dashboard for extraction visibility

Public notices about sunsetting the product by Feb 6, 2025; indicates ongoing product lifecycle changes and migration support, plus listed open roles and YC backing, suggesting early-stage startup activity transitioning or winding down product.

Founders · 1

Srijan Subedi
Srijan SubediFounder

Co-founder at Reworkd AI. Combined major in Science at UBC. Previously worked at STEMCELL Technologies and Heart Lung Innovation as a Clinical Researcher.

Launch

Launched on Y Combinator · Jul 2023
View launch post ↗

We help automate core business workflows with the help of AI Agents

Reworkd AI provides a no-code platform that connects businesses to AI Agent automation, enabling formalized, verifiably correct workflows that automate core processes and reduce manual intervention. The product targets companies needing automated, AI-driven handling of business tasks and promises reduced cycle times via AI Agents integrated with existing tech stacks.

From the original launch (Jul 2023) — may be outdated.

Formerly Reworkd, Reworkd AI

B2BArtificial IntelligenceGenerative AIB2BOpen Source

Related startups

Also in Summer 2023