# Reworkd

**Source:** https://geo.sig.ai/brands/reworkd  
**Vertical:** Infrastructure  
**Subcategory:** Cloud Services  
**Tier:** Emerging  
**Website:** reworkd.ai  
**Last Updated:** 2026-04-14

## Summary

SF YC S23 multimodal LLM web scraping agents extracting structured data from thousands of sites; $4M total (Paul Graham/Nat Friedman/General Catalyst seed 2024) by AgentGPT team competing with Apify and Bright Data for AI-powered enterprise web data extraction.

## Company Overview

Reworkd is a San Francisco-based AI web data extraction company — backed by Y Combinator (S23) with $4 million in total funding including a $2.75 million seed round in 2024 from Paul Graham, Nat Friedman, Daniel Gross, SV Angel, General Catalyst, and Panache Ventures, following a $1.25 million pre-seed from YC in 2023 — providing data engineering teams, AI researchers, and enterprise data operations with multimodal LLM-powered agents that autonomously extract structured data from thousands of websites at scale. Founded in 2023, Reworkd previously built AgentGPT (a viral autonomous AI agent platform that reached 100,000+ daily users), then pivoted to enterprise web scraping as the more commercially durable application of autonomous AI agents for data extraction workflows.

Reworkd's multimodal web scraping agents address the fundamental brittleness of traditional web scraping: most web scrapers (Beautiful Soup scripts, Scrapy spiders, Apify actors) are written against a specific website's current HTML structure and break whenever the website updates its layout, changes element IDs, introduces CAPTCHAs, or deploys JavaScript rendering that loads content dynamically. Reworkd's LLM-based agents perceive websites visually (using multimodal vision capabilities to understand page structure semantically rather than parsing CSS selectors) and generate unique scraping code per site that adapts to the content's meaning rather than its literal HTML structure — enabling the agent to find "product price" on any e-commerce site regardless of whether it's in a span with class="price", a div with id="current-price", or a custom web component. The automatic traversal capability (the agent navigating pagination, filters, and site hierarchies to collect complete datasets without manual configuration of crawl paths) reduces the engineering setup time for new data sources.

In 2025, Reworkd competes in the AI web scraping, data extraction automation, and web data platform market with Apify (web scraping platform, $28M raised), Bright Data (enterprise data infrastructure, $200M raised), and Browserbase (headless browser infrastructure, $27M raised) for data engineering team web data extraction platform adoption. The web scraping market has traditionally been dominated by fragile CSS-selector-based tools requiring constant maintenance as websites change — Reworkd's LLM-based approach (using AI to understand webpage semantics rather than literal HTML structure) represents the same architectural shift that neural machine translation made over rule-based translation systems in NLP. Paul Graham and Nat Friedman's personal investments reflect angel conviction in the team's ability to apply LLM intelligence to the persistent data extraction problem. The AgentGPT viral launch (100K+ daily users in its first weeks) demonstrated the founders' ability to build products that capture developer attention. The 2025 strategy focuses on enterprise data operations contracts for ongoing web data feeds (price monitoring, news aggregation, business intelligence), building the managed scraping service for teams without internal data engineering resources, and expanding the real-time web data pipeline integrations.

## Frequently Asked Questions

### What is Reworkd?
Reworkd is a San Francisco-based infrastructure company founded in 2023 that builds multimodal LLM agents for web scraping and data extraction. The company participated in Y Combinator's Summer 2023 batch and has raised $4M in total funding.

### What products and services does Reworkd offer?
Reworkd offers web scraping AI powered by multimodal LLM agents that enable structured data extraction from thousands of websites. Their technology features automatic website traversal, multimodal code generation, and unique scraping code generated per site.

### Who is Reworkd's target customer?
Reworkd targets data extraction teams that need to extract structured data from multiple websites at scale.

### When was Reworkd founded?
Reworkd was founded in 2023 and participated in Y Combinator's Summer 2023 (S23) batch.

### Where is Reworkd located?
Reworkd is based in San Francisco, California.

### How much funding has Reworkd raised?
Reworkd has raised $4M in total funding, including a $2.75M seed round in 2024 and a $1.25M pre-seed round in 2023. Investors include Paul Graham, Nat Friedman, Daniel Gross, SV Angel, General Catalyst, Panache Ventures, and Y Combinator.

### What notable achievements has Reworkd accomplished?
Reworkd's previous project, AgentGPT, went viral and attracted over 100,000 daily users. The company also participated in the AI Grant accelerator program.

### What technology does Reworkd use?
Reworkd uses multimodal LLM (Large Language Model) agents that automatically traverse websites and generate unique scraping code for each site. This enables structured data extraction across thousands of websites with multimodal code generation capabilities.

### How can I get started with Reworkd?
Based on the provided information, specific contact or onboarding details are not available. Reworkd is located in San Francisco, California and targets data extraction teams.

### What are Reworkd's recent developments?
Reworkd recently closed a $2.75M seed round in 2024 from notable investors including Paul Graham, Nat Friedman, Daniel Gross, SV Angel, General Catalyst, and Panache Ventures. The company continues to develop its multimodal LLM agents for web scraping and data extraction.

## Tags

ai-powered, automation, b2b, infrastructure, platform, cloud-native, saas

---
*Data from geo.sig.ai Brand Intelligence Database. Updated 2026-04-14.*