84.9k
Stars
5.4k
Forks
today
Last Commit
606
Open Issues
Details
LanguageTypeScript
LicenseApache-2.0
Category: Web Scraping & Crawling
OSS Maturity:Strong
Build note: Scrapy and Crawlee are production-grade crawling frameworks. Firecrawl turns websites into LLM-ready data. Puppeteer/Playwright handle dynamic pages.
More in Web Scraping & Crawling
ApifyCommercial
Web scraping and automation cloud platform
axiosPackage
Promise-based HTTP client for Node.js
beautifulsoup4Package
HTML/XML parser; de facto standard for Python scraping
Bright DataCommercial
Proxy network + scraping infrastructure; enterprise
cheerioPackage
Fast jQuery-like HTML parser for Node.js
CollyOpen Source
Fast and elegant scraping framework for Go; Apache 2.0
CrawleeOpen Source
Web scraping and browser automation library; Apache 2.0
DiffbotCommercial
AI-powered web data extraction; knowledge graph