We’re excited to announce the launch of our new Search API endpoint that combines web search with Firecrawl’s powerful scraping capabilities.
Search Features:
- Search the web and get full content from results in one API call
- Choose specific output formats (markdown, HTML, links, screenshots)
- Customize search parameters (language, country, time range, number of results)
- Full SDK support for Python and Node.js
More Features
- Auto mode proxy for scraping (
scrapeURL, js-sdk) #1551, #1602
- Timeout handling and content type improvements for
scrapeURL/pdf #1570, #1604, #1592
- Redis improvements: separate non-eviction Redis support #1600
- Search improvements:
ignoreBlockedURLs, ignore concurrency limit #1580, #1617
- New
/cclog endpoint for concurrency logging #1589
- Metadata extraction now includes
itemprop attributes #1624
- Self-hosted: deployable Playwright image #1625
Fixes & Improvements
- Better subdomain handling for
LLMs.txt + bypass option #1557
- Improved URL validation and special character handling #1547
- Zombie worker cleanup + TTL handling for extract status #1575, #1599
- Fix concurrency queue logic and rate limiter override #1595, #1593
- Better logging for search pagination and robust fetch #1572, #1588
- Minor fixes:
og:locale:alternate, adblock toggle, Playwright-only logic, malformed metadata arrays #1597, #1616, #1574
Testing & Docs
- Add
MAX_RAM and MAX_CPU environment variable docs #1581
- Testing infrastructure improvements #1623
What's Changed
New Contributors
Full Changelog: https://github.com/mendableai/firecrawl/compare/v1.9.0...v.10.0