releases.shpreview
Firecrawl/Firecrawl Changelog

Firecrawl Changelog

Mon
Wed
Fri
JunJulAugSepOctNovDecJanFebMarAprMay
Less
More
Releases13Avg4/moVersionsv2.9.0 to v2.10

Enter a URL, describe what you want to track, and /monitor notifies your AI agent via webhook the moment pages or sites change. Use up to 90% fewer LLM tokens by only ingesting what actually changes.

Key Features:

  • Set a goal in plain English — Describe what to watch and /monitor configures the URLs, schema, and schedule for you
  • Up to 90% fewer LLM tokens — Your agent only ingests what changes on a page
  • Any cadence, with cost upfront — 5 minutes, hourly, daily, or custom cron schedule with estimated monthly cost shown upfront
  • Webhook or email delivery — Signed webhooks with custom headers or email with diff in body
  • Permalinks for every change — Diffs are first-class objects you can share or hand to another agent
v2.10

Firecrawl v2.10 ships a new /parse endpoint, Lockdown Mode, Question and Highlights formats, and four new official SDKs (Go, Ruby, PHP, .NET) plus reliability and security fixes.

Key Features:

  • /parse endpoint — Upload PDFs, Word docs, and spreadsheets up to 50 MB and get clean, LLM-ready Markdown, JSON, or summaries back. Powered by a new Rust-based engine that's up to 5x faster
  • Lockdown Mode — Set lockdown: true on /scrape to serve results exclusively from Firecrawl's index with no outbound requests and zero data retention by default
  • Question Format — Pass a natural-language prompt to /scrape and get a grounded answer back, with up to 100x fewer tokens per call
  • Highlights Format — Get back the exact sentences, code blocks, and table rows on a page that match your query, with original formatting preserved
  • Four New Official SDKs — Go, Ruby, PHP (with Laravel support), and .NET all joined the SDK family with v2 parity. The Rust SDK has been promoted to the official v2 SDK
v2.10.0

Firecrawl v2.10 ships a new /parse endpoint, Lockdown Mode, Question and Highlights formats, and four new official SDKs (Go, Ruby, PHP, .NET) - plus a long list of reliability and security fixes.

Highlights
  • /parse endpoint — Upload PDFs, Word docs, and spreadsheets up to 50 MB and get clean, LLM-ready Markdown, JSON, or summaries back. Powered by a new Rust-based engine that's up to 5x faster.
  • Lockdown Mode — Set lockdown: true on /scrape to serve results exclusively from Firecrawl's index with no outbound requests and zero data retention by default. Available everywhere, including the CLI (--lockdown) and MCP.
  • Question Format — Pass a natural-language prompt to /scrape and get a grounded answer back, with up to 100x fewer tokens per call.
  • Highlights Format — Get back the exact sentences, code blocks, and table rows on a page that match your query, with original formatting preserved — also using up to 100x fewer tokens per call.
  • Four New Official SDKs — Go, Ruby, PHP (with Laravel support), and .NET all joined the SDK family with v2 parity. The Rust SDK has been promoted to the official v2 SDK.

Highlights is a new format for /scrape that returns the exact sentences, code blocks, and table rows on a page that match your query, all while using up to 100x fewer tokens.

Highlights
  • Citable, hallucination-free output — Nothing in the response is rewritten, translated, or hallucinated. Every sentence is provably from the source page, in the page's own words.
  • Code blocks and tables preserved — Consecutive sentences from the same block re-join into paragraphs, consecutive code lines wrap in fenced blocks with their original language, and table rows rebuild into Markdown tables with headers auto-included.
  • Up to 100x fewer tokens per call — Returning just the matching lines instead of the full page lowers inference costs, speeds up responses, and keeps your context window lean.

Question is a format for /scrape that returns high-quality, grounded answers from any web page using up to 100x fewer tokens.

Highlights
  • High-quality, grounded answersquestion pulls the page content most relevant to your prompt and answers strictly from it, with zero hallucinations.
  • Up to 100x fewer tokens per callquestion returns just the answer, not the page, giving you significantly lower inference costs, faster responses, and a leaner agent context window on every request.
  • Built for AI agents — Skip the scrape-parse-prompt pipeline. Drop precise, page-grounded answers straight into agent loops with a single call.
  • Fully managed LLM stackquestion runs on a managed model chain with automatic fallback and a production-tuned system prompt. Token usage and cost roll into the same billing surface as /scrape.
  • Hardened against prompt injection — Page content is isolated with XML tagging and zero-width-space escaping, and the model is instructed to ignore any instructions embedded in the page.

Lockdown Mode is a cache-only option for /scrape that keeps security-sensitive requests inside Firecrawl. Set lockdown: true to serve results exclusively from Firecrawl's index, with zero data retention by default.

Highlights
  • No outbound request - Lockdown serves results from Firecrawl's index only and gates every outbound path, including HTTP and robots.txt.
  • Zero data retention by default - URLs aren't persisted, response data isn't stored, and the scrape job is cleaned up after delivery.
  • One flag, every surface - lockdown: true works the same across the API, every SDK (Python, Node, Go, Rust, Java, .NET, Ruby, PHP, Elixir), the CLI (--lockdown), and MCP.

The /parse endpoint turns documents into clean, structured data for AI agents and RAG pipelines. Powered by a new Rust-based engine that's up to 5x faster, it works across PDFs, Word docs, spreadsheets, and more.

Highlights
  • Clean, LLM-ready output — Get back Markdown, JSON, or a summary, with tables and reading order preserved. No post-processing required.
  • Rust-based engine — A high-performance Rust core delivers up to 5x faster parsing, cutting latency in document ingestion and embedding workflows.
  • Zero Data Retention support — Enterprise plans with ZDR enabled ensure parsed output is never stored, so data from contracts, medical records, and internal reports stays secure.
  • Upload files up to 50 MB — Supports PDF, DOCX, DOC, ODT, RTF, XLSX, XLS, and HTML.

Firecrawl web-agent is an open framework for building AI agents that search, scrape, and interact with the web. Powered by the same architecture behind our /agent endpoint.

Highlights
  • Bring any model — Anthropic, OpenAI, Google, or your own. You control the logic, tools, and infra.
  • One command, full stack$ firecrawl create agent gives you /scrape, /search, and /interact in a plan-act loop, parallel sub-agents for concurrent research, and your choice of Streaming UI, API server, or library templates.
  • Teachable by design — Add Skill playbooks and your agent learns reusable routines. Paginate e-commerce sites, run multi-source research, and extract structured data your way.

Fire-PDF is a Rust-based parsing engine that converts any PDF - scanned, text-based, or mixed - into structured markdown, up to 5x faster.

Highlights
  • 5x Faster — Our open-source Rust library pdf-inspector classifies each page in milliseconds and picks the fastest extraction path. Pages are processed in under 400ms on average.
  • Layout-Aware Accuracy — A neural document layout model detects tables, formulas, text blocks, and headers individually. Tables get full markdown output, formulas are preserved in LaTeX, and reading order is predicted neurally.
  • Zero Configuration — Every PDF sent through Firecrawl's API now goes through Fire-PDF automatically.
v2.9.0

Firecrawl v2.9.0 includes browser interaction via /interact, new scrape formats, smarter PDF handling, two new SDKs, and reliability fixes.

Key Features:

  • Browser Interaction via /interact — Scrape a page, then call /interact to click buttons, fill forms, navigate, or extract dynamic content using natural language or Playwright/Bash code. Sessions persist across calls with live view URLs and reusable browser profiles.
  • Query Format — Pass natural-language prompts to /scrape and get direct answers in data.answer.
  • Audio Format — Request audio output from any scrape as a field on the document.
  • onlyCleanContent Parameter — Strip navigation, ads, and non-semantic content from markdown output.
  • PDF Parsing Modes — Choose fast, auto, or ocr parsing with maxPages option for fine-grained extraction control.
  • Java & Elixir SDKs — Official SDKs with full v2 API support, joining JS, Python, Go, and Rust.

Introduce the new /interact endpoint that turns any scrape into a live browser session where agents can click, type, and navigate using natural language.

Key Features:

  • Natural Language Control — Describe what you want in plain English; the agent clicks, types, scrolls, and extracts data automatically without selectors or scripts.
  • Live Browser Sessions — Every session includes a live URL you can embed, share, or interact with in real time for debugging and demos.
  • Persistent Profiles — Log in once and pick up where you left off with cookies and localStorage carrying across scrapes with named profiles.
  • Full Playwright Control — Switch to code mode and run Playwright (Node.js or Python) or Bash for precision control.
  • Session Reuse — Chain multiple interact calls on the same scrape with the browser maintaining state between calls for complex multi-step workflows.

Full support for core endpoints including scrape, search, and crawl. Works with Maven, Gradle, and Java 17+.

Key Features:

  • Maven & Gradle Ready — Drop into any Java project via JitPack with standard dependency management.
  • Java 17+ Support — Built for modern Java environments.
  • Core Endpoint Coverage — Scrape, search, crawl, map, and agent endpoints all supported.

New PDF parsing engine delivers 3x faster parsing and significantly improved reliability. Rebuilt in Rust, it automatically adapts to any PDF from clean text files to scanned reports and complex layouts.

Key Features:

  • Rust-Based Parser — High-performance engine built in Rust delivers up to 3x faster parsing, reducing latency in data ingestion and embedding workflows.
  • Three Parsing Modes:
    • fast — text-only parsing for maximum performance.
    • auto — new default; starts in fast mode and automatically falls back to OCR when needed, intelligently detecting edge cases like embedded images, graphs, multi-column layouts, and unusual text encodings.
    • ocr — forces OCR parsing for fully image-based or scanned documents.
  • Built for Production Reliability — Extensively tested across thousands of real-world PDFs for consistent, accurate extraction.

Browser Sandbox gives agents a secure, fully managed browser environment for interactive web automation with no local setup, Chromium installs, or driver compatibility issues. Each session runs in an isolated, disposable sandbox that scales without infrastructure management.

Key Features:

  • Browser Sandbox — Launch secure, isolated browser sessions with Python, JavaScript, and bash execution. Pre-installed with agent-browser CLI and Playwright.
  • Multi-Language Support — Execute Python, JavaScript, or bash code remotely via API, CLI, or SDK with instant results.
  • agent-browser Integration — Pre-installed CLI with 40+ commands for AI agents to write simple bash commands instead of complex Playwright code.
  • Live View & CDP Access — Watch sessions in real time via embeddable stream URL or connect own Playwright instance over WebSocket.
  • Session Management — Configurable TTL controls, parallel sessions (up to 20 concurrent), and automatic cleanup. 2 credits per browser minute with 5 minutes free.

Significantly improved logo extraction accuracy for Branding Format v2, the endpoint for extracting brand identities from websites.

Key Features:

  • Significantly improved logo detection — More reliable logo extraction with fewer false positives and better handling of edge cases like logos embedded in background images.
  • Works with modern site builders — Branding Format now properly detects logos built with Wix, Framer, and other drag-and-drop platforms generating complex or non-semantic HTML.
  • Built for AI agents and developers — Captures colors, typography, spacing, and UI components in structured format to power AI agents and apps.
v2.8.0

Firecrawl v2.8.0 brings major improvements to agent workflows, developer tooling, and self-hosted deployments across the API and SDKs.

Key Features:

  • Parallel Agents — Execute thousands of /agent queries simultaneously with automatic failure handling and intelligent waterfall execution. Powered by Spark 1 Fast for instant retrieval, automatically upgrading to Spark 1 Mini for complex queries.
  • Firecrawl Skill — Enables agents to use Firecrawl for web scraping and data extraction.
  • Firecrawl CLI — Command-line interface with full scrape, search, crawl, and map support.
  • Spark Model Family — Three new models: Spark 1 Fast for instant retrieval, Spark 1 Mini for complex research queries, and Spark 1 Pro for advanced extraction tasks.
  • Agent Enhancements — Webhook support, model selection, and new MCP Server tools for autonomous web data gathering.

Bringing parallel processing to /agent, letting you batch hundreds or thousands of queries simultaneously. What took hours of sequential queries now completes in minutes with automatic failure handling and parallel execution.

Key Features:

  • Parallel Batch Processing — Run thousands of /agent queries simultaneously to enrich companies, research competitors, or build datasets at scale.
  • Intelligent Waterfall — Tries instant retrieval first, then automatically upgrades specific cells to full agent research (Spark One Mini) only when needed.
  • Real-Time Spreadsheet Interface — Work in familiar CSV format with instant visual feedback as cells populate in real-time.
  • Zero Configuration — Input data schema, write one prompt, hit run without workflow building.
  • Predictable Pricing — 10 credits per cell with Spark-1 Fast.

Introducing the Firecrawl Skill and CLI, a new way for AI agents to reliably access real-time web data. With a single install, agents like Claude Code, Antigravity, and OpenCode can access Firecrawl endpoints including scrape, search, crawl, and map.

Key Features:

  • One-Command Install — Install the skill with a single command to teach agents how to authenticate and use all of Firecrawl's endpoints.
  • Real-Time Web Data at Runtime — Agents can pull fresh, full-page content from docs, product pages, pricing, and articles exactly when needed.
  • Context-Efficient for Agents — Uses a file-based approach for context management and bash methods for efficient search and retrieval.
  • Works Across Complex & Dynamic Sites — Powered by Firecrawl's custom browser stack for reliable extraction from large, JavaScript-heavy sites.
  • Proven, Best-in-Class Coverage — Backed by benchmark results showing >80% coverage across real-world evaluations.
v2.7.0

Major release with enterprise features and platform improvements.

Key Features:

  • ZDR Search Support — Enterprise customers can now search with Zero Data Retention enabled end-to-end.
  • Partner Integrations API — Available in closed beta for native integrations in partner products.
  • Improved Branding Format — Better detection and support across all platforms.
  • Faster Screenshots — Enhanced viewport and full page screenshots with improved speed and accuracy.
  • Self-hosted Improvements — Significant enhancements for deployments and infrastructure.
  • Performance Enhancements — Platform-wide improvements for better user experience.
v2.6.0

Major release with unified billing, enhanced features, and significant reliability improvements.

Key Features:

  • Unified Billing Model — Credits and tokens merged into single system. Extract now uses credits (15 tokens = 1 credit), existing tokens work everywhere.
  • Enhanced Branding Format — Full support across Playground, MCP, JS and Python SDKs.
  • Reliability and Speed Improvements — All endpoints significantly faster with improved reliability.
  • Instant Credit Purchases — Buy credit packs directly from dashboard without waiting for auto-recharge.
  • Improved Markdown Parsing — Enhanced markdown conversion and main content extraction accuracy.
  • Change Tracking — Faster and more reliable detection of web page content updates.
  • Core Stability Fixes — Fixed core stability issues, PDF timeouts, and improved error handling.
Last Checked
2d ago
Latest
May 26, 2026
Tracking since Aug 29, 2024