New PDF parsing engine delivers 3x faster parsing and significantly improved reliability. Rebuilt in Rust, it automatically adapts to any PDF from clean text files to scanned reports and complex layouts.
Key Features:
fast — text-only parsing for maximum performance.auto — new default; starts in fast mode and automatically falls back to OCR when needed, intelligently detecting edge cases like embedded images, graphs, multi-column layouts, and unusual text encodings.ocr — forces OCR parsing for fully image-based or scanned documents.
Fetched April 11, 2026