Introducing /parse
The /parse endpoint turns documents into clean, structured data for AI agents and RAG pipelines. Powered by a new Rust-based engine that's up to 5x faster, it works across PDFs, Word docs, spreadsheets, and more.
Highlights
- Clean, LLM-ready output — Get back Markdown, JSON, or a summary, with tables and reading order preserved. No post-processing required.
- Rust-based engine — A high-performance Rust core delivers up to 5x faster parsing, cutting latency in document ingestion and embedding workflows.
- Zero Data Retention support — Enterprise plans with ZDR enabled ensure parsed output is never stored, so data from contracts, medical records, and internal reports stays secure.
- Upload files up to 50 MB — Supports PDF, DOCX, DOC, ODT, RTF, XLSX, XLS, and HTML.

Fetched May 25, 2026




