Changelog
All posts
Keyboard shortcuts for annotation queues - Process an annotation queue without the mouse: arrow through items, jump between score fields, pick options with number keys, and complete with Cmd+Enter. 4 Days Ago
Ask AI in the Filter Search Bar - Describe the filters you want in plain language and the search bar drafts them as editable query pills. Opt-in on Langfuse Cloud, powered by AWS Bedrock with zero data retention. 4 Days Ago
Trace AI SDK 7 with Langfuse - Use @langfuse/vercel-ai-sdk 5.9.0 to trace Vercel AI SDK 7 calls in Langfuse. 4 Days Ago
Multi-modal datasets - Create Langfuse dataset items with images, audio, video, documents, and other attachments for SDK-based multi-modal experiments. 7 Days Ago
Filter Search Bar - Filter traces and observations by typing. A fast query bar with operators, full-text search, wildcards, and autocomplete. 11 Days Ago
Langfuse Assistant - The Langfuse Assistant is now in public beta for everyone on Langfuse Cloud. Ask questions about your traces, observations, and metrics in plain language. 11 Days Ago
Monitors and Alerts - Create monitors that watch cost, quality, and latency metrics and notify your team over Slack, webhooks, or GitHub Actions when they move outside expected ranges. 11 Days Ago
Score from the browser - Use @langfuse/browser to send frontend feedback scores with only a public key. 12 Days Ago
Trace AI SDK v7 beta - Use the new @langfuse/vercel-ai-sdk beta package to trace Vercel AI SDK v7 beta calls in Langfuse. 12 Days Ago
Web Callouts - Call any HTTP endpoint from your trace, observation, or session views to trigger external workflows from Langfuse. 13 Days Ago
Mask exported spans in Python - Run media detection and configured masking on every span accepted by the Python SDK span processor, including third-party OpenTelemetry spans. 2 Weeks Ago
Delete evaluator templates - Remove evaluator templates you no longer need from the UI, the unstable public API, and MCP, with safeguards that block deletion while rules still depend on them. 2 Weeks Ago
Manage evaluators via MCP - Set up evaluators and evaluation rules from AI agents through the Langfuse MCP server, and create code evaluators through the unstable public API. 3 Weeks Ago
Scores API v3 - Cursor-based pagination, a typed value field, and list-based filters with numeric ranges on the new GET /api/public/v3/scores endpoint. 3 Weeks Ago
Use OpenAI models on Amazon Bedrock - Connect Bedrock-hosted OpenAI models to Langfuse LLM Connections with OpenAI Responses API support. 4 Weeks Ago
Launch Week 5 🚀 Langfuse MCP - The hosted Langfuse MCP server now includes new tools for working with observations, metrics, scores, datasets, comments, and annotation queues from AI agents. May 29, 2026
Launch Week 5 🚀 Code evaluators - Run deterministic Python or TypeScript checks on observations and experiments in Langfuse. May 28, 2026
Launch Week 5 🚀 Full-Text Search - Langfuse Cloud is rolling out ClickHouse full-text search, improving UI search and adding the matches operator to Observations API v2. May 27, 2026
Launch Week 5 🚀 Langfuse agent skill - Your agent's playbook for production-ready LLM apps. May 26, 2026
Launch Week 5 🚀 Experiments CI/CD integration - Run langfuse experiments in GitHub Actions to catch quality regressions before releasing changes to production. May 25, 2026
Blob storage, PostHog, and Mixpanel exports default to enriched observations - New Langfuse Cloud projects use enriched observations for blob storage, PostHog, and Mixpanel exports. May 20, 2026
Sign in with ClickHouse Cloud - Use your ClickHouse Cloud account to sign in to Langfuse Cloud. May 18, 2026
Choose columns and compress blob storage exports - Pick which field groups are written to each row and enable gzip compression in scheduled S3, GCS, and Azure exports. May 15, 2026
Trace context on /api/public/v2/observations - Fetch a trace's tags, release, and trace name directly on each observation row. May 15, 2026
Introducing Langfuse Academy - Langfuse Academy is an open explanation of the AI engineering lifecycle. May 14, 2026
Self-Service Enterprise SSO Setup - Organization admins can now verify domains and configure Enterprise SSO directly in settings. May 8, 2026
Langfuse Cloud Japan - A new dedicated cloud region in Tokyo. April 27, 2026
Manage LLM-as-a-Judge evaluators via the API - Create, version, and update LLM-as-a-Judge evaluators and evaluation rules programmatically through the (unstable) public API. April 15, 2026
Amazon Bedrock API Keys - Use Amazon Bedrock API keys to connect Bedrock models to Langfuse. April 13, 2026
Experiments as a First-Class Concept - Experiments now live alongside Datasets as their own top-level feature. April 13, 2026
Free-Form Text Scores - Capture open-ended feedback and qualitative annotations with the new TEXT score type. April 10, 2026
Boolean LLM-as-a-Judge Scores - LLM-as-a-Judge evaluators can now return boolean scores. April 8, 2026
Updates to Dashboards - Detailed reference for how dashboards behave differently in "Fast Preview". March 23, 2026
Categorical LLM-as-a-Judge Scores - LLM-as-a-Judge evaluators can now return categorical scores. March 20, 2026
Simplify Langfuse for Scale - Langfuse now delivers faster product performance at scale. March 10, 2026
Langfuse CLI - Fully use Langfuse from the CLI. February 17, 2026
Evaluate Individual Operations - Observation-level evaluations enable precise operation-specific scoring for production monitoring. February 13, 2026
Run Experiments on Versioned Datasets - Fetch datasets at specific version timestamps and run experiments on historical dataset versions. February 11, 2026
Corrected Outputs for Traces and Observations - Capture improved versions of LLM outputs directly in trace views. January 14, 2026
Inline Comments on Observation I/O - Anchor comments to specific text selections within trace and observation input, output, and metadata fields. January 7, 2026
Filter Observations by Tool Calls - Add filtering, table columns, and dashboard widgets for analyzing tool usage. December 22, 2025
v2 Metrics and Observations API (Beta) - New high-performance v2 APIs for metrics and observations. December 17, 2025
Dataset Item Versioning - Track dataset changes over time with automatic versioning. December 15, 2025
OpenAI GPT-5.2 support - Langfuse now supports OpenAI GPT-5.2 with day 1 support. December 12, 2025
Batch Add Observations to Datasets - Select multiple observations and add them to datasets with flexible field mapping. December 11, 2025
Pricing Tiers for Accurate Model Cost Tracking - Support for pricing tiers for models with context-dependent pricing. December 2, 2025
Hosted MCP Server for Prompt Management - Native MCP server with write capabilities for AI agents to fetch and update prompts. November 20, 2025
OpenAI GPT-5.1 support - Langfuse now supports OpenAI GPT-5.1 with day 1 support. November 14, 2025
Launch Week 4 🚀 Organize Your Datasets in Folders - Use slashes in dataset names to create folders for better organization. October 27, 2025
Launch Week 4 🚀 JSON Schema Enforcement for Dataset Items - Define JSON schemas for dataset inputs and expected outputs. November 6, 2025
Fetched June 30, 2026


