Rebuilt experiment screens with faster loading, standalone access, and enhanced filtering for efficient analysis.
Key improvements:
- Faster loading and filtering leveraging observation-centric data model
- Standalone experiments that don't require linked datasets; SDK-based experiments now visible in UI
- Polished UI with visual deltas on scores, cost, and latency, baseline comparison, and score threshold filtering
Run A/B tests between model versions, compare evaluation scores across prompt variants, or triage regressions with quicker feedback loops. Currently in open beta on Langfuse Cloud only.
Fetched April 13, 2026

