releases.shpreview

v4

Compare Experiments Faster

$npx -y @buildinternet/releases show rel_lwhIl8PyMSxSh880ZUlmM

Rebuilt experiment screens with faster loading, standalone access, and enhanced filtering for efficient analysis.

Key improvements:

  • Faster loading and filtering leveraging observation-centric data model
  • Standalone experiments that don't require linked datasets; SDK-based experiments now visible in UI
  • Polished UI with visual deltas on scores, cost, and latency, baseline comparison, and score threshold filtering

Run A/B tests between model versions, compare evaluation scores across prompt variants, or triage regressions with quicker feedback loops. Currently in open beta on Langfuse Cloud only.

Fetched April 13, 2026