Boolean LLM-as-a-Judge Scores

LLM-as-a-Judge evaluators can now return boolean scores for true / false decisions. This makes it easier to model simple binary decisions directly as native boolean scores and analyze them across existing score tooling.

Key features:

Choose Boolean when creating a custom LLM-as-a-Judge evaluator
Store true / false outcomes as native boolean scores
Analyze boolean evaluator outputs in dashboards, filters, and score analytics

Use cases: Detect User Disagreement, Out-of-Scope Requests, or Insufficient Answers as true/false decisions. Boolean scores complement existing numeric and categorical score types.

More from Langfuse

More from Langfuse