LLM-as-a-Judge evaluators can now return boolean scores for true / false decisions. This makes it easier to model simple binary decisions directly as native boolean scores and analyze them across existing score tooling.
Key features:
Boolean when creating a custom LLM-as-a-Judge evaluatortrue / false outcomes as native boolean scoresUse cases: Detect User Disagreement, Out-of-Scope Requests, or Insufficient Answers as true/false decisions. Boolean scores complement existing numeric and categorical score types.

Fetched April 13, 2026