Usage reporting for thinking tokens, MCP tunnels, and self-hosted sandboxes expanded
The Messages API response now includes usage.output_tokens_details.thinking_tokens, reporting how many of the billed output tokens were extended thinking. When streaming, the breakdown appears only on the final message_delta event. No beta header is required.
Fetched June 6, 2026



