May 28, 2026
Product changes
- IP address filtering for enterprise plans -- Enterprise organizations can now restrict all API key usage to approved IP address ranges from Privacy settings, reducing the blast radius of leaked keys.
- Negation filters in activity and logs -- Filters now support exclusion — click "is" on any filter chip to toggle it to "is not", letting you exclude specific models, providers, API keys, or workspaces from your view. Docs
- Presets: Responses API support -- Creating presets from inference now supports the Responses API via
POST /api/v1/presets/:slug/responses, and new documentation covers creating and updating presets directly from inference request bodies. Docs - Prompt caching:
session_idsticky routing documentation -- Added docs on usingsession_idto control sticky provider routing, which activates on any successful request rather than requiring a cache hit. Docs - Anthropic thinking tokens in API responses -- API responses for Anthropic models now include
output_tokens_details.thinking_tokens, providing precise reasoning token counts for Claude Opus 4.8 and later. - Fixed: Responses API streaming crashes in OpenAI SDKs -- Streaming keep-alive heartbeats on the Responses API no longer cause
union_tag_invalid(Python) orinvalid_union(TypeScript) validation errors in the OpenAI SDKs.
New models
Fetched June 4, 2026


