Claude Platform shipped a major model refresh and expanded its agentic capabilities. Opus 4.7 arrived with improved reasoning and broke API compatibility, requiring migration from earlier versions; Sonnet 4.6 landed as a faster balanced option with extended thinking and 1M context now generally available at standard pricing. The platform grew its agent-building toolkit with Managed Agents (fully sandboxed autonomous operation), an advisor tool (pairing fast executors with high-intelligence guidance), and the ant CLI for streamlined local development. Automatic caching simplified request optimization, the compaction API enabled effectively infinite conversations through server-side summarization, and capability discovery moved into the Models API itself.
Extended output and expanded context reached general availability this month. The 1M token context window graduated to standard pricing for Claude Opus 4.6 and Sonnet 4.6, with the media limit per request raised to 600 images or PDFs. The max_tokens cap on Batches API grew to 300k for longer single-turn outputs, while a new display field for extended thinking lets developers omit thinking blocks for faster streaming. The Models API gained capability fields exposing max_input_tokens, max_tokens, and a capabilities object for runtime model discovery. The 1M context beta for Sonnet 4.5 and 4 retires April 30, after which requests exceeding 200k tokens will error on those models.