releases.shpreview
Vercel/Vercel/Nemotron 3 Ultra now available on AI Gateway

Nemotron 3 Ultra now available on AI Gateway

Nemotron 3 Ultra from Nvidia is now available on Vercel AI Gateway. This is an open Mixture-of-Experts reasoning model built for orchestrating long-running agent workflows, with a 1M token context window.

Key features:

  • Targets multi-turn agent workflows: planning, tool use, sub-agent delegation, and error recovery
  • Throughput up to 350 tokens per second
  • Up to 30% lower cost on agentic tasks
  • Use model nvidia/nemotron-3-ultra-550b-a55b in AI SDK

AI Gateway provides unified API for calling models, tracking usage and cost, configuring retries, failover, and performance optimizations. Includes custom reporting, Zero Data Retention support, and dynamic provider sorting.

Fetched June 5, 2026

Nemotron 3 Ultra now available on AI Gateway — Vercel — releases.sh