Nemotron 3 Ultra now available on AI Gateway
Nemotron 3 Ultra from Nvidia is now available on Vercel AI Gateway. This is an open Mixture-of-Experts reasoning model built for orchestrating long-running agent workflows, with a 1M token context window.
Key features:
- Targets multi-turn agent workflows: planning, tool use, sub-agent delegation, and error recovery
- Throughput up to 350 tokens per second
- Up to 30% lower cost on agentic tasks
- Use model
nvidia/nemotron-3-ultra-550b-a55bin AI SDK
AI Gateway provides unified API for calling models, tracking usage and cost, configuring retries, failover, and performance optimizations. Includes custom reporting, Zero Data Retention support, and dynamic provider sorting.
Fetched June 5, 2026


