releases.shpreview
Cloudflare/Cloudflare Changelog/Spend limits cap cumulative AI costs per model, provider, or user

Spend limits cap cumulative AI costs per model, provider, or user

1 featureThis release1 featureNew capabilitiesAI-tallied from the release notes

AI Gateway now supports spend limits — cost-based budgets that track cumulative dollar spend and block requests when the budget is exceeded. Unlike rate limiting, which caps the number of requests, spend limits track actual cost based on token usage and model pricing.

You can scope limits by model, provider, or custom metadata dimensions. For example, give each user a $200/day budget, cap total gateway spend at $10,000/day, or limit a specific model to $50/day per user. Each rule uses a configurable time window with fixed or sliding enforcement.

Spend limits work with both Unified Billing and BYOK requests for models with known pricing.

For more details, refer to the Spend limits documentation.

Fetched June 5, 2026