Introduced the new Flex and Priority inference tiers, offering more options for optimizing cost or latency.
Fetched May 1, 2026
Introduced revamped Usage Tiers and Billing Account spend caps for a better user billing experience.