Langfuse now supports pricing tiers for models with context-dependent pricing, enabling accurate cost calculation. Some model providers charge different rates depending on the number of input tokens used. For example, Anthropic's Claude Sonnet 4.5 with 1M context window, Google's Gemini 2.5 Pro and Gemini 3 Pro Preview all apply higher pricing when more than 200K input tokens are used.
How it works: Pricing tiers allow multiple price points for a single model, each with conditions that determine when that tier applies. Tiers are evaluated in priority order, and the first matching tier is used for cost calculation.
Pre-configured models: The following models now have pricing tiers pre-configured:
Custom pricing tiers: You can also define pricing tiers for your own custom models via the Langfuse UI or API. Each tier includes a name, priority, conditions that determine when the tier applies, and prices for cost per usage type.

Fetched April 13, 2026