releases.shpreview
ElevenLabs/ElevenLabs Changelog

ElevenLabs Changelog

$npx -y @buildinternet/releases show elevenlabs-changelog
Mon
Wed
Fri
AprMayJunJulAugSepOctNovDecJanFebMarApr
Less
More
Releases15Avg5/mo
Jun 30, 2025

Text to Voice Voice Design: Launched new Text to Voice Design with Eleven v3 for creating custom voices from text descriptions. Speech to Text Enhanced Diarization: Added diarization_threshold parameter to the Speech to Text endpoint. Fine-tune the balance between speaker accuracy and total speaker count by adjusting the threshold between 0.1 and 0.4. Professional Voice Cloning Background Noise Removal: Added remove_background_noise to clean up voice samples using audio isolation models for better quality training data. ElevenCreative Studio Video Support Detection: Added has_video property to chapter responses to indicate whether chapters contain video content. Workspaces Service Account Groups: Service accounts can now be added to workspace groups for better permission management and access control.Workspace Authentication: Added support for workspace authentication connections, enabling secure webhook tool integrations with external services. SDKs Python SDK: Released v2.6.0 with latest API support and bug fixes.JavaScript SDK: Released v2.5.0 with latest API support and bug fixes.React Agents Platform SDK: Added WebRTC support in 0.2.0 API

Jun 23, 2025

Tools migration Agents Platform tools migration: The way tools in Agents Platform are handled is being migrated, please see the guide here to understand what's changing and how to migrate Text to Speech Audio tags automatic removal: Audio tags are now automatically removed when switching from V3 to V2 models, ensuring optimal compatibility and performance. Agents Platform Tools management UI: Added a new comprehensive tools management interface for creating, configuring, and managing tools across all agents in your workspace.Streamlined agent creation: Introduced a new agent creation flow with improved user experience and better configuration options.Agent duplication: Added the ability to duplicate existing agents, allowing you to quickly create variations of successful agent configurations. SIP Trunking Inbound media encryption: Added support for configurable inbound media encryption settings for SIP trunk phone numbers, enhancing security options. Voices Famous voice category: Added a new "famous" voice category to the voice library, expanding the available voice options for users. Dubbing CSV frame rate control: Added csv_fps parameter to control frame rate when parsing CSV files for dubbing projects, providing more precise timing control. SDKs ElevenLabs JavaScript SDK v2.4.0: Released with new Agents Platform SDK support for Node.js. View release notesElevenLabs Python SDK v2.5.0: Updated with enhanced Agents Platform capabilities. View release notes API

Jun 17, 2025

Agents Platform Dynamic variables in simulated conversations: Added support for dynamic variable population in simulated conversations, enabling more flexible and context-aware conversation testing scenarios.MCP server integration: Introduced comprehensive support for Model Context Protocol (MCP) servers, allowing agents to connect to external tools and services through standardized protocols with configurable approval policies.Burst pricing for extra concurrency: Added bursting capability for workspace call limits, automatically allowing up to 3x the configured concurrency limit during peak usage for overflow capacity. ElevenCreative Studio JSON content initialization: Added support for initializing ElevenCreative Studio projects with structured JSON content through the from_content_json parameter, enabling programmatic project creation with predefined chapters, blocks, and voice configurations. Workspaces Webhook management: Introduced workspace-level webhook management capabilities, allowing administrators to view, configure, and monitor webhook integrations across the entire workspace with detailed usage tracking and failure diagnostics. API

Jun 8, 2025

Text to Speech Eleven v3 (alpha): Released Eleven v3 (alpha), our most expressive Text to Speech model, as a research preview. Agents Platform Custom voice settings in multi-voice: Added support for configuring individual voice settings per supported voice in multi-voice agents, allowing fine-tuned control over stability, speed, similarity boost, and streaming latency for each voice.Silent transfer to human in Twilio: Added backend configuration support for silent (cold) transfer to number in the Twilio native integration, enabling seamless handoff without announcing the transfer to callers.Batch calling retry and cancel: Added support for retrying outbound calls to phone numbers that did not respond during a batch call, along with the ability to cancel ongoing batch operations for better campaign management.LLM pinning: Added support for versioned LLM models with explicit checkpoint identifiersCustom LLM headers: Added support for passing custom headers to custom LLMsFixed issue in non-latin languages: Fixed an issue causing some conversations in non latin alphabet languages to fail. SDKs Python SDK v2.3.0: Released Python SDK v2.3.0JavaScript SDK v2.2.0: Released JavaScript SDK v2.2.0 API

Jun 1, 2025

Agents Platform Multi-voice support for agents: Enable ElevenLabs agents to dynamically switch between different voices during conversations for multi-character storytelling, language tutoring, and role-playing scenarios.Claude Sonnet 4 support: Added Claude Sonnet 4 as a new LLM option for conversational agents, providing enhanced reasoning capabilities and improved performance.Genesys Cloud integration: Introduced AudioHook Protocol integration for seamless connection with Genesys Cloud contact center platform.Force delete knowledge base documents: Added force parameter to knowledge base document deletion, allowing removal of documents even when used by agents.Multimodal widget: Added text input and text-only mode defaults for better user experience with improved widget configuration. API

May 26, 2025

Forced Aligment Forced alignment improvements: Fixed a rare failure case in forced alignment processing to improve reliability. Voices Live moderated voices filter: Added include_live_moderated query parameter to the shared voices endpoint, allowing you to include or exclude voices that are live moderated. Agents Platform Secret dynamic variables: Added support for specifying dynamic variables as secrets with the secret__ prefix. Secret dynamic variables can only be used in webhook tool headers and are never sent to an LLM, enhancing security for sensitive data. Learn more.Skip turn system tool: Introduced a new system tool called skip_turn. When enabled, the agent will skip its turn if the user explicitly indicates they need a moment to think or perform an action (e.g., "just a sec", "give me a minute"). This prevents turn timeout from being triggered during intentional user pauses. See the skip turn tool docs for more information.Text input support: Added text input support in websocket connections via "user_message" event with text field. Also added "user_activity" event support to indicate typing or other UI activity, improving agent turn-taking when there's interleaved text and audio input.RAG chunk limit: Added ability to configure the maximum number of chunks collected during RAG retrieval, giving users more control over context window usage and costs.Enhanced widget configuration: Expanded widget customization options to include text input and text only mode.LLM usage calculator: Introduced tools to calculate expected LLM token usage and costs for agents, helping with cost estimation and planning. Audio Native Accessibility improvements: Enhanced accessibility for the AudioNative player with multiple improvements:Added aria-labels for all buttonsEnabled keyboard navigation for all interactive elementsMade progress bar handle focusable and keyboard-accessibleImproved focus indicator visibility for better screen reader compatibility API

May 19, 2025

SDKs SDKs V2: Released new v2 SDKs for both Python and JavaScript Speech to Text Speech to text logprobs: The Speech to Text response now includes a logprob field for word prediction confidence. Billing Improved API error messages: Enhanced API error messages for subscriptions with failed payments. This provides clearer information if a failed payment has caused a user to reach their quota threshold sooner than expected. Agents Platform Batch calls: Released new batch calling functionality, which allows you to automate groups of outbound calls.Increased evaluation criteria limit: The maximum number of evaluation criteria for agent performance evaluation has been increased from 5 to 10.Human-readable IDs: Introduced human-readable IDs for key Agents Platform entities (e.g., agents, conversations). This improves usability and makes resources easier to identify and manage through the API and UI.Unanswered call tracking: 'Not Answered' outbound calls are now reliably detected and visible in the conversation history.LLM cost visibility in dashboard: The Agents Platform dashboard now displays the total and per-minute average LLM costs.Zero retention mode (ZRM) for agents: Allowed enabling Zero Retention Mode (ZRM) per agent.Dynamic variables in headers: Added option of setting dynamic variable as a header value for toolsCustomisable tool timeouts: Shipped setting different timeout durations per tool. Workspaces Simplified secret updates: Workspace secrets can now be updated more granularly using a PATCH request via the API, simplifying the management of individual secret values. For technical details, please see the API changes section below. API

May 12, 2025

Billing Downgraded Plan Pricing Fix: Fixed an issue where customers with downgraded subscriptions were shown their current price instead of the correct future price. Agents Platform Edit Knowledge Base Document Names: You can now edit the names of knowledge base documents. See: Knowledge BaseConversation Simulation: Released a new endpoint that allows you to test an agent over text ElevenCreative Studio Export Paragraphs as Zip: Added support for exporting separated paragraphs in a zip file. See: ElevenCreative Studio SDKs Released new SDKs:ElevenLabs Python v1.58.1ElevenLabs JS v1.58.0 API

May 5, 2025

Dubbing Disable Voice Cloning: Added an option in the Dubbing Studio UI to disable voice cloning when uploading audio, aligning with the existing disable_voice_cloning API parameter. Billing Quota Exceeded Error: Improved error messaging for exceeding character limits. Users attempting to generate audio beyond their quota within a short billing window will now receive a clearer 401 unauthorized: This request exceeds your quota limit of... error message indicating the limit has been exceeded. SDKs Released new SDKs: Added ElevenLabs Python v1.58.0 and ElevenLabs JS v1.58.0 to fix a breaking change that had been mistakenly shipped

Apr 28, 2025

Agents Platform Custom Dashboard Charts: The Agents Platform dashboard can now be extended with custom charts displaying the results of evaluation criteria over time. See the new GET and PATCH endpoints for managing dashboard settings.Call History Filtering: Added the ability to filter the call history by start date using the new call_start_before_unix parameter in the List Conversations endpoint. Try it here.Server Tools: Added option of making PUT requests in server toolsTransfer to number: Added call forwarding functionality to support forwarding to operators, see docs hereLanguage detection: Fixed an issue where the language detection system tool would trigger on a user replying yes in non-English language. Usage Analytics Custom Aggregation: Added an optional aggregation_interval parameter to the Get Usage Metrics endpoint to control the interval over which to aggregate character usage (hour, day, week, month, or cumulative).New Metric Breakdowns: The Usage Analytics section now supports additional metric breakdowns including minutes_used, request_count, ttfb_avg, and ttfb_p95, selectable via the new metric parameter in the Get Usage Metrics endpoint. Furthermore, you can now get a breakdown and filter by request_queue. API

Apr 21, 2025

Professional Voice Cloning (PVC) PVC API: Introduced a comprehensive suite of API endpoints for managing Professional Voice Clones (PVC). You can now programmatically create voices, add/manage/delete audio samples, retrieve audio/waveforms, manage speaker separation, handle verification, and initiate training. For a full list of new endpoints check the API changes summary below or read the PVC API reference here. Speech to Text Enhanced Export Options: Added options to include or exclude timestamps and speaker IDs when exporting Speech to Text results in segmented JSON format via the API. Agents Platform New LLM Models: Added support for new GPT-4.1 models: gpt-4.1, gpt-4.1-mini, and gpt-4.1-nanohereVAD Score: Added a new client event which sends VAD scores to the client, see reference here API

Apr 14, 2025

Voices New PVC flow: Added new flow for Professional Voice Clone creation, try it out here Agents Platform Agent-agent transfer: Added support for agent-to-agent transfers via a new system tool, enabling more complex conversational flows. See the Agent Transfer tool documentation for details.Enhanced tool debugging: Improved how tool execution details are displayed in the conversation history for easier debugging.Language detection fix: Resolved an issue regarding the forced calling of the language detection tool. Dubbing Render endpoint: Introduced a new endpoint to regenerate audio or video renders for specific languages within a dubbing project. This automatically handles missing transcriptions or translations. See the Render Dub endpoint.Increased size limit: Raised the maximum allowed file size for dubbing projects to 1 GiB. API

Apr 7, 2025

Speech to text scribe_v1_experimental: Launched a new experimental preview of the Scribe v1 model with improvements including improved performance on audio files with multiple languages, reduced hallucinations when audio is interleaved with silence, and improved audio tags. The new model is available via the API under the model name scribe_v1_experimental Text to speech A-law format support: Added a-law format with 8kHz sample rate to enable integration with European telephony systems.Fixed quota issues: Fixed a database bug that caused some requests to be mistakenly rejected as exceeding their quota. Agents Platform Document type filtering: Added support for filtering knowledge base documents by their type (file, URL, or text).Non-audio agents: Added support for conversational agents that don't output audio but still send response transcripts and can use tools. Non-audio agents can be enabled by removing the audio client event.Improved agent templates: Updated all agent templates with enhanced configurations and prompts. See more about how to improve system prompts here.Fixed stuck exports: Fixed an issue that caused exports to be stuck for extended periods. ElevenCreative Studio Fixed volume normalization: Fixed issue with streaming project snapshots when volume normalization is enabled. New API endpoints Forced alignment: Added new forced alignment endpoint for aligning audio with text, perfect for subtitle generation.Batch calling: Added batch calling endpoint for scheduling calls to multiple recipients API

Mar 31, 2025

Text to speech Opus format support: Added support for Opus format with 48kHz sample rate across multiple bitrates (32-192 kbps).Improved websocket error handling: Updated TTS websocket API to return more accurate error codes (1011 for internal errors instead of 1008) for better error identification and SLA monitoring. Agents Platform Twilio outbound: Added ability to natively run outbound calls.Post-call webhook override: Added ability to override post-call webhook settings at the agent level, providing more flexible configurations.Large knowledge base document viewing: Enhanced the knowledge base interface to allow viewing the entire content of large RAG documents.Added call SID dynamic variable: Added system__call_sid as a system dynamic variable to allow referencing the call ID in prompts and tools. ElevenCreative Studio Actor Mode: Added Actor Mode in ElevenCreative Studio, allowing you to use your own voice recordings to direct the way speech should sound in ElevenCreative Studio projects.Improved keyboard shortcuts: Updated keyboard shortcuts for viewing settings and editor shortcuts to avoid conflicts and simplified shortcuts for locking paragraphs. Dubbing Dubbing duplication: Made dubbing duplication feature available to all users.Manual mode foreground generation: Added ability to generate foreground audio when using manual mode with a file and CSV. Voices Enhanced voice collections: Improved voice collections with visual upgrades, language-based filtering, navigation breadcrumbs, collection images, and mouse dragging for carousel navigation.Locale filtering: Added locale parameter to shared voices endpoint for more precise voice filtering. API

Mar 24, 2025

Voices List Voices V2: Added a new V2 voice search endpoint with better search and additional filtering options Agents Platform Native outbound calling: Added native outbound calling for Twilio-configured numbers, eliminating the need for complex setup configurations. Outbound calls are now visible in the Call History page.Automatic language detection: Added new system tool for automatic language detection that enables agents to switch languages based on both explicit user requests ("Let's talk in Spanish") and implicit language in user audio.Pronunciation dictionary improvements: Fixed phoneme tags in pronunciation dictionaries to work correctly with Agents Platform.Large RAG document viewing: Added ability to view the entire content of large RAG documents in the knowledge base.Customizable widget controls: Updated UI to include an optional mute microphone button and made widget icons customizable via slots. Sound Effects Fractional duration support: Fixed an issue where users couldn't enter fractional values (like 0.5 seconds) for sound effect generation duration. Speech to Text Repetition handling: Improved detection and handling of repetitions in speech-to-text processing. ElevenCreative Studio Reader publishing fixes: Added support for mp3_44100_192 output format (high quality) so users below Publisher tier can export audio to Reader. Mobile Core app signup: Added signup endpoints for the new Core mobile app. API

Mar 17, 2025

Agents Platform Default LLM update: Changed the default agent LLM from Gemini 1.5 Flash to Gemini 2.0 Flash for improved performance.Fixed incorrect conversation abandons: Improved detection of conversation continuations, preventing premature abandons when users repeat themselves.Twilio information in history: Added Twilio call details to conversation history for better tracking.Knowledge base redesign: Redesigned the knowledge base interface.System dynamic variables: Added system dynamic variables to use time, conversation id, caller id and other system values as dynamic variables in prompts and tools.Twilio client initialisation: Adds an agent level override for conversation initiation client data twilio webhook.RAG chunks in history: Added retrieved chunks by RAG to the call transcripts in the history view. Speech to Text Reduced pricing: Reduced the pricing of our Scribe model, see more here.Improved VAD detection: Enhanced Voice Activity Detection with better pause detection at segment boundaries and improved handling of silent segments.Enhanced diarization: Improved speaker clustering with a better ECAPA model, symmetric connectivity matrix, and more selective speaker embedding generation.Fixed ASR bugs: Resolved issues with VAD rounding, silence and clustering that affected transcription accuracy. ElevenCreative Studio Disable publishing UI: Added ability to disable the publishing interface for specific workspace members to support enterprise workflows.Snapshot API improvement: Modified endpoints for project and chapter snapshots to return an empty list instead of throwing errors when snapshots can't be downloaded.Disabled auto-moderation: Turned off automatic moderation based on Text to Speech generations in ElevenCreative Studio. Workspaces Fixed API key editing: Resolved an issue where editing workspace API keys would reset character limits to zero, causing the keys to stop working.Optimized free subscriptions: Fixed an issue with refreshing free subscription character limits, API

Mar 10, 2025

Agents Platform HIPAA compliance: Agents Platform is now HIPAA compliant on appropriate plans, when a BAA is signed, zero-retention mode is enabled and appropriate LLMs are used. For access please contact salesCascade LLM: Added dynamic dispatch during the LLM step to other LLMs if your default LLM fails. This results in higher latency but prevents the turn failing.Better error messages: Added better error messages for websocket failures.Audio toggling: Added ability to select only user or agent audio in the conversation playback. Scribe HIPAA compliance: Added a zero retention mode to Scribe to be HIPAA compliant.Diarization: Increased time length of audio files that can be transcribed with diarization from 8 minutes to 2 hours.Cheaper pricing: Updated Scribe's pricing to be cheaper, as low as $0.22 per hour for the Business tier.Memory usage: Shipped improvements to Scribe's memory usage.Fixed timestamps: Fixed an issue that was causing incorrect timestamps to be returned. Text to Speech Pronunciation dictionaries: Fixed pronunciation dictionary rule application for replacements that contain symbols. Dubbing Studio support: Added support for creating dubs with dubbing_studio enabled, allowing for more advanced dubbing workflows beyond one-off dubs. Voices Verification: Fixed an issue where users on probation could not verify their voice clone. API

Mar 3, 2025

Dubbing Scribe for speech recognition: Dubbing Studio now uses Scribe by default for speech recognition to improve accuracy. Speech to Text Fixes: Shipped several fixes improving the stability of Speech to Text. Agents Platform Speed control: Added speed control to an agent's settings in Agents Platform.Post call webhook: Added the option of sending post-call webhooks after conversations are completed.Improved error messages: Added better error messages to the Agents Platform websocket.Claude 3.7 Sonnet: Added Claude 3.7 Sonnet as a new LLM option in Agents Platform. API

Feb 25, 2025

Speech to Text ElevenLabs launched a new state of the art Speech to Text API available in 99 languages. Text to Speech Speed control: Added speed control to the Text to Speech API. ElevenCreative Studio Auto-assigned projects: Increased token limits for auto-assigned projects from 1 month to 3 months worth of tokens, addressing user feedback about working on longer projects.Language detection: Added automatic language detection when generating audio for the first time, with suggestions to switch to Eleven Turbo v2.5 for languages not supported by Multilingual v2 (Hungarian, Norwegian, Vietnamese).Project export: Enhanced project exporting in ElevenReader with better metadata tracking. Dubbing Clip overlap prevention: Added automatic trimming of overlapping clips in dubbing jobs to ensure clean audio tracks for each speaker and language. Voice Management Instant Voice Cloning: Improved preview generation for Instant Voice Cloning v2, making previews available immediately. Agents Platform Agent ownership: Added display of agent creators in the agent list, improving visibility and management of shared agents. Web app Dark mode: Added dark mode to the web app. API

Feb 17, 2025

Agents Platform Tool calling fix: Fixed an issue where tool calling was not working with agents using gpt-4o mini. This was due to a breaking change in the OpenAI API.Tool calling improvements: Added support for tool calling with dynamic variables inside objects and arrays.Dynamic variables: Fixed an issue where dynamic variables of a conversation were not being displayed correctly. Voice Isolator Fixed: Fixed an issue that caused the voice isolator to not work correctly temporarily. Workspace Billing: Improved billing visibility by differentiating rollover, cycle, gifted, and usage-based credits.Usage Analytics: Improved usage analytics load times and readability.Fine grained fiat billing: Added support for customizable pricing based on several factors. API

Latest
Apr 13, 2026
Tracking Since
Oct 13, 2024
Last fetched Apr 19, 2026