Pipeline stall timeouts configurable; voice pipeline hang fixed
@livekit/agents@1.4.1
Patch Changes
-
Expose
AgentSessionOptions.ttsReadIdleTimeoutandAgentSessionOptions.forwardAudioIdleTimeoutto configure the two pipeline stall guards inperformTTSInferenceandperformAudioForwarding. Useful for custom LLM/TTS backends whose first-token latency can legitimately exceed the previous 10s default. Defaults remain 10 seconds, preserving existing behavior. - #1461 (@s-hamdananwar) -
Make default user turn span start times explicit. - #1456 (@rosetta-livekit-bot)
-
Prevent voice pipeline scheduling from hanging when a pipeline task crashes after a speech handle is already marked done. - #1423 (@u9g)
-
fix(google): abort pending realtime sends during reconnect - #1415 (@u9g)
-
feat(inference): propagate STT extra to SpeechData.metadata - #1389 (@toubatbrian)
The inference STT plugin now plumbs the gateway's per-transcript
extrafield ontoSpeechData.metadata, exposing provider-specific signals (e.g. Inworld voice profile, xAIspeech_final) to consumers. -
fix(worker): use available CPU cores for numIdleProcesses in production - #1449 (@KrishnaShuk)
-
fix(transcription): rstrip punctuation from interim segments - #1447 (@KrishnaShuk)
-
Emit agent configuration updates in OTLP session logs. - #1434 (@rosetta-livekit-bot)
-
fix(agents): persist user turn start across VAD bursts - #1457 (@rosetta-livekit-bot)
-
Support OpenAI Realtime Whisper STT - #1429 (@toubatbrian)
Fetched May 13, 2026
