releases.shpreview

Pipeline stall timeouts configurable; voice pipeline hang fixed

@livekit/agents@1.4.1

Patch Changes

  • Expose AgentSessionOptions.ttsReadIdleTimeout and AgentSessionOptions.forwardAudioIdleTimeout to configure the two pipeline stall guards in performTTSInference and performAudioForwarding. Useful for custom LLM/TTS backends whose first-token latency can legitimately exceed the previous 10s default. Defaults remain 10 seconds, preserving existing behavior. - #1461 (@s-hamdananwar)

  • Make default user turn span start times explicit. - #1456 (@rosetta-livekit-bot)

  • Prevent voice pipeline scheduling from hanging when a pipeline task crashes after a speech handle is already marked done. - #1423 (@u9g)

  • fix(google): abort pending realtime sends during reconnect - #1415 (@u9g)

  • docs(agents): document realtime capabilities - #1419 (@u9g)

  • feat(inference): propagate STT extra to SpeechData.metadata - #1389 (@toubatbrian)

    The inference STT plugin now plumbs the gateway's per-transcript extra field onto SpeechData.metadata, exposing provider-specific signals (e.g. Inworld voice profile, xAI speech_final) to consumers.

  • fix(worker): use available CPU cores for numIdleProcesses in production - #1449 (@KrishnaShuk)

  • fix(transcription): rstrip punctuation from interim segments - #1447 (@KrishnaShuk)

  • Emit agent configuration updates in OTLP session logs. - #1434 (@rosetta-livekit-bot)

  • fix(agents): persist user turn start across VAD bursts - #1457 (@rosetta-livekit-bot)

  • Support OpenAI Realtime Whisper STT - #1429 (@toubatbrian)

Fetched May 13, 2026