fix(tts): normalize streamed tts voice media

This commit is contained in:
Peter Steinberger
2026-04-26 04:28:08 +01:00
parent f4e6322649
commit 2c8c79de5c
4 changed files with 50 additions and 2 deletions

View File

@@ -797,6 +797,9 @@ When enabled, OpenClaw:
- skips very short replies (< 10 chars).
- summarizes long replies when enabled using `agents.defaults.model.primary` (or `summaryModel`).
- attaches the generated audio to the reply.
- in `mode: "final"`, still sends audio-only TTS for streamed final replies
after the text stream completes; the generated media goes through the same
channel media normalization as normal reply attachments.
If the reply exceeds `maxLength` and summary is off (or no API key for the
summary model), audio