fix: support parakeet-mlx output-dir transcript parsing (#9177) (thanks @mac-110)

2026-05-06 15:00:41 +00:00 · 2026-03-02 22:21:57 +00:00
parent f257818ea5
commit d89c25d69e
5 changed files with 95 additions and 1 deletions
--- a/docs/nodes/audio.md
+++ b/docs/nodes/audio.md
@@ -176,5 +176,6 @@ When `requireMention: true` is set for a group chat, OpenClaw now transcribes au

 - Scope rules use first-match wins. `chatType` is normalized to `direct`, `group`, or `room`.
 - Ensure your CLI exits 0 and prints plain text; JSON needs to be massaged via `jq -r .text`.
+- For `parakeet-mlx`, if you pass `--output-dir`, OpenClaw reads `<output-dir>/<media-basename>.txt` when `--output-format` is `txt` (or omitted); non-`txt` output formats fall back to stdout parsing.
 - Keep timeouts reasonable (`timeoutSeconds`, default 60s) to avoid blocking the reply queue.
 - Preflight transcription only processes the **first** audio attachment for mention detection. Additional audio is processed during the main media understanding phase.
--- a/docs/nodes/media-understanding.md
+++ b/docs/nodes/media-understanding.md
@@ -210,6 +210,7 @@ If you omit `capabilities`, the entry is eligible for the list it appears in.

 - `openai/gpt-4o-mini-transcribe`, `groq/whisper-large-v3-turbo`, `deepgram/nova-3`, or `mistral/voxtral-mini-latest`.
 - CLI fallback: `whisper-cli` (whisper-cpp) or `whisper`.
+- `parakeet-mlx` note: with `--output-dir`, OpenClaw reads `<output-dir>/<media-basename>.txt` when output format is `txt` (or unspecified); non-`txt` formats fall back to stdout.
 - Deepgram setup: [Deepgram (audio transcription)](/providers/deepgram).

 **Video**