SenseAudio batch speech-to-text for inbound voice notes
You want SenseAudio speech-to-text for audio attachments
You need the SenseAudio API key env var or audio config path
SenseAudio
SenseAudio can transcribe inbound audio and voice-note attachments through OpenClaw's shared tools.media.audio pipeline. OpenClaw posts multipart audio to the OpenAI-compatible transcription endpoint and injects the returned text as {{Transcript}} plus an [Audio] block.