Files
openclaw/docs/providers/senseaudio.md
Vincent Koc fcb188a41a docs(providers): tighten SenseAudio, Xiaomi, and Inworld pages
SenseAudio (docs/providers/senseaudio.md): removed the duplicate
'# SenseAudio' H1 (Mintlify renders title from frontmatter; an in-body
H1 produces a brittle anchor). Reordered the properties table to lead
with provider id, plugin, and the speechProviders/mediaUnderstanding
contract before the website/docs links, sourced from
extensions/senseaudio/openclaw.plugin.json. Lowercased the H2 to
'Getting started' for consistency.

Xiaomi (docs/providers/xiaomi.md): expanded the 4-row properties table
to include plugin, onboarding flag, direct CLI flag, and contracts
(chat completions plus speechProviders). The TTS default is surfaced
inline so readers see the dual-contract setup in one glance, sourced
from extensions/xiaomi/openclaw.plugin.json.

Inworld (docs/providers/inworld.md): renamed the table header from
'Detail' to 'Property' and added bundled-plugin status and the
speechProviders contract. Surfaced the audio output formats (MP3,
OGG_OPUS, PCM 22050 Hz) as a Property row so readers do not have to
read the Audio outputs accordion to confirm telephony support.
Verified against extensions/inworld/openclaw.plugin.json.
2026-05-05 17:33:59 -07:00

2.6 KiB

summary, read_when, title
summary read_when title
SenseAudio batch speech-to-text for inbound voice notes
You want SenseAudio speech-to-text for audio attachments
You need the SenseAudio API key env var or audio config path
SenseAudio

SenseAudio can transcribe inbound audio and voice-note attachments through OpenClaw's shared tools.media.audio pipeline. OpenClaw posts multipart audio to the OpenAI-compatible transcription endpoint and injects the returned text as {{Transcript}} plus an [Audio] block.

Property Value
Provider id senseaudio
Plugin bundled, enabledByDefault: true
Contract mediaUnderstandingProviders (audio)
Auth env var SENSEAUDIO_API_KEY
Default model senseaudio-asr-pro-1.5-260319
Default URL https://api.senseaudio.cn/v1
Website senseaudio.cn
Docs senseaudio.cn/docs

Getting started

```bash export SENSEAUDIO_API_KEY="..." ``` ```json5 { tools: { media: { audio: { enabled: true, models: [{ provider: "senseaudio", model: "senseaudio-asr-pro-1.5-260319" }], }, }, }, } ``` Send an audio message through any connected channel. OpenClaw uploads the audio to SenseAudio and uses the transcript in the reply pipeline.

Options

Option Path Description
model tools.media.audio.models[].model SenseAudio ASR model id
language tools.media.audio.models[].language Optional language hint
prompt tools.media.audio.prompt Optional transcription prompt
baseUrl tools.media.audio.baseUrl or model Override the OpenAI-compatible base
headers tools.media.audio.request.headers Extra request headers
SenseAudio is batch STT only in OpenClaw. Voice Call realtime transcription continues to use providers with streaming STT support.