openclaw/docs/providers/senseaudio.md at main

mirror of https://github.com/openclaw/openclaw.git synced 2026-05-06 05:40:44 +00:00

Files

Vincent Koc fcb188a41a docs(providers): tighten SenseAudio, Xiaomi, and Inworld pages

SenseAudio (docs/providers/senseaudio.md): removed the duplicate
'# SenseAudio' H1 (Mintlify renders title from frontmatter; an in-body
H1 produces a brittle anchor). Reordered the properties table to lead
with provider id, plugin, and the speechProviders/mediaUnderstanding
contract before the website/docs links, sourced from
extensions/senseaudio/openclaw.plugin.json. Lowercased the H2 to
'Getting started' for consistency.

Xiaomi (docs/providers/xiaomi.md): expanded the 4-row properties table
to include plugin, onboarding flag, direct CLI flag, and contracts
(chat completions plus speechProviders). The TTS default is surfaced
inline so readers see the dual-contract setup in one glance, sourced
from extensions/xiaomi/openclaw.plugin.json.

Inworld (docs/providers/inworld.md): renamed the table header from
'Detail' to 'Property' and added bundled-plugin status and the
speechProviders contract. Surfaced the audio output formats (MP3,
OGG_OPUS, PCM 22050 Hz) as a Property row so readers do not have to
read the Audio outputs accordion to confirm telephony support.
Verified against extensions/inworld/openclaw.plugin.json.

2026-05-05 17:33:59 -07:00

2.6 KiB

Raw Permalink Blame History

summary, read_when, title

summary

read_when

title

SenseAudio batch speech-to-text for inbound voice notes

You want SenseAudio speech-to-text for audio attachments

You need the SenseAudio API key env var or audio config path

SenseAudio

SenseAudio can transcribe inbound audio and voice-note attachments through OpenClaw's shared tools.media.audio pipeline. OpenClaw posts multipart audio to the OpenAI-compatible transcription endpoint and injects the returned text as {{Transcript}} plus an [Audio] block.

Property	Value
Provider id	`senseaudio`
Plugin	bundled, `enabledByDefault: true`
Contract	`mediaUnderstandingProviders` (audio)
Auth env var	`SENSEAUDIO_API_KEY`
Default model	`senseaudio-asr-pro-1.5-260319`
Default URL	`https://api.senseaudio.cn/v1`
Website	senseaudio.cn
Docs	senseaudio.cn/docs

Getting started

```bash export SENSEAUDIO_API_KEY="..." ``` ```json5 { tools: { media: { audio: { enabled: true, models: [{ provider: "senseaudio", model: "senseaudio-asr-pro-1.5-260319" }], }, }, }, } ``` Send an audio message through any connected channel. OpenClaw uploads the audio to SenseAudio and uses the transcript in the reply pipeline.

Options

Option	Path	Description
`model`	`tools.media.audio.models[].model`	SenseAudio ASR model id
`language`	`tools.media.audio.models[].language`	Optional language hint
`prompt`	`tools.media.audio.prompt`	Optional transcription prompt
`baseUrl`	`tools.media.audio.baseUrl` or model	Override the OpenAI-compatible base
`headers`	`tools.media.audio.request.headers`	Extra request headers

SenseAudio is batch STT only in OpenClaw. Voice Call realtime transcription continues to use providers with streaming STT support.

2.6 KiB Raw Permalink Blame History

Getting started

Options

2.6 KiB

Raw Permalink Blame History