From fcb188a41a4821a89e8a760581f40ebe2fdb79be Mon Sep 17 00:00:00 2001 From: Vincent Koc Date: Tue, 5 May 2026 17:33:22 -0700 Subject: [PATCH] docs(providers): tighten SenseAudio, Xiaomi, and Inworld pages SenseAudio (docs/providers/senseaudio.md): removed the duplicate '# SenseAudio' H1 (Mintlify renders title from frontmatter; an in-body H1 produces a brittle anchor). Reordered the properties table to lead with provider id, plugin, and the speechProviders/mediaUnderstanding contract before the website/docs links, sourced from extensions/senseaudio/openclaw.plugin.json. Lowercased the H2 to 'Getting started' for consistency. Xiaomi (docs/providers/xiaomi.md): expanded the 4-row properties table to include plugin, onboarding flag, direct CLI flag, and contracts (chat completions plus speechProviders). The TTS default is surfaced inline so readers see the dual-contract setup in one glance, sourced from extensions/xiaomi/openclaw.plugin.json. Inworld (docs/providers/inworld.md): renamed the table header from 'Detail' to 'Property' and added bundled-plugin status and the speechProviders contract. Surfaced the audio output formats (MP3, OGG_OPUS, PCM 22050 Hz) as a Property row so readers do not have to read the Audio outputs accordion to confirm telephony support. Verified against extensions/inworld/openclaw.plugin.json. --- docs/providers/inworld.md | 19 ++++++++++++------- docs/providers/senseaudio.md | 20 +++++++++----------- docs/providers/xiaomi.md | 21 +++++++++++++-------- 3 files changed, 34 insertions(+), 26 deletions(-) diff --git a/docs/providers/inworld.md b/docs/providers/inworld.md index d0e6e964c2f..46be662dff1 100644 --- a/docs/providers/inworld.md +++ b/docs/providers/inworld.md @@ -14,13 +14,18 @@ OpenClaw posts to Inworld's streaming TTS endpoint, concatenates the returned base64 audio chunks into a single buffer, and hands the result to the standard reply-audio pipeline. -| Detail | Value | -| ------------- | ----------------------------------------------------------- | -| Website | [inworld.ai](https://inworld.ai) | -| Docs | [docs.inworld.ai/tts/tts](https://docs.inworld.ai/tts/tts) | -| Auth | `INWORLD_API_KEY` (HTTP Basic, Base64 dashboard credential) | -| Default voice | `Sarah` | -| Default model | `inworld-tts-1.5-max` | +| Property | Value | +| ------------- | --------------------------------------------------------------- | +| Provider id | `inworld` | +| Plugin | bundled, `enabledByDefault: true` | +| Contract | `speechProviders` (TTS only) | +| Auth env var | `INWORLD_API_KEY` (HTTP Basic, Base64 dashboard credential) | +| Base URL | `https://api.inworld.ai` | +| Default voice | `Sarah` | +| Default model | `inworld-tts-1.5-max` | +| Output | MP3 (default), OGG_OPUS (voice notes), PCM 22050 Hz (telephony) | +| Website | [inworld.ai](https://inworld.ai) | +| Docs | [docs.inworld.ai/tts/tts](https://docs.inworld.ai/tts/tts) | ## Getting started diff --git a/docs/providers/senseaudio.md b/docs/providers/senseaudio.md index 24913485a4d..b90e67dc132 100644 --- a/docs/providers/senseaudio.md +++ b/docs/providers/senseaudio.md @@ -6,22 +6,20 @@ read_when: title: "SenseAudio" --- -# SenseAudio +SenseAudio can transcribe inbound audio and voice-note attachments through OpenClaw's shared `tools.media.audio` pipeline. OpenClaw posts multipart audio to the OpenAI-compatible transcription endpoint and injects the returned text as `{{Transcript}}` plus an `[Audio]` block. -SenseAudio can transcribe inbound audio/voice-note attachments through -OpenClaw's shared `tools.media.audio` pipeline. OpenClaw posts multipart audio -to the OpenAI-compatible transcription endpoint and injects the returned text -as `{{Transcript}}` plus an `[Audio]` block. - -| Detail | Value | +| Property | Value | | ------------- | ------------------------------------------------ | -| Website | [senseaudio.cn](https://senseaudio.cn) | -| Docs | [senseaudio.cn/docs](https://senseaudio.cn/docs) | -| Auth | `SENSEAUDIO_API_KEY` | +| Provider id | `senseaudio` | +| Plugin | bundled, `enabledByDefault: true` | +| Contract | `mediaUnderstandingProviders` (audio) | +| Auth env var | `SENSEAUDIO_API_KEY` | | Default model | `senseaudio-asr-pro-1.5-260319` | | Default URL | `https://api.senseaudio.cn/v1` | +| Website | [senseaudio.cn](https://senseaudio.cn) | +| Docs | [senseaudio.cn/docs](https://senseaudio.cn/docs) | -## Getting Started +## Getting started diff --git a/docs/providers/xiaomi.md b/docs/providers/xiaomi.md index 93cc0aa8e8e..91b71a1133d 100644 --- a/docs/providers/xiaomi.md +++ b/docs/providers/xiaomi.md @@ -6,15 +6,20 @@ read_when: title: "Xiaomi MiMo" --- -Xiaomi MiMo is the API platform for **MiMo** models. OpenClaw uses the Xiaomi -OpenAI-compatible endpoint with API-key authentication. +Xiaomi MiMo is the API platform for **MiMo** models. OpenClaw includes a bundled `xiaomi` plugin that registers both an OpenAI-compatible chat provider and a speech (TTS) provider against the same `XIAOMI_API_KEY`. -| Property | Value | -| -------- | ------------------------------- | -| Provider | `xiaomi` | -| Auth | `XIAOMI_API_KEY` | -| API | OpenAI-compatible | -| Base URL | `https://api.xiaomimimo.com/v1` | +| Property | Value | +| --------------- | ---------------------------------------- | +| Provider id | `xiaomi` | +| Plugin | bundled, `enabledByDefault: true` | +| Auth env var | `XIAOMI_API_KEY` | +| Onboarding flag | `--auth-choice xiaomi-api-key` | +| Direct CLI flag | `--xiaomi-api-key ` | +| Contracts | chat completions + `speechProviders` | +| API | OpenAI-compatible (`openai-completions`) | +| Base URL | `https://api.xiaomimimo.com/v1` | +| Default model | `xiaomi/mimo-v2-flash` | +| TTS default | `mimo-v2.5-tts`, voice `mimo_default` | ## Getting started