SenseAudio (docs/providers/senseaudio.md): removed the duplicate
'# SenseAudio' H1 (Mintlify renders title from frontmatter; an in-body
H1 produces a brittle anchor). Reordered the properties table to lead
with provider id, plugin, and the speechProviders/mediaUnderstanding
contract before the website/docs links, sourced from
extensions/senseaudio/openclaw.plugin.json. Lowercased the H2 to
'Getting started' for consistency.
Xiaomi (docs/providers/xiaomi.md): expanded the 4-row properties table
to include plugin, onboarding flag, direct CLI flag, and contracts
(chat completions plus speechProviders). The TTS default is surfaced
inline so readers see the dual-contract setup in one glance, sourced
from extensions/xiaomi/openclaw.plugin.json.
Inworld (docs/providers/inworld.md): renamed the table header from
'Detail' to 'Property' and added bundled-plugin status and the
speechProviders contract. Surfaced the audio output formats (MP3,
OGG_OPUS, PCM 22050 Hz) as a Property row so readers do not have to
read the Audio outputs accordion to confirm telephony support.
Verified against extensions/inworld/openclaw.plugin.json.
Xiaomi MiMo is the API platform for MiMo models. OpenClaw includes a bundled xiaomi plugin that registers both an OpenAI-compatible chat provider and a speech (TTS) provider against the same XIAOMI_API_KEY.
Or pass the key directly:
```bash
openclaw onboard --auth-choice xiaomi-api-key --xiaomi-api-key "$XIAOMI_API_KEY"
```
```bash
openclaw models list --provider xiaomi
```
Built-in catalog
Model ref
Input
Context
Max output
Reasoning
Notes
xiaomi/mimo-v2-flash
text
262,144
8,192
No
Default model
xiaomi/mimo-v2-pro
text
1,048,576
32,000
Yes
Large context
xiaomi/mimo-v2-omni
text, image
262,144
32,000
Yes
Multimodal
The default model ref is `xiaomi/mimo-v2-flash`. The provider is injected automatically when `XIAOMI_API_KEY` is set or an auth profile exists.
Text-to-speech
The bundled xiaomi plugin also registers Xiaomi MiMo as a speech provider for
messages.tts. It calls Xiaomi's chat-completions TTS contract with the text as
an assistant message and optional style guidance as a user message.
Supported built-in voices include mimo_default, default_zh, default_en,
Mia, Chloe, Milo, and Dean. mimo-v2-tts is supported for older MiMo
TTS accounts; the default uses the current MiMo-V2.5 TTS model. For voice-note
targets such as Feishu and Telegram, OpenClaw transcodes Xiaomi output to 48kHz
Opus with ffmpeg before delivery.
The `xiaomi` provider is injected automatically when `XIAOMI_API_KEY` is set in your environment or an auth profile exists. You do not need to manually configure the provider unless you want to override model metadata or the base URL.
- **mimo-v2-flash** — lightweight and fast, ideal for general-purpose text tasks. No reasoning support.
- **mimo-v2-pro** — supports reasoning with a 1M token context window for long-document workloads.
- **mimo-v2-omni** — reasoning-enabled multimodal model that accepts both text and image inputs.
<Note>
All models use the `xiaomi/` prefix (for example `xiaomi/mimo-v2-pro`).
</Note>
- If models do not appear, confirm `XIAOMI_API_KEY` is set and valid.
- When the Gateway runs as a daemon, ensure the key is available to that process (for example in `~/.openclaw/.env` or via `env.shellEnv`).
<Warning>
Keys set only in your interactive shell are not visible to daemon-managed gateway processes. Use `~/.openclaw/.env` or `env.shellEnv` config for persistent availability.
</Warning>
Related
Choosing providers, model refs, and failover behavior.
Full OpenClaw configuration reference.
Xiaomi MiMo dashboard and API key management.