mirror of
https://github.com/openclaw/openclaw.git
synced 2026-06-03 20:04:05 +00:00
* refactor(providers): catalog voice models * feat(tts): route speech through voice models * refactor(tts): rename speaker selection fields * refactor(tts): mark default speech models * test(tts): type migrated speaker config assertions * refactor(providers): avoid catalog merge map spread * fix(tts): honor voice model fallbacks * refactor(tts): move speech core into package * chore(tts): register speech core knip workspace * fix(tts): show migrated speaker voice in status * fix(tts): satisfy speech core lint * fix(tts): preserve explicit model aliases * test(tts): narrow provider config assertion * test(doctor): allow slow commitments repair check --------- Co-authored-by: Peter Steinberger <steipete@gmail.com>
5.1 KiB
5.1 KiB
summary, read_when, title
| summary | read_when | title | ||
|---|---|---|---|---|
| Inworld streaming text-to-speech for OpenClaw replies |
|
Inworld |
Inworld is a streaming text-to-speech (TTS) provider. In OpenClaw it synthesizes outbound reply audio (MP3 by default, OGG_OPUS for voice notes) and PCM audio for telephony channels such as Voice Call.
OpenClaw posts to Inworld's streaming TTS endpoint, concatenates the returned base64 audio chunks into a single buffer, and hands the result to the standard reply-audio pipeline.
| Property | Value |
|---|---|
| Provider id | inworld |
| Plugin | bundled, enabledByDefault: true |
| Contract | speechProviders (TTS only) |
| Auth env var | INWORLD_API_KEY (HTTP Basic, Base64 dashboard credential) |
| Base URL | https://api.inworld.ai |
| Default voice | Sarah |
| Default model | inworld-tts-1.5-max |
| Output | MP3 (default), OGG_OPUS (voice notes), PCM 22050 Hz (telephony) |
| Website | inworld.ai |
| Docs | docs.inworld.ai/tts/tts |
Getting started
Copy the credential from your Inworld dashboard (Workspace > API Keys) and set it as an env var. The value is sent verbatim as the HTTP Basic credential, so do not Base64-encode it again or convert it to a bearer token.```
INWORLD_API_KEY=<base64-credential-from-dashboard>
```
```json5
{
messages: {
tts: {
auto: "always",
provider: "inworld",
providers: {
inworld: {
speakerVoiceId: "Sarah",
modelId: "inworld-tts-1.5-max",
},
},
},
},
}
```
Send a reply through any connected channel. OpenClaw synthesizes the
audio with Inworld and delivers it as MP3 (or OGG_OPUS when the channel
expects a voice note).
Configuration options
| Option | Path | Description |
|---|---|---|
apiKey |
messages.tts.providers.inworld.apiKey |
Base64 dashboard credential. Falls back to INWORLD_API_KEY. |
baseUrl |
messages.tts.providers.inworld.baseUrl |
Override Inworld API base URL (default https://api.inworld.ai). |
speakerVoiceId |
messages.tts.providers.inworld.speakerVoiceId |
Voice identifier (default Sarah). |
modelId |
messages.tts.providers.inworld.modelId |
TTS model id (default inworld-tts-1.5-max). |
temperature |
messages.tts.providers.inworld.temperature |
Sampling temperature 0..2 (optional). |