mirror of
https://github.com/openclaw/openclaw.git
synced 2026-05-06 12:50:42 +00:00
66 lines
2.4 KiB
Markdown
66 lines
2.4 KiB
Markdown
---
|
|
summary: "SenseAudio batch speech-to-text for inbound voice notes"
|
|
read_when:
|
|
- You want SenseAudio speech-to-text for audio attachments
|
|
- You need the SenseAudio API key env var or audio config path
|
|
title: "SenseAudio"
|
|
---
|
|
|
|
# SenseAudio
|
|
|
|
SenseAudio can transcribe inbound audio/voice-note attachments through
|
|
OpenClaw's shared `tools.media.audio` pipeline. OpenClaw posts multipart audio
|
|
to the OpenAI-compatible transcription endpoint and injects the returned text
|
|
as `{{Transcript}}` plus an `[Audio]` block.
|
|
|
|
| Detail | Value |
|
|
| ------------- | ------------------------------------------------ |
|
|
| Website | [senseaudio.cn](https://senseaudio.cn) |
|
|
| Docs | [senseaudio.cn/docs](https://senseaudio.cn/docs) |
|
|
| Auth | `SENSEAUDIO_API_KEY` |
|
|
| Default model | `senseaudio-asr-pro-1.5-260319` |
|
|
| Default URL | `https://api.senseaudio.cn/v1` |
|
|
|
|
## Getting Started
|
|
|
|
<Steps>
|
|
<Step title="Set your API key">
|
|
```bash
|
|
export SENSEAUDIO_API_KEY="..."
|
|
```
|
|
</Step>
|
|
<Step title="Enable the audio provider">
|
|
```json5
|
|
{
|
|
tools: {
|
|
media: {
|
|
audio: {
|
|
enabled: true,
|
|
models: [{ provider: "senseaudio", model: "senseaudio-asr-pro-1.5-260319" }],
|
|
},
|
|
},
|
|
},
|
|
}
|
|
```
|
|
</Step>
|
|
<Step title="Send a voice note">
|
|
Send an audio message through any connected channel. OpenClaw uploads the
|
|
audio to SenseAudio and uses the transcript in the reply pipeline.
|
|
</Step>
|
|
</Steps>
|
|
|
|
## Options
|
|
|
|
| Option | Path | Description |
|
|
| ---------- | ------------------------------------- | ----------------------------------- |
|
|
| `model` | `tools.media.audio.models[].model` | SenseAudio ASR model id |
|
|
| `language` | `tools.media.audio.models[].language` | Optional language hint |
|
|
| `prompt` | `tools.media.audio.prompt` | Optional transcription prompt |
|
|
| `baseUrl` | `tools.media.audio.baseUrl` or model | Override the OpenAI-compatible base |
|
|
| `headers` | `tools.media.audio.request.headers` | Extra request headers |
|
|
|
|
<Note>
|
|
SenseAudio is batch STT only in OpenClaw. Voice Call realtime transcription
|
|
continues to use providers with streaming STT support.
|
|
</Note>
|