mirror of
https://github.com/openclaw/openclaw.git
synced 2026-03-12 07:20:45 +00:00
* Terminal: measure grapheme display width * Tests: cover grapheme terminal width * Terminal: wrap table cells by grapheme width * Tests: cover emoji table alignment * Terminal: refine table wrapping and width handling * Terminal: stop shrinking CLI tables by one column * Skills: use Terminal-safe emoji in list output * Changelog: note terminal skills table fixes * Skills: normalize emoji presentation across outputs * Terminal: consume unsupported escape bytes in tables
53 lines
1.1 KiB
Markdown
53 lines
1.1 KiB
Markdown
---
|
||
name: openai-whisper-api
|
||
description: Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
|
||
homepage: https://platform.openai.com/docs/guides/speech-to-text
|
||
metadata:
|
||
{
|
||
"openclaw":
|
||
{
|
||
"emoji": "🌐",
|
||
"requires": { "bins": ["curl"], "env": ["OPENAI_API_KEY"] },
|
||
"primaryEnv": "OPENAI_API_KEY",
|
||
},
|
||
}
|
||
---
|
||
|
||
# OpenAI Whisper API (curl)
|
||
|
||
Transcribe an audio file via OpenAI’s `/v1/audio/transcriptions` endpoint.
|
||
|
||
## Quick start
|
||
|
||
```bash
|
||
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
|
||
```
|
||
|
||
Defaults:
|
||
|
||
- Model: `whisper-1`
|
||
- Output: `<input>.txt`
|
||
|
||
## Useful flags
|
||
|
||
```bash
|
||
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
|
||
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
|
||
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
|
||
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
|
||
```
|
||
|
||
## API key
|
||
|
||
Set `OPENAI_API_KEY`, or configure it in `~/.openclaw/openclaw.json`:
|
||
|
||
```json5
|
||
{
|
||
skills: {
|
||
"openai-whisper-api": {
|
||
apiKey: "OPENAI_KEY_HERE",
|
||
},
|
||
},
|
||
}
|
||
```
|