2.7 KiB
summary, read_when, title
| summary | read_when | title | ||
|---|---|---|---|---|
| Use Vydra image, video, and speech in OpenClaw |
|
Vydra |
Vydra
The bundled Vydra plugin adds:
- image generation via
vydra/grok-imagine - video generation via
vydra/veo3andvydra/kling - speech synthesis via Vydra's ElevenLabs-backed TTS route
OpenClaw uses the same VYDRA_API_KEY for all three capabilities.
Important base URL
Use https://www.vydra.ai/api/v1.
Vydra's apex host (https://vydra.ai/api/v1) currently redirects to www. Some HTTP clients drop Authorization on that cross-host redirect, which turns a valid API key into a misleading auth failure. The bundled plugin uses the www base URL directly to avoid that.
Setup
Interactive onboarding:
openclaw onboard --auth-choice vydra-api-key
Or set the env var directly:
export VYDRA_API_KEY="vydra_live_..."
Image generation
Default image model:
vydra/grok-imagine
Set it as the default image provider:
{
agents: {
defaults: {
imageGenerationModel: {
primary: "vydra/grok-imagine",
},
},
},
}
Current bundled support is text-to-image only. Vydra's hosted edit routes expect remote image URLs, and OpenClaw does not add a Vydra-specific upload bridge in the bundled plugin yet.
See Image Generation for shared tool behavior.
Video generation
Registered video models:
vydra/veo3for text-to-videovydra/klingfor image-to-video
Set Vydra as the default video provider:
{
agents: {
defaults: {
videoGenerationModel: {
primary: "vydra/veo3",
},
},
},
}
Notes:
vydra/veo3is bundled as text-to-video only.vydra/klingcurrently requires a remote image URL reference. Local file uploads are rejected up front.- The bundled plugin stays conservative and does not forward undocumented style knobs such as aspect ratio, resolution, watermark, or generated audio.
See Video Generation for shared tool behavior.
Speech synthesis
Set Vydra as the speech provider:
{
messages: {
tts: {
provider: "vydra",
providers: {
vydra: {
apiKey: "${VYDRA_API_KEY}",
voiceId: "21m00Tcm4TlvDq8ikWAM",
},
},
},
},
}
Defaults:
- model:
elevenlabs/tts - voice id:
21m00Tcm4TlvDq8ikWAM
The bundled plugin currently exposes one known-good default voice and returns MP3 audio files.