openclaw/docs/providers/vydra.md at fa67ab2358df92d378eb75bcce2e730effa1cdae

vultr/openclaw

Fork 0

mirror of https://github.com/openclaw/openclaw.git synced 2026-04-13 18:21:27 +00:00

Files

Peter Steinberger 9b2b22f350 feat: add vydra media provider

2026-04-06 02:21:51 +01:00

2.7 KiB

Raw Blame History

summary, read_when, title

summary

read_when

title

Use Vydra image, video, and speech in OpenClaw

You want Vydra media generation in OpenClaw

You need Vydra API key setup guidance

Vydra

The bundled Vydra plugin adds:

image generation via vydra/grok-imagine
video generation via vydra/veo3 and vydra/kling
speech synthesis via Vydra's ElevenLabs-backed TTS route

OpenClaw uses the same VYDRA_API_KEY for all three capabilities.

Important base URL

Use https://www.vydra.ai/api/v1.

Vydra's apex host (https://vydra.ai/api/v1) currently redirects to www. Some HTTP clients drop Authorization on that cross-host redirect, which turns a valid API key into a misleading auth failure. The bundled plugin uses the www base URL directly to avoid that.

Setup

Interactive onboarding:

openclaw onboard --auth-choice vydra-api-key

Or set the env var directly:

export VYDRA_API_KEY="vydra_live_..."

Image generation

Default image model:

vydra/grok-imagine

Set it as the default image provider:

{
  agents: {
    defaults: {
      imageGenerationModel: {
        primary: "vydra/grok-imagine",
      },
    },
  },
}

Current bundled support is text-to-image only. Vydra's hosted edit routes expect remote image URLs, and OpenClaw does not add a Vydra-specific upload bridge in the bundled plugin yet.

See Image Generation for shared tool behavior.

Video generation

Registered video models:

vydra/veo3 for text-to-video
vydra/kling for image-to-video

Set Vydra as the default video provider:

{
  agents: {
    defaults: {
      videoGenerationModel: {
        primary: "vydra/veo3",
      },
    },
  },
}

Notes:

vydra/veo3 is bundled as text-to-video only.
vydra/kling currently requires a remote image URL reference. Local file uploads are rejected up front.
The bundled plugin stays conservative and does not forward undocumented style knobs such as aspect ratio, resolution, watermark, or generated audio.

See Video Generation for shared tool behavior.

Speech synthesis

Set Vydra as the speech provider:

{
  messages: {
    tts: {
      provider: "vydra",
      providers: {
        vydra: {
          apiKey: "${VYDRA_API_KEY}",
          voiceId: "21m00Tcm4TlvDq8ikWAM",
        },
      },
    },
  },
}

Defaults:

model: elevenlabs/tts
voice id: 21m00Tcm4TlvDq8ikWAM

The bundled plugin currently exposes one known-good default voice and returns MP3 audio files.

2.7 KiB Raw Blame History