Files
openclaw/docs/providers/deepinfra.md
Georgi Atsev 9e7c2b356b fix(deepinfra): load all DeepInfra models when user wants to browse t… (#84549)
* fix(deepinfra): load all DeepInfra models when user wants to browse them during onboarding

* docs(deepinfra): align TTS default

* fix(deepinfra): refresh video fallbacks

* fix(deepinfra): share credential-aware catalog discovery

* test(deepinfra): narrow catalog regression types

* test(deepinfra): keep catalog narrowing across callback

* fix(deepinfra): preserve default model in live catalog

* fix(deepinfra): align default model pricing

* fix(deepinfra): keep pixverse as video default

* docs(deepinfra): match video fallback default

* fix(deepinfra): honor config api keys for live catalog

* test(e2e): wait for watchdog stdio close

* test(media): align live harness provider expectation

* fix(deepinfra): always augment custom catalogs

* test(e2e): resolve watchdog commands before spawning

---------

Co-authored-by: Peter Steinberger <steipete@gmail.com>
2026-05-27 10:43:37 +01:00

4.2 KiB

summary, read_when, title
summary read_when title
Use DeepInfra's unified API to access the most popular open source and frontier models in OpenClaw
You want a single API key for the top open source LLMs
You want to run models via DeepInfra's API in OpenClaw
DeepInfra

DeepInfra provides a unified API that routes requests to the most popular open source and frontier models behind a single endpoint and API key. It is OpenAI-compatible, so most OpenAI SDKs work by switching the base URL.

Getting an API key

  1. Go to https://deepinfra.com/
  2. Sign in or create an account
  3. Navigate to Dashboard / Keys and generate a new API key or use the auto created one

CLI setup

openclaw onboard --deepinfra-api-key <key>

Or set the environment variable:

export DEEPINFRA_API_KEY="<your-deepinfra-api-key>" # pragma: allowlist secret

Config snippet

{
  env: { DEEPINFRA_API_KEY: "<your-deepinfra-api-key>" }, // pragma: allowlist secret
  agents: {
    defaults: {
      model: { primary: "deepinfra/deepseek-ai/DeepSeek-V4-Flash" },
    },
  },
}

Supported OpenClaw surfaces

The bundled plugin registers all DeepInfra surfaces that match current OpenClaw provider contracts. Chat, image generation, and video generation refresh their model catalogues live from /v1/openai/models?sort_by=openclaw&filter=with_meta when DEEPINFRA_API_KEY is configured; the other surfaces use the curated static defaults below.

Surface Default model OpenClaw config/tool
Chat / model provider first chat-tagged entry from live catalog (manifest fallback deepseek-ai/DeepSeek-V4-Flash) agents.defaults.model
Image generation/editing first image-gen-tagged entry from live catalog (static fallback black-forest-labs/FLUX-1-schnell) image_generate, agents.defaults.imageGenerationModel
Media understanding moonshotai/Kimi-K2.5 for images inbound image understanding
Speech-to-text openai/whisper-large-v3-turbo inbound audio transcription
Text-to-speech hexgrad/Kokoro-82M messages.tts.provider: "deepinfra"
Video generation first video-gen-tagged entry from live catalog (static fallback Pixverse/Pixverse-T2V) video_generate, agents.defaults.videoGenerationModel
Memory embeddings BAAI/bge-m3 agents.defaults.memorySearch.provider: "deepinfra"

DeepInfra also exposes reranking, classification, object-detection, and other native model types. OpenClaw does not currently have first-class provider contracts for those categories, so this plugin does not register them yet.

Available models

OpenClaw dynamically discovers available DeepInfra models at startup. Use /models deepinfra to see the full list of models available.

Any model available on DeepInfra.com can be used with the deepinfra/ prefix:

deepinfra/deepseek-ai/DeepSeek-V4-Flash
deepinfra/deepseek-ai/DeepSeek-V3.2
deepinfra/MiniMaxAI/MiniMax-M2.5
deepinfra/moonshotai/Kimi-K2.5
deepinfra/nvidia/NVIDIA-Nemotron-3-Super-120B-A12B
deepinfra/zai-org/GLM-5.1
...and many more

Notes

  • Model refs are deepinfra/<provider>/<model> (e.g., deepinfra/Qwen/Qwen3-Max).
  • Default model: deepinfra/deepseek-ai/DeepSeek-V4-Flash
  • Base URL: https://api.deepinfra.com/v1/openai
  • Native video generation uses https://api.deepinfra.com/v1/inference/<model>.