From 1cfc10e836328025ed8ed7b3cedaf75cadef75df Mon Sep 17 00:00:00 2001 From: Peter Steinberger Date: Sat, 4 Apr 2026 12:27:34 +0100 Subject: [PATCH] docs: refresh minimax multimodal refs --- docs/concepts/model-providers.md | 7 +++++++ docs/nodes/media-understanding.md | 17 ++++++++++++----- docs/providers/minimax.md | 28 +++++++++++++++++++++++++--- docs/tools/image-generation.md | 17 +++++++++++------ docs/tools/minimax-search.md | 4 ++++ 5 files changed, 59 insertions(+), 14 deletions(-) diff --git a/docs/concepts/model-providers.md b/docs/concepts/model-providers.md index 6c3a0cf6d34..1992bb7aa2e 100644 --- a/docs/concepts/model-providers.md +++ b/docs/concepts/model-providers.md @@ -561,6 +561,13 @@ On MiniMax's Anthropic-compatible streaming path, OpenClaw disables thinking by default unless you explicitly set it, and `/fast on` rewrites `MiniMax-M2.7` to `MiniMax-M2.7-highspeed`. +Plugin-owned capability split: + +- Text/chat defaults stay on `minimax/MiniMax-M2.7` +- Image generation is `minimax/image-01` or `minimax-portal/image-01` +- Image understanding is plugin-owned `MiniMax-VL-01` on both MiniMax auth paths +- Web search stays on provider id `minimax` + ### Ollama Ollama ships as a bundled provider plugin and uses Ollama's native API: diff --git a/docs/nodes/media-understanding.md b/docs/nodes/media-understanding.md index 186c5f6a70f..be37f8161c7 100644 --- a/docs/nodes/media-understanding.md +++ b/docs/nodes/media-understanding.md @@ -214,11 +214,18 @@ If you omit `capabilities`, the entry is eligible for the list it appears in. ## Provider support matrix (OpenClaw integrations) -| Capability | Provider integration | Notes | -| ---------- | -------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------ | -| Image | OpenAI, OpenRouter, Anthropic, Google, MiniMax, Moonshot, Z.AI, config providers | Vendor plugins register image support; image-capable config providers auto-register. | -| Audio | OpenAI, Groq, Deepgram, Google, Mistral | Provider transcription (Whisper/Deepgram/Gemini/Voxtral). | -| Video | Google, Moonshot | Provider video understanding via vendor plugins. | +| Capability | Provider integration | Notes | +| ---------- | -------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------- | +| Image | OpenAI, OpenRouter, Anthropic, Google, MiniMax, Moonshot, Z.AI, config providers | Vendor plugins register image support; MiniMax and MiniMax OAuth both use `MiniMax-VL-01`; image-capable config providers auto-register. | +| Audio | OpenAI, Groq, Deepgram, Google, Mistral | Provider transcription (Whisper/Deepgram/Gemini/Voxtral). | +| Video | Google, Moonshot | Provider video understanding via vendor plugins. | + +MiniMax note: + +- `minimax` and `minimax-portal` image understanding comes from the plugin-owned + `MiniMax-VL-01` media provider. +- The bundled MiniMax text catalog still starts text-only; explicit + `models.providers.minimax` entries materialize image-capable M2.7 chat refs. ## Model selection guidance diff --git a/docs/providers/minimax.md b/docs/providers/minimax.md index d09d046e9b2..2abbd0ab4e4 100644 --- a/docs/providers/minimax.md +++ b/docs/providers/minimax.md @@ -13,8 +13,14 @@ OpenClaw's MiniMax provider defaults to **MiniMax M2.7**. MiniMax also provides: - bundled speech synthesis via T2A v2 +- bundled image understanding via `MiniMax-VL-01` - bundled `web_search` through the MiniMax Coding Plan search API +Provider split: + +- `minimax`: API-key text provider, plus bundled image generation, image understanding, speech, and web search +- `minimax-portal`: OAuth text provider, plus bundled image generation and image understanding + ## Model lineup - `MiniMax-M2.7`: default hosted reasoning model. @@ -45,13 +51,28 @@ To use MiniMax for image generation, set it as the image generation provider: The plugin uses the same `MINIMAX_API_KEY` or OAuth auth as the text models. No additional configuration is needed if MiniMax is already set up. +Both `minimax` and `minimax-portal` register `image_generate` with the same +`image-01` model. API-key setups use `MINIMAX_API_KEY`; OAuth setups can use +the bundled `minimax-portal` auth path instead. + When onboarding or API-key setup writes explicit `models.providers.minimax` entries, OpenClaw materializes `MiniMax-M2.7` and `MiniMax-M2.7-highspeed` with `input: ["text", "image"]`. -The bundled MiniMax provider catalog also advertises image input on those M2.7 -chat refs, so image-capable routing can use MiniMax without requiring explicit -provider config first. +The built-in bundled MiniMax text catalog itself stays text-only metadata until +that explicit provider config exists. Image understanding is exposed separately +through the plugin-owned `MiniMax-VL-01` media provider. + +## Image understanding + +The MiniMax plugin registers image understanding separately from the text +catalog: + +- `minimax`: default image model `MiniMax-VL-01` +- `minimax-portal`: default image model `MiniMax-VL-01` + +That is why automatic media routing can use MiniMax image understanding even +when the bundled text-provider catalog still shows text-only M2.7 chat refs. ## Web search @@ -64,6 +85,7 @@ search API. - Accepted env alias: `MINIMAX_CODING_API_KEY` - Compatibility fallback: `MINIMAX_API_KEY` when it already points at a coding-plan token - Region reuse: `plugins.entries.minimax.config.webSearch.region`, then `MINIMAX_API_HOST`, then MiniMax provider base URLs +- Search stays on provider id `minimax`; OAuth CN/global setup can still steer region indirectly through `models.providers.minimax-portal.baseUrl` Config lives under `plugins.entries.minimax.config.webSearch.*`. See [MiniMax Search](/tools/minimax-search). diff --git a/docs/tools/image-generation.md b/docs/tools/image-generation.md index 9554bac6106..303d62f4ac4 100644 --- a/docs/tools/image-generation.md +++ b/docs/tools/image-generation.md @@ -36,12 +36,12 @@ The agent calls `image_generate` automatically. No tool allow-listing needed — ## Supported providers -| Provider | Default model | Edit support | API key | -| -------- | -------------------------------- | ----------------------- | ------------------------------------ | -| OpenAI | `gpt-image-1` | Yes (up to 5 images) | `OPENAI_API_KEY` | -| Google | `gemini-3.1-flash-image-preview` | Yes | `GEMINI_API_KEY` or `GOOGLE_API_KEY` | -| fal | `fal-ai/flux/dev` | Yes | `FAL_KEY` | -| MiniMax | `image-01` | Yes (subject reference) | `MINIMAX_API_KEY` | +| Provider | Default model | Edit support | API key | +| -------- | -------------------------------- | ----------------------- | ----------------------------------------------------- | +| OpenAI | `gpt-image-1` | Yes (up to 5 images) | `OPENAI_API_KEY` | +| Google | `gemini-3.1-flash-image-preview` | Yes | `GEMINI_API_KEY` or `GOOGLE_API_KEY` | +| fal | `fal-ai/flux/dev` | Yes | `FAL_KEY` | +| MiniMax | `image-01` | Yes (subject reference) | `MINIMAX_API_KEY` or MiniMax OAuth (`minimax-portal`) | Use `action: "list"` to inspect available providers and models at runtime: @@ -117,6 +117,11 @@ OpenAI, Google, fal, and MiniMax support editing reference images. Pass a refere OpenAI and Google support up to 5 reference images via the `images` parameter. fal and MiniMax support 1. +MiniMax image generation is available through both bundled MiniMax auth paths: + +- `minimax/image-01` for API-key setups +- `minimax-portal/image-01` for OAuth setups + ## Provider capabilities | Capability | OpenAI | Google | fal | MiniMax | diff --git a/docs/tools/minimax-search.md b/docs/tools/minimax-search.md index e16dd56e5fb..1b7f5cd6ee0 100644 --- a/docs/tools/minimax-search.md +++ b/docs/tools/minimax-search.md @@ -80,6 +80,10 @@ the region in this order: That means CN onboarding or `MINIMAX_API_HOST=https://api.minimaxi.com/...` automatically keeps MiniMax Search on the CN host too. +Even when you authenticated MiniMax through the OAuth `minimax-portal` path, +web search still registers as provider id `minimax`; the OAuth provider base URL +is only used as a region hint for CN/global host selection. + ## Supported parameters MiniMax Search supports: