fix(image): honor generation timeout config

2026-05-06 17:31:06 +00:00 · 2026-04-25 18:25:13 +01:00
parent 80739731dd
commit 0bbb0eb735
19 changed files with 264 additions and 6 deletions
--- a/docs/.generated/config-baseline.sha256
+++ b/docs/.generated/config-baseline.sha256
@@ -1,4 +1,4 @@
-9a012a9c87b9010683289dc7d68ba5446a4b78beedf381e2c5f9d486f25a9213  config-baseline.json
-6128d6eff8c28d17194d1ae9ee7f72abae48da1c6476ab16e6378f1898e4373a  config-baseline.core.json
+439ff58a4a54f0f4bda959239f382cc3b2f94a282680dcd89bd3f8c93e0f07d0  config-baseline.json
+6ef86147534d12aa5ac7a9cf208b4627177090c92479a71dfd1791096d20353b  config-baseline.core.json
 7cd9c908f066c143eab2a201efbc9640f483ab28bba92ddeca1d18cc2b528bc3  config-baseline.channel.json
 7825b56a5b3fcdbe2e09ef8fe5d9f12ac3598435afebe20413051e45b0d1968e  config-baseline.plugin.json
--- a/docs/cli/infer.md
+++ b/docs/cli/infer.md
@@ -156,6 +156,7 @@ Use `image` for generation, edit, and description.
 ```bash
 openclaw infer image generate --prompt "friendly lobster illustration" --json
 openclaw infer image generate --prompt "cinematic product photo of headphones" --json
+openclaw infer image generate --prompt "slow image backend" --timeout-ms 180000 --json
 openclaw infer image describe --file ./photo.jpg --json
 openclaw infer image describe --file ./ui-screenshot.png --model openai/gpt-4.1-mini --json
 openclaw infer image describe --file ./photo.jpg --model ollama/qwen2.5vl:7b --json
--- a/docs/providers/openrouter.md
+++ b/docs/providers/openrouter.md
@@ -71,13 +71,14 @@ OpenRouter can also back the `image_generate` tool. Use an OpenRouter image mode
    defaults: {
      imageGenerationModel: {
        primary: "openrouter/google/gemini-3.1-flash-image-preview",
+        timeoutMs: 180_000,
      },
    },
  },
 }
 ```

-OpenClaw sends image requests to OpenRouter's chat completions image API with `modalities: ["image", "text"]`. Gemini image models receive supported `aspectRatio` and `resolution` hints through OpenRouter's `image_config`.
+OpenClaw sends image requests to OpenRouter's chat completions image API with `modalities: ["image", "text"]`. Gemini image models receive supported `aspectRatio` and `resolution` hints through OpenRouter's `image_config`. Use `agents.defaults.imageGenerationModel.timeoutMs` for slower OpenRouter image models; the `image_generate` tool's per-call `timeoutMs` parameter still wins.

 ## Text-to-speech

--- a/docs/tools/image-generation.md
+++ b/docs/tools/image-generation.md
@@ -24,6 +24,8 @@ The tool only appears when at least one image generation provider is available.
    defaults: {
      imageGenerationModel: {
        primary: "openai/gpt-image-2",
+        // Optional default provider request timeout for image_generate.
+        timeoutMs: 180_000,
      },
    },
  },
@@ -150,6 +152,7 @@ Tool results report the applied settings. When OpenClaw remaps geometry during p
    defaults: {
      imageGenerationModel: {
        primary: "openai/gpt-image-2",
+        timeoutMs: 180_000,
        fallbacks: [
          "openrouter/google/gemini-3.1-flash-image-preview",
          "google/gemini-3.1-flash-image-preview",
@@ -185,6 +188,8 @@ Notes:
  `agents.defaults.mediaGenerationAutoProviderFallback: false` if you want image
  generation to use only the explicit `model`, `primary`, and `fallbacks`
  entries.
+- Set `agents.defaults.imageGenerationModel.timeoutMs` for slow image backends.
+  A per-call `timeoutMs` tool parameter overrides the configured default.
 - Use `action: "list"` to inspect the currently registered providers, their
  default models, and auth env-var hints.