feat(openai): default images to gpt-image-2

2026-05-06 14:30:45 +00:00 · 2026-04-21 21:49:16 +01:00
parent 0b1a35363e
commit aa94501f5f
16 changed files with 164 additions and 48 deletions
--- a/docs/tools/image-generation.md
+++ b/docs/tools/image-generation.md
@@ -25,7 +25,7 @@ The tool only appears when at least one image generation provider is available.
  agents: {
    defaults: {
      imageGenerationModel: {
-        primary: "openai/gpt-image-1",
+        primary: "openai/gpt-image-2",
      },
    },
  },
@@ -40,7 +40,7 @@ The agent calls `image_generate` automatically. No tool allow-listing needed —

 | Provider | Default model                    | Edit support                       | API key                                               |
 | -------- | -------------------------------- | ---------------------------------- | ----------------------------------------------------- |
-| OpenAI   | `gpt-image-1`                    | Yes (up to 5 images)               | `OPENAI_API_KEY`                                      |
+| OpenAI   | `gpt-image-2`                    | Yes (up to 5 images)               | `OPENAI_API_KEY`                                      |
 | Google   | `gemini-3.1-flash-image-preview` | Yes                                | `GEMINI_API_KEY` or `GOOGLE_API_KEY`                  |
 | fal      | `fal-ai/flux/dev`                | Yes                                | `FAL_KEY`                                             |
 | MiniMax  | `image-01`                       | Yes (subject reference)            | `MINIMAX_API_KEY` or MiniMax OAuth (`minimax-portal`) |
@@ -59,10 +59,10 @@ Use `action: "list"` to inspect available providers and models at runtime:
 | ------------- | -------- | ------------------------------------------------------------------------------------- |
 | `prompt`      | string   | Image generation prompt (required for `action: "generate"`)                           |
 | `action`      | string   | `"generate"` (default) or `"list"` to inspect providers                               |
-| `model`       | string   | Provider/model override, e.g. `openai/gpt-image-1`                                    |
+| `model`       | string   | Provider/model override, e.g. `openai/gpt-image-2`                                    |
 | `image`       | string   | Single reference image path or URL for edit mode                                      |
 | `images`      | string[] | Multiple reference images for edit mode (up to 5)                                     |
-| `size`        | string   | Size hint: `1024x1024`, `1536x1024`, `1024x1536`, `1024x1792`, `1792x1024`            |
+| `size`        | string   | Size hint: `1024x1024`, `1536x1024`, `1024x1536`, `2048x2048`, `3840x2160`            |
 | `aspectRatio` | string   | Aspect ratio: `1:1`, `2:3`, `3:2`, `3:4`, `4:3`, `4:5`, `5:4`, `9:16`, `16:9`, `21:9` |
 | `resolution`  | string   | Resolution hint: `1K`, `2K`, or `4K`                                                  |
 | `count`       | number   | Number of images to generate (1–4)                                                    |
@@ -81,7 +81,7 @@ Tool results report the applied settings. When OpenClaw remaps geometry during p
  agents: {
    defaults: {
      imageGenerationModel: {
-        primary: "openai/gpt-image-1",
+        primary: "openai/gpt-image-2",
        fallbacks: ["google/gemini-3.1-flash-image-preview", "fal/fal-ai/flux/dev"],
      },
    },
@@ -123,6 +123,42 @@ OpenAI, Google, fal, MiniMax, and ComfyUI support editing reference images. Pass

 OpenAI and Google support up to 5 reference images via the `images` parameter. fal, MiniMax, and ComfyUI support 1.

+### OpenAI `gpt-image-2`
+
+OpenAI image generation defaults to `openai/gpt-image-2`. The older
+`openai/gpt-image-1` model can still be selected explicitly, but new OpenAI
+image-generation and image-editing requests should use `gpt-image-2`.
+
+`gpt-image-2` supports both text-to-image generation and reference-image
+editing through the same `image_generate` tool. OpenClaw forwards `prompt`,
+`count`, `size`, and reference images to OpenAI. OpenAI does not receive
+`aspectRatio` or `resolution` directly; when possible OpenClaw maps those into a
+supported `size`, otherwise the tool reports them as ignored overrides.
+
+Generate one 4K landscape image:
+
+```
+/tool image_generate action=generate model=openai/gpt-image-2 prompt="A clean editorial poster for OpenClaw image generation" size=3840x2160 count=1
+```
+
+Generate two square images:
+
+```
+/tool image_generate action=generate model=openai/gpt-image-2 prompt="Two visual directions for a calm productivity app icon" size=1024x1024 count=2
+```
+
+Edit one local reference image:
+
+```
+/tool image_generate action=generate model=openai/gpt-image-2 prompt="Keep the subject, replace the background with a bright studio setup" image=/path/to/reference.png size=1024x1536
+```
+
+Edit with multiple references:
+
+```
+/tool image_generate action=generate model=openai/gpt-image-2 prompt="Combine the character identity from the first image with the color palette from the second" images='["/path/to/character.png","/path/to/palette.jpg"]' size=1536x1024
+```
+
 MiniMax image generation is available through both bundled MiniMax auth paths:

 - `minimax/image-01` for API-key setups
@@ -134,7 +170,7 @@ MiniMax image generation is available through both bundled MiniMax auth paths:
 | --------------------- | -------------------- | -------------------- | ------------------- | -------------------------- | ---------------------------------- | ------- |
 | Generate              | Yes (up to 4)        | Yes (up to 4)        | Yes (up to 4)       | Yes (up to 9)              | Yes (workflow-defined outputs)     | Yes (1) |
 | Edit/reference        | Yes (up to 5 images) | Yes (up to 5 images) | Yes (1 image)       | Yes (1 image, subject ref) | Yes (1 image, workflow-configured) | No      |
-| Size control          | Yes                  | Yes                  | Yes                 | No                         | No                                 | No      |
+| Size control          | Yes (up to 4K)       | Yes                  | Yes                 | No                         | No                                 | No      |
 | Aspect ratio          | No                   | Yes                  | Yes (generate only) | Yes                        | No                                 | No      |
 | Resolution (1K/2K/4K) | No                   | Yes                  | Yes                 | No                         | No                                 | No      |