Document agent runtimes and the Codex v1 contract (#71270)

* Document agent runtimes and Codex v1 contract * Document agent runtimes and Codex v1 contract * Clarify Codex runtime fallback docs * Clarify runtime and harness terminology
2026-05-06 19:10:58 +00:00 · 2026-04-24 18:46:24 -07:00
parent 0970507078
commit 42ec7a868f
8 changed files with 268 additions and 37 deletions
--- a/docs/plugins/codex-harness.md
+++ b/docs/plugins/codex-harness.md
@@ -15,6 +15,11 @@ discovery, native thread resume, native compaction, and app-server execution.
 OpenClaw still owns chat channels, session files, model selection, tools,
 approvals, media delivery, and the visible transcript mirror.

+If you are trying to orient yourself, start with
+[Agent runtimes](/concepts/agent-runtimes). The short version is:
+`openai/gpt-5.5` is the model ref, `codex` is the runtime, and Telegram,
+Discord, Slack, or another channel remains the communication surface.
+
 Native Codex turns keep OpenClaw plugin hooks as the public compatibility layer.
 These are in-process OpenClaw hooks, not Codex `hooks.json` command hooks:

@@ -124,7 +129,6 @@ Use `openai/gpt-5.5`, enable the bundled plugin, and force the `codex` harness:
      model: "openai/gpt-5.5",
      embeddedHarness: {
        runtime: "codex",
-        fallback: "none",
      },
    },
  },
@@ -150,11 +154,24 @@ Legacy configs that set `agents.defaults.model` or an agent model to
 `codex/<model>` still auto-enable the bundled `codex` plugin. New configs should
 prefer `openai/<model>` plus the explicit `embeddedHarness` entry above.

-## Add Codex without replacing other models
+## Add Codex alongside other models

-Keep `runtime: "auto"` when you want legacy `codex/*` refs to select Codex and
-PI for everything else. For new configs, prefer explicit `runtime: "codex"` on
-the agents that should use the harness.
+Do not set `runtime: "codex"` globally if the same agent should freely switch
+between Codex and non-Codex provider models. A forced runtime applies to every
+embedded turn for that agent or session. If you select an Anthropic model while
+that runtime is forced, OpenClaw still tries the Codex harness and fails closed
+instead of silently routing that turn through PI.
+
+Use one of these shapes instead:
+
+- Put Codex on a dedicated agent with `embeddedHarness.runtime: "codex"`.
+- Keep the default agent on `runtime: "auto"` and PI fallback for normal mixed
+  provider usage.
+- Use legacy `codex/*` refs only for compatibility. New configs should prefer
+  `openai/*` plus an explicit Codex runtime policy.
+
+For example, this keeps the default agent on normal automatic selection and
+adds a separate Codex agent:

 ```json5
 {
@@ -167,28 +184,36 @@ the agents that should use the harness.
  },
  agents: {
    defaults: {
-      model: {
-        primary: "openai/gpt-5.5",
-        fallbacks: ["openai/gpt-5.5", "anthropic/claude-opus-4-6"],
-      },
-      models: {
-        "openai/gpt-5.5": { alias: "gpt" },
-        "anthropic/claude-opus-4-6": { alias: "opus" },
-      },
      embeddedHarness: {
-        runtime: "codex",
+        runtime: "auto",
        fallback: "pi",
      },
    },
+    list: [
+      {
+        id: "main",
+        default: true,
+        model: "anthropic/claude-opus-4-6",
+      },
+      {
+        id: "codex",
+        name: "Codex",
+        model: "openai/gpt-5.5",
+        embeddedHarness: {
+          runtime: "codex",
+        },
+      },
+    ],
  },
 }
 ```

 With this shape:

- `/model gpt` or `/model openai/gpt-5.5` uses the Codex app-server harness for this config.
- `/model opus` uses the Anthropic provider path.
- If a non-Codex model is selected, PI remains the compatibility harness.
+- The default `main` agent uses the normal provider path and PI compatibility fallback.
+- The `codex` agent uses the Codex app-server harness.
+- If Codex is missing or unsupported for the `codex` agent, the turn fails
+  instead of quietly using PI.

 ## Codex-only deployments

@@ -418,12 +443,17 @@ Local Codex with default stdio transport:
 }
 ```

-Codex-only harness validation, with PI fallback disabled:
+Codex-only harness validation:

 ```json5
 {
-  embeddedHarness: {
-    fallback: "none",
+  agents: {
+    defaults: {
+      model: "openai/gpt-5.5",
+      embeddedHarness: {
+        runtime: "codex",
+      },
+    },
  },
  plugins: {
    entries: {
@@ -545,6 +575,40 @@ Codex native `hook/started` and `hook/completed` app-server notifications are
 projected as `codex_app_server.hook` agent events for trajectory and debugging.
 They do not invoke OpenClaw plugin hooks.

+## V1 support contract
+
+Codex mode is not PI with a different model call underneath. Codex owns more of
+the native model loop, and OpenClaw adapts its plugin and session surfaces
+around that boundary.
+
+Supported in Codex runtime v1:
+
+| Surface                                 | Support                                 | Why                                                                                                                                  |
+| --------------------------------------- | --------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------ |
+| OpenAI model loop through Codex         | Supported                               | Codex app-server owns the OpenAI turn, native thread resume, and native tool continuation.                                           |
+| OpenClaw channel routing and delivery   | Supported                               | Telegram, Discord, Slack, WhatsApp, iMessage, and other channels stay outside the model runtime.                                     |
+| OpenClaw dynamic tools                  | Supported                               | Codex asks OpenClaw to execute these tools, so OpenClaw stays in the execution path.                                                 |
+| Prompt and context plugins              | Supported                               | OpenClaw builds prompt overlays and projects context into the Codex turn before starting or resuming the thread.                     |
+| Context engine lifecycle                | Supported                               | Assemble, ingest or after-turn maintenance, and context-engine compaction coordination run for Codex turns.                          |
+| Dynamic tool hooks                      | Supported                               | `before_tool_call`, `after_tool_call`, and tool-result middleware run around OpenClaw-owned dynamic tools.                           |
+| Lifecycle hooks                         | Supported as adapter observations       | `llm_input`, `llm_output`, `agent_end`, `before_compaction`, and `after_compaction` fire with honest Codex-mode payloads.            |
+| Native shell and patch block or observe | Supported through the native hook relay | Codex `PreToolUse` and `PostToolUse` are relayed for supported Codex-native tools. Blocking is supported; argument rewriting is not. |
+| Native permission policy                | Supported through the native hook relay | Codex `PermissionRequest` can be routed through OpenClaw policy where the runtime exposes it.                                        |
+| App-server trajectory capture           | Supported                               | OpenClaw records the request it sent to app-server and the app-server notifications it receives.                                     |
+
+Not supported in Codex runtime v1:
+
+| Surface                                             | V1 Boundary                                                                                                                                     | Future Path                                                                               |
+| --------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------- |
+| Native tool argument mutation                       | Codex native pre-tool hooks can block, but OpenClaw does not rewrite Codex-native tool arguments.                                               | Requires Codex hook/schema support for replacement tool input.                            |
+| Editable Codex-native transcript history            | Codex owns canonical native thread history. OpenClaw owns a mirror and can project future context, but should not mutate unsupported internals. | Add explicit Codex app-server APIs if native thread surgery is needed.                    |
+| `tool_result_persist` for Codex-native tool records | That hook transforms OpenClaw-owned transcript writes, not Codex-native tool records.                                                           | Could mirror transformed records, but canonical rewrite needs Codex support.              |
+| Rich native compaction metadata                     | OpenClaw observes compaction start and completion, but does not receive a stable kept/dropped list, token delta, or summary payload.            | Needs richer Codex compaction events.                                                     |
+| Compaction intervention                             | Current OpenClaw compaction hooks are notification-level in Codex mode.                                                                         | Add Codex pre/post compaction hooks if plugins need to veto or rewrite native compaction. |
+| Stop or final-answer gating                         | Codex has native stop hooks, but OpenClaw does not expose final-answer gating as a v1 plugin contract.                                          | Future opt-in primitive with loop and timeout safeguards.                                 |
+| Native MCP hook parity                              | Codex owns MCP execution, and full pre/post hook payload parity depends on Codex MCP handler support.                                           | Add Codex MCP hook payloads, then relay them through the same native hook path.           |
+| Byte-for-byte model API request capture             | OpenClaw can capture app-server requests and notifications, but Codex core builds the final OpenAI API request internally.                      | Needs a Codex model-request tracing event or debug API.                                   |
+
 ## Tools, media, and compaction

 The Codex harness changes the low-level embedded agent executor only.
@@ -601,12 +665,15 @@ or disable discovery.
 and that the remote app-server speaks the same Codex app-server protocol version.

 **A non-Codex model uses PI:** that is expected unless you forced
-`embeddedHarness.runtime: "codex"` (or selected a legacy `codex/*` ref). Plain
-`openai/gpt-*` and other provider refs stay on their normal provider path.
+`embeddedHarness.runtime: "codex"` for that agent or selected a legacy
+`codex/*` ref. Plain `openai/gpt-*` and other provider refs stay on their normal
+provider path in `auto` mode. If you force `runtime: "codex"`, every embedded
+turn for that agent must be a Codex-supported OpenAI model.

 ## Related

 - [Agent Harness Plugins](/plugins/sdk-agent-harness)
+- [Agent runtimes](/concepts/agent-runtimes)
 - [Model Providers](/concepts/model-providers)
 - [Configuration Reference](/gateway/configuration-reference)
 - [Testing](/help/testing-live#live-codex-app-server-harness-smoke)
--- a/docs/plugins/sdk-agent-harness.md
+++ b/docs/plugins/sdk-agent-harness.md
@@ -10,6 +10,7 @@ read_when:

 An **agent harness** is the low level executor for one prepared OpenClaw agent
 turn. It is not a model provider, not a channel, and not a tool registry.
+For the user-facing mental model, see [Agent runtimes](/concepts/agent-runtimes).

 Use this surface only for bundled or trusted native plugins. The contract is
 still experimental because the parameter types intentionally mirror the current
@@ -123,9 +124,10 @@ OpenClaw. The harness then claims that provider in `supports(...)`.

 The bundled Codex plugin follows this pattern:

- provider id: `codex`
- user model refs: `openai/gpt-5.5` plus `embeddedHarness.runtime: "codex"`;
-  legacy `codex/gpt-*` refs remain accepted for compatibility
+- preferred user model refs: `openai/gpt-5.5` plus
+  `embeddedHarness.runtime: "codex"`
+- compatibility refs: legacy `codex/gpt-*` refs remain accepted, but new
+  configs should not use them as normal provider/model refs
 - harness id: `codex`
 - auth: synthetic provider availability, because the Codex harness owns the
  native Codex login/session
@@ -170,10 +172,11 @@ model refs remain compatibility aliases for the native harness.
 When this mode runs, Codex owns the native thread id, resume behavior,
 compaction, and app-server execution. OpenClaw still owns the chat channel,
 visible transcript mirror, tool policy, approvals, media delivery, and session
-selection. Use `embeddedHarness.runtime: "codex"` with
-`embeddedHarness.fallback: "none"` when you need to prove that only the Codex
-app-server path can claim the run. That config is only a selection guard:
-Codex app-server failures already fail directly instead of retrying through PI.
+selection. Use `embeddedHarness.runtime: "codex"` without a `fallback` override
+when you need to prove that only the Codex app-server path can claim the run.
+Explicit plugin runtimes already fail closed by default. Set `fallback: "pi"`
+only when you intentionally want PI to handle missing harness selection. Codex
+app-server failures already fail directly instead of retrying through PI.

 ## Disable PI fallback

@@ -182,9 +185,12 @@ set to `{ runtime: "auto", fallback: "pi" }`. In `auto` mode, registered plugin
 harnesses can claim a provider/model pair. If none match, OpenClaw falls back
 to PI.

-Set `fallback: "none"` when you need missing plugin harness selection to fail
-instead of using PI. Selected plugin harness failures already fail hard. This
-does not block an explicit `runtime: "pi"` or `OPENCLAW_AGENT_RUNTIME=pi`.
+In `auto` mode, set `fallback: "none"` when you need missing plugin harness
+selection to fail instead of using PI. Explicit plugin runtimes such as
+`runtime: "codex"` already fail closed by default, unless `fallback: "pi"` is
+set in the same config or environment override scope. Selected plugin harness
+failures always fail hard. This does not block an explicit `runtime: "pi"` or
+`OPENCLAW_AGENT_RUNTIME=pi`.

 For Codex-only embedded runs:

@@ -194,8 +200,7 @@ For Codex-only embedded runs:
    "defaults": {
      "model": "openai/gpt-5.5",
      "embeddedHarness": {
-        "runtime": "codex",
-        "fallback": "none"
+        "runtime": "codex"
      }
    }
  }