Summary: - Default active-run queueing to steer while preserving explicit followup/collect modes. - Keep `/steer` fallback behavior and migrate retired queue steering config. - Await Codex app-server steering acceptance so rejected/aborted steering can fall back safely. - Route active subagent announcements through intentional acceptance-aware steering, with legacy queue helpers deprecated for delivery decisions. Verification: - git diff --check - rg -n "^(<<<<<<<|=======|>>>>>>>|\|\|\|\|\|\|\|)" CHANGELOG.md docs src extensions || true - pnpm test src/agents/subagent-announce-dispatch.test.ts src/agents/subagent-announce-delivery.test.ts src/agents/pi-embedded-runner/runs.test.ts src/agents/subagent-announce.format.e2e.test.ts src/agents/subagent-announce.test.ts - pnpm test src/auto-reply/reply/commands-steer.test.ts src/auto-reply/reply/queue/settings.test.ts src/auto-reply/reply/queue-policy.test.ts src/auto-reply/reply/agent-runner.runreplyagent.e2e.test.ts src/auto-reply/reply/get-reply-run.media-only.test.ts extensions/codex/src/app-server/run-attempt.test.ts -- -t "queued steering|explicit all-mode steering|flushes pending default queued steering|rejects queued steering|resolveActiveRunQueueAction|resolveQueueSettings|handleSteerCommand" Co-authored-by: fuller-stack-dev <263060202+fuller-stack-dev@users.noreply.github.com>
4.4 KiB
summary, read_when, title
| summary | read_when | title | |||
|---|---|---|---|---|---|
| How active-run steering queues messages at runtime boundaries |
|
Steering queue |
When a normal prompt arrives while a session run is already streaming, OpenClaw
tries to send that prompt into the active runtime by default when the queue mode
is steer. No config entry and no queue directive are required for that default
behavior. Pi and the native Codex app-server harness implement the delivery
details differently.
Runtime boundary
Steering does not interrupt a tool call that is already running. Pi checks for queued steering messages at model boundaries:
- The assistant asks for tool calls.
- Pi executes the current assistant message's tool-call batch.
- Pi emits the turn end event.
- Pi drains queued steering messages.
- Pi appends those messages as user messages before the next LLM call.
This keeps tool results paired with the assistant message that requested them, then lets the next model call see the latest user input.
The native Codex app-server harness exposes turn/steer instead of Pi's
internal steering queue. OpenClaw batches queued prompts for the configured
quiet window, then sends a single turn/steer request with all collected user
input in arrival order.
Codex review and manual compaction turns reject same-turn steering. When a
runtime cannot accept steering in steer mode, OpenClaw waits for the active
run to finish before starting the prompt.
This page explains queue-mode steering for normal inbound messages when the mode
is steer. If the mode is followup or collect, normal messages do not enter
this steering path; they wait until the active run finishes. For the explicit
/steer <message> command, see Steer.
Modes
| Mode | Active-run behavior | Later behavior |
|---|---|---|
steer |
Steers the prompt into the active runtime when it can. | Waits for the active run to finish if steering is unavailable. |
followup |
Does not steer. | Runs queued messages later after the active run ends. |
collect |
Does not steer. | Coalesces compatible queued messages into one later turn after the debounce window. |
interrupt |
Aborts the active run instead of steering it. | Starts the newest message after aborting. |
Burst example
If four users send messages while the agent is executing a tool call:
- With default behavior, the active runtime receives all four messages in
arrival order before its next model decision. Pi drains them at the next model
boundary; Codex receives them as one batched
turn/steer. - With
/queue collect, OpenClaw does not steer. It waits until the active run ends, then creates a followup turn with compatible queued messages after the debounce window. - With
/queue interrupt, OpenClaw aborts the active run and starts the newest message instead of steering.
Scope
Steering always targets the current active session run. It does not create a new session, change the active run's tool policy, or split messages by sender. In multi-user channels, inbound prompts already include sender and route context, so the next model call can see who sent each message.
Use followup or collect when you want messages to queue by default instead
of steering the active run. Use interrupt when the newest prompt should
replace the active run.
Debounce
messages.queue.debounceMs applies to queued followup and collect delivery.
In steer mode with the native Codex harness, it also sets the quiet window
before sending batched turn/steer. For Pi, active steering itself does not use
the debounce timer because Pi naturally batches messages until the next model
boundary.