openclaw/docs/concepts/compaction.md at 7008379ff085b4fa926f6419dfa8866d3b3ec2fe

vultr/openclaw

Fork 0

mirror of https://github.com/openclaw/openclaw.git synced 2026-04-01 04:11:03 +00:00

Files

Vincent Koc 143b4c54ba docs: simplify sessions/memory concept pages and fix QMD experimental label

2026-03-30 07:32:20 +09:00

2.7 KiB

Raw Blame History

summary, read_when, title

summary

read_when

title

How OpenClaw summarizes long conversations to stay within model limits

You want to understand auto-compaction and /compact

You are debugging long sessions hitting context limits

Compaction

Every model has a context window -- the maximum number of tokens it can process. When a conversation approaches that limit, OpenClaw compacts older messages into a summary so the chat can continue.

How it works

Older conversation turns are summarized into a compact entry.
The summary is saved in the session transcript.
Recent messages are kept intact.

The full conversation history stays on disk. Compaction only changes what the model sees on the next turn.

Auto-compaction

Auto-compaction is on by default. It runs when the session nears the context limit, or when the model returns a context-overflow error (in which case OpenClaw compacts and retries).

Before compacting, OpenClaw automatically reminds the agent to save important notes to [memory](/concepts/memory) files. This prevents context loss.

Manual compaction

Type /compact in any chat to force a compaction. Add instructions to guide the summary:

/compact Focus on the API design decisions

Using a different model

By default, compaction uses your agent's primary model. You can use a more capable model for better summaries:

{
  agents: {
    defaults: {
      compaction: {
        model: "openrouter/anthropic/claude-sonnet-4-6",
      },
    },
  },
}

Compaction vs pruning

	Compaction	Pruning
What it does	Summarizes older conversation	Trims old tool results
Saved?	Yes (in session transcript)	No (in-memory only, per request)
Scope	Entire conversation	Tool results only

Session pruning is a lighter-weight complement that trims tool output without summarizing.

Troubleshooting

Compacting too often? The model's context window may be small, or tool outputs may be large. Try enabling session pruning.

Context feels stale after compaction? Use /compact Focus on <topic> to guide the summary, or enable the memory flush so notes survive.

Need a clean slate? /new starts a fresh session without compacting.

For advanced configuration (reserve tokens, identifier preservation, custom context engines, OpenAI server-side compaction), see the Session Management Deep Dive.

2.7 KiB Raw Blame History