openclaw/src at 57e6aeca840b2556c8cc74684e7cf6665d25f5eb - openclaw - Gitea: Git with a cup of tea

vultr/openclaw

mirror of https://github.com/openclaw/openclaw.git synced 2026-05-31 18:31:46 +00:00

Files

History

Alexander Bunn 57e6aeca84 fix(agents): detect llama.cpp slot overflow as context overflow

Auto-compaction never triggered for self-hosted llama.cpp HTTP servers
(used directly or behind an OpenAI-compatible shim configured with
`api: "openai-completions"`) because llama.cpp's native overflow wording
isn't covered by any existing pattern in `isContextOverflowError()` or
`matchesProviderContextOverflow()`.

When the prompt overshoots a slot's `--ctx-size`, llama.cpp returns:

  400 request (66202 tokens) exceeds the available context size (65536 tokens), try increasing it

That message uses "context size" rather than "context length", says
"request (N tokens)" instead of "input/prompt is too long", and the
status code is 400 (not 413), so it slips past every existing string
check and every regex in `PROVIDER_CONTEXT_OVERFLOW_PATTERNS`. The
generic candidate pre-check passes, but the concrete provider regexes
all miss, so the agent runner reports `surface_error reason=...` and
the user gets the raw upstream error instead of compaction + retry.

This commit adds a llama.cpp-shaped pattern next to the existing Bedrock
/ Vertex / Ollama / Cohere ones in
`PROVIDER_CONTEXT_OVERFLOW_PATTERNS`, plus four test cases (three
parameterised messages exercising the new regex directly, and one
end-to-end assertion that `isContextOverflowError()` now returns true
for the verbatim message produced by llama.cpp's slot manager).

The pattern is anchored on llama.cpp's stable slot-manager wording
(`(?:request|prompt) (N tokens) exceeds (the )?available context size`)
so it won't accidentally swallow unrelated provider errors.

Closes #64180

AI-assisted: drafted with Claude Code (Opus 4.6, 1M context).
Testing: targeted tests pass via `pnpm vitest run
src/agents/pi-embedded-helpers/provider-error-patterns.test.ts`
(26/26). Broader vitest run shows 2 unrelated failures in
`group-policy.fallback.contract.test.ts` that are not touched by this
change.

2026-04-10 13:30:33 +01:00

..

fix(acp): classify gateway chat error kinds

2026-04-10 10:12:07 +01:00

fix(agents): detect llama.cpp slot overflow as context overflow

2026-04-10 13:30:33 +01:00

fix: preserve commands.list metadata (#64147 )

2026-04-10 15:35:05 +08:00

…

…

refactor: dedupe gateway memory trimmed readers

2026-04-08 01:36:39 +01:00

fix: stabilize main test gates

2026-04-10 12:14:36 +01:00

refactor: dedupe misc lowercase helpers

2026-04-07 22:24:32 +01:00

test: fix parallel full-suite exposed gates

2026-04-10 12:34:53 +01:00

fix(cycles): remove browser cli and tlon runtime seams

2026-04-10 11:45:28 +01:00

…

fix(qqbot): allow extension fields in channel config schema (#64075 )

2026-04-10 17:01:00 +08:00

feat: expose prompt-cache runtime context to context engines (#62179 )

2026-04-07 09:29:57 -07:00

feat: add local exec-policy CLI (#64050 )

2026-04-10 01:16:03 -05:00

fix(daemon): skip machine-scope fallback on permission-denied bus errors (#62337 )

2026-04-08 01:22:31 +01:00

…

fix: stabilize main test gates

2026-04-10 12:14:36 +01:00

fix(gateway): improve websocket auth logging

2026-04-10 12:39:08 +01:00

refactor: dedupe normalization lowercase helpers

2026-04-07 22:57:52 +01:00

…

image-generation

test: move image generation live sweep out of src

2026-04-10 09:47:57 +01:00

fix: stabilize main test gates

2026-04-10 12:14:36 +01:00

…

link-understanding

…

fix(logging): break console/logger type cycle

2026-04-09 02:08:02 +01:00

…

refactor: dedupe core lowercase helpers

2026-04-07 20:58:01 +01:00

refactor: dedupe gateway memory trimmed readers

2026-04-08 01:36:39 +01:00

media-generation

test(boundary): route helper imports through bundled plugin surfaces

2026-04-10 08:05:56 +01:00

media-understanding

refactor: dedupe internal helper glue

2026-04-08 15:58:45 +01:00

memory-host-sdk

fix(memory-lancedb): accept dreaming config for slot-owned memory (#63874 )

2026-04-09 23:03:53 +02:00

music-generation

test: keep media runtime tests on same-directory provider mocks

2026-04-08 17:15:56 +01:00

refactor: dedupe core trimmed readers

2026-04-08 01:36:39 +01:00

refactor: dedupe core trimmed readers

2026-04-08 01:36:39 +01:00

fix(nostr): require operator.admin scope for profile mutation routes [AI] (#63553 )

2026-04-10 16:38:41 +05:30

test: fix latest type and lazy cli gates

2026-04-10 12:37:01 +01:00

fix(ci): repair main type drift

2026-04-10 08:13:02 +01:00

realtime-transcription

…

…

refactor: dedupe misc lowercase helpers

2026-04-07 22:24:32 +01:00

ci: parallelize full-suite project shards

2026-04-10 13:23:03 +01:00

test: project secrets apply path mutations without runtime preflight

2026-04-09 05:03:14 +01:00

refactor: dedupe core trimmed readers

2026-04-08 01:36:39 +01:00

fix(gateway): clear auto-fallback model override on session reset (#63155 )

2026-04-09 00:31:05 +08:00

refactor: tighten device pairing approval types

2026-04-10 10:22:00 +01:00

fix: allow CLI task cancel for stuck background tasks (#62506 ) (thanks @neeravmakwana)

2026-04-09 17:16:07 +05:30

…

…

test(boundary): route helper imports through bundled plugin surfaces

2026-04-10 08:05:56 +01:00

refactor: dedupe misc lowercase helpers

2026-04-07 22:24:32 +01:00

fix: reset TUI footer activity on session switch (#63988 ) (thanks @neeravmakwana)

2026-04-10 13:32:01 +05:30

…

test(delivery): keep telegram parent channel target expectation

2026-04-10 10:12:07 +01:00

video-generation

test: keep media runtime tests on same-directory provider mocks

2026-04-08 17:15:56 +01:00

…

refactor: dedupe repeated test helpers

2026-04-08 09:58:22 +01:00

refactor: dedupe repeated test helpers

2026-04-08 09:58:22 +01:00

fix(cycles): remove browser cli and tlon runtime seams

2026-04-10 11:45:28 +01:00

browser-lifecycle-cleanup.test.ts

…

browser-lifecycle-cleanup.ts

…

channel-web.ts

…

docker-build-cache.test.ts

…

docker-image-digests.test.ts

…

docker-setup.e2e.test.ts

…

dockerfile.test.ts

…

entry.respawn.test.ts

…

entry.respawn.ts

…

entry.test.ts

…

entry.ts

…

entry.version-fast-path.test.ts

…

extensionAPI.ts

…

global-state.ts

…

globals.ts

…

index.test.ts

…

index.ts

…

install-sh-version.test.ts

…

library.test.ts

…

library.ts

…

logger.test.ts

…

logger.ts

…

logging.ts

…

param-key.ts

refactor: dedupe remaining lowercase helpers

2026-04-07 22:57:52 +01:00

plugin-activation-boundary.test.ts

…

poll-params.test.ts

…

poll-params.ts

…

polls.test.ts

…

polls.ts

…

runtime.ts

…

ui-app-settings.agents-files-refresh.test.ts

UI: remove redundant cron refresh wrapper

2026-04-09 22:59:22 -05:00

utils.test.ts

…

utils.ts

…

version.test.ts

fix: stabilize live qa scenario suite

2026-04-08 08:17:59 +01:00

version.ts

fix: stabilize live qa scenario suite

2026-04-08 08:17:59 +01:00