mirror of https://github.com/openclaw/openclaw.git synced 2026-05-06 06:00:43 +00:00

Go to file

wirjo 2a15a3bb53 fix(amazon-bedrock): add known model context windows to discovery (#65952 )

* fix(amazon-bedrock): add known model context windows to discovery

Bedrock's ListFoundationModels API does not expose token limits. Discovery
was hardcoding contextWindow: 32000 for every model, causing Claude (1M),
Nova (300K), and other models to hit premature 'Context limit exceeded'
errors and unnecessary session resets.

Adds a lookup table of known context windows for Bedrock models:
- Anthropic Claude: 200K-1M
- Amazon Nova: 128K-1M
- Meta Llama: 128K
- Mistral: 32K-128K
- DeepSeek: 128K
- Cohere: 128K
- AI21 Jamba: 256K

Inference profile prefixes (us., eu., ap., global.) are stripped before
lookup, so us.anthropic.claude-opus-4-6-v1 correctly resolves to 1M.

Also raises the default fallback from 32K to 128K for unknown models —
most modern models have at least 128K context.

Single file change, no type system modifications.

Complementary to #65030 (provenance flag for warning on unknown models).

Fixes #64919
Related: #64250

* add KNOWN_MAX_TOKENS map and expand model coverage

- Add KNOWN_MAX_TOKENS lookup table with Bedrock-optimized values that
  balance response quality against quota burndown (5x rate for Claude 3.7+)
- Add missing models to KNOWN_CONTEXT_WINDOWS: Opus 4.7 (1M), Opus 4.1/4.5,
  Sonnet 4, Claude 3/3.5 Haiku, DeepSeek V3/V3.2, Google Gemma 3
- Refactor prefix-stripping into shared resolveKnownValue() helper
- Fix: use !== undefined instead of truthy check for table lookups
- Wire resolveKnownMaxTokens into toModelDefinition and resolveInferenceProfiles

Quota burndown context: Bedrock reserves input_tokens + max_tokens from
TPM at request start. For Claude 3.7+, output burns at 5x. The values
in KNOWN_MAX_TOKENS are intentionally conservative (8-16K for Claude)
to maximize concurrent throughput while still allowing useful responses.
Thinking budget is added separately by the runtime.

* remove KNOWN_MAX_TOKENS — maxTokens should be handled upstream

Remove the KNOWN_MAX_TOKENS map. Hardcoding maxTokens values in
discovery is the wrong layer to solve this — any explicit value
still gets reserved against Bedrock's TPM quota at request start.

The correct fix is upstream in pi's Bedrock provider: omit maxTokens
from inferenceConfig when not explicitly set, letting the model use
its internal default. This avoids quota waste entirely.

See: badlogic/pi-mono#3399 and badlogic/pi-mono#3400

Keep the expanded KNOWN_CONTEXT_WINDOWS (context windows ARE the
right thing to set in discovery — they affect compaction thresholds
and session management, not API-level quota reservation).

* docs: clarify why hardcoded context windows are needed

Bedrock's ListFoundationModels and GetFoundationModel APIs return no
token limit information — there is no Bedrock API to discover context
windows or max output tokens programmatically. Note that this table
should become a fallback if AWS adds token metadata in the future.

* fix: add au and apac to inference profile prefix regex

Add missing geo prefixes discovered by querying inference profiles
across multiple regions:
- au. (Australia/NZ, used in ap-southeast-2/4/6)
- apac. (Asia-Pacific, used for older models in ap-northeast-1)

Both resolveKnownContextWindow and resolveBaseModelId now handle
all known prefixes: us, eu, ap, apac, au, jp, global.

* test: port au. prefix test from #65449 by @alickgithub2, add apac. coverage

Port the Australia/NZ inference profile test from PR #65449
(credit: @alickgithub2) and extend it to also cover the apac.
prefix discovered in ap-northeast-1.

* expand model coverage: Llama 4, MiniMax, NVIDIA, Mistral 3, GLM, Qwen

Cross-referenced KNOWN_CONTEXT_WINDOWS against live
list-foundation-models API. Added missing models:
- Llama 4 Maverick (1M) and Scout (512K)
- MiniMax M2/M2.1/M2.5 (1M)
- NVIDIA Nemotron Super/Nano variants (128K)
- Mistral Large 3 675B (128K)
- GLM 4.7/4.7-flash/5 (128K)
- Qwen3 Coder/32B/VL (128-256K)

Removed deprecated deepseek.v3-v1:0 and claude-opus-4-20250514
(not in active foundation models list).

* raise default context window from 128K to 200K

200K matches the floor for all current Claude models (the most
popular on Bedrock). Every other active model with a lower actual
limit is already in the explicit table. This ensures new Claude
models get a correct default without requiring a table update.

* test: update discovery test expectations for known context window values

* test: fix remaining contextWindow expectation (default 200K)

* fix(amazon-bedrock): keep conservative context fallback

* docs(changelog): note Bedrock context window fix

* fix(amazon-bedrock): normalize known context fallback

---------

Co-authored-by: Vincent Koc <vincentkoc@ieee.org>

2026-04-22 15:53:41 -07:00

.agents/skills

test: harden parallels smoke harness

2026-04-22 22:01:04 +01:00

.github

ci: rotate stuck build-smoke queue

2026-04-22 21:59:48 +01:00

.pi

chore(pi): remove local pr prompts

2026-04-22 13:38:47 +03:00

.vscode

chore: disable makefile configure on open

2026-04-18 22:19:32 +01:00

apps

refactor: remove plugin tool display overrides from core

2026-04-22 06:43:48 +01:00

assets

…

docs

fix(telegram): isolate direct chat sandbox sessions

2026-04-22 23:46:34 +01:00

extensions

fix(amazon-bedrock): add known model context windows to discovery (#65952 )

2026-04-22 15:53:41 -07:00

git-hooks

feat: add changed-lane local gate

2026-04-20 15:48:20 +01:00

packages

test: use synthetic ui channel fixtures

2026-04-20 23:54:59 +01:00

patches

fix(whatsapp): harden Baileys media upload hotfix (#65966 )

2026-04-14 21:34:23 +08:00

test(qa): add OpenAI native web search live scenario

2026-04-22 23:06:55 +01:00

scripts

ci: rebalance extension test shards

2026-04-22 23:29:34 +01:00

skills

fix(skills): remove unused model-usage import (#67641 )

2026-04-16 19:56:34 +08:00

src

fix(amazon-bedrock-mantle): refresh IAM bearer token via resolveConfigApiKey cache lookup (#68903 )

2026-04-22 15:52:24 -07:00

Swabble

…

test

ci: rebalance extension test shards

2026-04-22 23:29:34 +01:00

test-fixtures

test: sync gateway and config expectations

2026-04-07 08:05:32 +01:00

fix(gateway): require auth for control ui bootstrap config (#70247 )

2026-04-22 16:52:08 -06:00

vendor/a2ui

…

.codex

fix(browser): block SSRF redirect bypass via real-time route interception (#58771 )

2026-04-02 09:07:57 -07:00

.detect-secrets.cfg

…

.dockerignore

Config: separate core/plugin baseline entries (#60162 )

2026-04-03 18:26:23 +09:00

.env.example

feat(tencent): add bundled Tencent Cloud provider plugin (Tokenhub + Token Plan) (#68460 )

2026-04-21 21:59:22 -07:00

.gitattributes

…

.gitignore

ci: isolate mlx from macos swift checks

2026-04-22 02:12:07 +01:00

.jscpd.json

…

.mailmap

…

.markdownlint-cli2.jsonc

…

.npmignore

…

.npmrc

…

.oxfmtrc.jsonc

build: keep a2ui bundle generated

2026-04-11 14:18:04 +01:00

.oxlintrc.json

chore(lint): enable additional cleanup rules

2026-04-18 20:37:13 +01:00

.pre-commit-config.yaml

fix(ci): replace retired pnpm audit hook

2026-04-15 01:10:07 +01:00

.prettierignore

…

.secrets.baseline

…

.shellcheckrc

…

.swiftformat

…

.swiftlint.yml

…

AGENTS.md

chore(agents): prefer local validation over testbox

2026-04-21 22:37:03 -07:00

appcast.xml

chore: update appcast for 2026.4.20

2026-04-21 21:01:19 +01:00

CHANGELOG.md

fix(amazon-bedrock): add known model context windows to discovery (#65952 )

2026-04-22 15:53:41 -07:00

CLAUDE.md

…

CONTRIBUTING.md

docs: fix stale community links in README and CONTRIBUTING (#69945 )

2026-04-21 22:47:16 -07:00

docker-compose.yml

…

docker-setup.sh

…

Dockerfile

fix(docker): verify matrix-sdk-crypto native addon without hardcoded pnpm path (#65608 ) (#67143 )

2026-04-15 11:37:14 -04:00

Dockerfile.sandbox

build(deps): bump debian sandbox image digest (#39403 )

2026-04-20 20:22:13 +01:00

Dockerfile.sandbox-browser

build(deps): bump debian sandbox image digest (#39403 )

2026-04-20 20:22:13 +01:00

Dockerfile.sandbox-common

…

docs.acp.md

…

dream-diary-preview-v2.html

style(preview): format dream diary preview files

2026-04-06 16:16:10 +01:00

dream-diary-preview-v3.html

style(preview): format dream diary preview files

2026-04-06 16:16:10 +01:00

fix2.py

fix(heartbeat): preserve HEARTBEAT.md directives in task-mode prompt

2026-04-04 15:09:48 +01:00

fly.private.toml

…

fly.toml

…

INCIDENT_RESPONSE.md

Update INCIDENT_RESPONSE.md

2026-04-11 21:22:40 +01:00

knip.config.ts

chore(deadcode): fix knip scan config

2026-04-06 16:13:26 +01:00

LICENSE

…

Makefile

…

openclaw.mjs

…

openclaw.podman.env

…

package.json

test(cron): add docker mcp cleanup e2e

2026-04-22 23:12:18 +01:00

pnpm-lock.yaml

build(deps): bump fast-xml-parser override

2026-04-22 22:45:57 +01:00

pnpm-workspace.yaml

fix(discord): avoid native opus install path (#69339 )

2026-04-20 15:25:07 +01:00

pyproject.toml

…

README.md

docs: fix stale community links in README and CONTRIBUTING (#69945 )

2026-04-21 22:47:16 -07:00

render.yaml

…

SECURITY.md

docs: clarify dependency parser advisory triage

2026-04-20 20:13:37 +01:00

setup-podman.sh

…

tsconfig.core.json

perf: parallelize local check gate

2026-04-20 13:55:55 +01:00

tsconfig.core.projects.json

perf: speed up type check gate

2026-04-20 13:17:43 +01:00

tsconfig.core.test.agents.json

refactor: enforce plugin-owned channel boundaries

2026-04-18 22:48:27 +01:00

tsconfig.core.test.json

perf: parallelize local check gate

2026-04-20 13:55:55 +01:00

tsconfig.core.test.non-agents.json

refactor: enforce plugin-owned channel boundaries

2026-04-18 22:48:27 +01:00

tsconfig.extensions.json

perf: parallelize local check gate

2026-04-20 13:55:55 +01:00

tsconfig.extensions.projects.json

perf: speed up type check gate

2026-04-20 13:17:43 +01:00

tsconfig.extensions.test.json

perf: parallelize local check gate

2026-04-20 13:55:55 +01:00

tsconfig.json

perf: trim tsgo input graph

2026-04-10 15:56:56 +01:00

tsconfig.oxlint.core.json

perf: parallelize local check gate

2026-04-20 13:55:55 +01:00

tsconfig.oxlint.extensions.json

perf: parallelize local check gate

2026-04-20 13:55:55 +01:00

tsconfig.oxlint.json

chore: update dependencies and oxc tooling

2026-04-10 19:28:42 +01:00

tsconfig.oxlint.scripts.json

perf: parallelize local check gate

2026-04-20 13:55:55 +01:00

tsconfig.plugin-sdk.dts.json

build: narrow plugin SDK declaration build

2026-04-08 20:00:51 +01:00

tsconfig.projects.json

perf: speed up type check gate

2026-04-20 13:17:43 +01:00

tsconfig.test.json

build: split tsgo prod and test graphs

2026-04-18 18:06:29 +01:00

tsconfig.test.packages.json

build: add targeted tsgo test graphs

2026-04-18 18:12:44 +01:00

tsconfig.test.src.json

build: add targeted tsgo test graphs

2026-04-18 18:12:44 +01:00

tsconfig.test.ui.json

build: add targeted tsgo test graphs

2026-04-18 18:12:44 +01:00

tsdown.config.ts

build: exclude private QA from npm package

2026-04-15 09:39:51 -07:00

VISION.md

…

vitest.config.ts

test: move Vitest configs under test

2026-04-10 13:44:51 +01:00

zizmor.yml

…

README.md

🦞 OpenClaw — Personal AI Assistant

EXFOLIATE! EXFOLIATE!

OpenClaw is a personal AI assistant you run on your own devices. It answers you on the channels you already use. It can speak and listen on macOS/iOS/Android, and can render a live Canvas you control. The Gateway is just the control plane — the product is the assistant.

If you want a personal, single-user assistant that feels local, fast, and always-on, this is it.

Supported channels include: WhatsApp, Telegram, Slack, Discord, Google Chat, Signal, iMessage, BlueBubbles, IRC, Microsoft Teams, Matrix, Feishu, LINE, Mattermost, Nextcloud Talk, Nostr, Synology Chat, Tlon, Twitch, Zalo, Zalo Personal, WeChat, QQ, WebChat.

Website · Docs · Vision · DeepWiki · Getting Started · Updating · Showcase · FAQ · Onboarding · Nix · Docker · Discord

New install? Start here: Getting started

Preferred setup: run openclaw onboard in your terminal. OpenClaw Onboard guides you step by step through setting up the gateway, workspace, channels, and skills. It is the recommended CLI setup path and works on macOS, Linux, and Windows (via WSL2; strongly recommended). Works with npm, pnpm, or bun.

Install (recommended)

Runtime: Node 24 (recommended) or Node 22.16+.

npm install -g openclaw@latest
# or: pnpm add -g openclaw@latest

openclaw onboard --install-daemon

OpenClaw Onboard installs the Gateway daemon (launchd/systemd user service) so it stays running.

Quick start (TL;DR)

Runtime: Node 24 (recommended) or Node 22.16+.

Full beginner guide (auth, pairing, channels): Getting started

openclaw onboard --install-daemon

openclaw gateway --port 18789 --verbose

# Send a message
openclaw message send --to +1234567890 --message "Hello from OpenClaw"

# Talk to the assistant (optionally deliver back to any connected channel: WhatsApp/Telegram/Slack/Discord/Google Chat/Signal/iMessage/BlueBubbles/IRC/Microsoft Teams/Matrix/Feishu/LINE/Mattermost/Nextcloud Talk/Nostr/Synology Chat/Tlon/Twitch/Zalo/Zalo Personal/WeChat/QQ/WebChat)
openclaw agent --message "Ship checklist" --thinking high

Upgrading? Updating guide (and run openclaw doctor).

Models config + CLI: Models. Auth profile rotation + fallbacks: Model failover.

Security defaults (DM access)

OpenClaw connects to real messaging surfaces. Treat inbound DMs as untrusted input.

Full security guide: Security

Default behavior on Telegram/WhatsApp/Signal/iMessage/Microsoft Teams/Discord/Google Chat/Slack:

DM pairing (dmPolicy="pairing" / channels.discord.dmPolicy="pairing" / channels.slack.dmPolicy="pairing"; legacy: channels.discord.dm.policy, channels.slack.dm.policy): unknown senders receive a short pairing code and the bot does not process their message.
Approve with: openclaw pairing approve <channel> <code> (then the sender is added to a local allowlist store).
Public inbound DMs require an explicit opt-in: set dmPolicy="open" and include "*" in the channel allowlist (allowFrom / channels.discord.allowFrom / channels.slack.allowFrom; legacy: channels.discord.dm.allowFrom, channels.slack.dm.allowFrom).

Run openclaw doctor to surface risky/misconfigured DM policies.

Highlights

Local-first Gateway — single control plane for sessions, channels, tools, and events.
Multi-channel inbox — WhatsApp, Telegram, Slack, Discord, Google Chat, Signal, BlueBubbles (iMessage), iMessage (legacy), IRC, Microsoft Teams, Matrix, Feishu, LINE, Mattermost, Nextcloud Talk, Nostr, Synology Chat, Tlon, Twitch, Zalo, Zalo Personal, WeChat, QQ, WebChat, macOS, iOS/Android.
Multi-agent routing — route inbound channels/accounts/peers to isolated agents (workspaces + per-agent sessions).
Voice Wake + Talk Mode — wake words on macOS/iOS and continuous voice on Android (ElevenLabs + system TTS fallback).
Live Canvas — agent-driven visual workspace with A2UI.
First-class tools — browser, canvas, nodes, cron, sessions, and Discord/Slack actions.
Companion apps — macOS menu bar app + iOS/Android nodes.
Onboarding + skills — onboarding-driven setup with bundled/managed/workspace skills.

Security model (important)

Default: tools run on the host for the main session, so the agent has full access when it is just you.
Group/channel safety: set agents.defaults.sandbox.mode: "non-main" to run non-main sessions inside sandboxes. Docker is the default sandbox backend; SSH and OpenShell backends are also available.
Typical sandbox default: allow bash, process, read, write, edit, sessions_list, sessions_history, sessions_send, sessions_spawn; deny browser, canvas, nodes, cron, discord, gateway.
Before exposing anything remotely, read Security, Sandboxing, and Configuration.

Operator quick refs

Chat commands: /status, /new, /reset, /compact, /think <level>, /verbose on|off, /trace on|off, /usage off|tokens|full, /restart, /activation mention|always
Session tools: sessions_list, sessions_history, sessions_send
Skills registry: ClawHub
Architecture overview: Architecture

Docs by goal

New here: Getting started, Onboarding, Updating
Channel setup: Channels index, WhatsApp, Telegram, Discord, Slack
Apps + nodes: macOS, iOS, Android, Nodes
Config + security: Configuration, Security, Sandboxing
Remote + web: Gateway, Remote access, Tailscale, Web surfaces
Tools + automation: Tools, Skills, Cron jobs, Webhooks, Gmail Pub/Sub
Internals: Architecture, Agent, Session model, Gateway protocol
Troubleshooting: Channel troubleshooting, Logging, Docs home

Apps (optional)

The Gateway alone delivers a great experience. All apps are optional and add extra features.

If you plan to build/run companion apps, follow the platform runbooks below.

macOS (OpenClaw.app) (optional)

Menu bar control for the Gateway and health.
Voice Wake + push-to-talk overlay.
WebChat + debug tools.
Remote gateway control over SSH.

Note: signed builds required for macOS permissions to stick across rebuilds (see macOS Permissions).

iOS node (optional)

Pairs as a node over the Gateway WebSocket (device pairing).
Voice trigger forwarding + Canvas surface.
Controlled via openclaw nodes ….

Runbook: iOS connect.

Android node (optional)

Pairs as a WS node via device pairing (openclaw devices ...).
Exposes Connect/Chat/Voice tabs plus Canvas, Camera, Screen capture, and Android device command families.
Runbook: Android connect.

From source (development)

Prefer pnpm for builds from source. Bun is optional for running TypeScript directly.

For the dev loop:

git clone https://github.com/openclaw/openclaw.git
cd openclaw

pnpm install

# First run only (or after resetting local OpenClaw config/workspace)
pnpm openclaw setup

# Optional: prebuild Control UI before first startup
pnpm ui:build

# Dev loop (auto-reload on source/config changes)
pnpm gateway:watch

If you need a built dist/ from the checkout (for Node, packaging, or release validation), run:

pnpm build
pnpm ui:build

pnpm openclaw setup writes the local config/workspace needed for pnpm gateway:watch. It is safe to re-run, but you normally only need it on first setup or after resetting local state. pnpm gateway:watch does not rebuild dist/control-ui, so rerun pnpm ui:build after ui/ changes or use pnpm ui:dev when iterating on the Control UI. If you want this checkout to run onboarding directly, use pnpm openclaw onboard --install-daemon.

Note: pnpm openclaw ... runs TypeScript directly (via tsx). pnpm build produces dist/ for running via Node / the packaged openclaw binary, while pnpm gateway:watch rebuilds the runtime on demand during the dev loop.

Development channels

stable: tagged releases (vYYYY.M.D or vYYYY.M.D-<patch>), npm dist-tag latest.
beta: prerelease tags (vYYYY.M.D-beta.N), npm dist-tag beta (macOS app may be missing).
dev: moving head of main, npm dist-tag dev (when published).

Switch channels (git + npm): openclaw update --channel stable|beta|dev. Details: Development channels.

Agent workspace + skills

Workspace root: ~/.openclaw/workspace (configurable via agents.defaults.workspace).
Injected prompt files: AGENTS.md, SOUL.md, TOOLS.md.
Skills: ~/.openclaw/workspace/skills/<skill>/SKILL.md.

Configuration

Minimal ~/.openclaw/openclaw.json (model + defaults):

{
  agent: {
    model: "<provider>/<model-id>",
  },
}

Full configuration reference (all keys + examples).

Star History

Molty

OpenClaw was built for Molty, a space lobster AI assistant. 🦞 by Peter Steinberger and the community.

Community

See CONTRIBUTING.md for guidelines, maintainers, and how to submit PRs. AI/vibe-coded PRs welcome! 🤖

Special thanks to Mario Zechner for his support and for pi-mono. Special thanks to Adam Doppelt for the lobster.bot domain.

Thanks to all clawtributors:

Languages

TypeScript 82.8%

JavaScript 11.1%

Swift 3.8%

Kotlin 0.9%

Shell 0.7%

Other 0.5%

README.md

🦞 OpenClaw — Personal AI Assistant

Sponsors

Install (recommended)

Quick start (TL;DR)

Security defaults (DM access)

Highlights

Security model (important)

Operator quick refs

Docs by goal

Apps (optional)

macOS (OpenClaw.app) (optional)

iOS node (optional)

Android node (optional)

From source (development)

Development channels

Agent workspace + skills

Configuration

Star History

Molty

Community