Commit Graph

16761 Commits

Author SHA1 Message Date
Peter Steinberger
deb212d3b0 fix(openai): tighten gpt chat action turns 2026-04-05 12:53:35 +01:00
Peter Steinberger
59a243e46b test: stabilize provider discovery matrix cases 2026-04-05 12:53:24 +01:00
Peter Steinberger
bc7f845714 test: speed up focused pi-tools tool tests 2026-04-05 12:53:15 +01:00
Peter Steinberger
dbcd35f6c2 test: decouple pi-tools params test imports 2026-04-05 12:53:15 +01:00
wirjo
78fe96f2d4 feat(bedrock): add inference profile discovery and region injection (#61299)
* feat(bedrock): add inference profile discovery and region injection

Inference profiles (cross-region and application) work with ConverseStream
but require the SDK client region to match the profile region. Without
this, users get "The provided model identifier is invalid" errors when
using cross-region profiles like us.anthropic.claude-sonnet-4-6.

Changes:

1. Inference profile discovery (discovery.ts):
   - Call ListInferenceProfiles alongside ListFoundationModels (parallel)
   - Inference profiles INHERIT capabilities from their underlying
     foundation model (modalities, reasoning, context window, cost)
   - resolveBaseModelId() maps profile → foundation model:
     "us.anthropic.claude-sonnet-4-6" → "anthropic.claude-sonnet-4-6"
     Application ARNs → extract model ID from models[].modelArn
   - Graceful degradation if IAM lacks bedrock:ListInferenceProfiles
   - Provider filter applies to profiles via underlying model ARNs

2. Region injection (register.sync.runtime.ts):
   - Extract region from provider baseUrl or bedrockDiscovery.region
   - Pass through to pi-ai options.region in wrapStreamFn
   - Ensures SDK client connects to correct regional endpoint

3. Inference profile model detection (anthropic-family-cache-semantics.ts):
   - isAnthropicBedrockModel() now recognizes application inference
     profile ARNs (arn:aws:bedrock:...:application-inference-profile/*)

4. Tests (discovery.test.ts):
   - New: inference profile inheritance test (4 models: 1 foundation +
     3 profiles, verifies capability inheritance, inactive filtering)
   - New: graceful AccessDeniedException handling test
   - Updated: all existing tests for dual-API discovery pattern

Fixes #55642

* fix(bedrock): preserve inference profile model lookup

---------

Co-authored-by: Vincent Koc <vincentkoc@ieee.org>
2026-04-05 12:52:03 +01:00
Peter Steinberger
996dccb19c feat(agents): add structured execution item events 2026-04-05 12:36:33 +01:00
Peter Steinberger
3b7e6152d1 fix: retry reasoning-required compaction with minimal thinking 2026-04-05 12:35:50 +01:00
Mariano
4175caee6e fix(agents): suppress commentary-phase output leaks (#61282)
Merged via squash.

Prepared head SHA: e392904f73
Co-authored-by: mbelinky <132747814+mbelinky@users.noreply.github.com>
Co-authored-by: mbelinky <132747814+mbelinky@users.noreply.github.com>
Reviewed-by: @mbelinky
2026-04-05 13:04:12 +02:00
Peter Steinberger
1bf5339b98 fix(gateway): pin startup channel registry 2026-04-05 12:01:03 +01:00
Vincent Koc
cfe66c6e02 test(contracts): guard config footprint regressions 2026-04-05 11:48:40 +01:00
Peter Steinberger
fef155cdbc fix: tighten file tool schemas for openai 2026-04-05 11:46:34 +01:00
Vincent Koc
63db3443f1 fix(plugin-sdk): prefer canonical private-network opt-in 2026-04-05 11:45:09 +01:00
Peter Steinberger
0f58cef75e fix: finalize facade re-entry landing cleanup (#61180) (thanks @adam91holt) 2026-04-05 11:42:29 +01:00
Peter Steinberger
66c047ddc3 fix: refresh facade re-entry landing delta (#61180) (thanks @adam91holt) 2026-04-05 11:40:51 +01:00
Adam Holt
3a957cfe8b fix: resolve circular re-entry in facade module loading
resolveTrackedFacadePluginId triggers config loading (plugin auto-enable,
channel discovery) which can re-enter loadBundledPluginPublicSurfaceModuleSync
for the same module. Because the sentinel was still empty at that point,
re-entrant callers saw undefined exports (e.g. shouldNormalizeGoogleProviderConfig).

Move Object.assign(sentinel, loaded) before the plugin ID resolution so any
re-entrant lookup through the cached sentinel finds the real exports.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-05 11:40:32 +01:00
Peter Steinberger
495ebd28a4 fix(ci): route zai endpoint constants through plugin sdk 2026-04-05 11:30:58 +01:00
Peter Steinberger
064f474ed7 test: isolate anthropic vertex provider env cases 2026-04-05 11:28:59 +01:00
Vincent Koc
fd0cc90427 fix(plugin-sdk): resolve facade post-load re-entry (#61286) 2026-04-05 11:25:36 +01:00
Peter Steinberger
4559ece355 fix(ci): align test fixtures with current runner types 2026-04-05 11:23:51 +01:00
Chunyue Wang
8c1ca1f245 fix(cron): remove OpenAPI 3.0 incompatible JSON Schema keywords from cron tool (#61221)
The cron tool schema used type arrays (['string','null']), the 'not'
keyword, and 'const' — all unsupported by the OpenAPI 3.0 subset that
Gemini-backed providers (e.g. GitHub Copilot) enforce. This caused
HTTP 400 for every request when cron was enabled.

Replace type arrays with scalar types, remove not/const from
CronFailureAlertSchema, and add 'not' to the Gemini unsupported
keywords list as defense-in-depth.

Fixes #61206
2026-04-05 11:21:45 +01:00
Peter Steinberger
2d7ec1b641 refactor: split zai config sdk seam 2026-04-05 11:19:05 +01:00
Peter Steinberger
be526d6423 refactor: split provider stream sdk seams 2026-04-05 11:19:05 +01:00
Peter Steinberger
af81ee9fee fix(agents): add embedded item lifecycle events 2026-04-05 11:16:28 +01:00
Peter Steinberger
fd917a471c test: scope implicit provider discovery harness 2026-04-05 11:12:41 +01:00
Vincent Koc
3b84884793 fix(agents): harden host-managed claude-cli auth path (#61276) 2026-04-05 11:02:18 +01:00
Peter Steinberger
afca9540bf fix: add openai responses phase support 2026-04-05 10:58:49 +01:00
Vincent Koc
1d736dcbbc fix(ci): drop unused google prompt cache type 2026-04-05 10:49:51 +01:00
Vincent Koc
64cf52ca20 fix(tool-display): generate swift snapshot from core config 2026-04-05 10:34:02 +01:00
Peter Steinberger
e468da1040 fix: improve gpt execution flow and visibility 2026-04-05 10:32:58 +01:00
Peter Steinberger
b3d73b648b test: fix hook-alias runtime coverage after rebase (#61234) 2026-04-05 18:19:33 +09:00
Tuyen
cd348659ce Anthropic: fix claude-cli runtime auth 2026-04-05 18:19:33 +09:00
Peter Steinberger
9d315cdf42 test: default vitest lanes to isolated forks 2026-04-05 10:11:33 +01:00
Onur
d4e06d1249 Revert "[codex] Reproduce session stall and restart drain bugs (#61225)" (#61265)
This reverts commit 83d29dae2b.
2026-04-05 11:10:20 +02:00
Vincent Koc
d5cde2171b fix(agents): surface disk full session write errors (#61264) 2026-04-05 10:09:42 +01:00
Bob
83d29dae2b [codex] Reproduce session stall and restart drain bugs (#61225)
* Tests: reproduce session stall and drain bugs

* Docs: add reply lifecycle unification plan

* Docs: lock down reply lifecycle plan

* Delete docs/experiments/plans/reply-lifecycle-unification.md

---------

Co-authored-by: Onur <2453968+osolmaz@users.noreply.github.com>
Co-authored-by: Vincent Koc <vincentkoc@ieee.org>
2026-04-05 10:05:40 +01:00
wesley
1030b498de fix(acpx): retry persisted resume ids cleanly (#52209)
* fix(acpx): store agent session ID when session/load fails

When an ACP agent (e.g. Gemini CLI) rejects the acpx-generated session
ID via session/load and falls back to session/new, the agent-returned
session ID was previously discarded. This caused identity stuck at
pending forever, multi-turn failures, lost completion events, and
persistent reconcile warnings.

- Parse ACP protocol stream in runTurn() to capture agent session IDs
- Flip resolveRuntimeResumeSessionId() to prefer agentSessionId
- Add createIdentityFromHandleEvent() for handle-sourced identity
- Layer handle event identity before status in reconcile
- Add regression tests for load fallback and restart resume

Closes #52182

* ACPX: prefer decoded session ids

* ACPX: refresh runtime handle state from status

---------

Co-authored-by: Wesley <imwyvern@users.noreply.github.com>
2026-04-05 10:01:59 +01:00
Vincent Koc
cc09171929 fix(ci): align sanitize session history tests with transcript types 2026-04-05 10:01:29 +01:00
Vincent Koc
f1f8fd5970 fix(plugins): drop dead runtime helper 2026-04-05 09:59:50 +01:00
Peter Steinberger
4a85810091 fix: migrate bedrock discovery config in doctor 2026-04-05 09:55:55 +01:00
Peter Steinberger
19de5d1b56 refactor: move provider discovery config into plugins 2026-04-05 09:55:55 +01:00
Vincent Koc
4613f121ad fix(agents): preserve native Anthropic replay tool ids (#61254)
* fix(agents): preserve native Anthropic replay tool ids

* docs(changelog): note native Anthropic replay ids

* fix(agents): preserve native Anthropic replay ids selectively
2026-04-05 09:53:52 +01:00
Peter Steinberger
a9c52dd935 test(gateway): cover claude cli bootstrap injection 2026-04-05 17:51:41 +09:00
Peter Steinberger
3d952aa35d fix(agents): preserve claude cli backend defaults 2026-04-05 17:51:41 +09:00
Daev Mithran
03be4c2489 fix(plugin-sdk): export missing context-engine types (#61251)
* fix(plugin-sdk): export missing context-engine types

Signed-off-by: DaevMithran <daevmithran1999@gmail.com>

* build(plugin-sdk): refresh api baseline hash

* docs(changelog): note context engine sdk exports

---------

Signed-off-by: DaevMithran <daevmithran1999@gmail.com>
Co-authored-by: Vincent Koc <vincentkoc@ieee.org>
2026-04-05 09:49:19 +01:00
Peter Steinberger
48653c2031 fix: recover launchd restart and restore prompt-cache gate 2026-04-05 17:47:07 +09:00
Peter Steinberger
48f0d9aaf7 fix: normalize telegram announce thread targets 2026-04-05 09:44:20 +01:00
Vincent Koc
3a9569ff38 fix(ci): keep self-hosted setup out of plugin internals 2026-04-05 09:43:35 +01:00
Peter Steinberger
647fc7bfec refactor(plugins): unify explicit provider ownership loading 2026-04-05 09:38:04 +01:00
Peter Steinberger
f9f44b9b96 fix(models): restore anthropic cli auth login 2026-04-05 09:36:47 +01:00
Vincent Koc
50ed91a589 fix(ci): wire heartbeat busy-lane test reply mock 2026-04-05 09:35:55 +01:00