Commit Graph

379 Commits

Author SHA1 Message Date
Peter Steinberger
9b2b22f350 feat: add vydra media provider 2026-04-06 02:21:51 +01:00
Peter Steinberger
a9f491310c fix: route comfy music through shared tool 2026-04-06 02:03:13 +01:00
Peter Steinberger
f6dbcf4cda docs: document music generation async flow 2026-04-06 01:49:58 +01:00
Peter Steinberger
aeb9ad52fa feat: add comfy workflow media support 2026-04-06 01:45:01 +01:00
wirjo
0793136c63 feat(bedrock-mantle): add IAM credential auth via @aws/bedrock-token-… (#61563)
* feat(bedrock-mantle): add IAM credential auth via @aws/bedrock-token-generator

Mantle previously required a manually-created API key (AWS_BEARER_TOKEN_BEDROCK).
This adds automatic bearer token generation from IAM credentials using the
official @aws/bedrock-token-generator package.

Auth priority:
1. Explicit AWS_BEARER_TOKEN_BEDROCK env var (manual API key from Console)
2. IAM credentials via getTokenProvider() → Bearer token (instance roles,
   SSO profiles, access keys, EKS IRSA, ECS task roles)

Token is cached in memory (1hr TTL, generated with 2hr validity) and in
process.env.AWS_BEARER_TOKEN_BEDROCK for downstream sync reads.

Falls back gracefully when package is not installed or credentials are
unavailable — Mantle provider simply not registered.

Closes #45152

* fix(bedrock-mantle): harden IAM auth

---------

Co-authored-by: Vincent Koc <vincentkoc@ieee.org>
2026-04-06 01:41:24 +01:00
Peter Steinberger
9b00008561 docs(openai): clarify gpt-5.4 fast mode 2026-04-06 01:20:52 +01:00
wirjo
699b2320a8 feat(memory): add Bedrock embedding provider for memory search (#61547)
* feat(memory): add Bedrock embedding provider for memory search

Add Amazon Bedrock as a native embedding provider for memory search.
Supports Titan Embed Text v1/v2 and Cohere Embed models via AWS SDK.

- New embeddings-bedrock.ts: BedrockRuntimeClient + InvokeModel
- Auth via AWS default credential chain (same as Bedrock inference)
- Auto-selected in 'auto' mode when AWS credentials are detected
- Titan V2: configurable dimensions (256/512/1024), normalization
- Cohere: native batch support with search_query/search_document types
- 16 new tests covering all model types, auth detection, edge cases

Closes #26289

* fix(memory): harden bedrock embedding selection

---------

Co-authored-by: Vincent Koc <vincentkoc@ieee.org>
2026-04-06 01:19:56 +01:00
Vincent Koc
f4cd1a3782 docs: rewrite video generation docs for readability 2026-04-06 01:19:44 +01:00
Peter Steinberger
4bb965e007 docs(providers): surface new video provider pages 2026-04-06 01:02:59 +01:00
Peter Steinberger
ad6c584ce7 fix: ignore unsupported video generation overrides 2026-04-06 01:02:10 +01:00
Peter Steinberger
379bc1c032 docs(video): document runway support 2026-04-06 00:50:32 +01:00
Peter Steinberger
6d34a1c814 fix(video): queue fal provider jobs 2026-04-06 00:12:47 +01:00
Peter Steinberger
c6d3ee70e2 docs(providers): unify qwen docs 2026-04-05 23:23:58 +01:00
Peter Steinberger
a234157337 docs(providers): link generation guides 2026-04-05 23:21:14 +01:00
Peter Steinberger
f30c087fdf docs(providers): add generation setup pages 2026-04-05 23:21:14 +01:00
Peter Steinberger
05d351c430 refactor(cli): remove bundled cli text providers 2026-04-05 18:46:36 +01:00
Peter Steinberger
5790435975 feat(agents): add video_generate tool 2026-04-05 18:44:06 +01:00
Peter Steinberger
fe93f29486 docs(anthropic): clarify api key and doctor recovery 2026-04-05 18:05:12 +01:00
Peter Steinberger
7075da59bd feat: allow occasional emoji in friendly openai overlay 2026-04-05 16:56:25 +01:00
Peter Steinberger
d25609bc06 fix: default OpenAI personality overlay to friendly 2026-04-05 16:15:08 +01:00
Peter Steinberger
dfd39a81d8 feat(openai): add opt-in GPT personality 2026-04-05 15:25:06 +01:00
Vincent Koc
fc9648b620 docs: add Bedrock inference profiles and Bedrock Mantle provider coverage, re-sort changelog 2026-04-05 13:04:47 +01:00
Vincent Koc
852e8f7a2a docs: update Claude CLI backend docs for MCP bridge, streaming, and auth changes 2026-04-05 10:54:11 +01:00
Peter Steinberger
84fb62170a docs: clarify anthropic cli fallback guidance 2026-04-05 10:06:32 +01:00
Peter Steinberger
19de5d1b56 refactor: move provider discovery config into plugins 2026-04-05 09:55:55 +01:00
Peter Steinberger
d655a8bc76 feat: add Fireworks provider and simplify plugin setup loading 2026-04-05 07:43:14 +01:00
Peter Steinberger
37301cbc3b docs: clarify anthropic extra usage billing 2026-04-05 07:14:35 +09:00
Peter Steinberger
eee868452f docs: refresh claude-cli model ref mirrors 2026-04-04 22:19:07 +01:00
Peter Steinberger
6de100d4e2 docs: refresh claude-cli naming mirrors 2026-04-04 22:11:45 +01:00
Peter Steinberger
8ea5b1ddc0 docs: refresh anthropic token compatibility mirrors 2026-04-04 22:09:21 +01:00
Peter Steinberger
5c5c82dfaa docs: refresh anthropic oauth defaults refs 2026-04-04 22:01:16 +01:00
Peter Steinberger
b83c5fb8e0 docs: refresh provider runtime fallback refs 2026-04-04 21:48:41 +01:00
Peter Steinberger
3b4bed7c38 docs: refresh model-scoped cooldown mirrors 2026-04-04 20:54:05 +01:00
Peter Steinberger
97a587ddca docs: refresh qwen auth-choice mirrors 2026-04-04 20:40:31 +01:00
Peter Steinberger
62dd299af1 docs: refresh qwen video generation refs 2026-04-04 19:55:39 +01:00
Peter Steinberger
879d45a56c docs: refresh qwen media and config refs 2026-04-04 19:42:13 +01:00
Peter Steinberger
889ddb5edf docs(qwen): refresh provider and endpoint guides 2026-04-04 19:34:56 +01:00
Peter Steinberger
5584af7ac3 docs: refresh proxy provider runtime refs 2026-04-04 15:37:20 +01:00
Peter Steinberger
1ca1ce85ee docs: refresh xai and zai provider refs 2026-04-04 15:34:57 +01:00
Peter Steinberger
3dda70a578 docs: refresh gemini cli oauth setup refs 2026-04-04 15:27:42 +01:00
Peter Steinberger
0166fd426e docs: refresh minimax auth path refs 2026-04-04 12:47:07 +01:00
Peter Steinberger
9da0feeecf docs: fix minimax usage docs merge markers 2026-04-04 12:43:44 +01:00
Peter Steinberger
a375635a9a docs: refresh status token fallback refs 2026-04-04 12:42:50 +01:00
Peter Steinberger
fb0d60d7f3 fix: resolve MiniMax portal usage auth 2026-04-04 12:42:30 +01:00
Peter Steinberger
ac254f50e8 docs: refresh minimax usage refs 2026-04-04 12:36:18 +01:00
Peter Steinberger
90af255a91 docs: refresh gemini cli usage refs 2026-04-04 12:30:55 +01:00
Peter Steinberger
1cfc10e836 docs: refresh minimax multimodal refs 2026-04-04 12:27:47 +01:00
Peter Steinberger
3ec0463da9 docs: refresh minimax thinking refs 2026-04-04 12:23:33 +01:00
Peter Steinberger
b473816afb docs: refresh native streaming compat refs 2026-04-04 12:20:31 +01:00
Peter Steinberger
00337cdde1 docs: refresh codex auth and ws refs 2026-04-04 12:11:45 +01:00