openclaw

mirror of https://github.com/openclaw/openclaw.git synced 2026-04-14 18:51:04 +00:00

Author	SHA1	Message	Date
xieyongliang	e0a2c568b2	video_generate: support url-only delivery (#61988 ) (thanks @xieyongliang) (#61988 ) Co-authored-by: George Zhang <georgezhangtj97@gmail.com>	2026-04-11 03:08:30 -07:00
xieyongliang	2c57ec7b5f	video_generate: add providerOptions, inputAudios, and imageRoles (#61987 ) * video_generate: add providerOptions, inputAudios, and imageRoles - VideoGenerationSourceAsset gains an optional `role` field (e.g. "first_frame", "last_frame"); core treats it as opaque and forwards it to the provider unchanged. - VideoGenerationRequest gains `inputAudios` (reference audio assets, e.g. background music) and `providerOptions` (arbitrary provider-specific key/value pairs forwarded as-is). - VideoGenerationProviderCapabilities gains `maxInputAudios`. - video_generate tool schema adds: - `imageRoles` array (parallel to `images`, sets role per asset) - `audioRef` / `audioRefs` (single/multi reference audio inputs) - `providerOptions` (JSON object passed through to the provider) - `MAX_INPUT_IMAGES` bumped 5 → 9; `MAX_INPUT_AUDIOS` = 3 - Capability validation extended to gate on `maxInputAudios`. - runtime.ts threads `inputAudios` and `providerOptions` through to `provider.generateVideo`. - Docs and runtime tests updated. Made-with: Cursor * docs: fix BytePlus Seedance capability table — split 1.5 and 2.0 rows 1.5 Pro supports at most 2 input images (first_frame + last_frame); 2.0 supports up to 9 reference images, 3 videos, and 3 audios. Provider notes section updated accordingly. Made-with: Cursor * docs: list all Seedance 1.0 models in video-generation provider table - Default model updated to seedance-1-0-pro-250528 (was the T2V lite) - Provider notes now enumerate all five 1.0 model IDs with T2V/I2V capability notes Made-with: Cursor * video_generate: address review feedback (P1/P2) P1: Add "adaptive" to SUPPORTED_ASPECT_RATIOS so provider-specific ratio passthrough (used by Seedance 1.5/2.0) is accepted instead of throwing. Update error message to include "adaptive" in the allowed list. P1: Fix audio input capability default — when a provider does not declare maxInputAudios, default to 0 (no audio support) instead of MAX_INPUT_AUDIOS. Providers must explicitly opt in via maxInputAudios to accept audio inputs. P2: Remove unnecessary type cast in imageRoles assignment; VideoGenerationSourceAsset already declares role?: string so a non-null assertion suffices. P2: Add videoRoles and audioRoles tool parameters, parallel to imageRoles, so callers can assign semantic role hints to reference video and audio assets (e.g. "reference_video", "reference_audio" for Seedance 2.0). Made-with: Cursor * video_generate: fix check-docs formatting and snake_case param reading Made-with: Cursor * video_generate: clarify Roles are parallel to combined input list (P2) Made-with: Cursor video_generate: add missing duration import; fix corrupted docs section Made-with: Cursor * video_generate: pass mode inputs to duration resolver; note plugin requirement (P2) Made-with: Cursor * plugin-sdk: sync new video-gen fields — role, inputAudios, providerOptions, maxInputAudios Add fields introduced by core in the PR1 batch to the public plugin-sdk mirror so TypeScript provider plugins can declare and consume them without type assertions: - VideoGenerationSourceAsset.role?: string - VideoGenerationRequest.inputAudios and .providerOptions - VideoGenerationModeCapabilities.maxInputAudios The AssertAssignable bidirectional checks still pass because all new fields are optional; this change makes the SDK surface complete. Made-with: Cursor * video-gen runtime: skip failover candidates lacking audio capability Made-with: Cursor * video-gen: fall back to flat capabilities.maxInputAudios in failover and tool validation Made-with: Cursor * video-gen: defer audio-count check to runtime, enabling fallback for audio-capable candidates Made-with: Cursor * video-gen: defer maxDurationSeconds check to runtime, enabling fallback for higher-cap candidates Made-with: Cursor * video-gen: add VideoGenerationAssetRole union and typed providerOptions capability Introduces a canonical VideoGenerationAssetRole union (first_frame, last_frame, reference_image, reference_video, reference_audio) for the source-asset role hint, and a VideoGenerationProviderOptionType tag ('number' \| 'boolean' \| 'string') plus a new capabilities.providerOptions schema that providers use to declare which opaque providerOptions keys they accept and with what primitive type. Types are additive and backwards compatible. The role field accepts both canonical union values and arbitrary provider-specific strings via a `VideoGenerationAssetRole \| (string & {})` union, so autocomplete works for the common case without blocking provider-specific extensions. Runtime enforcement of providerOptions (skip-in-fallback, unknown key and type mismatch) lands in a follow-up commit. Co-authored-by: yongliang.xie <yongliang.xie@bytedance.com> * video-gen: enforce typed providerOptions schema via skip-in-fallback Adds `validateProviderOptionsAgainstDeclaration` in the video-generation runtime and wires it into the `generateVideo` candidate loop alongside the existing audio-count and duration-cap skip guards. Behavior: - Candidates with no declared `capabilities.providerOptions` skip any non-empty providerOptions payload with a clear skip reason, so a provider that would ignore `{seed: 42}` and succeed without the caller's intent never gets reached. - Candidates that declare a schema reject unknown keys with the list of accepted keys in the error. - Candidates that declare a schema reject type mismatches (expected number/boolean/string) with the declared type in the error. - All skip reasons push into `attempts` so the aggregated failure message at the end of the fallback chain explains exactly why each candidate was rejected. Also hardens the tool boundary: `providerOptions` that is not a plain JSON object (including bogus arrays like `["seed", 42]`) now throws a `ToolInputError` up front instead of being cast to `Record` and forwarded with numeric-string keys. Consistent with the audio/duration skip-in-fallback pattern introduced by yongliang.xie in earlier commits on this branch. Co-authored-by: yongliang.xie <yongliang.xie@bytedance.com> * video-gen: harden Roles parity + document canonical role values Replaces the inline `parseRolesArg` lambda with a dedicated `parseRoleArray` helper that throws a ToolInputError when the caller supplies more roles than assets. Off-by-one alignment mistakes in `imageRoles` / `videoRoles` / `audioRoles` now fail loudly at the tool boundary instead of silently dropping trailing roles. Also tightens the schema descriptions to document the canonical VideoGenerationAssetRole values (first_frame, last_frame, reference_) and the skip-in-fallback contract on providerOptions, and rejects non-array inputs to any `Roles` field early rather than coercing them to an empty list. Co-authored-by: yongliang.xie <yongliang.xie@bytedance.com> video-gen: surface dropped aspectRatio sentinels in ignoredOverrides "adaptive" and other provider-specific sentinel aspect ratios are unparseable as numeric ratios, so when the active provider does not declare the sentinel in caps.aspectRatios, `resolveClosestAspectRatio` returns undefined and the previous code silently nulled out `aspectRatio` without surfacing a warning. Push the dropped value into `ignoredOverrides` so the tool result warning path ("Ignored unsupported overrides for …") picks it up, and the caller gets visible feedback that the request was dropped instead of a silent no-op. Also corrects the tool-side comment on SUPPORTED_ASPECT_RATIOS to describe actual behavior. Co-authored-by: yongliang.xie <yongliang.xie@bytedance.com> * video-gen: surface declared providerOptions + maxInputAudios in action=list `video_generate action=list` now includes the declared providerOptions schema (key:type) per provider, so agents can discover which opaque keys each provider accepts without trial and error. Both mode-level and flat-provider providerOptions declarations are merged, matching the runtime lookup order in `generateVideo`. Also surfaces `maxInputAudios` alongside the other max-input counts for completeness — previously the list output did not expose the audio cap at all, even though the tool validates against it. Co-authored-by: yongliang.xie <yongliang.xie@bytedance.com> * video-gen: warn once per request when runtime skips a fallback candidate The skip-in-fallback guards (audio cap, duration cap, providerOptions) all logged at debug level, which meant operators had no visible signal when the primary provider was silently passed over in favor of a fallback. Add a first-skip log.warn in the runtime loop so the reason for the first rejection is surfaced once per request, and leave the rest of the skip events at debug to avoid flooding on long chains. Co-authored-by: yongliang.xie <yongliang.xie@bytedance.com> * video-gen: cover new tool-level behavior with regression tests Adds regression tests for: - providerOptions shape rejection (arrays, strings) - providerOptions happy-path forwarding to runtime - imageRoles length-parity guard - Roles non-array rejection - positional role attachment to loaded reference images - audio data: URL templated rejection branch - aspectRatio='adaptive' acceptance and forwarding - unsupported aspectRatio rejection (mentions 'adaptive' in the error) All eight new cases run in the existing video-generate-tool suite and use the same provider-mock pattern already established in the file. Co-authored-by: yongliang.xie <yongliang.xie@bytedance.com> video-gen: cover runtime providerOptions skip-in-fallback branches Adds runtime regression tests for the new typed-providerOptions guard: - candidates without a declared providerOptions schema are skipped when any providerOptions is supplied (prevents silent drop) - candidates that declare a schema skip on unknown keys with the accepted-key list surfaced in the error - candidates that declare a schema skip on type mismatches with the declared type surfaced in the error - end-to-end fallback: openai (no providerOptions) is skipped and byteplus (declared schema) accepts the same request, with an attempt entry recording the first skip reason Also updates the existing 'forwards providerOptions to the provider unchanged' case so the destination provider declares the matching typed schema, and wires a `warn` stub into the hoisted logger mock so the new first-skip log.warn call path does not blow up. Co-authored-by: yongliang.xie <yongliang.xie@bytedance.com> * changelog: note video_generate providerOptions / inputAudios / role hints Adds an Unreleased Changes entry describing the user-visible surface expansion for video_generate: typed providerOptions capability, inputAudios reference audio, per-asset role hints via the canonical VideoGenerationAssetRole union, the 'adaptive' aspect-ratio sentinel, maxInputAudios capability, and the relaxed 9-image cap. Credits the original PR author. Co-authored-by: yongliang.xie <yongliang.xie@bytedance.com> * byteplus: declare providerOptions schema (seed, draft, camerafixed) and forward to API Made-with: Cursor * byteplus: fix camera_fixed body field (API uses underscore, not camerafixed) Made-with: Cursor * fix(byteplus): normalize resolution to lowercase before API call The Seedance API rejects resolution values with uppercase letters — "480P", "720P" etc return InvalidParameter, while "480p", "720p" are accepted. This was breaking the video generation live test (resolveLiveVideoResolution returns "480P"). Normalize req.resolution to lowercase at the provider layer before setting body.resolution, so any caller-supplied casing is corrected without requiring changes to the VideoGenerationResolution type or live-test helpers. Verified via direct API call: body.resolution = "480P" → HTTP 400 InvalidParameter body.resolution = "480p" → task created successfully body.resolution = "720p" → task created successfully (t2v, i2v, 1.5-pro) body.resolution = "1080p" → task created successfully Made-with: Cursor * video-gen/byteplus: auto-select i2v model when input images provided with t2v model Seedance 1.0 uses separate model IDs for T2V (seedance-1-0-lite-t2v-250428) and I2V (seedance-1-0-lite-i2v-250428). When the caller requests a T2V model but also provides inputImages, the API rejects with task_type i2v not supported on t2v model. Fix: when inputImages are present and the requested model contains "-t2v-", auto-substitute "-i2v-" so the API receives the correct model. Seedance 1.5 Pro uses a single model ID for both modes and is unaffected by this substitution. Verified via live test: both mode=generate and mode=imageToVideo pass for byteplus/seedance-1-0-lite-t2v-250428 with no failures. Co-authored-by: odysseus0 <odysseus0@example.com> Made-with: Cursor * video-gen: fix duration rounding + align BytePlus (1.0) docs (P2) Made-with: Cursor * video-gen: relax providerOptions gate for undeclared-schema providers (P1) Distinguish undefined (not declared = backward-compat pass-through) from {} (explicitly declared empty = no options accepted) in validateProviderOptionsAgainstDeclaration. Providers without a declared schema receive providerOptions as-is; providers with an explicit empty schema still skip. Typed schemas continue to validate key names and types. Also: restore camera_fixed (underscore) in BytePlus provider schema and body key (regression from earlier rebase), remove duplicate local readBooleanToolParam definition now imported from media-tool-shared, update tests and docs accordingly. Made-with: Cursor * video_generate: add landing follow-up coverage * video_generate: finalize plugin-sdk baseline (#61987) (thanks @xieyongliang) --------- Co-authored-by: yongliang.xie <yongliang.xie@bytedance.com> Co-authored-by: George Zhang <georgezhangtj97@gmail.com> Co-authored-by: odysseus0 <odysseus0@example.com>	2026-04-11 02:23:14 -07:00
Peter Steinberger	3b6fac85ea	chore: prepare 2026.4.10 release	2026-04-11 03:22:18 +01:00
Peter Steinberger	69244f837f	test: speed provider retry imports	2026-04-11 02:37:51 +01:00
Peter Steinberger	202f80792e	feat: add plugin text transforms	2026-04-11 02:17:39 +01:00
Peter Steinberger	09b1117271	agents: add strict-agentic execution contract	2026-04-10 22:56:37 +01:00
Peter Steinberger	bfc0889776	docs: document Codex harness plugin workflow	2026-04-10 21:22:16 +01:00
Peter Steinberger	bac98d4218	test: reduce media contract import cost	2026-04-10 17:31:08 +01:00
Peter Steinberger	9fd08f9d0f	refactor: remove type-only import cycles	2026-04-10 15:14:27 +01:00
Mariano	46f8c4dfd5	fix(memory-core): harden request-scoped dreaming fallback (#64156 ) * memory-core: harden request-scoped dreaming fallback * memory-core: tighten request-scoped fallback classification	2026-04-10 12:11:57 +02:00
Peter Steinberger	e6797bcd08	chore: refresh plugin SDK API baseline	2026-04-09 02:21:03 +01:00
Peter Steinberger	6e200f4077	fix: update command-status SDK baseline (#63174 ) (thanks @hxy91819)	2026-04-09 01:35:15 +01:00
Gustavo Madeira Santana	bd7801eefa	Slack: key turn-local dedupe by dispatch kind Scope Slack turn-local delivery dedupe by reply dispatch kind so identical tool and final payloads on the same thread do not collapse into one send. Expose the existing dispatcher kind on the public reply-runtime seam and cover the Slack tracker and preview-fallback paths with regression tests.	2026-04-08 18:19:34 -04:00
Agustin Rivera	dafcaf9d69	fix(browser): harden browser control override loading (#62663 ) * fix(browser): harden browser control overrides * fix(lint): prepare boundary artifacts for extension oxlint * docs(changelog): add browser override hardening entry * fix(lint): avoid duplicate boundary prep --------- Co-authored-by: Devin Robison <drobison@nvidia.com> Co-authored-by: Devin Robison <drobison00@users.noreply.github.com>	2026-04-08 13:24:47 -06:00
Peter Steinberger	0950bdf727	fix: resolve post-rebase boundary drift	2026-04-08 09:58:22 +01:00
Peter Steinberger	a4b9755999	chore: prepare 2026.4.7-1 npm release	2026-04-08 05:08:17 +01:00
Peter Steinberger	c33ad415df	docs: update plugin sdk api baseline	2026-04-08 02:47:43 +01:00
Gustavo Madeira Santana	cfe71e2e44	Docs: document approval adapter subpaths	2026-04-07 16:06:02 -04:00
Gustavo Madeira Santana	d78512b09d	Refactor: centralize native approval lifecycle assembly (#62135 ) Merged via squash. Prepared head SHA: `b7c20a7398` Co-authored-by: gumadeiras <5599352+gumadeiras@users.noreply.github.com> Co-authored-by: gumadeiras <5599352+gumadeiras@users.noreply.github.com> Reviewed-by: @gumadeiras	2026-04-07 14:40:26 -04:00
Vincent Koc	dfb6c9c920	perf(plugin-sdk): split channel secret runtime helpers	2026-04-07 13:09:12 +01:00
Vincent Koc	9e9730a55e	chore(plugin-sdk): refresh api baseline hash	2026-04-07 08:58:24 +01:00
Vincent Koc	2988203a5e	feat(context-engine): add memory prompt helper	2026-04-07 08:56:41 +01:00
Vincent Koc	49fbecbf16	perf(plugin-sdk): add web fetch contract artifacts	2026-04-07 08:35:27 +01:00
Vincent Koc	e318f48ff2	perf(secrets): narrow channel secret-ref imports	2026-04-07 07:38:34 +01:00
Vincent Koc	2a6e8dca47	fix(plugin-sdk): add web-search contract subpath	2026-04-06 23:30:56 +01:00
Vincent Koc	78639eff76	perf(secrets): narrow channel secret sdk seam	2026-04-06 20:40:11 +01:00
Peter Steinberger	de20d3a024	refactor(plugin-sdk): add simple completion runtime entrypoint	2026-04-06 16:29:43 +01:00
Peter Steinberger	380a396266	refactor: share ambient proxy agent helpers	2026-04-06 15:03:30 +01:00
Vincent Koc	e69cfc3e3b	fix(plugin-sdk): restore compat auth helper exports	2026-04-06 13:14:02 +01:00
Vincent Koc	5716d83336	feat(memory-wiki): restore llm wiki stack	2026-04-06 04:56:52 +01:00
Peter Steinberger	3e72c0352d	chore: release 2026.4.5	2026-04-06 04:04:21 +01:00
Peter Steinberger	dc0ee2e178	feat: add music generation tooling	2026-04-06 01:47:14 +01:00
Vincent Koc	94256ea1a0	revert(memory-wiki): back out llm wiki stack	2026-04-05 22:44:20 +01:00
Vincent Koc	2f72363984	feat(memory-core): bridge wiki corpus into memory tools	2026-04-05 22:34:02 +01:00
Vincent Koc	c11e7a7420	feat(memory-wiki): add prompt supplement integration	2026-04-05 22:34:01 +01:00
Peter Steinberger	9b7002ee59	refactor(reply): type reply threading policy	2026-04-05 21:40:56 +01:00
Peter Steinberger	acd78e0c2f	refactor: split browser sdk seams	2026-04-05 17:17:16 +01:00
Peter Steinberger	9a0d88a868	refactor: move talk config contract under plugin	2026-04-05 14:26:35 +01:00
Vincent Koc	63db3443f1	fix(plugin-sdk): prefer canonical private-network opt-in	2026-04-05 11:45:09 +01:00
Daev Mithran	03be4c2489	fix(plugin-sdk): export missing context-engine types (#61251 ) * fix(plugin-sdk): export missing context-engine types Signed-off-by: DaevMithran <daevmithran1999@gmail.com> * build(plugin-sdk): refresh api baseline hash * docs(changelog): note context engine sdk exports --------- Signed-off-by: DaevMithran <daevmithran1999@gmail.com> Co-authored-by: Vincent Koc <vincentkoc@ieee.org>	2026-04-05 09:49:19 +01:00
Peter Steinberger	8be017fae6	refactor: remove plugin sdk facade generator	2026-04-05 09:23:55 +01:00
Peter Steinberger	b57372d665	refactor: route capability runtime through channel stores	2026-04-05 09:07:33 +01:00
Altay	2ba3484d10	fix(plugin-sdk): avoid telegram config import side effects (#61061 ) * fix(plugin-sdk): avoid telegram config import side effects * fix(plugin-sdk): address telegram contract review * test(plugin-sdk): tighten telegram contract guards	2026-04-05 02:32:04 +03:00
Gustavo Madeira Santana	e627f53d24	core: dedupe approval not-found handling (#60932 ) Merged via squash. Prepared head SHA: `108221fdfe` Co-authored-by: gumadeiras <5599352+gumadeiras@users.noreply.github.com> Co-authored-by: gumadeiras <5599352+gumadeiras@users.noreply.github.com> Reviewed-by: @gumadeiras	2026-04-04 13:23:58 -04:00
Peter Steinberger	4dbc66b1ed	fix: remove bundled channel startup reentry	2026-04-04 15:39:12 +01:00
Vincent Koc	486505a54e	refactor(providers): share kilocode stream family	2026-04-04 21:05:42 +09:00
Vincent Koc	39d2a719c9	refactor(providers): add family replay and tool hooks	2026-04-04 19:33:31 +09:00
Peter Steinberger	b5265a07d7	refactor: replace 156k-line generated baselines with SHA-256 hash files Config and Plugin SDK drift detection now compares SHA-256 hashes instead of full JSON content. The .sha256 files (6 lines total) are tracked in git; the full JSON baselines are gitignored and generated locally for inspection. Same CI guarantee, zero repo churn on schema changes.	2026-04-04 16:49:21 +09:00

48 Commits