mirror of
https://github.com/openclaw/openclaw.git
synced 2026-06-22 12:28:10 +00:00
fix(vllm): wire configured thinking params
Move vLLM Qwen thinking control onto configured model compat metadata and carry it through catalog/model-selection/runtime thinking contexts. Also migrate legacy provider/default request params in doctor and keep Pi/runtime model rows buildable with explicit reasoning defaults. Thanks @rendrag-git. Co-authored-by: rendrag-git <253747599+rendrag-git@users.noreply.github.com>
This commit is contained in:
@@ -818,6 +818,11 @@ canonical replacement.
|
||||
ranked level list. OpenClaw downgrades stale stored values by profile
|
||||
rank automatically.
|
||||
|
||||
The context includes `provider`, `modelId`, optional merged `reasoning`,
|
||||
and optional merged model `compat` facts. Provider plugins can use those
|
||||
catalog facts to expose a model-specific profile only when the configured
|
||||
request contract supports it.
|
||||
|
||||
Implement one hook instead of three. The legacy hooks keep working during
|
||||
the deprecation window but are not composed with the profile result.
|
||||
|
||||
|
||||
@@ -501,6 +501,7 @@ API key auth, and dynamic model resolution.
|
||||
|
||||
- `normalizeConfig` checks the matched provider first, then other hook-capable provider plugins until one actually changes the config. If no provider hook rewrites a supported Google-family config entry, the bundled Google config normalizer still applies.
|
||||
- `resolveConfigApiKey` uses the provider hook when exposed. The bundled `amazon-bedrock` path also has a built-in AWS env-marker resolver here, even though Bedrock runtime auth itself still uses the AWS SDK default chain.
|
||||
- `resolveThinkingProfile(ctx)` receives the selected `provider`, `modelId`, optional merged `reasoning` catalog hint, and optional merged model `compat` facts. Use `compat` only to select the provider's thinking UI/profile.
|
||||
- `resolveSystemPromptContribution` lets a provider inject cache-aware system-prompt guidance for a model family. Prefer it over `before_prompt_build` when the behavior belongs to one provider/model family and should preserve the stable/dynamic cache split.
|
||||
|
||||
For detailed descriptions and real-world examples, see [Internals: Provider Runtime Hooks](/plugins/architecture-internals#provider-runtime-hooks).
|
||||
|
||||
Reference in New Issue
Block a user