mirror of
https://github.com/openclaw/openclaw.git
synced 2026-05-06 12:10:42 +00:00
refactor(vllm): own qwen thinking payloads
This commit is contained in:
@@ -169,6 +169,13 @@ Availability can still vary by endpoint and billing plan even when a model is
|
||||
present in the bundled catalog.
|
||||
</Note>
|
||||
|
||||
## Thinking Controls
|
||||
|
||||
For reasoning-enabled Qwen Cloud models, the bundled provider maps OpenClaw
|
||||
thinking levels to DashScope's top-level `enable_thinking` request flag. Disabled
|
||||
thinking sends `enable_thinking: false`; other thinking levels send
|
||||
`enable_thinking: true`.
|
||||
|
||||
## Multimodal add-ons
|
||||
|
||||
The `qwen` plugin also exposes multimodal capabilities on the **Standard**
|
||||
|
||||
@@ -131,7 +131,7 @@ Use explicit config when:
|
||||
|
||||
<Accordion title="Qwen thinking controls">
|
||||
For Qwen models served through vLLM, set
|
||||
`compat.thinkingFormat: "qwen-chat-template"` on the model entry when the
|
||||
`params.qwenThinkingFormat: "chat-template"` on the model entry when the
|
||||
server expects Qwen chat-template kwargs. OpenClaw maps `/think off` to:
|
||||
|
||||
```json
|
||||
@@ -145,8 +145,8 @@ Use explicit config when:
|
||||
|
||||
Non-`off` thinking levels send `enable_thinking: true`. If your endpoint
|
||||
expects DashScope-style top-level flags instead, use
|
||||
`compat.thinkingFormat: "qwen"` to send `enable_thinking` at the request
|
||||
root.
|
||||
`params.qwenThinkingFormat: "top-level"` to send `enable_thinking` at the
|
||||
request root. Snake-case `params.qwen_thinking_format` is also accepted.
|
||||
|
||||
</Accordion>
|
||||
|
||||
|
||||
Reference in New Issue
Block a user