fix: resolve providerless image model refs

This commit is contained in:
Peter Steinberger
2026-04-28 10:18:00 +01:00
parent 5741e40c14
commit fac116cfa4
10 changed files with 241 additions and 7 deletions

View File

@@ -283,6 +283,8 @@ To make Ollama the default image-understanding model for inbound media, configur
}
```
Prefer the full `ollama/<model>` ref. If the same model is listed under `models.providers.ollama.models` with `input: ["text", "image"]` and no other configured image provider exposes that bare model ID, OpenClaw also normalizes a bare `imageModel` ref such as `qwen2.5vl:7b` to `ollama/qwen2.5vl:7b`. If more than one configured image provider has the same bare ID, use the provider prefix explicitly.
Slow local vision models can need a longer image-understanding timeout than cloud models. They can also crash or stop when Ollama tries to allocate the full advertised vision context on constrained hardware. Set a capability timeout, and cap `num_ctx` on the model entry when you only need a normal image-description turn:
```json5