From 19017bad96c0a3548cc544ee83360428147c3382 Mon Sep 17 00:00:00 2001 From: Peter Steinberger Date: Sat, 25 Apr 2026 09:51:21 +0100 Subject: [PATCH] docs(browser): explain actionable aria snapshot refs --- docs/tools/browser-control.md | 15 +++++++++++++-- .../browser/skills/browser-automation/SKILL.md | 2 +- 2 files changed, 14 insertions(+), 3 deletions(-) diff --git a/docs/tools/browser-control.md b/docs/tools/browser-control.md index 31164ec16ae..78abfee9d8c 100644 --- a/docs/tools/browser-control.md +++ b/docs/tools/browser-control.md @@ -213,14 +213,14 @@ openclaw browser set device "iPhone 14" Notes: - `upload` and `dialog` are **arming** calls; run them before the click/press that triggers the chooser/dialog. -- `click`/`type`/etc require a `ref` from `snapshot` (numeric `12` or role ref `e12`). CSS selectors are intentionally not supported for actions. Use `click-coords` when the visible viewport position is the only reliable target. +- `click`/`type`/etc require a `ref` from `snapshot` (numeric `12`, role ref `e12`, or actionable ARIA ref `ax12`). CSS selectors are intentionally not supported for actions. Use `click-coords` when the visible viewport position is the only reliable target. - Download, trace, and upload paths are constrained to OpenClaw temp roots: `/tmp/openclaw{,/downloads,/uploads}` (fallback: `${os.tmpdir()}/openclaw/...`). - `upload` can also set file inputs directly via `--input-ref` or `--element`. Snapshot flags at a glance: - `--format ai` (default with Playwright): AI snapshot with numeric refs (`aria-ref=""`). -- `--format aria`: accessibility tree, no refs; inspection only. +- `--format aria`: accessibility tree with `axN` refs. When Playwright is available, OpenClaw binds refs with backend DOM ids to the live page so follow-up actions can use them; otherwise treat the output as inspection-only. - `--efficient` (or `--mode efficient`): compact role snapshot preset. Set `browser.snapshotDefaults.mode: "efficient"` to make this the default (see [Gateway configuration](/gateway/configuration-reference#browser)). - `--interactive`, `--compact`, `--depth`, `--selector` force a role snapshot with `ref=e12` refs. `--frame "