openclaw/docs/concepts/markdown-formatting.md at ff7beea3da8928c54e5984f542c7b18d664e65d7

mirror of https://github.com/openclaw/openclaw.git synced 2026-05-18 16:14:44 +00:00

Files

kinjitakabe 09a2d6e571 fix(matrix): default markdown tables to bullets across Matrix clients

Matrix outbound markdown was hitting the shared `resolveMarkdownTableMode`
"code" fallback for every send because the Matrix channel plugin never
declared `defaultMarkdownTableMode` in its `messaging` block at
`extensions/matrix/src/channel.ts:452`. Tables were emitted as
`<pre><code>` fenced blocks across every Matrix client.

This change declares `defaultMarkdownTableMode: "bullets"`, matching the
Signal and WhatsApp precedent at `extensions/signal/src/shared.ts:111`
and `extensions/whatsapp/src/shared.ts:261`. The choice matches the
cross-client compatibility profile the issue filer surveyed:
Element X iOS squashes HTML `<table>` and Element X Android drops cell
text entirely (element-hq/element-x-android#1551), while bullet lists
render cleanly across every Matrix client. Operators wanting the
previous fenced-code rendering can set `channels.matrix.markdown.tables:
"code"` explicitly; clients that do render real tables can opt in with
`channels.matrix.markdown.tables: "off"` (markdown-it's `table` rule is
already enabled by default through the markdown-it default preset, so
raw markdown tables flow through to native HTML tables on that path).

Docs and the changelog entry list Matrix alongside Signal and WhatsApp
as a bullet-default channel.

Fixes #78990.

2026-05-12 15:15:05 +05:30

5.0 KiB

Raw Blame History

summary, read_when, title

summary

read_when

title

Markdown formatting pipeline for outbound channels

You are changing markdown formatting or chunking for outbound channels

You are adding a new channel formatter or style mapping

You are debugging formatting regressions across channels

Markdown formatting

OpenClaw formats outbound Markdown by converting it into a shared intermediate representation (IR) before rendering channel-specific output. The IR keeps the source text intact while carrying style/link spans so chunking and rendering can stay consistent across channels.

Goals

Consistency: one parse step, multiple renderers.
Safe chunking: split text before rendering so inline formatting never breaks across chunks.
Channel fit: map the same IR to Slack mrkdwn, Telegram HTML, and Signal style ranges without re-parsing Markdown.

Pipeline

Parse Markdown -> IR
- IR is plain text plus style spans (bold/italic/strike/code/spoiler) and link spans.
- Offsets are UTF-16 code units so Signal style ranges align with its API.
- Tables are parsed only when a channel opts into table conversion.
Chunk IR (format-first)
- Chunking happens on the IR text before rendering.
- Inline formatting does not split across chunks; spans are sliced per chunk.
Render per channel
- Slack: mrkdwn tokens (bold/italic/strike/code), links as <url|label>.
- Telegram: HTML tags (<b>, <i>, <s>, <code>, <pre><code>, <a href>).
- Signal: plain text + text-style ranges; links become label (url) when label differs.

IR example

Input Markdown:

Hello **world** - see [docs](https://docs.openclaw.ai).

IR (schematic):

{
  "text": "Hello world - see docs.",
  "styles": [{ "start": 6, "end": 11, "style": "bold" }],
  "links": [{ "start": 19, "end": 23, "href": "https://docs.openclaw.ai" }]
}

Where it is used

Slack, Telegram, and Signal outbound adapters render from the IR.
Other channels (WhatsApp, iMessage, Microsoft Teams, Discord) still use plain text or their own formatting rules, with Markdown table conversion applied before chunking when enabled.

Table handling

Markdown tables are not consistently supported across chat clients. Use markdown.tables to control conversion per channel (and per account).

code: render tables as code blocks (default for most channels).
bullets: convert each row into bullet points (default for Matrix, Signal, and WhatsApp).
off: disable table parsing and conversion; raw table text passes through.

Config keys:

channels:
  discord:
    markdown:
      tables: code
    accounts:
      work:
        markdown:
          tables: off

Chunking rules

Chunk limits come from channel adapters/config and are applied to the IR text.
Code fences are preserved as a single block with a trailing newline so channels render them correctly.
List prefixes and blockquote prefixes are part of the IR text, so chunking does not split mid-prefix.
Inline styles (bold/italic/strike/inline-code/spoiler) are never split across chunks; the renderer reopens styles inside each chunk.

If you need more on chunking behavior across channels, see Streaming + chunking.

Link policy

Slack: [label](url) -> <url|label>; bare URLs remain bare. Autolink is disabled during parse to avoid double-linking.
Telegram: [label](url) -> <a href="url">label</a> (HTML parse mode).
Signal: [label](url) -> label (url) unless label matches the URL.

Spoilers

Spoiler markers (||spoiler||) are parsed only for Signal, where they map to SPOILER style ranges. Other channels treat them as plain text.

How to add or update a channel formatter

Parse once: use the shared markdownToIR(...) helper with channel-appropriate options (autolink, heading style, blockquote prefix).
Render: implement a renderer with renderMarkdownWithMarkers(...) and a style marker map (or Signal style ranges).
Chunk: call chunkMarkdownIR(...) before rendering; render each chunk.
Wire adapter: update the channel outbound adapter to use the new chunker and renderer.
Test: add or update format tests and an outbound delivery test if the channel uses chunking.

Common gotchas

Slack angle-bracket tokens (<@U123>, <#C123>, <https://...>) must be preserved; escape raw HTML safely.
Telegram HTML requires escaping text outside tags to avoid broken markup.
Signal style ranges depend on UTF-16 offsets; do not use code point offsets.
Preserve trailing newlines for fenced code blocks so closing markers land on their own line.

Outbound streaming behavior, chunk boundaries, and channel-specific delivery. What the model sees before the conversation, including injected workspace files.

5.0 KiB Raw Blame History