openclaw

mirror of https://github.com/openclaw/openclaw.git synced 2026-06-30 17:33:36 +00:00

Author	SHA1	Message	Date
Wynne668	d7dff3cbf4	fix(document-extract): render PDF image fallback per page so multi-page scans don't starve later pages (#96390 ) * fix(document-extract): render PDF image fallback per page so multi-page scans don't starve later pages clawpdf's mode:"images" extract applies a single maxPixels budget across every page, so the first page consumes it and later pages collapse to ~1x1 PNGs that vision OCR models reject. Render each selected page in its own extract() call so the pixel budget resets per page and every page yields a usable image. * fix(document-extract): preserve aggregate PDF render budget --------- Co-authored-by: Vincent Koc <vincentkoc@ieee.org>	2026-06-25 16:37:47 +08:00
Peter Steinberger	4fa5092cdc	docs: document small extension sources	2026-06-04 21:02:07 -04:00
Peter Steinberger	73168d37ac	feat: support encrypted PDF extraction (#87751 )	2026-05-28 20:19:49 +01:00
Peter Steinberger	75c3b53038	[codex] Use clawpdf for PDF extraction (#87670 ) * feat: use clawpdf for PDF extraction * fix: align approval action prompt typing * chore: use clawpdf 0.2.0 * fix: lazily load clawpdf backend	2026-05-28 17:35:39 +01:00
Peter Steinberger	225a374d5e	test: guard document extract mock calls	2026-05-12 00:19:06 +01:00
Shakker	b779bc1dc6	test: tighten document extractor assertions	2026-05-11 07:22:49 +01:00
Shakker	2faf2303a1	test: tighten pdf extraction image assertion	2026-05-08 18:09:09 +01:00
Peter Steinberger	1ef85c7d4c	test: make suites safe without isolation (#78834 ) * test: make suites safe without isolation * fix: narrow auth profile credential types * test: inject channel module loader factory locally	2026-05-07 08:43:29 +01:00
JC	83753535eb	fix(pdf): resolve standard fonts from pdfjs package root (#70936 ) * fix(pdf): resolve standard fonts from pdfjs package root Resolve PDF.js standard fonts via pdfjs-dist/package.json instead of a relative ../../node_modules path so the fallback renderer does not depend on emitted dist chunk layout. Add focused regression coverage that asserts the forwarded standardFontDataUrl matches the installed pdfjs-dist package root and exists on disk. * fix(pdf): resolve pdfjs standard fonts from package root * fix(pdf): use PDF.js font URL separator --------- Co-authored-by: Dr JCai <jingxiao.cai@gmail.com> Co-authored-by: vincentkoc <25068+vincentkoc@users.noreply.github.com> Co-authored-by: Vincent Koc <vincentkoc@ieee.org>	2026-04-30 00:38:48 -07:00
Vincent Koc	e3cba98f39	refactor(pdf): move document extraction to plugin * refactor(pdf): move document extraction to plugin * fix(deps): sync document extract lockfile * fix(pdf): harden document extraction plugin	2026-04-24 17:15:05 -07:00

10 Commits