Commit Graph

78 Commits

Author SHA1 Message Date
Peter Steinberger
c0aed59fca refactor: move qa suite definitions into markdown 2026-04-07 23:39:50 +01:00
Peter Steinberger
e0ad3e79e6 refactor: dedupe normalization lowercase helpers 2026-04-07 22:57:52 +01:00
Peter Steinberger
83bdba2bae fix: resolve rebase regressions for ci landing 2026-04-07 21:02:04 +01:00
Peter Steinberger
5b090561fb refactor: dedupe browser whatsapp qa lowercase helpers 2026-04-07 20:58:01 +01:00
Peter Steinberger
a00b01f5ed fix: harden complex qa suite scenarios 2026-04-07 20:35:39 +01:00
Peter Steinberger
b5d2bd6f41 fix(qa): tighten frontier scope evals 2026-04-07 20:32:42 +01:00
Peter Steinberger
4e69a9b329 fix(qa): restore safe no-fork gateway runtime 2026-04-07 20:32:42 +01:00
Vincent Koc
cde12e63e7 perf(qa): lazy-load runner catalog for lab ui 2026-04-07 20:32:42 +01:00
Vincent Koc
f312d6c106 fix(qa): preserve gateway cli auth in no-fork rpc path 2026-04-07 20:32:42 +01:00
Vincent Koc
e7538b4499 perf(qa): drop per-rpc gateway cli forks 2026-04-07 20:32:42 +01:00
Vincent Koc
02bd9e8c10 perf(qa): trim frontier direct-agent waits 2026-04-07 20:32:42 +01:00
Vincent Koc
35eb70f1f5 test(qa): retry flaky local fetches in lab server tests 2026-04-07 20:32:42 +01:00
Vincent Koc
986536ff6b fix(qa): keep direct self-check outputs under repo root 2026-04-07 20:32:42 +01:00
Vincent Koc
f6544a0a3b fix(qa): anchor runner artifacts to repo root 2026-04-07 20:32:42 +01:00
Vincent Koc
daeff2fa89 fix(qa): default docker artifacts from repo root 2026-04-07 20:32:42 +01:00
Vincent Koc
76bde3d42b fix(qa): support neutral-cwd docker commands 2026-04-07 20:32:42 +01:00
Vincent Koc
816a3eae8a chore(qa): align qa cli provider input types 2026-04-07 20:32:42 +01:00
Vincent Koc
5aa4fd3216 fix(qa): normalize qa cli lane inputs 2026-04-07 20:32:42 +01:00
Vincent Koc
7d18b145f8 fix(qa): keep manual alternate model aligned 2026-04-07 20:32:42 +01:00
Vincent Koc
cdf18c16b4 fix(qa): default manual lanes by provider mode 2026-04-07 20:32:42 +01:00
Vincent Koc
3182588ad4 fix(qa): allow random qa-lab control-ui origins 2026-04-07 20:32:42 +01:00
Vincent Koc
82535771cd fix(qa): pin gateway child control ui root 2026-04-07 20:32:42 +01:00
Vincent Koc
f9f38a48e6 fix(qa): align mock model-switch continuity 2026-04-07 20:32:42 +01:00
Vincent Koc
9a106f7e3c fix(qa): support neutral-cwd suite runs 2026-04-07 20:32:42 +01:00
Vincent Koc
f93b217834 feat(qa): add manual harness lane 2026-04-07 20:32:42 +01:00
Vincent Koc
63e6bb026c fix(qa): isolate gateway child runtime 2026-04-07 20:32:42 +01:00
Vincent Koc
4f421fa0f1 fix(qa): harden frontier claude bakeoffs 2026-04-07 20:32:42 +01:00
Vincent Koc
18fb171179 feat(qa): add frontier harness bakeoff loop 2026-04-07 20:32:41 +01:00
Peter Steinberger
a3d5630232 test: stabilize scoped runners and qa ports 2026-04-07 15:28:46 +01:00
Peter Steinberger
ad605052bf refactor: dedupe provider lowercase helpers 2026-04-07 15:12:31 +01:00
Peter Steinberger
1cec37184c fix: harden qa memory dreaming sweep 2026-04-07 12:57:33 +01:00
Peter Steinberger
c541a9c110 Tests: fix flaky shard expectations 2026-04-07 12:22:51 +01:00
Peter Steinberger
524951e124 fix(ci): route qa-lab web imports through package barrels 2026-04-07 10:24:02 +01:00
Peter Steinberger
f2494aa33f feat: streamline qa lab live runs 2026-04-07 10:05:49 +01:00
Peter Steinberger
124cd5e307 feat: auto-reload qa lab fast refresh 2026-04-07 09:54:59 +01:00
Peter Steinberger
54a884865e feat: add fast qa lab ui refresh mode 2026-04-07 09:45:11 +01:00
Peter Steinberger
36aeef30c2 style: add padding to qa lab scenario list 2026-04-07 09:45:11 +01:00
Peter Steinberger
1baf5533aa feat(qa-lab): add Clawfather/Claw avatars and live-watch mode for scenario runs 2026-04-07 09:24:26 +01:00
Peter Steinberger
282188a326 fix(qa-lab): widen sidebar to 360px and allow scenario titles to wrap 2026-04-07 09:21:25 +01:00
Peter Steinberger
0e51f2f2ab fix(qa-lab): set themed background on app-shell so light mode covers dark body 2026-04-07 09:04:37 +01:00
Peter Steinberger
17085ec1a4 fix: make qa lab docker boot resilient 2026-04-07 09:04:18 +01:00
Vincent Koc
d5ed6d26e9 chore(plugins): bulk add package boundary tsconfig rollout 2026-04-07 08:48:23 +01:00
Peter Steinberger
7a2abb1c50 fix: stabilize qa lab docker builds 2026-04-07 06:06:29 +01:00
Peter Steinberger
a03e430248 refactor: dedupe core error helpers 2026-04-07 05:06:54 +01:00
Peter Steinberger
e169fcd263 refactor: dedupe qa and diff error formatting 2026-04-07 05:06:54 +01:00
Peter Steinberger
f6c3474342 fix(qa-lab): bump health timeout to 360s, add settle delay, show compose path in error 2026-04-06 22:51:06 +01:00
Peter Steinberger
80c8567f9d fix: resolve merge conflicts and preserve runtime test fixes 2026-04-06 22:46:33 +01:00
Peter Steinberger
cfebdee073 refactor: dedupe qa cli shutdown handling 2026-04-06 22:21:01 +01:00
Peter Steinberger
1a893132f6 refactor: dedupe qa mock input text extraction 2026-04-06 22:21:00 +01:00
Peter Steinberger
27dc1bd0fc fix(qa-lab): improve health timeout error message and fix port-free check 2026-04-06 19:35:18 +01:00