Commit Graph

268 Commits

Author SHA1 Message Date
Peter Steinberger
feb3cc70fb perf: narrow qa bus test imports 2026-04-24 03:44:56 +01:00
Peter Steinberger
903308dbf2 fix: stabilize qa lab mock suite 2026-04-24 02:46:33 +01:00
Peter Steinberger
f523bbfcd1 fix: align claude cli permissions with exec policy
Derive Claude CLI bypass mode from OpenClaw exec YOLO policy, preserve raw Claude permission-mode overrides, update docs/changelog, and cover global/per-agent policy behavior.
2026-04-23 23:11:34 +01:00
Peter Steinberger
f5042adf27 feat: add forked subagent context 2026-04-23 21:28:58 +01:00
Peter Steinberger
ff56a9d41b test(openai): prefer canonical GPT refs 2026-04-23 20:47:39 +01:00
Peter Steinberger
cd5bc2fc93 test(openai): cover GPT-5.5 defaults 2026-04-23 20:19:15 +01:00
Peter Steinberger
a36903b94c feat(openai): default to GPT-5.5 2026-04-23 20:19:15 +01:00
Ayaan Zaidi
1bd8c5f362 fix(qa): timestamp telegram update batches 2026-04-23 14:33:17 +05:30
Ayaan Zaidi
c22a21759b test(qa): report telegram reply rtt 2026-04-23 14:33:17 +05:30
joshavant
d3a8480093 qa-live: simplify telegram pass progress log lines 2026-04-23 00:37:03 -05:00
joshavant
e6d0342629 qa-live: stream telegram scenario progress logs in realtime 2026-04-23 00:19:43 -05:00
Peter Steinberger
86ace805b7 fix(qa): align telegram commands live assertion 2026-04-23 06:04:06 +01:00
joshavant
c2f0559829 qa-live: tag telegram observed messages with scenario context 2026-04-22 23:52:06 -05:00
joshavant
754577b43e qa-live: keep telegram failure details with public metadata redaction 2026-04-22 23:36:07 -05:00
Peter Steinberger
0b0662b1c9 chore: apply extension lint cleanups 2026-04-23 05:30:49 +01:00
Josh Avant
01e18b6e3b Add maintainer-gated Telegram live QA workflow with Convex hardening (#70427) 2026-04-22 23:17:09 -05:00
Peter Steinberger
9937452405 build: enable more zero-baseline oxlint rules 2026-04-23 05:03:58 +01:00
Peter Steinberger
d0d018bdad fix(qa): restore agentic parity tool runtime 2026-04-23 04:22:03 +01:00
Peter Steinberger
5e172b3888 fix(qa): preserve image parity plugin allowlist 2026-04-23 03:49:04 +01:00
Peter Steinberger
74dfeaae0d fix(qa): preserve image generation plugin allowlist 2026-04-23 02:55:22 +01:00
Peter Steinberger
f4c4e940a6 test(qa): stabilize lab catalog abort fixture 2026-04-22 23:36:34 +01:00
Peter Steinberger
03c1fff8f6 test(qa): add OpenAI native web search live scenario 2026-04-22 23:06:55 +01:00
Vincent Koc
4c675216f1 fix(qa): deflake parity approval preflight 2026-04-22 13:43:29 -07:00
Val Alexander
dab46a7e98 qa: harden parity gate execution (#70045) 2026-04-22 03:08:25 -05:00
Peter Steinberger
dd9adc57c2 test: harden qa parity runtime staging 2026-04-22 08:52:55 +01:00
Peter Steinberger
8bfb4024f6 test: harden qa parity config cleanup 2026-04-22 08:52:55 +01:00
Peter Steinberger
764bb310f7 ci: pin qa parity tool profile 2026-04-22 08:52:55 +01:00
Peter Steinberger
66576f3355 test(extensions): fix lint-clean test assertions 2026-04-22 01:43:18 +01:00
Peter Steinberger
bcfa781a1b fix: remap thinking levels on model switch 2026-04-21 18:53:49 +01:00
Peter Steinberger
05ba1335d9 fix: tolerate qa cli json startup logs 2026-04-21 07:37:48 +01:00
Peter Steinberger
6ce17db11a fix: gate max thinking by model support 2026-04-21 07:02:43 +01:00
Peter Steinberger
f5be489266 test: add gpt-5.4 thinking visibility QA 2026-04-21 06:13:39 +01:00
Peter Steinberger
f1a544ef6d perf: avoid sort-for-single selection 2026-04-20 23:20:31 +01:00
Peter Steinberger
dc4e90bbd2 fix(qa-lab): restore transport helper contracts 2026-04-20 21:03:34 +01:00
Peter Steinberger
fbba29319f refactor: share qa credential helpers 2026-04-20 21:00:42 +01:00
Peter Steinberger
0882b85d5a refactor: share qa runtime helpers 2026-04-20 20:58:28 +01:00
Peter Steinberger
6fbfb8b7a3 test: share character eval fixtures 2026-04-20 20:56:30 +01:00
Peter Steinberger
f587887122 test: share qa temp dir harness 2026-04-20 20:54:45 +01:00
Peter Steinberger
b97f993b0c test: share qa docker healthy deps 2026-04-20 18:07:04 +01:00
Peter Steinberger
98a5f737d7 perf(test): streamline qa gateway child tests 2026-04-20 16:14:15 +01:00
Peter Steinberger
1148f245c8 build(deps): declare extension runtime dependencies 2026-04-20 16:07:14 +01:00
Peter Steinberger
09171eee8d perf(test): shorten qa channel readiness polling 2026-04-20 16:00:59 +01:00
Peter Steinberger
e50b8e3e99 perf(test): mock qa lab capture store 2026-04-20 15:55:08 +01:00
Peter Steinberger
372ca5e81e perf(test): skip manual lane settle delay in unit tests 2026-04-20 15:51:28 +01:00
Peter Steinberger
44030ac4fd perf(test): cache qa scenario catalog 2026-04-20 15:17:07 +01:00
Peter Steinberger
beff874340 perf(test): trim active memory and qa lab hotspots 2026-04-20 15:04:38 +01:00
Peter Steinberger
869950564f build: update dependencies 2026-04-20 13:18:32 +01:00
Peter Steinberger
8116e638f3 chore: release 2026.4.20 2026-04-20 13:16:40 +01:00
Gustavo Madeira Santana
ea37a833dc test(matrix): add stale sync replay dedupe scenario 2026-04-20 05:51:04 -04:00
Josh Avant
d5b326523f qa-lab: make live lanes CI-ready for v1 E2E automation (#69122)
* qa-lab: harden CI defaults and failure semantics for live lanes

* qa-lab: add unit tests for suite progress logging defaults

* qa-lab: cover malformed multipass summary edge cases

* qa-lab: share suite summary failure counting helper

* qa-lab: test allow-failures parse wiring and sanitize progress ids

* fix: note qa CI live-lane defaults in changelog (#69122) (thanks @joshavant)
2026-04-19 21:13:27 -05:00