Commit Graph

487 Commits

Author SHA1 Message Date
Trevin Chow 06aa4e00a4 fix(ce-code-review): close walk-through Apply gap and align Step 3 ordering
Address PR #685 review feedback:

- walkthrough.md: add a "No suggested_fix (Apply suppressed)" adaptation
  so the entry-point menu omits Apply for findings without a concrete
  suggested_fix. Stage 5 step 6b already maps these to a Defer
  recommendation; this matches the suppression applied during the
  post-run "Walk through these one at a time" re-entry path so the
  same handling applies regardless of which entry path the user came
  in through.
- SKILL.md Step 3: extend the homogeneous queue contract with a
  defensive backstop. If a no-suggested_fix finding ever slips into
  the walk-through Apply set, the fixer routes it to failed with
  reason "no fix proposed by reviewer", mirroring the heterogeneous
  queue's apply-or-fail handling. Autofix and headless callers are
  unaffected (they only ever process safe_auto items).
- SKILL.md Step 3: update the "Best-judgment path is single-pass"
  paragraph to reference Step 2 option B's two-branch ordering
  (failed empty -> emit report directly; non-empty -> question first,
  then execute, then report). Removes a contradiction the prior
  reconciliation commit (530560a3) didn't catch.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-25 15:45:30 -07:00
Trevin Chow 33e00c4047 fix(ce-code-review): rename LFG path to best-judgment to avoid /lfg collision
The interactive routing option B was labeled "LFG" in user-facing prose
and "the LFG path" in skill internals. That name collides with the /lfg
macro skill in this repo, creating confusion when an agent reads the
ce-code-review skill while also having /lfg in scope.

Rename:

- User-facing menu (option B): "LFG. Apply the agent's best-judgment
  action per finding" -> "Auto-resolve with best judgment -- apply
  per-finding fixes the agent can defend, surface the rest"
- User-facing per-finding option D: "LFG the rest" -> "Auto-resolve
  with best judgment on the rest"
- Internal naming: "LFG path" / "LFG routing (option B)" ->
  "best-judgment path" / "best-judgment routing (option B)"
- Routing tables, Step 3 fixer queue prose, completion-report bucket
  list, and other internal references updated to match
- Test assertions updated against the new labels

The user-facing label leads with "Auto-resolve" so the action is clear
on scan; "with best judgment" qualifies how decisions are made. The
internal name uses "best-judgment" (not "auto-resolve") to avoid
lexical collision with the existing mode:autofix flag, since the two
do different things and live at different structural levels --
autofix is a top-level mode while best-judgment is a routing option
within interactive mode.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-25 15:30:46 -07:00
Trevin Chow 530560a3eb fix(ce-code-review): close LFG fixer gap and reconcile walkthrough
Address PR #685 review feedback:

- Step 3 fixer queue contract previously defined behavior for
  gated_auto/manual WITH suggested_fix and manual WITHOUT, but left
  gated_auto WITHOUT suggested_fix undefined. Extend the no-fix branch
  to cover both gated_auto and manual so they route to failed instead
  of being silently skipped, preserving the apply-or-fail contract.

- walkthrough.md still documented LFG-the-rest -> Proceed/Cancel and
  end-of-loop dispatch semantics from the removed bulk-preview path.
  Reconcile: scope end-of-walk-through dispatch to the run-to-completion
  path and explicitly point to the LFG-the-rest bullet for that path's
  single-dispatch semantics. Remove stale Proceed/Cancel framings and
  the bulk-preview reference in the no-sink adaptation.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-25 10:24:09 -07:00
Trevin Chow f1e36e990b fix(ce-code-review): drop LFG defer-bias and approval gates
LFG previously routed too many findings to ticket-filing when the
agent could propose concrete fixes from review context. Stage 5b
validation and bulk-preview approval ran before any action, requiring
upfront research the user already opted out of by picking LFG.

Push personas to commit to a `suggested_fix` whenever any defensible
code change is reachable from the diff -- imperfect information is
not grounds for omission. `manual` findings with `suggested_fix` now
recommend Apply (was Defer). Drop Stage 5b and bulk-preview on the
LFG path; dispatch the fixer immediately on the full pending set.
Items that fail to apply or lack a proposed fix surface in a `failed`
bucket with a one-line reason; one post-run question (file tickets /
walk through / ignore) handles them only when the bucket is
non-empty, fired before the completion report so it reflects final
state.

Walk-through option A inherits the action-derivation change
automatically -- per-finding recommendations flip from Defer to Apply
when the persona proposed a fix. `LFG the rest` from inside the
walk-through dispatches one fixer pass on the union of accumulated
Apply set and remaining undecided findings, preserving the
"one fixer, consistent tree" contract.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-25 01:59:04 -07:00
Trevin Chow 1284290af2 fix(ce-debug): delegate commit/PR and add branch check (#683) 2026-04-24 20:54:06 -07:00
github-actions[bot] ea8721eb21 chore: release main (#680)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-04-24 15:16:49 -07:00
Trevin Chow 304a975d02 feat(ce-brainstorm): probe rigor gaps with prose before Phase 2 (#677)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-24 15:13:35 -07:00
Trevin Chow bc8ae1a6b5 fix(main): recover version drift, fix stale test, document learnings (#678)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-24 15:07:49 -07:00
Kieran Klaassen 47350c3e4e fix(ce-test-browser): skip headed/headless question in pipeline mode
Agents spawned from LFG were blocking forever at the AskUserQuestion
prompt with no user present to respond. In mode:pipeline, default to
headless and skip step 2 entirely.

Bump 3.0.6 -> 3.0.7

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-24 13:17:07 -07:00
Kieran Klaassen 22d493b192 feat(ce-test-browser): gate port scan and auto-start on pipeline mode
- Port scan (find_free_port) only runs when PIPELINE_MODE=1
- Dev server auto-start only runs in pipeline mode; manual invocations
  print a help message and stop
- LFG step 6 now passes mode:pipeline to ce-test-browser so parallel
  agents claim non-colliding ports automatically
- Bump version 3.0.5 -> 3.0.6

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-24 12:59:50 -07:00
Kieran Klaassen f8720da3d1 feat(ce-test-browser): free-port scan and auto-server start
- Always verify preferred port is free; scan upward until finding one
- Auto-start dev server (bin/dev / rails server / npm run dev) on the
  claimed port if nothing is listening — no more "please start your server"
- Pass PORT= explicitly so parallel agents on the same machine never
  collide on 3000

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-24 12:56:31 -07:00
Kieran Klaassen 1f20c3842d feat(lfg): add ce-commit-push-pr step and remove ralph-loop
- Add ce-commit-push-pr as step 7 so LFG ends with a pushed branch and open PR
- Remove optional ralph-loop step (step 1) -- simplifies the pipeline
- Renumber all steps and fix cross-references accordingly
- Bump version 3.0.3 -> 3.0.4

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-24 12:18:26 -07:00
github-actions[bot] bc3709fc53 chore: release main (#675)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-04-24 07:24:56 -07:00
Trevin Chow f0433d9150 fix(ce-ideate): sharpen bug intent, surprise-me dispatch, and drop authoring refs (#672) 2026-04-24 02:21:21 -07:00
github-actions[bot] 6b5da46ccd chore: release main (#661)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-04-24 02:06:18 -07:00
Trevin Chow 6514b1fce5 feat(ce-ideate): subject gate, surprise-me, and warrant contract (#671)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-24 02:04:21 -07:00
Trevin Chow 494313e8eb fix(ce-brainstorm): enforce Interaction Rules in universal flow (#669) 2026-04-23 23:56:35 -07:00
Trevin Chow c33bf70f46 fix(skills): plan is a decision artifact; progress comes from git (#666) 2026-04-23 23:12:12 -07:00
Trevin Chow 9ddcd22aee fix(ce-demo-reel): prevent secrets in recorded demos (#664)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-23 21:52:52 -07:00
Trevin Chow 75cf4d603d feat(ce-commit-push-pr): skip evidence prompt when judgment allows (#663) 2026-04-23 16:59:53 -07:00
Trevin Chow 351d12ec5b fix(ce-update): compare against main plugin.json, not release tags (#660) 2026-04-23 14:36:58 -07:00
github-actions[bot] 5e6ec41b95 chore: release main (#657)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-04-22 20:15:57 -07:00
Trevin Chow a9fd8421f4 fix(ce-proof): correct op shapes and add retry/batch discipline (#658) 2026-04-22 19:52:02 -07:00
Trevin Chow b9ae6b758d fix(ce-update): replace cache sweep with claude plugin update (#656) 2026-04-22 18:23:15 -07:00
github-actions[bot] 7e83755acb chore: release main (#596)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-04-22 14:21:19 -07:00
Trevin Chow 5eb62a7d0e refactor(agents): restrict tools allowlist on research agents (#650)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 12:23:27 -07:00
Trevin Chow 23dc11b95a feat(ce-setup): check for ast-grep CLI and agent skill (#653) 2026-04-22 11:29:20 -07:00
Luca Henn fdf5fe4af5 feat(ce-demo-reel): add local save as alternative to catbox upload (#647) 2026-04-22 11:28:44 -07:00
Trevin Chow 7ddfbed33b feat(pi): first-class support via pi-subagents + pi-ask-user (#651)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 10:26:29 -07:00
Trevin Chow cce95fb814 feat(ce-debug): environment sanity, assumption audit, more techniques (#649)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 09:08:24 -07:00
Trevin Chow 6155b9de3c fix(ce-update): derive cache dir from CLAUDE_PLUGIN_ROOT parent (#645)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 08:42:28 -07:00
Trevin Chow 86d9a2c55f fix(ce-debug): stop hanging handoffs and read full issue thread (#646) 2026-04-22 01:51:53 -07:00
Trevin Chow 01d5e8fb73 docs(readme): focus readmes on compound engineering (#644) 2026-04-21 23:33:21 -07:00
Trevin Chow 13f95ba639 fix(skills): cap skill descriptions at harness limit (#643) 2026-04-21 21:56:15 -07:00
Trevin Chow 5a26a8fbd3 refactor(ce-code-review): anchored confidence, staged validation, and model tiering (#641)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-21 21:04:29 -07:00
Trevin Chow b104ce46be fix(lfg): use platform-neutral skill references (#642) 2026-04-21 20:08:48 -07:00
Trevin Chow accbd2adcf refactor(todos): remove internal file-based todo system (#635)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-21 18:16:13 -07:00
Trevin Chow 19bbb60e90 refactor(skills): remove ce-onboarding skill (#639)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-21 18:15:36 -07:00
Trevin Chow 701ae10c2d feat(ce-code-review): add Swift/iOS stack-specific reviewer persona (#638)
Co-authored-by: Joshua Martens <joshua@every.to>
Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
2026-04-21 17:30:04 -07:00
Trevin Chow 271b1a4458 refactor(skills): remove 5 unused skills and clean references (#634) 2026-04-21 18:52:29 -05:00
Trevin Chow ff0eee391e refactor(ce-brainstorm): make doc review opt-in in Phase 4 handoff (#633) 2026-04-21 16:28:20 -07:00
Trevin Chow 44ce9dd127 feat(ce-plan): add U-IDs and origin trace to plan template (#632) 2026-04-21 16:07:28 -07:00
Trevin Chow 21666f0b67 refactor(ce-worktree): trim to creation-only and fix latent bugs (#631) 2026-04-21 15:15:09 -07:00
Trevin Chow 6caf330363 refactor(ce-doc-review): anchor-based confidence scoring (#622)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-21 14:54:03 -07:00
Trevin Chow bd77d5550a feat(ce-brainstorm): product-tier with end-to-end ID traceability (#629) 2026-04-21 14:07:20 -07:00
Trevin Chow 05ea109bdb fix(ce-learnings-researcher): drop unreadable schema path reference (#630) 2026-04-21 14:00:27 -07:00
Trevin Chow 4c57508c1a refactor(agents): flatten agents directory (#621) 2026-04-21 02:35:21 -07:00
Trevin Chow d359cc7e2f fix(question-tool): stop silent skips when tool looks unavailable (#620) 2026-04-21 01:27:52 -07:00
Trevin Chow cd4af86e5e refactor(session-history): move extraction scripts behind skills (#619) 2026-04-21 00:12:11 -07:00
Trevin Chow e0f2a4f9d7 feat(ce-proof): broaden triggers and surface markdown viewing (#618)
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-20 21:11:07 -07:00