Commit Graph

601 Commits

Author SHA1 Message Date
Trevin Chow 8e0a61e602 Address PR review feedback (#900)
- Scope the orchestrator-direct-verification substitution to anchor-100 P2/P3
  findings only (verifiable from code alone, where an independent re-read adds
  nothing). P2/P3 at anchor 75 (judgment calls) and all P0/P1 still require the
  independent validator wave -- the orchestrator synthesized these findings, so
  it cannot supply the fresh second opinion that filters their false positives.
  Direct verification catches a wrong fact, not the orchestrator's own bias.
2026-06-03 09:15:41 -07:00
Trevin Chow 0d6a0f96d6 Address PR review feedback (#900)
- Make the subagent and validator templates path-aware: when a large review
  stages the diff/file-list to disk and passes paths (per the Stage 4 nudge),
  the child now Reads the staged path instead of treating the filename as the
  content to review. Without this, the path-staging mode the nudge introduced
  would feed reviewers/validators a bare filename in the Diff block and they'd
  analyze no hunks.
- Point the Stage 4 nudge at the diff / changed-files slots and note that the
  templates instruct the child to Read a staged path.
2026-06-03 01:03:58 -07:00
Trevin Chow 76637365ca fix(ce-code-review): let direct verification stand in for the validator wave on P2/P3-only sets
Both dogfood runs validated a P2/P3-only survivor set by orchestrator direct
first-party verification instead of the per-finding subagent wave, reading
"complement, not a skip / never replaces the wave for P0/P1" as "may replace
it for non-P0/P1." That read is reasonable -- each P2 was source-checkable and
the full suite ran -- so make it intended rather than emergent:

- Any P0/P1 survivor: the per-finding validator wave is required; direct
  verification only complements it.
- P2/P3-only survivors: direct first-party verification per finding may stand
  in for the wave when each finding is source-checkable; note the method in
  Coverage.

The stage still runs whenever findings survive (validation is never skipped);
only the method varies by severity. Contract 29/0.
2026-06-02 23:40:31 -07:00
Trevin Chow 331ce43c9e fix(ce-code-review): enforce pipe-table format on the Applied section
Two dogfood runs rendered the Applied section as Field:-prefixed blocks +
box-drawing separators instead of the mandated `| # | File | Fix | Reviewer |`
pipe table -- reproducibly, even on a faithful run that otherwise followed the
skill (loaded refs, used path-passing, applied + committed-on-clean, never
pushed, adapted commit scope to fix(cli):).

Root cause: the "never produce these shapes" rules and the format-verification
gate were scoped to "a finding's index" / "across severities" -- they never
named the Applied table, so the no-field-blocks rule read as findings-only.
(The review-output-template's Applied example was already a correct pipe table;
the always-loaded rules just didn't cover Applied.)

- Broaden the forbidden-shapes rules to every tabular section, Applied
  included, naming the #:/Fix: field-block labels the agent actually emitted.
- Harden the Applied item: pipe table, never Field:-blocks or box-drawing.
- Restate the concrete forbidden shapes at the format gate and call out the
  Applied table as the most common offender -- restating at the point of
  action for the proven-fragile format spot.

Contract 29/0.
2026-06-02 23:37:34 -07:00
Trevin Chow e3ed7b63c6 fix(ce-code-review): persist rendered report, harden artifact contract, soften commit scope
Three gaps surfaced by a dogfood review on a large PR (none from the lean
refactor itself):

- Persist the rendered report (report.md, default mode) in the run dir, so
  output format and numbering stay auditable after a run.
- State at the orchestrator level that each reviewer's artifact file must
  carry the detail-tier fields (why_it_matters, evidence) -- writing the
  compact return to the artifact strips detail that Coverage and the keyed
  detail lines depend on -- and that staging context to disk for a large
  diff never licenses a thinner reviewer contract.
- Soften the hardcoded fix(review): commit scope to "fix(review):, or the
  repo's nearest convention" -- target repos whose commit-lint has no
  "review" scope reject the literal form.

Contract 29/0; full suite unchanged (same pre-existing CLI failures).
2026-06-02 23:01:20 -07:00
Trevin Chow 588cd9d229 fix(ce-code-review): nudge large diffs to disk-staged paths, not inlined contents
The "pass file paths, not contents, for large material" rule lived only in
AGENTS.md (authoring context, invisible at runtime), while the runtime
SKILL.md told reviewers to receive the diff inline. So orchestrators that
staged a big diff to disk and passed reviewers the path were exercising
good judgment the skill never encoded -- emergent and unreliable.

Add a brief Stage 4 nudge: when inlining the diff/file-list into every
reviewer and validator prompt would be wasteful (many files / a big diff),
write them once into the run dir (full.diff, files.txt) and pass the paths;
inline a small diff directly. The trigger is the qualitative cost test, not
a magic number, and it's a nudge (the failure mode is wasted tokens, not a
broken review) -- so it stays lightweight.

Mode-safe: the staged diff is an artifact, not a workspace file, so passing
its path is valid even in pr-remote/branch-remote scope. Contract 29/0.
2026-06-02 21:21:11 -07:00
Trevin Chow f05d0d6815 refactor(ce-code-review): lean SKILL.md prose and de-duplicate references
Reduce the always-loaded SKILL.md body without changing behavior:

- Dedup the apply/commit/never-push model to a single home (Stage 5c);
  Operating principles, Output format, Action Routing, After Review, and
  the deprecated-mode note are now terse pointers.
- Consolidate the Stage 6 findings-format failure signatures, which were
  stated twice (the "never produce these shapes" list and the
  format-verification gate).
- Trim behavior-neutral rationale across Stage 5/5b/5c, keeping the
  reasons that guard named failures (gate-runs-late ordering, scope-mode
  inspection, per-finding validator independence).
- Collapse the "## Reviewers" tables to a compact roster with terse
  triggers; de-triplicate the data-migration spawn gate and the
  maintainability-owns-structure rule.
- Drop the redundant @-inline of review-output-template: Stage 6 already
  loads it on demand and keeps an inline fallback skeleton, so inlining
  it again was duplicate always-loaded weight.

persona-catalog and subagent-template stay @-inlined -- they are needed
in every full review, so deferring them would save little while adding
load-reliability risk on the dispatch contract.

SKILL.md 703 -> 671 lines, 9,205 -> 8,561 words. Contract tests 29/0;
full suite unchanged (same pre-existing CLI failures, zero new).
2026-06-02 20:37:10 -07:00
Trevin Chow 3eedade76f refactor(ce-code-review): lean apply model — safe self-apply interactive, report-only for callers (#881)
Co-authored-by: Cursor <cursoragent@cursor.com>
2026-06-02 20:08:44 -07:00
Trevin Chow 7c4bb16123 feat(skill): introduce CONCEPTS.md as shared vocabulary substrate (#838) 2026-06-02 00:38:03 -07:00
Trevin Chow 3e77a7bd84 fix(ce-resolve-pr-feedback): drop clustering, default to merit-based fixing (#893) 2026-06-01 17:24:36 -07:00
github-actions[bot] 422ffc77d1 chore: release main (#887) 2026-05-31 10:35:31 -07:00
Trevin Chow ece9fa1f1f fix(ce-plan): add answer-seeking disposition to universal planning (#886) 2026-05-31 10:34:34 -07:00
github-actions[bot] 85987d496f chore: release main (#874)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-28 12:23:52 -07:00
Trevin Chow b3e396d0bf fix(ce-plan): honor explicit external-research requests and route them by intent (#875) 2026-05-28 12:22:49 -07:00
Trevin Chow 253dba80dd fix(ce-sessions): emit repo root path instead of basename subshell (#873) 2026-05-27 15:21:06 -07:00
github-actions[bot] 2bab351c81 chore: release main (#872)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-27 11:17:53 -07:00
Trevin Chow e5e3fc3630 fix(ce-brainstorm,ce-plan): add conceptual-diagram affordance to brainstorm docs (#871) 2026-05-27 11:12:39 -07:00
github-actions[bot] 48cd9140b2 chore: release main (#869)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-27 09:54:04 -07:00
Trevin Chow 1051132d04 fix(html-rendering): constrain measure and surface execution notes (#870) 2026-05-27 09:52:22 -07:00
Trevin Chow 5c88212c1f fix(ce-brainstorm,ce-plan): restore default-on requirements grouping (#868) 2026-05-27 09:47:08 -07:00
github-actions[bot] e2c9cd2352 chore: release main (#852)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-26 10:02:33 -07:00
Trevin Chow 11e12e5739 feat(ce-plan,ce-brainstorm): contract-driven sections + optional HTML output (#826) 2026-05-26 09:53:14 -07:00
Trevin Chow 673dcfacb8 fix(simplify-code): guard against over-simplification and behavior drift (#859) 2026-05-24 22:03:03 -07:00
Trevin Chow 26a802551e fix(commit): auto-create feature branch on default branch (#856) 2026-05-22 21:53:59 -07:00
Trevin Chow 5297a9440f refactor(review): consolidate migration personas and trim stack reviewers (#854)
Co-authored-by: Cursor <cursoragent@cursor.com>
2026-05-21 22:37:42 -07:00
Trevin Chow 67d273622e fix(ce-commit-push-pr): require user-visible bug summaries (#853) 2026-05-21 18:42:02 -07:00
Kieran Klaassen 0aa6b55a80 feat(ce-dogfood-beta): add diff-scoped browser QA dogfood skill (#848)
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-21 12:14:02 -05:00
github-actions[bot] 08bb589903 chore: release main (#850)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-21 00:21:38 -07:00
Trevin Chow ac1c6d9a99 fix(ce-compound): drop date suffix from generated doc filenames (#849) 2026-05-21 00:20:43 -07:00
Brian Paden 796bea75b7 fix(agents): rename .agent.md to .md for VS Code Copilot tool access (#846)
Co-authored-by: Trevin Chow <trevin@trevinchow.com>
2026-05-21 00:18:17 -07:00
Trevin Chow 2a46670958 fix(ce-proof): update HITL flow for Proof v2 (#847) 2026-05-21 00:16:52 -07:00
github-actions[bot] f61d1b33ea chore: release main (#834)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-18 09:34:16 -07:00
Trevin Chow fd88fd8fd7 fix(ce-commit-push-pr): rewrite pr-description around a core principle (#841) 2026-05-17 13:30:00 -07:00
Trevin Chow 82b8af415d fix(ce-coherence-reviewer): remove Bash from tool allowlist (#837) 2026-05-16 01:45:10 -07:00
Trevin Chow 6fa1277e57 fix(ce-web-researcher): use any web tool, not just Claude built-ins (#836) 2026-05-16 00:30:10 -07:00
Trevin Chow 39cb9da3a1 fix(ce-plan): inline synthesis gate output into SKILL.md (#822) 2026-05-14 20:42:40 -07:00
github-actions[bot] 705f28a7a9 chore: release main (#831)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-14 10:54:13 -07:00
Trevin Chow 6df3f96580 fix(ce-brainstorm): scoping synthesis and Q&A interaction cleanup (#829) 2026-05-13 22:29:40 -07:00
github-actions[bot] b07aac55b5 chore: release main (#817)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-11 16:45:52 -07:00
Trevin Chow 60c1c938d5 fix(ce-plan): compress synthesis confirmation to prose + call-outs (#819) 2026-05-11 12:09:52 -07:00
Trevin Chow d090bde0ff fix(ce-code-review): replace resolve-base.sh with prose-driven base detection (#815) 2026-05-10 16:23:19 -07:00
github-actions[bot] 85548a5cff chore: release main (#814)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-10 12:53:45 -07:00
Trevin Chow 9b45a83d7e feat(ce-compound): add mode:headless for non-interactive use (#813) 2026-05-10 12:50:12 -07:00
github-actions[bot] 834ca4e58a chore: release main (#808)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-08 16:50:44 -07:00
Trevin Chow 07a6d52879 fix(ce-resolve-pr-feedback): paginate GraphQL connections (#807) 2026-05-08 15:18:12 -07:00
github-actions[bot] dbac0245b5 chore: release main (#806)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-08 14:43:50 -07:00
Trevin Chow 62279b0583 refactor(skills): extract conditional content to references (#804) 2026-05-08 13:51:56 -07:00
Trevin Chow 81710efad5 fix(ce-sessions): unblock session-history on Claude Code (#800) 2026-05-08 13:51:33 -07:00
github-actions[bot] 888c98a38d chore: release main (#797)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-05-08 09:41:58 -07:00
Trevin Chow 0e49506bf0 refactor(agent-descriptions): trim top 7 by ~25% (#803) 2026-05-08 09:31:03 -07:00