Skip to content

Commit fe8e717

Browse files
authored
Merge pull request #416 from igerber/ai-review-redo
Revert AI review CI to Codex + gpt-5.4 (reverts #404, #415)
2 parents 2f77f55 + 15f6a3d commit fe8e717

6 files changed

Lines changed: 334 additions & 994 deletions

File tree

.claude/commands/ai-review-local.md

Lines changed: 16 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -22,14 +22,14 @@ pre-PR use. Designed for iterative review/revision cycles before submitting a PR
2222
files (default: 200000). Changed source files are always included regardless of budget.
2323
- `--force-fresh`: Skip delta-diff mode, run a full fresh review even if previous state exists
2424
- `--full-registry`: Include the entire REGISTRY.md instead of selective sections
25-
- `--model <name>`: Override the OpenAI model (default: `gpt-5.5`)
26-
- `--timeout <seconds>`: HTTP request timeout. If omitted, defaults to 900 for reasoning models (gpt-5.4, gpt-5.5, *-pro, o1/o3/o4) and 300 otherwise.
25+
- `--model <name>`: Override the OpenAI model (default: `gpt-5.4`)
26+
- `--timeout <seconds>`: HTTP request timeout. If omitted, defaults to 900 for reasoning models (gpt-5.4, *-pro, o1/o3/o4) and 300 otherwise.
2727
- `--dry-run`: Print the compiled prompt without calling the API
2828

29-
**Reasoning models** (`gpt-5.5`, `gpt-5.5-pro`, `o3`, `o4-mini`, etc.): Reviews may take 10-15
29+
**Reasoning models** (`gpt-5.4-pro`, `o3`, `o4-mini`, etc.): Reviews may take 10-15
3030
minutes. For deep reviews with reasoning models, combine `--token-budget` with `--model`:
3131
```
32-
/ai-review-local --model gpt-5.5-pro --token-budget 500000 --context deep
32+
/ai-review-local --model gpt-5.4-pro --token-budget 500000 --context deep
3333
```
3434

3535
## Constraints
@@ -47,7 +47,7 @@ before any data is sent externally.
4747
### Step 1: Parse Arguments
4848

4949
Parse `$ARGUMENTS` for the optional flags listed above. All flags are optional —
50-
the default behavior (standard context, selective registry, gpt-5.5, live API call)
50+
the default behavior (standard context, selective registry, gpt-5.4, live API call)
5151
requires no arguments.
5252

5353
### Step 2: Validate Prerequisites
@@ -334,15 +334,10 @@ python3 .claude/scripts/openai_review.py \
334334
Note: `--force-fresh` is a skill-only flag — it controls whether delta diffs are
335335
generated in Step 4 and is NOT passed to the script.
336336

337-
**Reasoning model handling:** Resolve the effective model first — `effective_model` is
338-
the value of `--model` if the user provided one, otherwise the script default `gpt-5.5`.
339-
The `--model`, `--timeout`, and `--dry-run` flags pass through to the script when provided.
340-
341-
If `effective_model` contains `-pro`, starts with `o1`/`o3`/`o4`, or starts with
342-
`gpt-5.4`/`gpt-5.5` (e.g., `gpt-5.5`, `gpt-5.5-pro`, `o3`, `o4-mini`):
343-
- The script's `_resolve_timeout()` already auto-selects 900s for these models when
344-
`--timeout` is omitted, so no wrapper timeout pass-through is required. (Passing
345-
`--timeout 900` explicitly remains harmless and is fine for backward compatibility.)
337+
**Reasoning model handling:** If the model is `gpt-5.4`, contains `-pro`, or starts with
338+
`o1`/`o3`/`o4` (e.g., `gpt-5.4`, `gpt-5.4-pro`, `o3`, `o4-mini`):
339+
- The script auto-resolves `--timeout` to 900s for reasoning models when omitted, so
340+
no extra flag is required unless overriding
346341
- Run the Bash command with `run_in_background: true` (bypasses the 600s Bash tool timeout cap)
347342
- After the background command completes, continue to Step 6
348343

@@ -397,15 +392,15 @@ Review passed with no findings. Suggested next steps:
397392
- /submit-pr — commit and open a pull request
398393
```
399394

400-
**For ⛔ or ⚠️ (P0/P1/P2 findings)**:
395+
**For ⛔ or ⚠️ (P0/P1 findings)**:
401396
```
402397
Options:
403398
1. Enter plan mode to address findings (Recommended)
404399
2. Re-run with --full-registry for deeper methodology context
405400
3. Skip — I'll address these manually
406401
```
407402

408-
**For ✅ with P3 findings only**:
403+
**For ✅ with P2/P3 findings only**:
409404
```
410405
Options:
411406
1. Address findings before submitting
@@ -414,8 +409,8 @@ Options:
414409

415410
**If user chooses to address findings**: Parse the findings from the review output.
416411
The review context is already in the conversation. Start addressing the findings
417-
directly — for P0/P1/P2 issues use `EnterPlanMode` for a structured approach; for
418-
P3 issues, fix them directly since they are minor.
412+
directly — for P0/P1 issues use `EnterPlanMode` for a structured approach; for P2/P3
413+
issues, fix them directly since they are minor.
419414

420415
After fixes are committed, the user re-runs `/ai-review-local` for a follow-up review.
421416
On re-review, the script automatically activates delta-diff mode (comparing only
@@ -472,7 +467,7 @@ runs `--force-fresh` or when a rebase invalidates the tracked commit.
472467
/ai-review-local --model gpt-4.1 --full-registry
473468

474469
# Deep review with reasoning model (may take 10-15 minutes)
475-
/ai-review-local --model gpt-5.5-pro --token-budget 500000 --context deep
470+
/ai-review-local --model gpt-5.4-pro --token-budget 500000 --context deep
476471

477472
# Limit token budget for faster/cheaper reviews
478473
/ai-review-local --token-budget 100000
@@ -502,9 +497,8 @@ runs `--force-fresh` or when a rebase invalidates the tracked commit.
502497
- The review criteria are adapted from `.github/codex/prompts/pr_review.md` (same
503498
methodology axes, severity levels, and anti-patterns) but framed for local
504499
code-change review rather than PR review
505-
- The CI review (single-shot Responses API, same architecture as local but with
506-
`--ci-mode` and `--full-registry`) remains the authoritative final check — local
507-
review is a fast first pass to catch most issues early
500+
- The CI review (Codex action with full repo access) remains the authoritative final
501+
check — local review is a fast first pass to catch most issues early
508502
- **Data transmission**: In non-dry-run mode, this skill transmits the unified diff,
509503
changed-file metadata, full source file contents (in standard/deep mode),
510504
import-context files (in deep mode), selected methodology registry text, and

0 commit comments

Comments
 (0)