@@ -22,14 +22,14 @@ pre-PR use. Designed for iterative review/revision cycles before submitting a PR
2222 files (default: 200000). Changed source files are always included regardless of budget.
2323- ` --force-fresh ` : Skip delta-diff mode, run a full fresh review even if previous state exists
2424- ` --full-registry ` : Include the entire REGISTRY.md instead of selective sections
25- - ` --model <name> ` : Override the OpenAI model (default: ` gpt-5.5 ` )
26- - ` --timeout <seconds> ` : HTTP request timeout. If omitted, defaults to 900 for reasoning models (gpt-5.4, gpt-5.5, * -pro, o1/o3/o4) and 300 otherwise.
25+ - ` --model <name> ` : Override the OpenAI model (default: ` gpt-5.4 ` )
26+ - ` --timeout <seconds> ` : HTTP request timeout. If omitted, defaults to 900 for reasoning models (gpt-5.4, * -pro, o1/o3/o4) and 300 otherwise.
2727- ` --dry-run ` : Print the compiled prompt without calling the API
2828
29- ** Reasoning models** (` gpt-5.5 ` , ` gpt-5.5 -pro` , ` o3 ` , ` o4-mini ` , etc.): Reviews may take 10-15
29+ ** Reasoning models** (` gpt-5.4 -pro ` , ` o3 ` , ` o4-mini ` , etc.): Reviews may take 10-15
3030minutes. For deep reviews with reasoning models, combine ` --token-budget ` with ` --model ` :
3131```
32- /ai-review-local --model gpt-5.5 -pro --token-budget 500000 --context deep
32+ /ai-review-local --model gpt-5.4 -pro --token-budget 500000 --context deep
3333```
3434
3535## Constraints
@@ -47,7 +47,7 @@ before any data is sent externally.
4747### Step 1: Parse Arguments
4848
4949Parse ` $ARGUMENTS ` for the optional flags listed above. All flags are optional —
50- the default behavior (standard context, selective registry, gpt-5.5 , live API call)
50+ the default behavior (standard context, selective registry, gpt-5.4 , live API call)
5151requires no arguments.
5252
5353### Step 2: Validate Prerequisites
@@ -334,15 +334,10 @@ python3 .claude/scripts/openai_review.py \
334334Note: ` --force-fresh ` is a skill-only flag — it controls whether delta diffs are
335335generated in Step 4 and is NOT passed to the script.
336336
337- ** Reasoning model handling:** Resolve the effective model first — ` effective_model ` is
338- the value of ` --model ` if the user provided one, otherwise the script default ` gpt-5.5 ` .
339- The ` --model ` , ` --timeout ` , and ` --dry-run ` flags pass through to the script when provided.
340-
341- If ` effective_model ` contains ` -pro ` , starts with ` o1 ` /` o3 ` /` o4 ` , or starts with
342- ` gpt-5.4 ` /` gpt-5.5 ` (e.g., ` gpt-5.5 ` , ` gpt-5.5-pro ` , ` o3 ` , ` o4-mini ` ):
343- - The script's ` _resolve_timeout() ` already auto-selects 900s for these models when
344- ` --timeout ` is omitted, so no wrapper timeout pass-through is required. (Passing
345- ` --timeout 900 ` explicitly remains harmless and is fine for backward compatibility.)
337+ ** Reasoning model handling:** If the model is ` gpt-5.4 ` , contains ` -pro ` , or starts with
338+ ` o1 ` /` o3 ` /` o4 ` (e.g., ` gpt-5.4 ` , ` gpt-5.4-pro ` , ` o3 ` , ` o4-mini ` ):
339+ - The script auto-resolves ` --timeout ` to 900s for reasoning models when omitted, so
340+ no extra flag is required unless overriding
346341- Run the Bash command with ` run_in_background: true ` (bypasses the 600s Bash tool timeout cap)
347342- After the background command completes, continue to Step 6
348343
@@ -397,15 +392,15 @@ Review passed with no findings. Suggested next steps:
397392- /submit-pr — commit and open a pull request
398393```
399394
400- ** For ⛔ or ⚠️ (P0/P1/P2 findings)** :
395+ ** For ⛔ or ⚠️ (P0/P1 findings)** :
401396```
402397Options:
4033981. Enter plan mode to address findings (Recommended)
4043992. Re-run with --full-registry for deeper methodology context
4054003. Skip — I'll address these manually
406401```
407402
408- ** For ✅ with P3 findings only** :
403+ ** For ✅ with P2/ P3 findings only** :
409404```
410405Options:
4114061. Address findings before submitting
@@ -414,8 +409,8 @@ Options:
414409
415410** If user chooses to address findings** : Parse the findings from the review output.
416411The review context is already in the conversation. Start addressing the findings
417- directly — for P0/P1/P2 issues use ` EnterPlanMode ` for a structured approach; for
418- P3 issues, fix them directly since they are minor.
412+ directly — for P0/P1 issues use ` EnterPlanMode ` for a structured approach; for P2/P3
413+ issues, fix them directly since they are minor.
419414
420415After fixes are committed, the user re-runs ` /ai-review-local ` for a follow-up review.
421416On re-review, the script automatically activates delta-diff mode (comparing only
@@ -472,7 +467,7 @@ runs `--force-fresh` or when a rebase invalidates the tracked commit.
472467/ai-review-local --model gpt-4.1 --full-registry
473468
474469# Deep review with reasoning model (may take 10-15 minutes)
475- /ai-review-local --model gpt-5.5 -pro --token-budget 500000 --context deep
470+ /ai-review-local --model gpt-5.4 -pro --token-budget 500000 --context deep
476471
477472# Limit token budget for faster/cheaper reviews
478473/ai-review-local --token-budget 100000
@@ -502,9 +497,8 @@ runs `--force-fresh` or when a rebase invalidates the tracked commit.
502497- The review criteria are adapted from ` .github/codex/prompts/pr_review.md ` (same
503498 methodology axes, severity levels, and anti-patterns) but framed for local
504499 code-change review rather than PR review
505- - The CI review (single-shot Responses API, same architecture as local but with
506- ` --ci-mode ` and ` --full-registry ` ) remains the authoritative final check — local
507- review is a fast first pass to catch most issues early
500+ - The CI review (Codex action with full repo access) remains the authoritative final
501+ check — local review is a fast first pass to catch most issues early
508502- ** Data transmission** : In non-dry-run mode, this skill transmits the unified diff,
509503 changed-file metadata, full source file contents (in standard/deep mode),
510504 import-context files (in deep mode), selected methodology registry text, and
0 commit comments