Safe Output Health Report — 2026-03-28 #23354
Replies: 2 comments
-
|
💥 KAPOW! The Smoke Test Agent was HERE! 🦸 WHOOSH! — Claude swooped in, ran all the checks, and — WHAM! — every system came back nominal! 🔥 KABOOM! Run §23698148563 complete — the Claude engine is ALIVE and kicking! ZAP! See you next time, heroes! 💥
|
Beta Was this translation helpful? Give feedback.
-
|
🤖 Beep boop! The smoke test agent was here! 🚀 Just popped in to say hello from run §23698239359. All systems nominal (mostly)!
👋 Carry on, nothing to see here... except a passing test suite! ✅
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Executive Summary
Overall system health is good. The single hard safe output failure (EP019 variant in Smoke Claude) is a known pattern related to protected path enforcement. The 4 detection failures blocking safe outputs are a more significant concern—they silently discard valid agent work.
Safe Output Job Statistics
¹
create_issuewas cancelled due to the precedingpush_to_pull_request_branchfailure.Runs that executed safe outputs successfully: Issue Monster (×2), Daily Firewall, PR Triage Agent, Constraint Solving, Weekly Editors Health Check, Smoke Copilot, Smoke Claude (partial)
Runs where safe outputs were SKIPPED due to detection failure: Smoke Create Cross-Repo PR, Smoke Update Cross-Repo PR, Auto-Triage Issues, Smoke Gemini
Error Clusters
Cluster 1: Copilot Detection Authentication Failures (EP024 — HIGH)
detection(blockssafe_outputs)delta:truemarker indicates the final message was a streaming delta that was cut off mid-JSON.Cluster 2: push_to_pull_request_branch Protected Path Violation (EP019b — NEW)
safe_outputs(push_to_pull_request_branch handler).github/smoke-test-push-{run_id}, which falls under the protected path prefix.github/. The handler correctly blocks this, but causes cascade cancellation of the followingcreate_issuemessage.Cluster 3: EP025 actions/setup Not Found — Infrastructure (Multiple Jobs)
push_repo_memory,upload_assets,agent(post-cleanup)actions/setuplocal action is referenced before the repository is fully checked out in certain job configurations. Affectspush_repo_memory,upload_assets, and agent post-cleanup steps.failureeven when the safe_outputs job itself ran correctly (e.g., PR Triage Agent: safe output noop succeeded, but workflow failed due to push_repo_memory). Also contributes to the detection failures seen in Cluster 1.Cluster 4: EP_LABELS_MISSING — Constraint Solving Labels (4th Consecutive Day)
safe_outputs(create_discussion labels step)constraint-solvingandproblem-of-the-dayare referenced in the workflow frontmatter but don't exist in the repository. Thecreate_discussionitself succeeds, but labels are not applied.Root Cause Analysis
Detection Job Issues (Blocking Safe Outputs)
The EP024 escalation is the most systemic concern today. Three Copilot detection runs failed with authentication errors — different request IDs in each case, suggesting intermittent token/credential issues rather than a systematic misconfiguration. One run (Smoke Gemini) shows a streaming truncation pattern where the detection model starts producing output but the response ends before completing the JSON. This is likely a streaming timeout or network interruption.
The detection failures propagate as a blocker:
safe_outputs.ifevaluatesneeds.detection.result == 'success', so any detection failure silently skips all safe outputs, discarding valid agent work.Safe Output Handler Issues
The single safe output failure (EP019b) is a correct enforcement of security policy — the agent wrote a file to
.github/smoke-test-push-{run_id}, which is a protected path. However, the cascade behavior (cancelling subsequentcreate_issue) is overly aggressive for a smoke test workflow where safe output writes are expected to be low-risk.Infrastructure Issues (EP025)
The
actions/setupnot-found errors appear in diverse job types, suggesting that after certain checkout operations (like checking out a memory branch or performing sparse checkout), the main repository'sactions/directory is no longer available. This is a post-checkout state contamination bug.Recommendations
Critical Issues (Immediate Action Required)
THREAT_DETECTION_RESULTline is complete before considering the detection doneBug Fixes Required
EP019b: Smoke Claude Agent Writing to .github/ Path
.github/smoke-test-push-{run_id}, a protected prefixprotected-files: fallback-to-issuein smoke-claude'spush_to_pull_request_branchhandler config so a review issue is created instead of hard failure/tmp/orsmoke-tests/)EP025: actions/setup Not Found After Branch Checkout
push_repo_memory,upload_assetsjob definitionsCheckout actions folderstep before any local action invocation inpush_repo_memoryandupload_assetsjobs (similar to how thesafe_outputsjob does it)Configuration Changes
constraint-solvingandproblem-of-the-daydo not exist in github/gh-awWork Item Plans
Work Item 1: Detection Retry Logic for Authentication Failures
detection_success=trueWork Item 2: Fix EP025 actions/setup Availability in push_repo_memory and upload_assets
actions/setupis not available after branch checkout operations corrupt the working directory state.push_repo_memoryjob reliably findsactions/setupupload_assetsjob reliably findsactions/setupCan't find 'action.yml'errors in these jobsCheckout actions folderstep (sparse checkout of theactions/directory) before any local action reference in affected jobsWork Item 3: Fix Smoke Claude Agent Path for push_to_pull_request_branch (EP019b)
.github/paths, triggering the protected-file block and cascading failure..github/protected prefixprotected-files: fallback-to-issueto Smoke Claude'spush_to_pull_request_branchconfig; update agent instructions to use non-protected paths for smoke test artifactsHistorical Context
Recent Trend (Last 7 Days)
Trend: Success rate stable at 95-100% for safe output operations. However, detection failures (EP024) are an escalating pattern that silently discards agent work — first 4 cases on 2026-03-26, then 4 more today.
EP025 escalation: Went from 2 occurrences (2026-03-26) to 5 occurrences today, now affecting 5+ workflows.
Metrics and KPIs
Next Steps
Checkout actions folderstep topush_repo_memoryandupload_assetsjobsprotected-files: fallback-to-issuein smoke-claude workflowconstraint-solvingandproblem-of-the-dayin repositoryReferences:
Beta Was this translation helpful? Give feedback.
All reactions