[observability] Agentic Observability Report — 2026-03-26 #23082
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Agentic Observability Kit. A newer discussion is available at Discussion #23527. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Executive Summary
This report covers the last 14 days of agentic workflow activity in
github/gh-aw. Due to high run volume, the analysis window is bounded to the 100 most recent runs, all of which fall within 2026-03-26T07:13–10:23 UTC (~3 hours). No escalation-eligible episodes were detected. All 100 episodes are standalone with high confidence, no orchestration chains, and no MCP failures. The primary operational signal is a cluster of single-run failures for newer workflows that lack baselines, and one high-severity resource-heavy run for Go Fan.Key Metrics
Highest Risk Episodes
No episodes crossed the escalation threshold. The one high-severity signal is a single run of Go Fan:
Go Fan —
resource_heavy_for_domain(high severity)Episode Regressions
No regressions detected. All episodes with baselines were classified stable:
stable— the failure run shows identical posture/turns/blocked-requests vs baseline. Likely an upstream trigger failure, not agent misbehavior.stable.The following 5 workflows failed with no baseline found — first-run or low-frequency workflows where regression detection is not yet possible:
Recommended Actions
countto 300+ for future reports, or running the observability kit on a longer cadence with date-bounded queries to ensure multi-day coverage.Optimization Candidates (overkill_for_agentic, low severity)
The following 15 workflows consistently produce zero turns, zero tool types, and read-only posture. They are assessed as potentially replaceable with deterministic automation. All are low severity and no action is required — listed for portfolio awareness only.
The high skip rate (74 of 100 runs) is likely by design — workflows activate only on matching triggers. No action recommended unless persistent skipping on intended triggers is suspected.
Resource-Heavy Runs Detail
Go Fan is the only run with actual cost logged ($1.19). All other runs show $0 estimated cost — cost attribution may be missing for Claude/Codex engines or the runs did not use billable tokens.
Episode Model Observations
standalonekind withno_shared_lineage_markersreasonIf orchestrator→worker patterns exist in this repository (e.g., a scheduling workflow that triggers others), they would only appear if both the parent and child runs fell within the same 100-run window. A wider time window or larger count is needed for DAG lineage analysis.
References:
Beta Was this translation helpful? Give feedback.
All reactions