📊 Lockfile Statistics Report - November 11, 2025 #3599
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it was created by an agentic workflow more than 1 week ago. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
📊 Agentic Workflow Lock File Statistics - November 11, 2025
This comprehensive analysis examines all
.lock.ymlfiles in the gh-aw repository to understand the structure, patterns, and characteristics of agentic workflows. The analysis reveals interesting insights about how workflows are configured, what triggers they use, and their structural complexity.Executive Summary
Key Findings:
workflow_dispatch(manual) is most common (59 workflows)Full Report Details
File Size Analysis
Distribution Statistics
Size Distribution by Range
Key Observation: The vast majority (84%) of workflow files are over 100 KB, indicating substantial complexity with extensive configuration and embedded instructions.
Extremes:
opencode.lock.yml(23 KB) - located in shared directorypoem-bot.lock.yml(390 KB) - contains 101 steps and 1,177 jobsTrigger Analysis
Workflows use various trigger mechanisms to respond to GitHub events. Many workflows combine multiple triggers for flexible activation.
Most Popular Triggers
workflow_dispatchscheduleissue_commentissuespull_requestpull_request_review_commentdiscussion_commentdiscussionworkflow_runpushworkflow_callTotal Triggers: 140 trigger configurations across 77 workflows (average 1.8 triggers per workflow)
Common Trigger Combinations
Based on the frequency data, common patterns include:
Schedule Patterns
Most Common Cron Schedules:
0 9 * * *0 0,6,12,18 * * *0 6 * * 00 2 * * 1-50 15 * * 10 0 * * *Insight: Workflows predominantly run during morning hours (UTC) or on specific days (Mondays, Sundays), suggesting batch processing patterns for reports, summaries, and maintenance tasks.
Safe Outputs Analysis
Safe outputs are mechanisms for workflows to create persistent artifacts like issues, discussions, or comments.
Safe Output Types Distribution
add-commentcreate-issueTotal Workflows Using Safe Outputs: 5 out of 77 (6.5%)
Key Finding: Safe output adoption is very limited. Only 6.5% of workflows use these features, suggesting either:
Example Workflows Using Safe Outputs:
Discussion Categories
Analysis Result: No workflows currently use
create-discussionsafe outputs, despite this being a common pattern in the repository's history.Structural Characteristics
This section examines the internal complexity of workflow files.
Job Complexity
poem-bot.lock.yml)poem-bot.lock.yml)Note: The extremely high job counts likely reflect the internal structure of compiled workflow files where agents and conditional logic expand into multiple job definitions.
Top 10 Most Complex Workflows (by step count)
poem-bot.lock.ymltechnical-doc-writer.lock.ymlunbloat-docs.lock.ymlq.lock.ymlmcp-inspector.lock.ymltidy.lock.ymlInsight: The most complex workflows are document-focused (technical writing, cleanup, inspection), suggesting these tasks require extensive conditional logic and multi-stage processing.
Typical Lock File Structure
Based on median values, a typical
.lock.ymlfile has:ubuntu-slimorubuntu-latestPermission Patterns
Permissions define what GitHub API operations workflows can perform.
Most Common Permissions
contentspull-requestsissuesactionsdiscussionssecurity-eventsrepository-projectsattestationschecksdeploymentsmodelspackagespagesstatusesTotal Permission Grants: 527 across all workflows
Permission Distribution
contents,pull-requests,issuesform the standard trioSecurity Observation: Workflows follow the principle of least privilege, requesting primarily read access to repository resources. This is excellent security hygiene for analytical and reporting workflows.
Runner & Infrastructure Patterns
Runner Types
ubuntu-slimubuntu-latestTotal Runner Allocations: 416 across all job definitions
Insight: The repository strongly prefers
ubuntu-slim(65% of allocations), suggesting optimization for faster startup times and resource efficiency.Timeout Configurations
Average Timeout: ~13 minutes
Most Common: 10 minutes (44% of configurations)
Insight: Timeout values cluster around 10-20 minutes, appropriate for AI agent workflows that need time for LLM API calls and multi-step reasoning.
MCP Server & Tool Patterns
MCP Server Usage
Analysis Result: No explicit MCP server declarations found in the lockfiles using standard patterns. This suggests:
Tool Allowlists
Common tools available to workflows (based on gh-aw platform capabilities):
Interesting Findings
High Trigger Flexibility: 76.6% of workflows support manual dispatch, enabling on-demand agent execution for debugging and testing.
Morning-Biased Scheduling: Scheduled workflows predominantly run during morning hours (UTC), suggesting alignment with working hours or daily reporting cycles.
Document-Focused Complexity: The most complex workflows (by step count) focus on documentation tasks - writing, cleanup, and inspection. This suggests documentation workflows require more conditional logic and multi-stage processing than other workflow types.
Structural Bloat: Lock files are surprisingly large (average 214 KB) compared to typical GitHub Actions workflows. This reflects the embedded agent instructions, extensive conditional logic, and compiled nature of .lock.yml files.
Low Safe Output Adoption: Only 6.5% of workflows use safe output features. This is unexpected given the repository's focus on agentic workflows and suggests an opportunity for increased adoption.
Job Count Explosion: Average 662 jobs per workflow seems anomalous - this likely reflects how the workflow compiler expands agent logic into multiple conditional job paths rather than actual parallel job execution.
Lightweight Infrastructure Preference: 65% of runner allocations use
ubuntu-slim, indicating a focus on efficiency and fast startup times rather than requiring heavyweight build environments.Security-Conscious Permission Model: Workflows overwhelmingly use read-only permissions, with write access granted sparingly and specifically. This demonstrates mature security practices.
No Discussion Output Usage: Despite discussions being a rich communication medium, no current workflows use
create-discussionas a safe output, suggesting this pattern may have been deprecated or replaced.Timeout Clustering: Sharp clustering around 10 and 20-minute timeouts suggests these are platform defaults or recommended values for typical agentic workflows.
Historical Trends
Current Analysis Date: November 11, 2025
Note: This is a snapshot analysis. Future runs will compare:
Previous Data Available: Cache memory contains analysis artifacts from October 28-29 and November 4, 2025, enabling trend analysis in future reports.
Recommendations
Based on this comprehensive analysis, we recommend:
1. Investigate Safe Output Underutilization
Only 6.5% of workflows use safe outputs. Consider:
2. Standardize Timeout Values
With clear clustering at 10 and 20 minutes, consider:
3. Optimize Lock File Sizes
Average 214 KB per lock file suggests potential optimization:
4. Document Job Count Patterns
The average 662 jobs per workflow needs clarification:
5. Expand Trigger Diversity
Consider additional trigger types:
pull_request_targetfor safe fork handlingrepository_dispatchfor external integrationswebhookevents for third-party service integration6. Monitor Permission Creep
Current security posture is excellent:
Methodology
Analysis Approach
Data Collection:
Lock Files Analyzed: 77 primary files in
.github/workflows/(150 total including subdirectories)Cache Memory: Used
/tmp/gh-aw/cache-memory/for:/scripts/)/data/)/history/)Data Sources:
.lock.ymlfiles in.github/workflows/shared/subdirectoryAnalysis Scripts
Stored in
/tmp/gh-aw/cache-memory/scripts/:analyze_lockfiles.sh- Primary extraction scriptextract_detailed_stats.sh- Detailed pattern extractionanalyze_yaml.py- Python-based YAML parsingData Files
Stored in
/tmp/gh-aw/cache-memory/data/:file_sizes.txt- File size measurementstriggers_detailed.txt- Trigger extraction by filesafe_outputs_detailed.txt- Safe output patternspermissions_python.txt- Permission analysisjobs_steps_python.txt- Complexity metricsLimitations
Generated by Lockfile Statistics Analysis Agent
Analysis Date: November 11, 2025
Repository: githubnext/gh-aw
Cache Location:
/tmp/gh-aw/cache-memory/Analysis Scripts: Available in cache for future reuse
Beta Was this translation helpful? Give feedback.
All reactions