Skip to content

feat(eval): improve eval reporting, add agents-v2-py skill, and docs improvements#108

Merged
thegovind merged 1 commit intomainfrom
improve-eval-reporting
Feb 7, 2026
Merged

feat(eval): improve eval reporting, add agents-v2-py skill, and docs improvements#108
thegovind merged 1 commit intomainfrom
improve-eval-reporting

Conversation

@thegovind
Copy link
Collaborator

Summary

  • Skill evaluation workflow: Add verbose CLI reporting and CI failure annotations so evaluation results are easier to read in terminal output and GitHub Actions
  • New skill: Add agents-v2-py skill for building container-based Foundry Agents using ImageBasedHostedAgentDefinition, with acceptance criteria and test scenarios
  • Docs site improvements: Add OG image for social sharing, fix skills grid COLLAPSED_HEIGHT constant, shorten tagline, correct frontend-ui-dark-ts category placement (Foundry → Frontend), and rebuild static assets

Changes

Skill Evaluation (f9ce4c3)

  • Verbose CLI reporting with pass/fail indicators
  • CI failure annotations for GitHub Actions integration

agents-v2-py Skill (216520c)

  • SKILL.md with patterns for ImageBasedHostedAgentDefinition, ProtocolVersionRecord, AgentProtocol.RESPONSES
  • Acceptance criteria covering correct and incorrect usage patterns
  • Test scenarios in tests/scenarios/agents-v2-py/scenarios.yaml

Docs Site (216520c)

  • Added og-image.png (1200×630) and Open Graph / Twitter Card meta tags
  • Fixed skills grid: use COLLAPSED_HEIGHT constant instead of inline calc()
  • Shortened tagline to "Domain knowledge for coding agents"
  • Moved frontend-ui-dark-ts to correct category (Monitoring & Frontend)
  • Added skills/typescript/frontend/frontend-ui-dark symlink
  • Rebuilt docs/ static assets

…ments

Add new agents-v2-py skill with acceptance criteria and test scenarios.
Add OG/Twitter image meta tags for social sharing previews.
Rebuild docs site assets, fix skill grid collapse height constant,
move frontend-ui-dark-ts to Monitoring & Frontend category in README,
fix CI workflow YAML indentation, and add frontend-ui-dark symlink.

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode)

Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>
@thegovind thegovind merged commit b7493bc into main Feb 7, 2026
2 checks passed
@thegovind thegovind deleted the improve-eval-reporting branch February 7, 2026 01:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant