Problem Statement
Given the status that agent can implement itself, how to use strands-evals to evaluate the result and iterate is a good experiment / investigation.
Proposed Solution
No response
Use Case
As agent can coding and raise the PR by itself, the team wants to assure the quality and be confident of the Agent-generated PR.
Alternatives Solutions
No response
Additional Context
No response