-
Notifications
You must be signed in to change notification settings - Fork 4
symbol_context followed by Read 64% of the time — output may lack sufficient source code #130
Copy link
Copy link
Open
Description
Problem
In our SWE-bench-verified evaluation, 64% of symbol_context calls are immediately followed by a Read tool call, suggesting the agent needs to see actual source code that symbol_context doesn't provide (or doesn't provide enough of).
Data
- 64% of
symbol_contextcalls are followed byReadon the same or related file - This pattern wastes a turn: the agent calls
symbol_contextto find what it needs, then callsReadto see the actual code - The full render mode does include source code, but either:
- The agent is using brief mode (which omits source code)
- The source code snippet is too narrow (just the definition, not surrounding context)
- The agent needs to see a broader section of the file
Context
PR #122 partially addressed this by adding source code to the full render and creating a brief mode. However, the 64% follow-up-Read rate was measured on a run before those changes. A new evaluation is needed to measure whether the changes help.
Recommended Investigation
- Re-evaluate after PR feat: Batch symbol_context, inject overview into instructions, remove overview tool #122's changes are deployed: does the Read-follow-up rate decrease?
- Expand source context: Include more surrounding lines (e.g., ±20 lines around the definition rather than just the definition)
- Include file path prominently: Make sure the agent knows exactly where to look if it does need to Read more
- Suggest related regions: If the symbol is a method, include the class definition header and any closely-related methods
Impact
If symbol_context returned sufficient code to avoid the follow-up Read 50% of the time, that saves ~0.3 turns per task across all tasks — a meaningful efficiency gain.
Labels
enhancement, swe-bench
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels