Commit cd8efdb
committed
fix: Lower resolved_threshold default from 0.8 to 0.0 for dead code benchmarks
The 80% precision AND recall gate meant every task showed "Resolved: False"
for both MCP and baseline agents. No dead code detection approach achieves
80% on both metrics simultaneously. Setting to 0.0 means any task with
non-zero P and R counts as resolved. Still configurable via config YAML.1 parent e35a179 commit cd8efdb
File tree
3 files changed
+3
-3
lines changed- src/mcpbr
- benchmarks
- supermodel
3 files changed
+3
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
145 | 145 | | |
146 | 146 | | |
147 | 147 | | |
148 | | - | |
| 148 | + | |
149 | 149 | | |
150 | 150 | | |
151 | 151 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
53 | | - | |
| 53 | + | |
54 | 54 | | |
55 | 55 | | |
56 | 56 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
901 | 901 | | |
902 | 902 | | |
903 | 903 | | |
904 | | - | |
| 904 | + | |
905 | 905 | | |
906 | 906 | | |
907 | 907 | | |
| |||
0 commit comments