Generalize never-worse guard to all filters (curl emits more tokens than raw on tiny responses)

## Problem

The benchmark CI gate (`BENCHMARK FAILED: 1 filter(s) produced more tokens than raw output`) flags the `curl` filter as negative:

```
🔴 curl text │ curl -s https://httpbin.org/robots.txt │ rtk curl ... │ 8 → 40 (-400%)
```

`rtk curl` wraps the response with status/header metadata. On a tiny payload (httpbin's `robots.txt` is ~8 tokens) that wrapper dominates, so the filtered output is **larger** than the raw command output.

## Root cause

The never-worse guard added for `grep` in #2550 (fall back to the raw form when filtering doesn't shrink the output) is **grep-only**. Other filters — `curl` here — have no such cap, so they can emit more tokens than the underlying command on small inputs.

## Proposal

Extract the never-worse comparison into a shared `core` helper and apply it across filters (start with `curl`, then audit the rest), so **no filter ever emits more tokens than the raw command**. This makes the benchmark gate hold structurally rather than per-filter.

## Evidence

- Benchmark run: https://github.com/rtk-ai/rtk/actions/runs/28024206815/job/82948000842
- Failing line: `🔴 curl text   8 → 40 (-400%)`
- **Not** introduced by #2550 (which touches only `grep`); the same run shows every grep shape positive: `fn +94%`, `-l +99%`, `-c +95%`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize never-worse guard to all filters (curl emits more tokens than raw on tiny responses) #2551

Problem

Root cause

Proposal

Evidence

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Generalize never-worse guard to all filters (curl emits more tokens than raw on tiny responses) #2551

Description

Problem

Root cause

Proposal

Evidence

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions