Skip to content

feat(eval): add zero-shot text classification evaluator#386

Closed
jeon185 wants to merge 1 commit into
mainfrom
feat/eval-zero-shot-classification
Closed

feat(eval): add zero-shot text classification evaluator#386
jeon185 wants to merge 1 commit into
mainfrom
feat/eval-zero-shot-classification

Conversation

@jeon185

@jeon185 jeon185 commented Apr 23, 2026

Copy link
Copy Markdown
Contributor

Resolves #325.

Adds WinMLZeroShotClassificationEvaluator, registered under the zero-shot-classification task, with a pipeline subclass that pads to max_length for static-shape ONNX.

Accuracy and macro-F1 are computed via a new ClassificationMetric, since HF evaluate has no wrapper for this task.

Default dataset is AG News; an E2E entry for MoritzLaurer/DeBERTa-v3-base-mnli-fever-anli is included.

Unit tests cover the evaluator and the metric. An integration test (slow/network) runs end-to-end on DistilBERT, RoBERTa, and DeBERTa NLI checkpoints.

@jeon185 jeon185 requested a review from a team as a code owner April 23, 2026 21:47
@jeon185 jeon185 closed this Apr 23, 2026
@jeon185 jeon185 deleted the feat/eval-zero-shot-classification branch April 23, 2026 21:55
@jeon185

jeon185 commented Apr 23, 2026

Copy link
Copy Markdown
Contributor Author

Closed by mistake — replaced by #387.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

feat: support zero-shot-classification model evaluation (DeBERTa, RoBERTa, DistilBERT)

1 participant