Feature/chrf score #2221

kauabh · 2025-08-25T07:39:52Z

CHRF Score Metric Using sacrebleu

Issue Link / Problem Description

This PR introduces a new ChrfScore metric based on sacrebleu.corpus_chrf for evaluating the similarity between a generated response and a reference.
CHRF is better suited for morphologically rich languages and provides a character-level F-score.

Changes Made

Added ChrfScore class to implement character F-score (CHRF) metric using sacrebleu.corpus_chrf.

Testing

How to Test

Manual testing steps:
1. Install sacrebleu if not already installed: pip install sacrebleu
2. Import and instantiate ChrfScore
3. Pass a SingleTurnSample object with reference and response
4. Run the metric and verify output is a float between 0.0 and 1.0

References

sacrebleu documentation:

Screenshots/Examples (if applicable)

from sacrebleu import corpus_chrf

hypotheses = ["The cat is on the mat."]
references = [["The cat is sitting on the mat."]]

score = corpus_chrf(hypotheses, references).score / 100
print(score)  # e.g., 0.67

Added _chrf_score

Removed Typo

Added ChrfScore

Added CHRF docs

kauabh added 4 commits August 25, 2025 12:21

Create _chrf_score.py

f36b148

Added _chrf_score

Update _chrf_score.py

d0b4b3e

Removed Typo

Update __init__.py

80662f8

Added ChrfScore

Update traditional.md

6a8db09

Added CHRF docs

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Aug 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/chrf score #2221

Feature/chrf score #2221

Uh oh!

kauabh commented Aug 25, 2025

Uh oh!

Uh oh!

Feature/chrf score #2221

Are you sure you want to change the base?

Feature/chrf score #2221

Uh oh!

Conversation

kauabh commented Aug 25, 2025

CHRF Score Metric Using sacrebleu

Issue Link / Problem Description

Changes Made

Testing

How to Test

References

Screenshots/Examples (if applicable)

Uh oh!

Uh oh!