Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create workshop: Write Your Own Domain-Specific Benchmark #43

Open
deanwampler opened this issue Mar 4, 2025 · 0 comments
Open

Create workshop: Write Your Own Domain-Specific Benchmark #43

deanwampler opened this issue Mar 4, 2025 · 0 comments

Comments

@deanwampler
Copy link
Contributor

deanwampler commented Mar 4, 2025

As part of the TSEI promotion plan, a Write Your Own Domain-Specific Benchmark workshop drills into and generalizes the RAG benchmark workshop #42.

How does someone create their own benchmark for their use cases or domain? The session will use the TSEI reference stack, including lm-evaluation-harness and unitxt, with candidate benchmark data, either hand-curated Q&A pairs or synthetic data generated with a teacher model. The session will demonstrate the basics of running the benchmark and interpreting the results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Todo
Development

No branches or pull requests

1 participant