-
Notifications
You must be signed in to change notification settings - Fork 116
add post on initial vitals cran release #741
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
* [x] `usethis::use_tidy_thanks()` | ||
--> | ||
|
||
We're bear-y excited to announce the release of [vitals](https::vitals.tidyverse.org) on CRAN. vitals is a framework for large language model evaluation in R. It’s specifically aimed at ellmer users who want to measure the effectiveness of their LLM products like [custom chat apps](https://posit.co/blog/custom-chat-app/) and [querychat](https://github.com/posit-dev/querychat) apps. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🤣
#| label: save-tsk-claude | ||
#| include: false | ||
if (should_eval) { | ||
save(tsk_claude, file = here::here("content/blog/vitals-0-1-0/results/tsk_claude.rda")) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wonder if there's anything we could learn from ellmer's batch interface here. Since evaluations are slow, it'd be nice to have some built-in caching tools. (Just a thought; doesn't need to impact this blog post)
* update title * use one-liner to clone and eval
A pared back version of the package's introductory vignette.
Deploy preview here: https://deploy-preview-741--tidyverse-org.netlify.app/blog/2025/06/vitals-0-1-0/