Skip to content

add post on initial vitals cran release #741

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from
Open

Conversation

simonpcouch
Copy link
Collaborator

@simonpcouch simonpcouch commented Jun 20, 2025

A pared back version of the package's introductory vignette.

Deploy preview here: https://deploy-preview-741--tidyverse-org.netlify.app/blog/2025/06/vitals-0-1-0/

@simonpcouch simonpcouch requested a review from hadley June 20, 2025 18:16
* [x] `usethis::use_tidy_thanks()`
-->

We're bear-y excited to announce the release of [vitals](https::vitals.tidyverse.org) on CRAN. vitals is a framework for large language model evaluation in R. It’s specifically aimed at ellmer users who want to measure the effectiveness of their LLM products like [custom chat apps](https://posit.co/blog/custom-chat-app/) and [querychat](https://github.com/posit-dev/querychat) apps.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤣

#| label: save-tsk-claude
#| include: false
if (should_eval) {
save(tsk_claude, file = here::here("content/blog/vitals-0-1-0/results/tsk_claude.rda"))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if there's anything we could learn from ellmer's batch interface here. Since evaluations are slow, it'd be nice to have some built-in caching tools. (Just a thought; doesn't need to impact this blog post)

* update title
* use one-liner to clone and eval
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants