Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix uninitialized subset loading #34

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

sumukshashidhar
Copy link
Collaborator

When using the concat option, users will often want to run models one after the other (to test different prompts, approaches, etc), without having to regenerate everything or use a new split.

This is why the concat option exists. However, right now, on the first run, the concat_if_exists option will fail, because that particular config doesn't exist. Therefore, we handle this, by specifically looking for the issue, where the dataset itself exists, but the split doesn't yet exist (on the first run)

Copy link
Member

@clefourrier clefourrier left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm OK with this but there needs to be a strong documentation of the concat option because it's not obvious behavior imo

@alozowski
Copy link
Collaborator

Should we only implement this behavior when concat_dataset is set to true in the config?

@sumukshashidhar
Copy link
Collaborator Author

could be - but imo this is a sane default

@sumukshashidhar
Copy link
Collaborator Author

we would rather want it to not error out

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants