Skip to content

Conversation

@venkatsai2004
Copy link

Adds an integration test for the HuggingFaceM4/InterleavedWebDocuments dataset.

  • Gracefully skips if the dataset is not yet available on the Hub
  • Checks basic loading and structure once it becomes available

Closes #7394

First-time contributor to datasets — really excited about this! Happy to make any adjustments needed. 🙂

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Using load_dataset with data_files and split arguments yields an error

1 participant