Skip to content

Commit

Permalink
Fix deadlink to the-eye
Browse files Browse the repository at this point in the history
Pile is no longer hosted at the-eye. Remove.
  • Loading branch information
sdake committed May 4, 2024
1 parent 76f2d08 commit 9f266cc
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion workloads/alexandria/data/datasets/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -144,7 +144,7 @@ The resulting `data.spec.json` can now be fed as `--training-dataset-spec` or `-

**Note**: Substantial computational and storage resources are required to prepare MassiveOpenText.

1. Download the uncompressed Pile data file tree from https://mystic.the-eye.eu/public/AI/pile/ and put it in `Pile/`.
1. Download the uncompressed Pile data file and put it in `Pile/`.
2. Request and download the RealNews dataset, and put the `realnews.tar.gz` in `RealNews/`.
3. Tokenize, shard and chunk the data:

Expand Down

0 comments on commit 9f266cc

Please sign in to comment.