Skip to content

Continue the pre-training step #1546

Closed Answered by ZQ-Dev8
miguelwon asked this question in Q&A
Apr 19, 2024 · 1 comments · 3 replies
Discussion options

You must be logged in to vote

In your config.yml, what you'll do is put "completion" as the type: in the datasets: section, meaning you're intending to train on a dataset containing a column called "text" where the raw text completion data is. If your column is named something else, you can use the field: parameter in the dataset: to override the default.

For the local dataset, just ensure it's a .jsonl file with a column entitled "text", unless you override it as described above. All other columns will be ignored.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@miguelwon
Comment options

@mtisz
Comment options

@NanoCode012
Comment options

Answer selected by NanoCode012
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants