Skip to content

Conversation

@gkielian
Copy link
Collaborator

Draft PR, but we should modify this section to allow for setting any number of layers.

Then we can use these as checkpoints for finetuning explorations.

karpathy and others added 6 commits September 13, 2024 19:04
This is a documentation only change. Hoping this is OK to merge. See this tweet for more context on why we made this change https://x.com/jeremyphoward/status/1838341110344880637
Adding Softplus, ReLU, and trying a SparseReLU.
We'll create an array of different checkpoints and load them to HF.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants