Pinned Loading
-
microrlhf.py
microrlhf.py 1"""2An atomic way to RLHF a language model in pure, dependency-free Python.3Uses a discriminative n-gram classifier P(LOTR | name) + REINFORCE to steer microgpt toward Tolkienian names.4Inspired by @karpathy's microgpt.py.5 -
PufferLib
PufferLib PublicForked from PufferAI/PufferLib
Simplifying reinforcement learning for complex game environments
C
-
kubernetes
kubernetes PublicForked from kubernetes/kubernetes
Production-Grade Container Scheduling and Management
Go 1
-
transformers
transformers PublicForked from huggingface/transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Python
-
PathOfBuildingCommunity/PathOfBuilding
PathOfBuildingCommunity/PathOfBuilding PublicOffline build planner for Path of Exile.
-
magic-modules
magic-modules PublicForked from GoogleCloudPlatform/magic-modules
Add Google Cloud Platform support to Terraform
HTML
If the problem persists, check the GitHub status page or contact support.




