Skip to content

MLX example code to build the simplest language model, minGPT, including a bitlinear-mingpt impl

Notifications You must be signed in to change notification settings

adhulipa/mlx-bitnet-mingpt

Repository files navigation

mlx-bitnet-mingpt

This repo contains example code to demonstrate the following:

  1. A port of Karpathy's minGPT in MLX.
  2. An extension of mlx-mingpt to use BitLinear instead of nn.Linear to demonstrate training language models using 2-bits as reported in https://arxiv.org/abs/2402.17764

Reports

Take a peek at the ipynb pdfs under demo-results folder

About

MLX example code to build the simplest language model, minGPT, including a bitlinear-mingpt impl

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published