This repo contains example code to demonstrate the following:
- A port of Karpathy's minGPT in MLX.
- An extension of mlx-mingpt to use
BitLinear
instead ofnn.Linear
to demonstrate training language models using 2-bits as reported in https://arxiv.org/abs/2402.17764
Take a peek at the ipynb pdfs under demo-results
folder
- minGPT in MLX: https://github.com/adhulipa/mlx-mingpt/blob/main/demo-results/mingptmlx.pdf
- bitlinear minGPT in MLX: https://github.com/adhulipa/mlx-mingpt/blob/main/demo-results/bitnet-mingptmlx.pdf