Skip to content

ruchir321/transformer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

transformer

Andrej Karpathy follow along

Objective

Recreate the decoder-only transformer from bottom up

Generate coherent WikiHow articles

Data

Wikihow corpus

Results

Sub par text generation results because of compute constraints (my potato laptop)

future improvements

Use GPU

Use BPE, Wordpiece etc for tokenization. The character level tokenization method is simplistic and fails to capture statistics of the corpus

About

Andrej Karpathy follow along

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published