Latte: Latent Attention for Linear Time Transformers

This repository is an implementation of latent attention (latte) for language modelling.

Get started

Get started by cloning the repository.

We use PDM to manage Python packages and dependencies and use Python 3.11 (consider pyenv as a simple Python version management solution). pdm is a modern alternative to pip / poetry. pdm manages a virtualenv locally in the project directory itself (in a directory .venv).

It keeps dependencies from pyproject.toml in sync with the virtual environment. This makes it easy to have a python environment specific to the project and ensure we all run the same dependencies. Similar to how npm, yarn for JS/TS or cargo for Rust work.

Installation

Once you have pdm and the right python version installed, run

pdm install

Scripts

pdm run train will train a model
pdm run preprocess-tiny-stories will download and preprocess the TinyStories dataset locally

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
latte		latte
.flake8		.flake8
.gitignore		.gitignore
.pdm-python		.pdm-python
README.md		README.md
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latte: Latent Attention for Linear Time Transformers

Get started

Installation

Scripts

About

Releases

Packages

Languages

mcobzarenco/latte

Folders and files

Latest commit

History

Repository files navigation

Latte: Latent Attention for Linear Time Transformers

Get started

Installation

Scripts

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages