ViR: Vision Retention Networks (unofficial re-implementation)

Unofficial re-implementation of ViR: Vision Retention Networks by Ali Hatamizadeh, Michael Ranzinger, Jan Kautz.

Usage

from vir import ViR, ViRModes

model = ViR(
  out_dim=10,
  patch_size=14,
  depth=12,
  heads=12,
  embed_dim=768,
  max_len=257,
)

x = torch.randn(16, 257, 768)

# All forward modes (parallel, recurrent, chunkwise) give the same output
# Parallel
y_parallel = model(x, ViRModes.PARALLEL)

# Recurrent
y_recurrent = model(x, ViRModes.RECURRENT)

# Parallel
y_chunkwise = model(x, mode=ViRModes.CHUNKWISE, chunk_size=20)

Classification performance on ImageNette

A Vision Retention Network tiny (3 heads, 12 layers, 192 embed dim) achieves a 100% accuracy on the Imagenette dataset after roughly 40 epochs with a batch size of 64.

Citation

If you find this code useful for your research, please cite the repo:

@software{Pulfer_ViR_2023,
author = {Pulfer, Brian},
month = November
title = {{Vision Retention Networks (unofficial re-implementation)}},
url = {https://github.com/BrianPulfer/vision-retention-networks},
year = {2023}
}

License

The code is released with the Apache 2.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
.vscode		.vscode
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ViR: Vision Retention Networks (unofficial re-implementation)

Usage

Classification performance on ImageNette

Citation

License

About

Releases

Packages

Contributors 2

Languages

License

BrianPulfer/vision-retention-networks

Folders and files

Latest commit

History

Repository files navigation

ViR: Vision Retention Networks (unofficial re-implementation)

Usage

Classification performance on ImageNette

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages