- [2021 ArXiv] An Attention Free Transformer, [paper], [bibtex], sources: [rish-16/aft-pytorch].
- [2021 ICML] Perceiver: General Perception with Iterative Attention, [paper], [bibtex], sources: [lucidrains/perceiver-pytorch].
- [2021 ICML] Evolving Attention with Residual Convolutions, [paper], [bibtex].
- [2022 ArXiv] data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language, [paper], [bibtex], sources: [pytorch/fairseq/data2vec].