Transformers

Minimal PyTorch implementation of the Transformers architecture from Vaswani et al.'s 2017 paper, Attention Is All You Need. This repository serves as a deep dive into understanding key architectures and their nuances, including architecture design and training techniques.

Components

Project Structure

Tokenizer.py
Modules/
- Attention.py
  - MultiHeadAttentionBase
  - MultiHeadSelfAttention
  - MultiHeadCrossAttention
- AddNorm.py
- MLP.py
- Encoder.py
  - EncoderLayer
  - Encoder
- Decoder.py
  - DecoderLayer
  - Decoder
- Transformer.py
  - Transformer
Config/
- Config.py
BERT/
- BERT.py
  - Bert

Name	Name	Last commit message	Last commit date
Latest commit yonas-g bert impl Oct 6, 2023 cbee8f1 · Oct 6, 2023 History 14 Commits
BERT	BERT	bert impl	Oct 6, 2023
config	config	restructured the code. Transformer impl	Oct 6, 2023
modules	modules	restructured the code. Transformer impl	Oct 6, 2023
.gitignore	.gitignore	.gitignore	Oct 5, 2023
Dataset.py	Dataset.py	restructured the code. Transformer impl	Oct 6, 2023
README.md	README.md	bert impl	Oct 6, 2023
Tokenizer.py	Tokenizer.py	MultiHeadSelfAttention Implemented	Oct 5, 2023
Trainer.py	Trainer.py	restructured the code. Transformer impl	Oct 6, 2023
utils.py	utils.py	Encoder impl done. MLP implemented	Oct 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformers

Components

Project Structure

About

Languages

yonas-g/Transformer

Folders and files

Latest commit

History

Repository files navigation

Transformers

Components

Project Structure

About

Resources

Stars

Watchers

Forks

Languages