Attention Is All You Need Pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".
Annotated_deep_learning_paper_implementations41,87724 months ago79November 05, 202330mitJupyter Notebook
🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Vit Pytorch16,29865 months ago184November 15, 2023114mitPython
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Nlp Tutorial13,226
6 months ago35mitJupyter Notebook
Natural Language Processing Tutorial for Deep Learning Researchers
External Attention Pytorch10,361
4 months ago2September 27, 202263mitPython
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Attention Is All You Need Pytorch7,910
8 months ago74mitPython
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Espnet7,56354 months ago33October 25, 2023270apache-2.0Python
End-to-End Speech Processing Toolkit
Bertviz5,54739 months ago5April 02, 20228apache-2.0Python
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Pytorch Seq2seq5,024
4 months agomitJupyter Notebook
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
4 months ago128apache-2.0Python
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
a year agoPython
Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.
