labmlai

    labmlai/annotated_deep_learning_paper_implementations

    #607 this week

    🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

    deep-learning
    machine-learning
    attention
    deep-learning-tutorial
    gan
    literate-programming
    lora
    Python
    MIT
    66.5K stars
    6.7K forks
    66.5K GitHub watchers
    Updated 5/4/2026
    View on GitHub

    Backblaze Generative Media Hackathon

    Build the next generation of AI media apps with Genblaze, stored on Backblaze B2. $10,000 in prizes.

    Enter the hackathon

    Loading star history...

    Use Cases & Benefits

    • This repository provides 60+ PyTorch implementations of deep learning papers with detailed side-by-side notes for better understanding.
    • Key features include implementations of transformers, GANs, reinforcement learning, optimizers, diffusion models, and normalization layers using Python and PyTorch.
    • Strengths are comprehensive coverage, active maintenance, and educational documentation; limitations may include complexity for beginners and dependency on PyTorch.
    • Organizations can use it for research, prototyping, and training deep learning models in production with well-documented, modular code and state-of-the-art algorithms.
    • Ideal use cases include academic learning, deep learning research, model benchmarking, and developing advanced neural network architectures in NLP, vision, and RL.

    About annotated_deep_learning_paper_implementations

    Twitter

    labml.ai Deep Learning Paper Implementations

    This is a collection of simple PyTorch implementations of neural networks and related algorithms. These implementations are documented with explanations,

    The website renders these as side-by-side formatted notes. We believe these would help you understand these algorithms better.

    Screenshot

    We are actively maintaining this repo and adding new implementations almost weekly. Twitter for updates.

    Paper Implementations

    Transformers

    Low-Rank Adaptation (LoRA)

    Eleuther GPT-NeoX

    Diffusion models

    Generative Adversarial Networks

    Recurrent Highway Networks

    LSTM

    HyperNetworks - HyperLSTM

    ResNet

    ConvMixer

    Capsule Networks

    U-Net

    Sketch RNN

    ✨ Graph Neural Networks

    Counterfactual Regret Minimization (CFR)

    Solving games with incomplete information such as poker with CFR.

    Reinforcement Learning

    Optimizers

    Normalization Layers

    Distillation

    Adaptive Computation

    Uncertainty

    Activations

    Langauge Model Sampling Techniques

    Scalable Training/Inference

    Installation

    pip install labml-nn
    

    Discover Repositories

    Search across tracked repositories by name or description