TBM 2024 — Tutorial 5: Full Decoder-Only Transformer Architecture

Assembling all transformer modules and training a decoder-only model