TBM 2024 — Tutorial 4: Attention Mechanism & Transformer Components

Enhancing the bigram model with self-attention and transformer building blocks