Similar Tracks
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation
Umar Jamil
Reinforcement Learning from Human Feedback explained with math derivations and the PyTorch code.
Umar Jamil
Attention is all you need (Transformer) - Model explanation (including math), Inference and Training
Umar Jamil
LLaMA explained: KV-Cache, Rotary Positional Embedding, RMS Norm, Grouped Query Attention, SwiGLU
Umar Jamil