Similar Tracks
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained)
Yannic Kilcher
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows (paper illustrated)
AI Bites
Vision Transformer (ViT) - An image is worth 16x16 words | Paper Explained
Aleksa Gordić - The AI Epiphany