Lecture 25: Speaking Composable Kernel (CK) Share: Download MP3 Similar Tracks Lecture 28: Liger Kernel - Efficient Triton Kernels for LLM Training GPU MODE AMD HACC Tech Talk: ROCm Ecosystem and HIP Programming Xilinx Research & Open Source Projects Lecture 24: Scan at the Speed of Light GPU MODE Mega Hackathon 2.0 Presentations ICP HUB Kenya CUTLASS: A CUDA C++ Template Library for Accelerating Deep Learning... Aniket Shivam & Vijay Thakkar The Linux Foundation Visualizing transformers and attention | Talk for TNG Big Tech Day '24 Grant Sanderson Lecture 27: gpu.cpp - Portable GPU compute using WebGPU GPU MODE Lecture 37: Introduction to SASS & GPU Microarchitecture GPU MODE Stanford CS229 I Machine Learning I Building Large Language Models (LLMs) Stanford Online Think Fast, Talk Smart: Communication Techniques Stanford Graduate School of Business How To Speak Fluently In English About Almost Anything EnglishAnyone Transformers (how LLMs work) explained visually | DL5 3Blue1Brown CppCon 2016: “Bringing Clang and C++ to GPUs: An Open-Source, CUDA-Compatible GPU C++ Compiler" CppCon CUDA: New Features and Beyond | NVIDIA GTC 2024 NVIDIA Developer Lecture 23: Tensor Cores GPU MODE MIT Introduction to Deep Learning | 6.S191 Alexander Amini Coding Adventure: Rendering Fluids Sebastian Lague Lecture 32: Unsloth GPU MODE Lecture 22: Hacker's Guide to Speculative Decoding in VLLM GPU MODE