GRPO's new variants and implementation secrets Share: Download MP3 Similar Tracks Early stages of the reinforcement learning era of language models Nathan Lambert How to approach post-training for AI applications Nathan Lambert Jensen Huang on GPUs - Computerphile Computerphile We Finally Figured Out How AI Actually Works… (not what we thought!) Matthew Berman MCP + PydanticAI - Build powerful AI agents Riza Group Relative Policy Optimization (GRPO) - Formula and Code Deep Learning with Yacine Music for Work — Deep Focus Mix for Programming, Coding Chill Music Lab Olympiad level counting (Generating functions) 3Blue1Brown Everything You Wanted to Know About LLM Post-Training, with Nathan Lambert of Allen Institute for AI Cognitive Revolution \ But what is a convolution? 3Blue1Brown [Talk] Cornell Robotics Seminar: MPC in MBRL Nathan Lambert Anthropic found a "terrifying" consequence of adding reasoning to AI bycloud You’re in a War (and You Don’t Even Know It) | Eric Weinstein [ARC 2025] Alliance for Responsible Citizenship How might LLMs store facts | DL7 3Blue1Brown Python RAG Tutorial (with Local LLMs): AI For Your PDFs pixegami this MP3 file is malware John Hammond Transformers (how LLMs work) explained visually | DL5 3Blue1Brown RAG vs. CAG: Solving Knowledge Gaps in AI Models IBM Technology Understanding GD&T The Efficient Engineer