What Do LLM Benchmarks Actually Tell Us? (+ How to Run Your Own) Share: Download MP3 Similar Tracks Quantizing LLMs - How & Why (8-Bit, 4-Bit, GGUF & More) Adam Lucek The BEST Way to Chunk Text for RAG Adam Lucek Trump Thanks Qatar for Their Generous Jet Bribe & Accidentally Does a Socialism | The Daily Show The Daily Show Model Distillation: Same LLM Power but 3240x Smaller Adam Lucek Find the BEST RAG Strategy with Domain Specific Evals Adam Lucek Why Every AI Developer Should Learn Model Context Protocol (MCP) Adam Lucek Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote Snowflake Inc. I Trained an LLM to Think Deeper (Here's How) Adam Lucek Does LLM Size Matter? How Many Billions of Parameters do you REALLY Need? Gary Explains Improving RAG Retrieval by 60% with Fine-Tuned Embeddings Adam Lucek Knowledge Graph or Vector Database… Which is Better? Adam Lucek Evaluating LLM-based Applications Databricks Which Quantization Method is Right for You? (GPTQ vs. GGUF vs. AWQ) Maarten Grootendorst Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral MLOps.community Compressing Large Language Models (LLMs) | w/ Python Code Shaw Talebi Scaling Test Time Compute: How o3-Style Reasoning Works (+ Open Source Implementation) Adam Lucek Unleash a SWARM of AI Agents: Reliable Multi-Agent Orchestration Adam Lucek Building Brain-Like Memory for AI | LLM Agent Memory Systems Adam Lucek Model Context Protocol (MCP), clearly explained (why it matters) Greg Isenberg AI Agents Fundamentals In 21 Minutes Tina Huang