Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral
Similar Tracks
LLMs in Production at GetYourGuide // Meghana Satish & Tina Treimane // LLMs III Talk
MLOps.community
Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI
Stanford Online
Efficiently Scaling and Deploying LLMs // Hanlin Tang // LLM's in Production Conference
MLOps.community
Codon DSF: Image classification in recommerce w/ Thomas Whitington (Schibsted Media Group)
Codon Consulting
Fine-Tuning LLMs: Best Practices and When to Go Small // Mark Kim-Huang // MLOps Meetup #124
MLOps.community