Similar Tracks
[SPCL_Bcast #51] Neural Network Quantization with Brevitas
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87
Stanford MLSys Seminars
Hardware-Aware Efficient Primitives for Machine Learning – Dan Fu
Johns Hopkins Whiting School of Engineering
[SPCL_Bcast #49] Programming Groq LPUs without IEEE Floating Point
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
From Large Language Models to Reasoning Language Models - Three Eras in The Age of Computation.
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
[SPCL_Bcast #53] The evolution of accelerator-centric GPU services - past, present, future
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
Exascale Cloud Computing – A Foggy Tale of Networks, AI, Containers, and Ultra Ethernet
Scalable Parallel Computing Lab, SPCL @ ETH Zurich