[SPCL_Bcast #50] Hardware-aware Algorithms for Language Modeling

[SPCL_Bcast #50] Hardware-aware Algorithms for Language Modeling

Share:

Similar Tracks

[SPCL_Bcast #51] Neural Network Quantization with Brevitas Scalable Parallel Computing Lab, SPCL @ ETH Zurich

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87 Stanford MLSys Seminars

Computer Architecture - Lecture 30: SIMD and GPU Architectures (Fall 2024) Onur Mutlu Lectures

Context Is The Next Frontier by Jacob Buckman, CEO of Manifest AI Democratize Intelligence

Hardware-Aware Efficient Primitives for Machine Learning – Dan Fu Johns Hopkins Whiting School of Engineering

[SPCL_Bcast #49] Programming Groq LPUs without IEEE Floating Point Scalable Parallel Computing Lab, SPCL @ ETH Zurich

From Large Language Models to Reasoning Language Models - Three Eras in The Age of Computation. Scalable Parallel Computing Lab, SPCL @ ETH Zurich

[SPCL_Bcast #53] The evolution of accelerator-centric GPU services - past, present, future Scalable Parallel Computing Lab, SPCL @ ETH Zurich

Test-Time Adaptation: A New Frontier in AI Machine Learning Street Talk

Computer Architecture - Lecture 29: SIMD & GPU Architectures (Fall 2023) Onur Mutlu Lectures

Exascale Cloud Computing – A Foggy Tale of Networks, AI, Containers, and Ultra Ethernet Scalable Parallel Computing Lab, SPCL @ ETH Zurich

[SPCL_Bcast] Merging and MoErging for compositional generalization Scalable Parallel Computing Lab, SPCL @ ETH Zurich

Transformers (how LLMs work) explained visually | DL5 3Blue1Brown

HetSys Course: Lecture 5: GPU Performance Considerations (Fall 2022) Onur Mutlu Lectures

NVIDIA Spectrum-X Network Platform Architecture Open Compute Project

Cybersecurity Architecture: Five Principles to Follow (and One to Avoid) IBM Technology

Digital Design and Computer Arch. - L14: Out-of-Order Execution (Spring 2025) Onur Mutlu Lectures

ETH Zürich AISE: Course Introduction CAMLab, ETH Zürich

VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores Scalable Parallel Computing Lab, SPCL @ ETH Zurich