Paper Reading & Discussion: TorchTitan: One-stop PyTorch native solution for production ready LLM...
Similar Tracks
Paper Reading & Discussion: LoRAMoE: Alleviate World Know. Forgetting in LLMs via MoE-Style Plugin
Aflah
Paper Reading & Discussion: Finding Skill Neurons in Pre-trained Transformer-based Language Models
Aflah