Scaling Laws for Linear Complexity Language ModelsJun 24, 2024·Xuyang Shen,Dong Li,Ruitao Leng,Zhen QinWeigao Sun,Yiran Zhong· 0 min read PDF CiteLast updated on Jun 24, 2024 AuthorsWeigao SunYoung Scientist ← LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training Nov 24, 2024Unlocking the Secrets of Linear Complexity Sequence Model from A Unified Perspective May 27, 2024 →