About Me
Papers
News

Recent & Upcoming Talks
- Example Talk
Papers
Projects
Projects
Experience
Teaching
- Learn JavaScript
- Learn Python
Blog

Minimax-01: Scaling Foundation Models with Lightning Attention

Jan 14, 2025·

Weigao Sun

Weigao Sun

,

Et Al

· 0 min read

Last updated on Jan 14, 2025

Weigao Sun

Authors

Young Scientist

← LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid Feb 11, 2025

LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training Nov 24, 2024 →

© 2025 Me. This work is licensed under CC BY NC ND 4.0

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.