About Me
Papers
News

Recent & Upcoming Talks
- Example Talk
Papers
Projects
Projects
Experience
Teaching
- Learn JavaScript
- Learn Python
Blog

LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid

Feb 11, 2025·

Weigao Sun

Weigao Sun

,

Disen Lan

,

Yiran Zhong

,

Xiaoye Qu

,

Yu Cheng

· 0 min read

Last updated on Feb 11, 2025

Weigao Sun

Authors

Young Scientist

← MoM: Linear Sequence Modeling with Mixture-of-Memories Feb 19, 2025

Minimax-01: Scaling Foundation Models with Lightning Attention Jan 14, 2025 →

© 2025 Me. This work is licensed under CC BY NC ND 4.0

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.