About Me
Papers
News

Recent & Upcoming Talks
- Example Talk
Papers
Projects
Projects
Experience
Teaching
- Learn JavaScript
- Learn Python
Blog

Linear Attention Sequence Parallelism

Apr 3, 2024·

Weigao Sun

Weigao Sun

,

Zhen Qin

,

Dong Li

,

Xuyang Shen

,

Yu Qiao

,

Yiran Zhong

· 0 min read

Last updated on Apr 3, 2024

Weigao Sun

Authors

Young Scientist

← HGRN2: Gated Linear RNNs with State Expansion Apr 11, 2024

CO2: Efficient Distributed Training with Full Communication-Computation Overlap Jan 29, 2024 →

© 2025 Me. This work is licensed under CC BY NC ND 4.0

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.