I am a Young Scientist at Shanghai AI Laboratory. Currently, I closely collaborate with Prof. Yu Cheng, work on the revolution of foundational model architecture, including algorithm and system co-innovations on Linear Sequence Modeling (Linear Attention) and Mixture-of-Experts.
See our research projects: Linear-MoE, Linearization and MoM for technical details.
🔥 I am looking for talented interns to work with me on above projects and beyond. Please feel free to hit me up with your CV or any questions if you are interested. Also I am always open-mind to any other cooperations.
From 2020 to 2022, I was a Research Staff Member at Linx Lab, Turing Architecture and Design Department at 2012 Lab, Huawei, supervised by Jiashu Lin and Heng Liao. I earned my PhD degree from Huazhong University of Science and Technology (HUST) at 2020.04, jointly trained at School of Artificial Intelligence and Automation (AIA) and HUST Innovation Institute (with the First-class Grant).