About meI'm a Ph.D student in School of Data Science at The Chinese University of Hong Kong, Shenzhen, China. I'm very proud to be advised by Prof. Zhi-Quan (Tom) Luo. I'm also very fortunate to work closely with Prof. Ruoyu Sun. Previously, I did my undergraduate study in the Department of Mathematics at Southern University of Science and Technology (SUSTech). My research focuses on optimization, deep learning, and especially, large language models. I am interested in important and practical problems with optimization flavor. Biography
Major Research Projects(My full publication list can be seen in Google Scholar) Towards Quantifying the Hessian Structure of Neural Networks XX^t Can Be Faster Finite Horizon Optimization: Framework and Applications Adam-mini: Use Fewer Learning Rates To Gain More Why Transformers Need Adam: A Hessian Perspective Adam Can Converge Without Any Modification on Update Rules Does Adam Converge and When? When Expressivity Meets Trainability: Fewer than n Neurons Can Work Invited TalksJune 2025: I gave a talk at ICTCAS and FAI Seminar, hosted by Bohan Wang and Jiaye Teng. Thanks Bohan and Jiaye for the invitation!
Dec 2024: I gave a talk at Tsinghua University, hosted by Kaifeng Lyu. Thanks Kaifeng for the invitation!
Oct 2024: I gave a talk at University of Minnesota, hosted by Prof. Mingyi Hong. Thanks Prof. Hong for the invitation!
Oct 2024: I gave a talk at INFORMS Anneal Meeting, Seattle, hosted by Jianhao Ma. Thanks Jianhao for the invitation!
Sep 2023: I gave a talk at Tsinghua University, hosted by Prof. Jian Li. Thanks Prof. Li for the invitation!
Jan 2023: I gave a talk at Google Brain, hosted by Dr. Diederik P. Kingma. Thanks Dr. Kingma for the invitation!
AwardsDec 2023: Duan Yongping Outstanding Resesearch Award (1st place) Dec 2023: Teaching Assistant Award, School of Data Science Aug 2022: Best Paper Presentation Award (1st place), 2nd Doctoral and Postdoctoral Daoyuan Academic Forum
Jul 2021: Best Paper Presentation Award (1st place), 3rd Tsinghua-Berkeley workshop on Learning Theory
Jun 2019: Magna cum laude of SUSTech Jun 2019: Outstanding graduation thesis, SUSTech Sep 2018: Scholarship Award for Excellence, Mathematics department, SUSTech (Top 10 students) ServicesReviewerI serve as a reviewer for machine learning conferences including NeurIPS, ICLR, ICML, COLT, AISTATS, as well as journals including JMLR and TMLR. Social ActivitiesI hosted a session named “Optimization Issues in Recent AI Models” at INFORMS Anneal Meeting, Oct, 2024. Teaching Assistant (by time)
Experiences2009 - 2012: I spent the best three years at the Shenzhen Foreign Language School, branch. |