Wei Yao


Hi there, welcome! I’m currently a third-year Ph.D. student in Gaoling School of Artificial Intelligence, Renmin University of China. It’s a great fortunate to be advised by Prof. Yong Liu. I earned my B.E. degree in Software Engineering from Huazhong University of Science and Technology in June 2022.

My previous research focused on trustworthy AI: TMLR24, ACL24, CVPR23, arXiv:2410.06851. Currently, I am deeply engaged in the theoretical analysis of superalignment, especially weak-to-strong generalization: arXiv:2502.01458, arXiv:2502.11107.

Preprints


* indicates equal contribution

Revisiting Weak-to-Strong Generalization in Theory and Practice: Reverse KL vs. Forward KL

Wei Yao*, Wenkai Yang*, Ziqiao Wang, Yankai Lin, Yong Liu
arXiv preprint arXiv:2502.11107

Understanding the Capabilities and Limitations of Weak-to-Strong Generalization

Wei Yao*, Wenkai Yang*, Ziqiao Wang, Yankai Lin, Yong Liu
arXiv preprint arXiv:2502.01458

Understanding Model Ensemble in Transferable Adversarial Attack

Wei Yao*, Zeliang Zhang*, Huayi Tang, Yong Liu
arXiv preprint arXiv:2410.06851

Selected Publications


* indicates equal contribution

Understanding Fairness Surrogate Functions in Algorithmic Fairness

Wei Yao*, Zhanke Zhou*, Zhicong Li, Bo Han, Yong Liu
TMLR 2024 (TMLR-to-ICLR pilot program)

Revisiting Pre-training Period of Large Language Models

Chen Qian*, Jie Zhang*, Wei Yao*, Dongrui Liu, Zhenfei Yin, Yu Qiao, Yong Liu, Jing Shao
ACL 2024

Random Smooth-based Certified Defense against Text Adversarial Attack

Zeliang Zhang*, Wei Yao*, Susan Liang, Chenliang Xu
EACL 2024

Fair Scratch Tickets: Finding Fair Sparse Networks without Weight Training

Pengwei Tang*, Wei Yao*, Zhicong Li, Yong Liu
CVPR 2023

Research Intern


  • 2023.10-2024.3: Shanghai AI Laboratory, research intern, mentor: Dr. Jing Shao.

Honors and Awards


National Scholarship, 2019

Service


Conference Reviewer: ICLR 2025, AISTATS 2025

Journal Reviewer: TMLR