|
Zhongwei Wan 万中威
I am a final-year Ph.D. student (expected graduation in May 2026) at The Ohio State University🌰 in Computer Science and Engineering, advised by Prof. Mi Zhang. My research focuses on Foundation Model Reasoning (Test-time Scaling, RL for LLMs/MLLMs/Agents/VLAs), Efficient Foundation Models (Long-context LLMs, MLLMs, VLAs), and Domain-specific Foundation Models. My work has been published at NeurIPS, ICLR, ICML, EMNLP, ACL, NAACL, TMLR, ACM TOIS, and ICASSP, and I am the recipient of the ECCV CADL Workshop Best Paper Award 🏆 and the IEEE Internet Computing Magazine Best Paper Award 🏆.
Previously, I worked as a research scientist intern at Bytedance Seed (Multimodal Pre-training Team) at San Jose, Tencent AI Lab (NLP Group), and Noah’s Ark Lab (Speech and Language Group). I received my M.S. degree from the University of Chinese Academy of Sciences and my B.S. degree from Southern University of Science and Technology. Feel free to contact me if you are interested in my work or potential collaborations :P
Google Scholar /
LinkedIn /
X /
GitHub
|
Education
|
The Ohio State University, United States
Ph.D. Student in Computer & Information Science
Advised by Prof. Mi Zhang
|
|
University of the Chinese Academy of Sciences, China
MPhil in Control Science & Engineering
(2020.9 - 2023.6)
|
|
Southern University of Science and Technology, China
B.S. in Computer Science
(2016.9 - 2020.6)
|
|
Services
-
Reviewer: ICML, ICLR, NeurIPS, ACL, EMNLP, NAACL,TKDE, TMLR
|
|
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning
Zhongwei Wan,
Zhihao Dou, Che Liu, Yu Zhang, Dongfei Cui, Qinjian Zhao, Hui Shen, Jing Xiong, Yi Xin, Yifan Jiang, Yangfan He, Mi Zhang, Shen Yan
NeurIPS 2025
Project Page
/
Data
/
Code
/
Paper
|
|
Plan Then Action: High-level Planning Guidance Reinforcement Learning for LLM Reasoning
Zhihao Dou*, Qinjian Zhao*, Zhongwei Wan*, Dinggen Zhang, Weida Wang, Towsif Raiyan, Benteng Chen, Qingtao Pan, Yang Ouyang, Zhiqiang Gao, Shufei Zhang, Sumon Biswas
Under Review 2025, *Co-first Author
Code
/
Paper
|
|
A1: Asynchronous Test-Time Scaling via Conformal Prediction
Jing Xiong, Qiujiang Chen, Fanghua Ye, Zhongwei Wan, Chuanyang Zheng, Chenyang Zhao, Hui Shen, Alexander Hanbo Li, Chaofan Tao, Haochen Tan, Haoli Bai, Lifeng Shang, Lingpeng Kong, Ngai Wong
Under Review 2025
Code
/
Paper
|
|
PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
Hui Shen, Taiqiang Wu, Qi Han, Yunta Hsieh, Jizhou Wang, Yuyue Zhang, Yuxin Cheng, Zijian Hao, Yuansheng Ni, Xin Wang, Zhongwei Wan, Kai Zhang, Wendong Xu, Jing Xiong, Ping Luo, Wenhu Chen, Chaofan Tao, Zhuoqing Mao, Ngai Wong
Under Review 2025
Code
/
Paper
|
|
Enhancing Code LLMs with Reinforcement Learning in Code Generation
Junqiao Wang, Zeng Zhang, Yangfan He, Zihao Zhang, Yuyang Song, Tianyu Shi, Yuchen Li, Hengyuan Xu, Kunyu Wu, Xin Yi, Zhongwei Wan, Xinhang Yuan, Kuan Lu, Menghao Huo, Tang Jingqun, Guangwu Qian, Keqin Li, Qiuwu Chen, Lewei He
Technical Report 2025
Paper
|
|
UNComp: Can Matrix Entropy Uncover Sparsity? -- A Compressor Design from an Uncertainty-Aware Perspective
Jing Xiong, Jianghan Shen, Fanghua Ye, Chaofan Tao, Zhongwei Wan, Jianqiao Lu, Xun Wu, Chuanyang Zheng, Zhijiang Guo, Min Yang, Lingpeng Kong, Ngai Wong
EMNLP 2025
Code
/
Paper
|
|
Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation
Simin Chen, Yiming Chen, Zexin Li, Yifan Jiang, Zhongwei Wan, Yixin He, Dezhi Ran, Tianle Gu, Haizhou Li, Tao Xie, Baishakhi Ray
EMNLP 2025
Code
/
Paper
|
|
SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Wendong Xu, Jing Xiong, Chenyang Zhao, Qiujiang Chen, Haoran Wang, Hui Shen, Zhongwei Wan, Jianbo Dai, Taiqiang Wu, He Xiao, Chaofan Tao, Z Morley Mao, Ying Sheng, Zhijiang Guo, Hongxia Yang, Bei Yu, Lingpeng Kong, Quanquan Gu, Ngai Wong
Under Review 2025
Code
/
Paper
|
|
LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Weichu Liu, Jing Xiong, Yuxuan Hu, Zixuan Li, Minghuan Tan, Ningning Mao, Chenyang Zhao, Zhongwei Wan, Chaofan Tao, Wendong Xu, Hui Shen, Chengming Li, Lingpeng Kong, Ngai Wong
Under Review 2025
Code
/
Paper
|
|
Autoregressive Models in Vision: A Survey
Jing Xiong, Gongye Liu, Lun Huang, Chengyue Wu, Taiqiang Wu, Yao Mu, Yuan Yao, Hui Shen, Zhongwei Wan, Jinfa Huang, Chaofan Tao, Shen Yan, Huaxiu Yao, Lingpeng Kong, Hongxia Yang, Mi Zhang, Guillermo Sapiro, Jiebo Luo, Ping Luo, Ngai Wong
TMLR 2025
Code
/
Paper
|
|
Efficient diffusion models: A survey
Hui Shen, Jingxuan Zhang, Boning Xiong, Rui Hu, Shoufa Chen, Zhongwei Wan, Xin Wang, Yu Zhang, Zixuan Gong, Guangyin Bao, Chaofan Tao, Yongfeng Huang, Ye Yuan, Mi Zhang
TMLR 2025
Code
/
Paper
|
|
UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation
Zixuan Li, Jing Xiong, Fanghua Ye, Chuanyang Zheng, Xun Wu, Jianqiao Lu, Zhongwei Wan, Xiaodan Liang, Chengming Li, Zhenan Sun, Lingpeng Kong, Ngai Wong
Under Review 2024
Code
/
Paper
|
|
DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm Detection
Zhihong Zhu, Kefan Shen, Zhaorun Chen, Yunyan Zhang, Yuyan Chen, Xiaoqi Jiao, Zhongwei Wan, Shaorong Xie, Wei Liu, Xian Wu, Yefeng Zheng
EMNLP 2024
Paper
|
|
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
Zixuan Gong, Guangyin Bao, Qi Zhang, Zhongwei Wan, Duoqian Miao, Shoujin Wang, Lei Zhu, Changwei Wang, Rongtao Xu, Liang Hu, Ke Liu, Yu Zhang
NeurIPS 2024 Oral
Code
/
Paper
|
|
V-PETL bench: A Unified Visual Parameter-efficient Transfer Learning Benchmark
Yi Xin, Siqi Luo, Xuyang Liu, Haodi Zhou, Xinyu Cheng, Christina E Lee, Junlong Du, Haozhe Wang, MingCai Chen, Ting Liu, Guimin Hu, Zhongwei Wan, Aoxue Li, Mingyang Yi, Xiaohong Liu
NeurIPS 2024
Code
/
Paper
|
|
Efficient Large Language Models: A Survey
Zhongwei Wan, Xin Wang, Che Liu, Samiul Alam, Yu Zheng, Jiachen Liu, Zhongnan Qu, Shen Yan, Yi Zhu, Quanlu Zhang, Mosharaf Chowdhury, Mi Zhang
TMLR 2024
Code
/
Paper
|
Contact
Email: wan.512 [at] osu [dot] edu
|
|