My research interests broadly include Natural Language Processing and Multimodal Models. Now, I am also committed to the frontier exploration and practical application of Large Language Models (LLM) and Artificial Intelligence Generated Content (AIGC).
(* co-first author, † corresponding author or project lead, ¡ intern)
Uncertainty-Aware Routing for Principled Alignment with MoE Dynamics []
Yilong Chen, Junyuan Shang, Yuchen Feng, Zhenyu Zhang, Naibin Gu, Ziqi Wang, Tingwen Liu, Shuohuan Wang, Yu Sun, Hua Wu, Haifeng Wang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Sparse Growing Transformer: Training-Time Sparse Depth Allocation via Progressive Attention Looping []
Yao Chen, Yilong Chen, Yinqi Yang, Junyuan Shang, Zhenyu Zhang, Zefeng Zhang, Shuaiyi Nie, Shuohuan Wang, Yu Sun, Hua Wu, Haifeng Wang, Tingwen Liu
Findings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026, Findings)
Blink: Dynamic Visual Token Resolution for Enhanced Multimodal Understanding []
Yuchen Feng¡*, Zhenyu Zhang*, Naibin Gu, Yilong Chen, Peng Fu, Zheng Lin, Shuohuan Wang, Yu Sun, Hua Wu, Weiping Wang, Haifeng Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (CVPR 2026)
COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence []
Zefeng Zhang¡, Xiangzhao Hao¡, Hengzhu Tang, Zhenyu Zhang†, Jiawei Sheng, Xiaodong Li, Zhenyang Li, Li Gao, Daiting Shi, Dawei Yin, Tingwen Liu
Findings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (CVPR 2026, Findings)
Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking []
Yilong Chen, Junyuan Shang, Zhenyu Zhang, Yanxi Xie, Jiawei Sheng, Tingwen Liu, Shuohuan Wang, Yu Sun, Hua Wu, Haifeng Wang
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)
Debiasing Multimodal Large Language Models via Noise-Aware Preference Optimization []
Zefeng Zhang¡, Hengzhu Tang, Jiawei Sheng, Zhenyu Zhang†, Yiming Ren, Zhenyang Li, Dawei Yin, Duohe Ma, Tingwen Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025 (CVPR 2025)
NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time []
Yilong Chen, Guoxia Wang, Junyuan Shang, Shiyao Cui, Zhenyu Zhang, Tingwen Liu, Shuohuan Wang, Yu Sun, Dianhai Yu, Hua Wu
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts []
Zhida Feng*¡, Zhenyu Zhang*, Xintong Yu*, Yewei Fang, Lanxin Li, Xuyi Chen, Yuxiang Lu, Jiaxiang Liu, Weichong Yin, Shikun Feng, Yu Sun, Li Chen, Hao Tian, Hua Wu, Haifeng Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023 (CVPR 2023, Highlight)
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding []
Qiming Peng*, Yinxu Pan*, Wenjin Wang*, Bin Luo, Zhenyu Zhang, Zhengjie Huang, Teng Hu, Weichong Yin, Yongfeng Chen, Yin Zhang, Shikun Feng, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang
Findings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022, Findings)