Zhenyu ZHANG

My research interests broadly include Natural Language Processing and Multimodal Models. Now, I am also committed to the frontier exploration and practical application of Large Language Models (LLM) and Artificial Intelligence Generated Content (AIGC).

Google Scholar / DBLP

(* co-first author, † corresponding author, ¡ intern)


— Preprint —


Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging []
Tingfeng Hui¡, Zhenyu Zhang, Shuohuan Wang, Yu Sun, Hua Wu, Sen Su

E-Bench: Towards Evaluating the Ease-of-Use of Large Language Models []
Zhenyu Zhang*, Bingguang Hao, Jinpeng Li, Zekai Zhang, Dongyan Zhao

HFT: Half Fine-Tuning for Large Language Models []
Tingfeng Hui¡, Zhenyu Zhang, Shuohuan Wang, Yu Sun, Weiran Xu, Hua Wu


— Publications —


DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion []
Yilong Chen¡, Linhao Zhang, Junyuan Shang, Zhenyu Zhang, Tingwen Liu, Shuohuan Wang, Yu Sun
Proceedings of the 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

FFT: Towards Evaluating Large Language Models with Factuality, Fairness, Toxicity []
Shiyao Cui*, Zhenyu Zhang*, Yilong Chen, Wenyuan Zhang, Tianyun Liu, Siqi Wang, Tingwen Liu
Proceedings of the KDD 2024 workshop on Evaluation and Trustworthiness of Generative AI Models (GenAI Eval @ KDD 2024)

LoginMEA: Local-to-Global Interaction Network for Multi-modal Entity Alignment []
Taoyu Su, Xinghua Zhang, Jiawei Sheng, Zhenyu Zhang, Tingwen Liu
Proceedings of the 27th European Conference on Artificial Intelligence (ECAI 2024)

LEMON: Reviving Stronger and Smaller LMs from Larger LMs with Linear Parameter Fusion []
Yilong Chen¡, Junyuan Shang, Zhenyu Zhang, Shiyao Cui, Tingwen Liu, Shuohuan Wang, Yu Sun, Hua Wu
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time []
Yilong Chen¡, Guoxia Wang, Junyuan Shang, Shiyao Cui, Zhenyu Zhang, Tingwen Liu, Shuohuan Wang, Yu Sun, Dianhai Yu, Hua Wu
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)


ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts []
Zhida Feng, Zhenyu Zhang*, Xintong Yu*, Yewei Fang, Lanxin Li, Xuyi Chen, Yuxiang Lu, Jiaxiang Liu, Weichong Yin, Shikun Feng, Yu Sun, Li Chen, Hao Tian, Hua Wu, Haifeng Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023 (CVPR 2023, Highlight)

Learning Structural Co-occurrences for Structured Web Data Extraction in Low-Resource Settings []
Zhenyu Zhang, Bowen Yu, Tingwen Liu, Tianyun Liu, Yubin Wang, Li Guo
Proceedings of the Web Conference 2023 (WWW 2023)

Enhancing Table Retrieval with Dual Graph Representations []
Tianyun Liu, Xinghua Zhang, Zhenyu Zhang, Yubin Wang, Quangang Li, Shuai Zhang, Tingwen Liu
Proceedings of the 2023 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD 2023)


Towards Generalized Open Information Extraction []
Bowen Yu, Zhenyu Zhang, Jingyang Li, Haiyang Yu, Tingwen Liu, Jian Sun, Yongbin Li, Bin Wang
Findings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022, Findings)

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding []
Qiming Peng*, Yinxu Pan*, Wenjin Wang*, Bin Luo, Zhenyu Zhang, Zhengjie Huang, Teng Hu, Weichong Yin, Yongfeng Chen, Yin Zhang, Shikun Feng, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang
Findings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022, Findings)

Layout-Aware Information Extraction for Document-Grounded Dialogue: Dataset, Method and Demonstration []
Zhenyu Zhang*, Bowen Yu*, Haiyang Yu, Tingwen Liu, Cheng Fu, Jingyang Li, Chengguang Tang, Jian Sun, Yongbin Li
Proceedings of the 30th ACM International Conference on Multimedia (MM 2022)

Enhancing Chinese Pre-trained Language Model via Heterogeneous Linguistics Graph []
Yanzeng Li, Jiangxia Cao, Xin Cong, Zhenyu Zhang, Bowen Yu, Hongsong Zhu, Tingwen Liu
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022)


Relation Extraction Based on Data Partition and Representation Integration []
Jiapeng Zhao, Panpan Zhang, Tingwen Liu, Zhenyu Zhang, Yanzeng Li, Jinqiao Shi
Proceedings of the 6th IEEE International Conference on Data Science in Cyberspace (DSC 2021, Best Paper Award Nominee)

Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising Learning []
Xinghua Zhang, Bowen Yu, Tingwen Liu, Zhenyu Zhang, Jiawei Sheng, Xue Mengge, Hongbo Xu
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

NA-Aware Machine Reading Comprehension for Document-Level Relation Extraction []
Zhenyu Zhang, Bowen Yu, Xiaobo Shu, Tingwen Liu
Proceedings of the 2021 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD 2021)

From What to Why: Improving Relation Extraction with Rationale Graph []
Zhenyu Zhang, Bowen Yu, Xiaobo Shu, Mengge Xue, Tingwen Liu, Li Guo
Findings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021, Findings)

Multi-Granularity Heterogeneous Graph for Document-Level Relation Extraction []
Hengzhu Tang, Yanan Cao, Zhenyu Zhang, Ruipeng Jia, Fang Fang, Shi Wang
Proceedings of the 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)

Semi-Open Information Extraction []
Bowen Yu, Zhenyu Zhang, Jiawei Sheng, Tingwen Liu, Yubin Wang, Yucheng Wang, Bin Wang
Proceedings of the Web Conference 2021 (WWW 2021)


Document-level Relation Extraction with Dual-tier Heterogeneous Graph []
Zhenyu Zhang, Bowen Yu, Xiaobo Shu, Tingwen Liu, Hengzhu Tang, Yubin Wang, Li Guo
Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020)

Learning to Prune Dependency Trees with Rethinking for Neural Relation Extraction []
Bowen Yu, Mengge Xue, Zhenyu Zhang, Tingwen Liu, Yubin Wang, Bin Wang
Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020)

Coarse-to-Fine Pre-training for Named Entity Recognition []
Mengge Xue, Bowen Yu, Zhenyu Zhang, Tingwen Liu, Yue Zhang, Bin Wang
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)

Edge-Enhanced Graph Convolution Networks for Event Detection with Syntactic Relation []
Shiyao Cui, Bowen Yu, Tingwen Liu, Zhenyu Zhang, Xuebin Wang, Jinqiao Shi
Findings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020, Findings)

Fine-Grained Semantics-Aware Heterogeneous Graph Neural Networks []
Yubin Wang, Zhenyu Zhang, Tingwen Liu, Hongbo Xu, Jingjing Wang, Li Guo
Proceedings of the 21st International Conference on Web Information Systems Engineering (WISE 2020)

Joint Entity Linking and Relation Extraction with Neural Networks for Knowledge Base Population []
Zhenyu Zhang, Xiaobo Shu, Tingwen Liu, Zheng Fang, Quangang Li
Proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN 2020)

DRG2vec: Learning Word Representations from Definition Relational Graph []
Xiaobo Shu, Bowen Yu, Zhenyu Zhang, Tingwen Liu
Proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN 2020)

BiG-Transformer: Integrating Hierarchical Features for Transformer via Bipartite Graph []
Xiaobo Shu, Mengge Xue, Yanzeng Li, Zhenyu Zhang, Tingwen Liu
Proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN 2020)

Strong Baselines for Author Name Disambiguation with and without Neural Networks []
Zhenyu Zhang, Bowen Yu, Tingwen Liu, Dong Wang
Proceedings of the 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2020)

SLGAT: Soft Labels Guided Graph Attention Networks
Yubin Wang, Zhenyu Zhang, Tingwen Liu, Li Guo []
Proceedings of the 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2020)

HIN: Hierarchical Inference Network for Document-Level Relation Extraction []
Hengzhu Tang, Yanan Cao, Zhenyu Zhang, Jiangxia Cao, Fang Fang, Shi Wang, Pengfei Yin
Proceedings of the 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2020, Best Paper Award)

Joint Extraction of Entities and Relations Based on a Novel Decomposition Strategy []
Bowen Yu, Zhenyu Zhang, Xiaobo Shu, Tingwen Liu, Yubin Wang, Bin Wang, Sujian Li
Proceedings of the 24th European Conference on Artificial Intelligence (ECAI 2020)

High Quality Candidate Generation and Sequential Graph Attention Network for Entity Linking []
Zheng Fang, Yanan Cao, Ren Li, Zhenyu Zhang, Yanbing Liu, Shi Wang
Proceedings of the Web Conference 2020 (WWW 2020)

Distilling Knowledge from Well-informed Soft Labels for Neural Relation Extraction []
Zhenyu Zhang, Xiaobo Shu, Bowen Yu, Tingwen Liu, Jiapeng Zhao, Quangang Li, Li Guo
Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI 2020)


Beyond Word Attention: Using Segment Attention in Neural Relation Extraction []
Bowen Yu, Zhenyu Zhang, Tingwen Liu, Bin Wang, Sujian Li, Quangang Li
Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI 2019)

ICNet: Incorporating Indicator Words and Contexts to Identify Functional Description Information []
Qu Liu, Zhenyu Zhang, Yanzeng Li, Tingwen Liu, Diying Li, Jinqiao Shi
Proceedings of the 2019 International Joint Conference on Neural Networks (IJCNN 2019)

Joint Entity Linking with Deep Reinforcement Learning []
Zheng Fang, Yanan Cao, Qian Li, Dongjie Zhang, Zhenyu Zhang, Yanbing Liu
Proceedings of the Web Conference 2019 (WWW 2019)