Leyang
profile photo

Leyang Cui (崔乐阳)

I am a senior researcher at Tencent AI lab. I obtained my Ph.D. degree at Zhejiang University and Westlake University, advised by Prof. Yue Zhang. My research interests are focused on large language models, specifically on reward models, preference alignment algorithms, and methods for alleviating hallucinations.

Looking for internship or collaboration? drop me an email.

Email: nealcly.nlp AT gmail.com

Google Scholar  /  DBLP

News

  • Aug. 2024: Invited to serve as Session Chair at ACL 2024.

  • Sep. 2023: Invited to serve as Action Editor/Area Chair at EACL, NAACL and ACL 2024.

  • Dec. 2022: Invited to serve as Area Chair (Syntax: Tagging, Chunking and Parsing) at ACL 2023.

  • Jun. 2022: Invited to serve as Area Chair (Dialogue and Interactive Systems) at EMNLP 2022.

  • Experience

  • Jun. 2021 - Dec. 2021, Research Intern, Pattern Recognition Center, Wechat, Tencent. Advised by Fandong Meng
  • Nov. 2020 - May. 2021, Research Intern, NLC Group, MSRA. Advised by Yu Wu and Shujie Liu
  • Jun. 2019 - Sep. 2019, Research Intern, NLC Group, MSRA. Advised by Yu Wu and Shujie Liu
  • Sep. 2017 - Apr. 2018, Research Intern, NLP Department, I2R, ASTAR. Advised by Anh Tuan Luu
  • Publications

    (*: Equal contribution, †: Corresponding author)

    Preprints:

    1. Do Reasoning Models Show Better Verbalized Calibration? [PDF]
    2. Qingcheng Zeng, Weihao Xuan, Leyang Cui, Rob Voigt

    3. Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs [PDF]
    4. Sen Yang, Xin Li, Leyang Cui, Lidong Bing, Wai Lam

    5. Towards Robust Online Dialogue Response Generation [PDF]
      Leyang Cui, Fandong Meng, Yijin Liu, Jie Zhou, Yue Zhang

    Conference Papers:

    1. Lost in Literalism: How Supervised Training Shapes Translationese in LLMs [PDF]
    2. Yafu Li, Ronghao Zhang, Zhilin Wang, Huajian Zhang, Leyang Cui, Yongjing Yin, Tong Xiao, Yue Zhang
      ACL 2025

    3. Alleviating Hallucinations of Large Language Models through Induced Hallucinations [PDF]
    4. Yue Zhang, Leyang Cui†, Wei Bi, Shuming Shi
      Findings of NAACL 2025

    5. Collaborative Evaluation: Exploring the Synergy of Large Language Models and Humans for Open-ended Generation Evaluation [PDF]
    6. Qintong Li, Leyang Cui, Lingpeng Kong, Wei Bi
      COLING 2025

    7. Gated Slot Attention for Efficient Linear-Time Sequence Modeling
    8. Yu Zhang, Songlin Yang, Rui-Jie Zhu, Yue Zhang, Leyang Cui, Yiqiao Wang, Bolun Wang, Freda Shi, Bailin Wang, Wei Bi, Peng Zhou, Guohong Fu
      NeurIPS 2024

    9. Knowledge Verification to Nip Hallucination in the Bud
    10. Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming Shi
      EMNLP 2024

    11. Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning
    12. Sen Yang, Leyang Cui†, Deng Cai, Xinting Huang, Shuming Shi, Wai Lam
      Findings of EMNLP 2024

    13. Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability
    14. Tsz Ting Chung, Leyang Cui, Lemao Liu, Xinting Huang, Shuming Shi, Dit-Yan Yeung
      Findings of EMNLP 2024

    15. DETect: Deepfake Text Detection in the Wild
    16. Yafu Li, Qintong Li, Leyang Cui†, Wei Bi, Zhilin Wang, Longyue Wang, Linyi Yang, Shuming Shi, Yue Zhang†
      ACL 2024

    17. Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
    18. Jianheng Huang, Leyang Cui, Ante Wang, Chengyi Yang, Xinting Liao, Linfeng Song, Junfeng Yao, Jinsong Su
      ACL 2024

    19. GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
    20. Qintong Li, Leyang Cui, Xueliang Zhao, Lingpeng Kong, Wei Bi
      ACL 2024

    21. Spotting AI’s Touch: Identifying LLM-Paraphrased Spans in Text
    22. Yafu Li, Zhilin Wang, Leyang Cui†, Wei Bi, Shuming Shi, Yue Zhang†
      Findings of ACL 2024

    23. Benchmarking and Improving Long-Text Translation with Large Language Models
    24. Zefeng Du, Wenxiang Jiao, Longyue Wang, Chenyang Lyu, Jianhui Pang, Leyang Cui, Kaiqiang Song, Derek F. Wong, Shuming Shi, Zhaopeng Tu
      Findings of ACL 2024

    25. Retrieval is Accurate Generation
    26. Bowen Cao, Deng Cai, Leyang Cui, Xuxin Cheng, Wei Bi, Yuexian Zou, Shuming Shi
      ICLR 2024

    27. NaRuto: Automatically Acquiring Planning Models from Narrative Texts
    28. Ruiqi Li, Leyang Cui, Songtuan Lin, Patrik Haslum
      AAAI 2024

    29. EDeR: Towards Understanding Dependency Relations Between Events
    30. Ruiqi Li, Patrik Haslum, Leyang Cui
      EMNLP 2023

    31. RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
      Yue Zhang, Leyang Cui†, Enbo Zhao, Wei Bi, Shuming Shi
      EMNLP 2023

    32. Non-autoregressive Text Editing with Copy-aware Latent Alignments
      Yu Zhang*, Yue Zhang*, Leyang Cui, Guohong Fu
      EMNLP 2023

    33. LogiCoT: Logical Chain-of-Thought Instruction Tuning
      Hanmeng Liu, Zhiyang Teng, Leyang Cui, Chaoli Zhang, Qiji Zhou, Yue Zhang
      Findings of EMNLP 2023

    34. Explicit Syntactic Guidance for Neural Text Generation
      Yafu Li, Leyang Cui†, Jianhao Yan, Yongjing Yin, Wei Bi, Shuming Shi, Yue Zhang†
      ACL 2023, Best Paper Nomination (1.6%)

    35. Enhancing Grammatical Error Correction Systems with Explanations
      Yuejiao Fei, Leyang Cui†, Sen Yang, Wai Lam, Zhenzhong Lan, Shuming Shi
      ACL 2023

    36. Uni-Encoder: A Fast and Accurate Response Selection Paradigm for Generation-Based Dialogue Systems
      Chiyu Song, Hongliang He, Haofei Yu, Pengfei Fang, Leyang Cui, Zhenzhong Lan
      Findings of ACL 2023

    37. Cross-domain Generalization for AMR Parsing
      Xuefeng Bai, Sen Yang, Leyang Cui, Linfeng Song, Yue Zhang
      EMNLP 2022

    38. Multi-Granularity Optimization for Non-Autoregressive Translation
      Yafu Li, Leyang Cui†, Yongjing Yin, Yue Zhang†
      EMNLP 2022

    39. FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition
      Linyi Yang, Lifan Yuan, Leyang Cui, Wenyang Gao, Yue Zhang
      COLING 2022

    40. Investigating Non-local Features for Neural Constituency Parsing
      Leyang Cui*, Sen Yang*, Yue Zhang
      ACL 2022

    41. Challenges to Open-Domain Constituency Parsing
      Sen Yang, Leyang Cui, Ruoxi Ning, Di Wu, Yue Zhang
      Findings of ACL 2022

    42. Knowledge Enhanced Fine-tuning for Better Handling Unseen Entities in Dialogue Generation
      Leyang Cui, Yu Wu, Shujie Liu, Yue Zhang
      EMNLP 2021

    43. Solving Aspect Category Sentiment Analysis as a Text Generation Task
      Jian Liu, Zhiyang Teng, Leyang Cui, Hanmeng Liu, Yue Zhang
      EMNLP 2021

    44. Template-based Named Entity Recognition Using BART
      Leyang Cui, Yu Wu, Jian Liu, Sen Yang, Yue Zhang
      Findings of ACL 2021

    45. On Commonsense Cues in BERT for Solving Commonsense Tasks
      Leyang Cui, Sijie Cheng, Yu Wu, Yue Zhang
      Findings of ACL 2021

    46. Natural Language Inference in Context Investigating Contextual Reasoning over Long Texts
      Hangmeng Liu, Leyang Cui, Jian Liu, Yue Zhang
      AAAI 2021

    47. MuTual: A Dataset for Multi-Turn Dialogue Reasoning
      Leyang Cui, Yu Wu, Shujie Liu, Yue Zhang, Ming Zhou
      ACL 2020

    48. What Have We Achieved on Text Summarization?
      Dandan Huang*, Leyang Cui*, Sen Yang*, Guangsheng Bao, Kun Wang, Jun Xie, Yue Zhang
      EMNLP 2020

    49. Making the Best Use of Review Summary for Sentiment Analysis
      Sen Yang*, Leyang Cui*, Jun Xie, Yue Zhang
      COLING 2020

    50. Does Chinese BERT Encode Word Structure?
      Yile Wang, Leyang Cui, Yue Zhang
      COLING 2020

    51. LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning
      Jian Liu, Leyang Cui, Hanmeng Liu, Dandan Huang, Yile Wang, Yue Zhang
      IJCAI 2020

    52. Evaluating Commonsense in Pre-trained Language Models
      Xuhui Zhou, Yue Zhang, Leyang Cui, Dandan Huang
      AAAI 2020

    53. Hierarchically-Refined Label Attention Network for Sequence Labeling
      Leyang Cui, Yue Zhang
      EMNLP 2019

    Journal Papers:

  • Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models [PDF]
  • Yue Zhang, Yafu Li, Leyang Cui†, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shi
    Computational Linguistics

    1. LogiQA 2.0—An Improved Dataset for Logical Reasoning in Natural Language Understanding
      Hanmeng Liu, Jian Liu, Leyang Cui, Zhiyang Teng, Nan Duan, Ming Zhou, Yue Zhang
      IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023

    2. Label Attention Network for Structured Prediction
      Leyang Cui*, Yafu Li*, Yue Zhang
      IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022

    3. Improving Skip-gram Embeddings Using BERT
      Yile Wang, Leyang Cui, Yue Zhang
      IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021

    Services

  • Senior Program Committee Members: IJCAI 2021
  • Area Chair: EMNLP 2022, ACL 2023, EACL 2024, NAACL 2024, ACL 2024
  • Reviewer: TASLP, TACL
  • Program Committee Members: NAACL 2022, ACL 2022, AAAI 2021, ACL 2021, EMNLP 2021, AAAI 2020, EMNLP 2020, COLING 2020, IJCAI 2020.
  • Awards

  • Westlake Presidential Award, 2022

  • Stars of Tomorrow, MSRA, 2021

  • Outstanding Students, Zhejiang University, 2019, 2020 and 2021

  • National Scholarship, 2019