Shuzheng Si's Homepage 👋🏻

Welcome to my homepage! This photo was taken before I started my research in NLP. Compared to back then, I've gained quite a bit of weight and lost quite a bit of hair :(

Shuzheng Si

Ph.D. student, Tsinghua University

Hi, I’m Shuzheng Si, a second-year CS Ph.D. Candidate at Tsinghua University. I am lucky to be advised by Prof. Maosong Sun from the TsinghuaNLP Lab. Before this, I obtained my master’s degree from Peking University, where I was supervised by Prof. Baobao Chang at the Institute of Computational Linguistics. My research interests lie in Natural Language Processing and Large Language Models. Recently, I have been interested in a fundamental challenge for LLMs: Hallucinations.

📖 Hallucination Attribution — Understanding Hallucinations at the Source: My first line of research aims to explore and understand how training data contributes to hallucinations (NOVA, GATEAU, NUGGETS). This line of research mainly focuses on combining both qualitative and quantitative analyses of training data, guiding data collection to reduce LLM hallucinations.
🔧 Hallucination Mitigation — Reducing Hallucinations in LLMs: This line of research focuses on designing effective strategies to reduce hallucinations, covering text generation (CANOE, LingoEDU), multi-modal scenarios (MMICL, LACING), and agentic tasks (SpokenWOZ, EAGLET, RhinoInsight). These methods provide practical solutions to reduce hallucinations in real-world applications.
🔎 Hallucination Detection — Identifying Hallucinations in the Wild: This research attempts to identify hallucinated responses generated by LLMs and prevent such hallucinated responses from being provided to users (FaithLens, Infi-Check). By enabling the timely hallucination detection and practical interventions, LLM-based systems can be more reliable and trustworthy in real-world deployment.
🌏 Building Information-Seeking Tools With the Lowest Hallucination Rates: I also apply my research to build real-world information-seeking applications with the lowest hallucination rates, e.g., LingoWhale, RhinoInsight, and Zhiliao News released by DeepLang AI and TsinghuaNLP Lab. To date, these applications have provided trustworthy text processing services to hundreds of thousands of Chinese users.

I’m always happy to discuss potential collaborations. Feel free to drop an email if you are interested in connecting. 🕊🕊🕊

ssz24(at)mails.tsinghua.edu.cn Google Scholar GitHub

Education

Tsinghua University

Ph.D. in Computer Science and Technology Sep. 2024 - Jul. 2028 (expected)
Peking University

M.S. in Software Engineering Sep. 2021 - Jul. 2024
Yunnan University

B.S. at the School of Software (Rank: 1/300+) Sep. 2017 - Jul. 2021

Honors & Awards

CAST’s Young Talents Support Project, Ph.D. Students Program 2025
EMNLP SAC Highlights Paper Award 2025
Comprehensive Excellence Scholarship, THU 2025
Merit Student, PKU 2022
Student of the Year Nominee Award (Ranked 1st, YNU) 2020
National Scholarship 2019
Provincial Government Scholarship 2018

Experience

DeepLang AI

Research Staff Apr. 2024 - Now
Alibaba DAMO Academy

Research Intern Jun. 2022 - Jun. 2023
SenseTime Research

Research Intern Jul. 2021 - Feb. 2022

Service

NLP Communities: Reviewer of ACL, EMNLP, NAACL, COLING, and TASLP
ML Communities: Reviewer of NeurIPS, ICLR, ICML, and AAAI
CV Communities: Reviewer of ICCV
I am also a member of the BIRD team, led by the talent researcher Jinyang Li, which drives the development of text-to-SQL for real-world database applications

News

2025

▪ 🚀 Honored to receive the China Association for Science and Technology (CAST) Young Talents Support Project for Ph.D. Students.

Dec 15

▪ 🏆 My first-authored paper GATEAU wins the SAC Highlights Award (top 35 out of 8,000+ submissions) at EMNLP 2025!

Nov 09

▪ 🎉 Four papers accepted by EMNLP 2025, congrats to all co-authors!

Aug 25

▪ 🎉 Four papers accepted by ACL 2025, congrats to all co-authors!

May 15

2024

▪ 🧑🏻‍💻 Started my Ph.D. study journey.

Sep 01

First-Authored Papers (view all papers on Google Scholar )

FaithLens: Detecting and Explaining Faithfulness Hallucination

Shuzheng Si, Qingyi Wang, Haozhe Zhao, Yuzhuo Bai, Guanqiao Chen, Kangyang Luo, Gang Chen, Fanchao Qi, Minjia Zhang, Baobao Chang, Maosong Sun

Preprint 2025, 20,000+ downloads on Hugging Face

Education

Honors & Awards

Experience

Service

News

First-Authored Papers (view all papers on Google Scholar )

FaithLens: Detecting and Explaining Faithfulness Hallucination

FaithLens: Detecting and Explaining Faithfulness Hallucination

A Goal Without a Plan Is Just a Wish: Efficient and Effective Global Planner Training for Long-Horizon Agent Tasks

A Goal Without a Plan Is Just a Wish: Efficient and Effective Global Planner Training for Long-Horizon Agent Tasks

From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition

From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition

RhinoInsight: Improving Deep Research through Control Mechanisms for Model Behavior and Context

RhinoInsight: Improving Deep Research through Control Mechanisms for Model Behavior and Context

Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning

Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning

GATEAU: Selecting Influential Samples for Long Context Alignment

GATEAU: Selecting Influential Samples for Long Context Alignment

InFi-Check: Interpretable and Fine-Grained Fact-Checking of LLMs

InFi-Check: Interpretable and Fine-Grained Fact-Checking of LLMs

Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering

Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering

Looking Beyond Text: Reducing Language Bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance

Looking Beyond Text: Reducing Language Bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance

Rethinking Semantic Parsing for Large Language Models: Enhancing LLM Performance with Semantic Hints

Rethinking Semantic Parsing for Large Language Models: Enhancing LLM Performance with Semantic Hints

Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning and Student-Student Collaborative Learning

Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning and Student-Student Collaborative Learning

Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation

Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation

MMICL: Empowering Vision-Language Model with Multi-Modal In-Context Learning

MMICL: Empowering Vision-Language Model with Multi-Modal In-Context Learning

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents

SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition

SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition

Mining Clues from Incomplete Utterance: A Query-Enhanced Network for Incomplete Utterance Rewriting

Mining Clues from Incomplete Utterance: A Query-Enhanced Network for Incomplete Utterance Rewriting

SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER

SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER