FIT Building
Tsinghua University
Beijing, P.R.China, 100084
DebateQA: Evaluating Question Answering on Debatable Knowledge
Rongwu Xu*, Xuan Qi*, Zehan Qi, Wei Xu, Zhijiang Guo
arXiv Preprint
[Paper][Code]
Course-Correction: Safety Alignment Using Synthetic Preferences
Rongwu Xu*, Yishuo Cai*, Zhenhong Zhou, Renjie Gu, Haiqin Wang, Yan Liu, Tianwei Zhang, Wei Xu, Han
Qiu
arXiv Preprint
[Paper][Code]
MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
Zhongshen Zeng, Yinhong Liu, Yingjia Wan, Jingyao Li, Pengguang Chen, Jianbo Dai, Yuxuan Yao, Rongwu
Xu, Zehan Qi, Wanru Zhao, Linling Shen, Jianqiao Lu, Haochen Tan, Yukang Chen, Hao Zhang, Zhan Shi,
Bailin Wang, Zhijiang Guo, Jiaya Jia
arXiv Preprint
[Paper][Code][Project
Page]
How Alignment and Jailbreak Work: Explain LLM Safety through
Intermediate Hidden States
Zhenhong Zhou, Haiyang Yu, Xinghua Zhang, Rongwu Xu, Fei Huang, Yongbin Li
arXiv Preprint
[Paper][Code]
Knowledge Conflicts for LLMs: A Survey
Rongwu Xu*, Zehan Qi*, Zhijiang Guo, Cunxiang Wang, Hongru Wang, Yue Zhang, Wei Xu
arXiv Preprint
[Paper][Code][机器之心][Talk
(Chinese)][Slide]
Preemptive Answer ``Attacks'' on Chain-of-Thought
Reasoning
Rongwu Xu*, Zehan Qi*, Wei Xu
ACL 2024 (Findings) Bangkok, Thailand
[Paper][Code][Poster]
The Earth is Flat because...: Investigating LLMs' Belief towards
Misinformation via Persuasive Conversation
Rongwu Xu, Brian S. Lin, Shujian Yang, Tianqi Zhang, Weiyan Shi, Tianwei Zhang, Zhixuan Fang, Wei Xu,
Han Qiu
ACL 2024 (Oral, Main) Bangkok,
Thailand
🏆 Outstanding Paper Award [Certificate]
[Paper][Code][机器之心][Project Page][Video][Poster]
Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity
and Bias
Rongwu Xu, Zi'an Zhou, Tianwei Zhang, Zehan Qi, Su Yao, Ke Xu, Wei Xu, Han Qiu
arXiv Preprint
[Paper]
Exploring Chinese Humor Generation: A Study on Two-Part
Allegorical Sayings
Rongwu Xu
IJCNN 2024 Yokohama, Japan
[Paper]
Tempo: Confidentiality Preservation in Cloud-Based Neural
Network Training
Rongwu Xu and Zhixuan Fang
IJCNN 2024 Yokohama, Japan
[Paper]
LSync: A Universal Timeline-synchronizing Solution for Live Streaming
Fan Dang*, Yifan Xu*, Rongwu Xu, Xinlei Chen, Yunhao Liu
IEEE/ACM ToN
[Paper]
MISO:
Legacy-compatible Privacy-preserving Single Sign-on using Trusted Execution Environments
Rongwu Xu, Sen Yang, Fan Zhang, Zhixuan Fang
IEEE EuroS&P 2023 Delft, The Netherlands
[Paper][Project Page]
LSync:
A Universal Event-synchronizing Solution for Live Streaming
Yifan Xu, Fan Dang, Rongwu Xu, Xinlei Chen, Yunhao Liu
IEEE INFOCOM 2022 Virtual
[Paper]
LifeRec: A Mobile App for Lifelog Recording
and Ubiquitous Recommendation
Jiayu Li, Hantian Zhang*, Zhiyu He*, Rongwu Xu*, Pingfei Wu*, Min Zhang, Yiqun Liu, Shaoping
Ma
ACM CHIIR 2022 Regensburg, Germany
[Paper][Code]
* denotes equally contribution.
2024 Outsanding Paper Award at ACL 2024 (<0.79% submitted papers)
2023 Tsinghua-Yangtze River Delta International R&D Community Talent Scholarship
2023 Tsinghua University Overall Excellence Scholarship
2020 Tsinghua University Technological Innovation Scholarship
2019 Tsinghua-Panasonic Scholarship
2018 Outstanding Volunteers in Beijing
MS in Computer Science, Tsinghua University, 2025 (Expected)
BEng in Computer Science, Tsinghua University, 2022