Yuen Chen
Home
Publications
CV
Research
Publications
* denotes equal contribution. See also
Google Scholar
.
2025
UAI
2025
Moment Alignment: Unifying Gradient and Hessian Matching for Domain Generalization
Yuen Chen
, Haozhi Si, Guojun Zhang, Han Zhao
Paper
EMNLP
2025
Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders
Agam Goyal, Vedant Rathi, William Yeh, Yian Wang,
Yuen Chen
, Hari Sundaram
Paper
NAACL
2025
Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias
Yuen Chen
*, Vethavikashini Chithrra Raghuram*, Justus Mattern*, Rada Mihalcea, Zhijing Jin
Paper
2024
NAACL
2024
Analyzing the Role of Semantic Representations in the Era of Large Language Models
Zhijing Jin*,
Yuen Chen
*, Fernando Gonzalez Adauto*, Jiarui Liu, Jiayi Zhang, Julian Michael, Bernhard Schölkopf, Mona Diab
PDF
ACL
2024
CausalCite: A Causal Formulation of Paper Citations
Ishan Kumar Agrawal*, Zhijing Jin*, Ehsan Mokhtarian, Siyuan Guo,
Yuen Chen
, Negar Kiyavash, Mrinmaya Sachan, Bernhard Schölkopf
PDF
arXiv
2023
NeurIPS
2023
CLadder: Assessing Causal Reasoning in Language Models
Zhijing Jin*,
Yuen Chen
*, Felix Leeb*, Luigi Gresele*, Ojasv Kamal, Zhiheng Lyu, Kevin Blin, Fernando Gonzalez, Max Kleiman-Weiner, Mrinmaya Sachan, Bernhard Schölkopf
arXiv