Yue Xu

ASPIRE LAB, ShanghaiTech University

prof_pic.jpg

Hi :wave:, this is Yue Xu (Savannah). I’m a Ph.D. student in Computer Science at ShanghaiTech University, advised by Prof. Wenjie Wang. My research explores AI alignment, including safety, fairness, and robustness on large language and multimodal models, aiming to build intelligent systems that are both trustworthy and adaptive.

Recently, I’ve been focusing on the personalization of LLMs and LLM-powered agents, exploring how memory, preference modeling, and adaptive reasoning can enable human-aligned, self-evolving agents.

If you’re interested in collaboration or discussion, feel free to reach out at xuyue2022 [at] shanghaitech.edu.cn!

news

Sep 22, 2025 FaIRMaker accepted to NeurIPS2025! :sparkles: See you in San Diego! :smile:
Dec 14, 2024 MMJ-Bench accepted to AAAI2025! :sparkles:
Sep 20, 2024 CIDER accepted to EMNLP2024! :sparkles:
May 30, 2024 I will be continuing my Ph.D. journey at ShanghaiTech University, advised by Prof. Wenjie Wang!🎉
Mar 30, 2024 LinkPrompt accepted to NAACL2024! :sparkles: See you in Mexico! :smile:

selected publications

  1. CIDER
    Cross-modality information check for detecting jailbreaking in multimodal large language models
    Yue Xu, Xiuyuan Qi, Zhan Qin, and 1 more author
    EMNLP Findings, 2024
  2. FaIRMaker
    Auto-Search and Refinement: An Automated Framework for Gender Bias Mitigation in Large Language Models
    Yue Xu, Chengyan Fu, Li Xiong, and 2 more authors
    NeurIPS, 2025
  3. LinkPrompt
    Linkprompt: Natural and universal adversarial attacks on prompt-based language models
    Yue Xu and Wenjie Wang
    In NAACL, 2024
  4. MMJ-Bench
    Mmj-bench: A comprehensive study on jailbreak attacks and defenses for vision language models
    Fenghua Weng, Yue Xu, Chengyan Fu, and 1 more author
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2025