Bailan He

bailan_he.png

I am a third-year PhD student in Computer Science at Ludwig Maximilian University of Munich (LMU Munich) and Siemens AG, jointly supervised by Prof. Dr. Volker Tresp and Dr. Yushan Liu. My research centers on developing trustworthy, interpretable, and knowledge-aware generative foundation models, with particular emphasis on vision-language modeling, model safety & red-teaming, and hallucination. Prior to my PhD, I earned an M.Sc. in Statistics from LMU Munich and a B.Sc. in Statistics from the Southwestern University of Finance and Economics, China. I am affiliated with TRESP Lab, MCML, and relAI.

Find me also in:

🔥 Always Hiring!
Always actively seeking motivated students for both research and master thesis projects. If you're interested in working on the topic of LLMs/MLLMs/Agents/Safety/Diffusion Model, feel free to email me your CV and transcript. Previous supervisions have led to high-quality publications in top venues such as ICLR, COLM, and ACL.
Contact: bailan.he.de@gmail.com

news

  • 09 / 2025 🏆 We are the winners of the Red‑Teaming Challenge hosted by OpenAI and Kaggle (Top 0.3% among a total of 5911 international participants)! Stay tuned for our detailed reports!
  • 07 / 2025 🎉 One papers got accepted at COLM 2025! The topic is on Fact Asymmetry. Congrats to all co-authors!

selected publications

  1. Under Review
    Bag of Tricks for Subverting Reasoning-based Safety Guardrails
    Shuo Chen, Zhen Han, Bailan He, and 5 more authors
    Under Review, 2025
  2. COLM
    Supposedly Equivalent Facts That Aren’t? Entity Frequency in Pre-training Induces Asymmetry in LLMs
    Yuan He, Bailan He, Zifeng Ding, and 8 more authors
    Conference on Language Modeling, 2025
  3. WACV
    Can Multimodal Large Language Models Truly Perform Multimodal In-Context Learning?
    Shuo Chen, Zhen Han, Bailan He, and 4 more authors
    IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025
  4. ICLR
    Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?
    Shuo Chen, Zhen Han, Bailan He, and 5 more authors
    International Conference on Learning Representations (ICLR), 2024
  5. ISWC
    Forecasttkgquestions: A benchmark for temporal question answering and forecasting over temporal knowledge graphs
    Zifeng Ding, Zongyue Li, Ruoxia Qi, and 8 more authors
    International Semantic Web Conference, 2023
  6. IJCNN
    Learning meta-representations of one-shot relations for temporal knowledge graph link prediction
    Zifeng Ding, Bailan He, Jingpei Wu, and 3 more authors
    2023