Sean Xuefeng Du
@xuefeng_du
Incoming Assistant Professor @NTUsg | Ph.D. @WisconsinCS, fellow @JaneStreetGroup | reliable machine learning 🤖️ ⛑️
🚨 We’re hiring! The Radio Lab @ NTU Singapore is looking for PhD, master, undergrads, RAs, and interns to build responsible AI & LLMs. Remote/onsite from 2025. Interested? Email us: [email protected] 🔗 d12306.github.io/recru.html Please spread the word if you can!


Sean @xuefeng_du successfully defended his PhD thesis on “Foundations of Unknown-Aware Machine Learning” today. His PhD work laid the theoretical and algorithmic groundwork for building AI systems that can recognize and reason about the unknown—shaping the field in significant…
🚨New Paper!🚨 We trained reasoning LLMs to reason about what they don't know. o1-style reasoning training improves accuracy but produces overconfident models that hallucinate more. Meet RLCR: a simple RL method that trains LLMs to reason and reflect on their uncertainty --…
How could we characterize the performance gap of MLLMs under distribution shifts? Please drop by our poster at #ICML2025 !! 🕒Jul 16 (Tomorrow) 11:00-13:30 📍#2707 East Exhibition Hall A-B Happy to introduce a new information-theoretic quantification of MLLM's robustness😋
In May, the latest class of CS PhDs walked across the stage toward their futures—and officially became alumni. Now off to begin careers in academia and industry, we congratulate all who successfully defended their dissertations in the last year: cs.wisc.edu/2025/06/30/cel…
🌍 GeoArena is live! Evaluate how well large vision-language models (LVLMs) understand the world through image geolocalization. Help us compare models via human preference — your feedback matters! 🔗 Try it now: huggingface.co/spaces/garena2… #GeoArena #Geolocation #LVLM #AI
🎉Our survey on how OOD detection & related tasks have evolved in the VLM and Large VLM era is accepted to #TMLR! The field is finally coming together, and OOD detection & anomaly detection are now at the center in the VLM era. In the LVLM era, UPD (Unsolvable Problem…
Amazing work!
A good language model should say “I don’t know” by reasoning about the limits of its knowledge. Our new work AbstentionBench carefully measures this overlooked skill in leading models in an open-codebase others can build on! We find frontier reasoning degrades models’ ability to…
Excited to share that I have received the NSF GRFP!!😀 I'm really grateful to my advisor @SharonYixuanLi for all her support, to @YilunZhou and @jacobandreas, and to everyone else who has guided me through my research journey! #nsfgrfp
Please check out our work on steering internal representations to detect LLM hallucinations!
🚨 If you care about reliable, low-cost LLM hallucination detection, our #ICML2025 paper offers a powerful and data-efficient solution. 💡We introduce TSV: Truthfulness Separator Vector — a single vector injected into a frozen LLM that reshapes its hidden space to better…
🚨 If you care about reliable, low-cost LLM hallucination detection, our #ICML2025 paper offers a powerful and data-efficient solution. 💡We introduce TSV: Truthfulness Separator Vector — a single vector injected into a frozen LLM that reshapes its hidden space to better…
🛟 Reliable & reliability researchers @CVPR! Join our workshop on Uncertainty Quantification for Computer Vision next week! We have a super lineup of speakers (from self-driving to LLMs) and cool posters. 🗓️ Day: Wed, Jun 11 📌Room: 102 B #CVPR2025 #UNCV2025
Things are happening over at Morgridge Hall, the future home of CDIS, Computer Sciences, Statistics, and the iSchool. As of Tuesday, 6/10, the building's official name is emblazoned above the main entrance at Orchard & Johnson.
Large Language Models Often Know When They Are Being Evaluated Joe Needham, Giles Edkins (@gdedkins), Govind Pimpale (@GovindPimpale), Henning Bartsch, @MariusHobbhahn @apolloaievals @matsprogram
Dear MAGA friends, I have been worrying about STEM in the US a lot, because right now the Senate is writing new laws that cut 75% of the STEM budget in the US. Sorry for the long post, but the issue is really important, and I want to share what I know about it. The entire…
As I reflect on my time working with @xuefeng_du these years, what stood out wasn’t just his intellectual achievements—but also his kindness and thoughtfulness, as a person. These characteristics touch me and leave a lasting impression. This moment was a perfect example of…
Please check @xuanmingzh36352 ’s work on building socially intelligent LLM agents! 🎉🎉
MetaMind: Equip LLM with a "Social Brain" via Metacognitive Multi-agent Framework “What is meant often goes far beyond what is said, and that is what makes conversation possible.” ——H. P. Grice Huggingface: huggingface.co/papers/2505.18…