Anh Totti Nguyen
@anh_ng8
ISO a trustworthy and explainable AI. Deep Learning, HCI, human-machine interface, and Javascript. Associate Professor @AuburnEngineers. Hometown: Hanoi 🇻🇳
How do best AI image editors 🤖 GPT-4o, Gemini 2.0, SeedEdit, HF 🤗 fare ⚔️ human Photoshop wizards 🧙♀️ on text-based 🏞️ image editing? Logan @septisum and Brandon @brandon_co92810 shared some answers at our poster today! #CVPR2025 psrdataset.github.io A few insights 👇
The opportunity gap in AI is more striking than ever. We talk way too much about those receiving $100M or whatever for their jobs, but not enough those asking for <$1k to present their work. For 3rd year in a row, @ml_collective is raising funds to support @DeepIndaba attendees.
We've published a position paper, with many across the industry, calling for work on chain-of-thought faithfulness. This is an opportunity to train models to be interpretable. We're investing in this area at OpenAI, and this perspective is reflected in our products:
I am extremely excited about the potential of chain-of-thought faithfulness & interpretability. It has significantly influenced the design of our reasoning models, starting with o1-preview. As AI systems spend more compute working e.g. on long term research problems, it is…
Supported by one of our grants, @an_vo12, Mohammad Reza Taesiri, and @anh_ng8 from @kaist_ai, tackled bias in LLMs. Their research shows that LLMs exhibit fewer biases when they can see their previous answers, leading to the development of the B-score metric.
🚨 Our latest work shows that SOTA VLMs (o3, o4-mini, Sonnet, Gemini Pro) fail at counting legs due to bias⁉️ See simple cases where VLMs get it wrong, no matter how you prompt them. 🧪 Think your VLM can do better? Try it yourself here: vlmsarebiased.github.io/#example-galle… 1/n #ICML2025
This isn't hallucination in the traditional sense. Grok's math was nearly correct. But it confidently applied PhD-level theory to explain pure fantasy, while making a MATH error. Like a surgeon giving faulty, detailed instructions for transplanting wings onto humans.
Today I finished my PhD at @AuburnEngineers with @anh_ng8 . What’s next? Off to @guidelabsai to build foundation AI models that are interpretable-by-design with @juliusadml and a crew of amazing people 🌁
That @cursor_ai silently downgrades the working model from Claude4 to Claude3.5 during an active coding session, is borderline criminal. A preview of what happens when AI model monopolists in the near future silently downgrade the intelligence of entire nations?
Asking GPT-4o to generate images in the style of @DiegoCusano_ shows an existing gap between the real Cusano vs. GPT-4o in creativity / wittiness. (random samples) Sometimes it signs "Cusano" at the bottom.




I got fired today. I'm not sure why, I personally don't think there is a reason, or that it's important. When I joined twitter, I joined because of the engineers I met in SF. They seemed happy. They were having fun. Engineers at play. Engineers that were enabled. It was good!
2015 in Boston: I attended my first #CVPR (1 paper in the main conference). 2025 in Nashville: my 3rd grader has his first #CVPR (no workshop or conference papers though 😜). #CVPR2025


