lukas helff
@lukas_helff
PhD student in the AI/ML lab at @TUDarmstadt | AI Safety | Vision Safeguarding | Synthetic Data | Visual and Logical Reasoning #AI #ML
I'll be at #ICML2025 next week presenting our recent work on VLMs and Bongard Problems! Feel free to reach out, happy to have a chat ☺️
Excited to share that our paper got accepted at #ICML2025!! 🎉 We challenge Vision-Language Models like OpenAI’s o1 with Bongard problems, classic visual reasoning challenges and uncover surprising shortcomings. Check out the paper: arxiv.org/abs/2410.19546 & read more below 👇
📢 Update: We've deepened our exploration of VLMs on Bongard Problems with more rigorous evaluations! The best-performing model (o1) we tested solved 43 out of 100 problems - progress, but still plenty of room for improvement!
Aktuelle #KI-Modelle bestehen nicht KI-Benchmarks aus den 1960iger Jahren 😤 Tolle Zusammenarbeit mit @toniwuest @philosotim @lukas_helff @devendratweetin @c_rothkopf heise.de/news/Vision-La…