Dayoon Ko
@dayoon12161
M.S/Ph.D integrated student in CSE @SeoulNatlUni | Research Intern @LG_AI_Research
🚨 Excited to share that our paper was accepted to #ACL2025 Findings 🎉 "When Should Dense Retrievers Be Updated in Evolving Corpora? Detecting Out-of-Distribution Corpora Using GradNormIR" Huge thanks to my amazing collaborators! 🙌 @jinyoung__kim @ohmyksh We propose…
🔥 GUI agents struggle with real-world mobile tasks. We present MONDAY—a diverse, large-scale dataset built via an automatic pipeline that transforms internet videos into GUI agent data. ✅ VLMs trained on MONDAY show strong generalization ✅ Open data (313K steps) (1/7) 🧵 #CVPR
🎸 First time in Nashville—and it’s for #CVPR2025! Excited to present our poster: HalLoc: Token-level Localization of Hallucinations for Vision Language Models 📅 Sunday, June 15 |🕓 4:00–6:00 p.m. CDT 📍Poster #358, Exhibit Hall D We introduce HalLoc, the first dataset for…
🚨New Paper Alert🚨 Excited to share our new video game benchmark, "Orak"! 🕹️ It was a thrilling experience to test whether LLM/VLM agents can solve real video games 🎮 Looking forward to continuing my research on LLM/VLM-based game agents with @Krafton_AI !
As a video gaming company, @Krafton_AI has secretly been cooking something big with @NVIDIAAI for a while! 🥳 We introduce Orak, the first comprehensive video gaming benchmark for LLMs! arxiv.org/abs/2506.03610
When Should Dense Retrievers Be Updated in Evolving Corpora? Detecting Out-of-Distribution Corpora Using GradNormIR @dayoon12161 et al. introduce an unsupervised approach to detect when dense retrievers need updates. 📝arxiv.org/abs/2506.01877 👨🏽💻github.com/dayoon-ko/grad…
🙁 LLMs are overconfident even when they are dead wrong. 🧐 What about reasoning models? Can they actually tell us “My answer is only 60% likely to be correct”? ❗Our paper suggests that they can! Through extensive analysis, we investigate what enables this emergent ability.
🎉Our paper "Can LLMs Deceive CLIP? Benchmarking Adversarial Compositionality of Pre-trained Multimodal Representation via Text Updates" is accepted to #ACL2025 Main!🎉 We introduce a benchmark for multimodal "deception" + LLM-based diversified attack. 🚀 Preprint coming soon!
Fantastic Paper from @GoogleDeepMind. Astute RAG enhances LLM performance by resolving conflicts between internal and external knowledge sources. Original Problem 🔍: RAG systems face challenges from imperfect retrieval, introducing irrelevant or misleading information.…
After going to NAACL, ACL and #EMNLP2024 this year, here are a few tips I’ve picked up about attending *ACL conferences. Would love to hear any other tips if you have them! 🙂 1. This might be obvious, but I suggest showing everyone the same respect and interest regardless of…
All set in Miami for #EMNLP2024 #EMNLP✈️ I'll be presenting DynamicER today at: 📅 November 12 (Tue) 🕓 16:00–17:30 📍 Poster Session C, Riverfront Hall I'm also applying to PhD programs this year—looking forward to connecting and chatting with everyone at #EMNLP2024!