Jeffrey (Young-Min) Cho
@jeffrey_ch0
CS PhD Student @ UPenn - NLP/AI
We investigated how people interpret the Cantril Ladder, the measure used to claim that Finland is the happiest country in the world in the @HappinessRpt. The title is also the take home: "The Cantril Ladder elicits thoughts about power and wealth" (1/7) nature.com/articles/s4159…
🚨COLM 2025 Workshop on AI Agents: Capabilities and Safety @COLM_conf This workshop explores AI agents’ capabilities—including reasoning and planning, interaction and embodiment, and real-world applications—as well as critical safety challenges related to reliability, ethics,…
😵💫 Long-context human-AI planning with LLMs struggles when users have to manually manage all the context in messy chats (e.g. with ChatGPT). Meet 💡JumpStarter: task-structured context curation for better, collaborative planning with LLMs on complex tasks. 🧵 (1/n)
🤖💬 Herding instincts… in AIs? Yes, even LLMs can follow the crowd! • 📉 Conformity ↑ when agents lack confidence but trust peers • 🧠 Presentation format shapes peer influence • 🎯 Controlled herding can boost collaboration outcomes 👉 Read more: arxiv.org/abs/2505.21588

Thrilled to share that our work has received the Outstanding Paper Award @c3_nlp #NAACL2025! 🎉 Huge congratulations to @snyrai_ & @khushi_shelat for leading this collaborative effort, and thanks to @jeffrey_ch0 for delivering an amazing talk on behalf of his co-authors! 🙌🏽
#NAACL25 #C3NLP How do cultures differ in their mental health struggles? Do they seek professional help or emotional comfort? Check out our study on cross-cultural differences in mental health expressions on Reddit. aclanthology.org/2025.c3nlp-1.1… #mentalhealth #depression #llm
#NAACL2025 How to compare cultural differences with social media data in scale? Our work uses lexica to annotate X 🇺🇸 & Weibo 🇨🇳 posts with valence (😄☹️) & arousal (🔥❄️) scores, revealing cross-cultural differences in emotional expression. aclanthology.org/2025.findings-…
🚨 New preprint on AI persuasion and public health 🚨 A 3-min conversation with GPT-4o nudged HPV-vax-hesitant parents (who obv knew it was AI & consented!)—BUT reading standard public-health material still outperformed chatbots in impact and longevity. Details below 👇
#NAACL25 #C3NLP How do cultures differ in their mental health struggles? Do they seek professional help or emotional comfort? Check out our study on cross-cultural differences in mental health expressions on Reddit. aclanthology.org/2025.c3nlp-1.1… #mentalhealth #depression #llm
🚀 How well can LLMs know you and personalize your response? Turns out, not so much! Introducing the PersonaMem Benchmark -- 👩🏻💻Evaluate LLM's ability to understand evolving persona from 180+ multi-session user-chatbot conversation history 🎯Latest models (GPT-4.1, GPT-4.5,…
Does #shame manifest differently across #cultures? Yes. Can LLMs identify #norms behind shame? Yes. Are women shamed more than men? Yes!!! Can #LLMs identify when someone is shamed? arxiv.org/abs/2402.11333 #NAACL2025
🚨 LLMs must grasp implied language to reason about emotions, social cues, etc. Our @GoogleDeepMind paper presents the Implied NLI dataset. Targeting social norms 🌎 and conversational dynamics 💬, we enhance LLM understanding of real-world implication! arxiv.org/abs/2501.07719
Can Text-to-Image models understand common sense? 🤔 Can they generate images that fit everyday common sense? 🤔 tldr; NO, they are far less intelligent than us 💁🏻♀️ Introducing Commonsense-T2I 💡 zeyofu.github.io/CommonsenseT2I/, a novel evaluation and benchmark designed to measure…
Belated update: honored to be an AWS-AI ASSET Fellow for 2024!
Penn Engineering's @PennASSET Center is dedicated to ensuring responsible development of AI tools. Amazon Web Services (@AmazonScience) has gifted $700,000 to fund 10 Ph.D. student research projects advancing safe and trustworthy AI. bit.ly/3w62ZjJ
We had a Q&A for our work recently: • How do people from different races discuss their depression? 🗣️ PNAS Paper: pnas.org/doi/full/10.10… • How do CS and Med work on mental health chatbots? 🤖 EMNLP Paper: aclanthology.org/2023.emnlp-mai…
In our recent Q&A with LDI Senior Fellow @SharathGuntuku and collaborators Sunny Rai (@sny_rai) and Jeffrey Cho (@jeffrey_ch0), we explored two studies that show shortcomings of mental health AI technologies. Specifically, depression screening and chatbot tools.
An exciting talk!
Gave a talk at @upennnlp! Thanks so much for inviting me and for all the great insightful questions! I was very happy to see that many people are excited in research for Theory of Mind & LLMs 😍
🚨 Announcement: DLATK is now available as a colab notebook! github.com/dlatk/dlatk/bl… DLATK is a suite for end-to-end human language analysis, used in 100+ AI/NLP/Psych papers. This was lead by @digitalyonko, joint with @sal_giorgi, @adigan8, @JEichstaedt, and @WWBProject 🧵1/4
Ten months ago, we launched the Vesuvius Challenge to solve the ancient problem of the Herculaneum Papyri, a library of scrolls that were flash-fried by the eruption of Mount Vesuvius in 79 AD. Today we are overjoyed to announce that our crazy project has succeeded. After 2000…
We are pleased to announce that the first Conference on Language Modeling will be held at the University of Pennsylvania in Philadelphia at the Zellerbach Theatre. Thanks so much to UPenn CS as well as Mark Yatskar and Zachary Ives for facilitating the amazing venue.
Only Bert is sold out 😂 #EMNLP2023 #bert #UniversalStudios #Singapore

if you trust @GoogleDeepMind Gemini about itself, it has 1.56 trillion parameters and cost Google $1-2billion (as opposed to GPT-4 which cost OpenAI $500M.) there were more than 100 engineers in the team who worked on Gemini. jailbreak by 한국어 🤣🤣🤣 g.co/bard/share/09c…