Siyan Sylvia Li 🦋🌸
@Sylvia_Sparkle
2nd year PhD @columbianlp • Incoming Intern @Adobe • Prev @stanfordnlp @GeorgiaTech • NLP, Dialogue Systems, Education • Caffeine Gremlin 🩷💜💙
Concerned about sending private data to LLM providers, but local LLMs aren't as good? Introducing PAPILLON 🦋, a system that uses local LLMs to create privacy-preserving LLM queries for you. 🦋 prevents leakage of PII (93%) while retaining response quality on 85% of queries! 🧵

WHY do you prefer something over another? Reward models treat preference as a black-box😶🌫️but human brains🧠decompose decisions into hidden attributes We built the first system to mirror how people really make decisions in our #COLM2025 paper🎨PrefPalette✨ Why it matters👉🏻🧵
bro is cheating on this too💀
in the past week at @cluely, we've been kicking off our most ambitious project ever. the models of today are great at answering questions. the models at @cluely will be really good at predicting which questions you have. this is a fundamentally different user experience than…
I'll be at ICML this year! Reach out if: - you want to chat -- great! -- sign up here calendar.app.google/qtDkRmS1uV3pLz… and/or DM me. - you want to fund my lab @ Columbia -- also great! -- research into deeply understanding language models for alignment, safety, performance. email me.
CMU is hosting a workshop on Human-AI Complementarity for Decision Making this September! Abstract submissions due July 15, travel will be covered for accepted presenters. cmu.edu/ai-sdm/researc…
✨Hi everyone! We are running a user study on a physical activity coaching chatbot to help boost your STEPCOUNT 💪💪💪! This study will span 4 - 6 weeks. Please sign up if you are interested! forms.gle/K9wgvcL8jBsKTf…
Hi everyone. I'm excited to announce that I will be organizing a 2nd Summit on Responsible Computing, AI, and Society, October 27-29. We will explore the future of computing for health, sustainability, human-centered AI, and policy. Please consider submitting a 1-page abstract
Are AI scientists already better than human researchers? We recruited 43 PhD students to spend 3 months executing research ideas proposed by an LLM agent vs human experts. Main finding: LLM ideas result in worse projects than human ideas.
🪄We made a 1B Llama BEAT GPT-4o by... making it MORE private?! LoCoMo results: 🔓GPT-4o: 80.6% 🔐1B Llama + GPT-4o (privacy): 87.7% (+7.1!⏫) 💡How? GPT-4o provides reasoning ("If X then Y"), the local model fills in the blanks with your private data to get the answer!
We had to downsize due to NIH funding cuts and lay off a junior software engineer who is proficient in Python coding, crawling, LLMs, RAG, and other related areas. He is currently on OPT (24 months) and will need an H1B sponsor. If any startups are interested, pls DM. RT for…
A bit late to announce, but I’m excited to share that I'll be starting as an assistant professor at the University of Maryland @umdcs this August. I'll be recruiting PhD students this upcoming cycle for fall 2026. (And if you're a UMD grad student, sign up for my fall seminar!)
New #ACL2025NLP Paper! 🎉 Curious what AI thinks about YOU? We interact with AI every day, offering all kinds of feedback, both implicit ✏️ and explicit 👍. What if we used this feedback to personalize your AI assistant to you? Introducing SynthesizeMe! An approach for…
😵💫 Long-context human-AI planning with LLMs struggles when users have to manually manage all the context in messy chats (e.g. with ChatGPT). Meet 💡JumpStarter: task-structured context curation for better, collaborative planning with LLMs on complex tasks. 🧵 (1/n)