Valerie Chen
@valeriechen_
phd student @mldcmu @SCSatCMU + intern @allhands_ai | building @CopilotArena | previously @NYUDataScience @MSFTResearch @yale @CMU_Robotics @IBMResearch
🚨Let’s rethink how we’re evaluating LLMs for code using @CopilotArena 👩🏻💻. Get started with the extension with a one-click download from the VSCode marketplace!👇
Introducing Copilot Arena - Interactive coding evaluation in the wild. Our extension lets you test top models for free, right in VSCode. Let's vote and build the Copilot leaderboard! Download here: marketplace.visualstudio.com/items?itemName… Led by @iamwaynechi and @valeriechen_ at CMU. 1/🧵
After 3 conferences in the last 3 months 🇯🇵🇳🇴🇨🇦, I won’t be at ACL… but @JanePan_ will be! Go talk to her about our paper on interactive code eval👇
I'll be at ACL Vienna 🇦🇹 next week presenting this work! If you're around, come say hi on Monday (7/28) from 18:00–19:30 in Hall 4/5. Would love to chat about code model benchmarks 🧠, simulating user interactions 🤝, and human-centered NLP in general!
Correction! The oral presentation at the R2-FM workshop this afternoon is at 🕝 2:40pm 🏛️ West Ballroom C See you there!
Heading to Vancouver for ICML✈️🇨🇦Let’s chat about coding agents, evals, and human-AI collab. I’ll also be on the job market this upcoming cycle, looking for TT faculty roles + post-docs. Here's where you'll be able to find me this week👇
Stop by the poster sessions today at ICML Workshop on Computer Use Agents to chat about OpenHands-Versa!
Can we design AI Agents that achieve generalizability across diverse task domains? Our new paper introduces OpenHands-Versa, a generalist agent with strong performance on three challenging agent benchmarks, ranking #1 on SWE-Bench Multimodal and The Agent Company leaderboards 🚀
We’ll be at the MoFA workshop today! Stop by the talk or poster session.
Heading to Vancouver for ICML✈️🇨🇦Let’s chat about coding agents, evals, and human-AI collab. I’ll also be on the job market this upcoming cycle, looking for TT faculty roles + post-docs. Here's where you'll be able to find me this week👇
Come check us out tomorrow at 9:55am at our first workshop oral! #ICML2025
Heading to Vancouver for ICML✈️🇨🇦Let’s chat about coding agents, evals, and human-AI collab. I’ll also be on the job market this upcoming cycle, looking for TT faculty roles + post-docs. Here's where you'll be able to find me this week👇
What are the differences between developer productivity and satisfaction when using: - coding assistance through autocomplete - autonomous coding agents @valeriechen_ did the first controlled academic study answering this question, check out the results!
Excited to be hanging out today at @WiMLworkshop 👩🏻💻 Come say hi during the poster session 🕝 2:45–3:30pm 📍 West Meeting Room 211–214 Let’s chat about how coding agents are changing developer workflows! 🤖💻🔧✨
Come say hi to me and @iamwaynechi in 2 hours to chat about better code evals!
Accepted to #ICML2025! We’ll see you in Vancouver 🥳
📢 Last call for submissions! The deadline to submit an abstract is tomorrow 📢
🤖🧠 Join us for the 2025 Workshop on Human-AI Complementarity for Decision Making at CMU! 📅 Sept 25-26, 2025 💰 Travel to Pittsburgh & lodging covered 📝 Abstract deadline: July 15 We welcome abstract submissions, which will be presented as talks or posters. Details below!
I’ll be at ICML next week! 🇨🇦 Longer tweet coming soon about our paper / workshops I’m going to. Looking forward to catching up with friends and meeting new ones 👋🏼
blog.ml.cmu.edu/2025/07/08/car… Check out our latest post on CMU @ ICML 2025!
a random valerie may or may not appear in this video 👀
The method is simple, in any slack channel, you just write @ OpenHands and the agent will start working based on the context that you provided.