Sheryl Hsu
@SherylHsu02
@openai | bs/ms @Stanford👩🏻💻
Congrats to the GDM team on their IMO result! I think their parallel success highlights how fast AI progress is. Their approach was a bit different than ours, but I think that shows there are many research directions for further progress. Some thoughts on our model and results 🧵
This would be the dream
It’s crazy how we’ve gone from 12% on AIME (GPT 4o) → IMO gold in ~ 15 months. We have come very far very quickly. I wouldn’t be surprised if by next year models will be deriving new theorems and contributing to original math research!
Sheryl (@sherylhsu02) was our first hire onto the multi-agent team. Within a few months of joining, she helped to make this possible. We're so lucky to have her on the team!
Watching the model solve these IMO problems and achieve gold-level performance was magical. A few thoughts 🧵
Presenting this @iclr_conf Saturday 3-5:30 Hall 2B/3 poster 540. Come say hi!!
Feeling spooked👻🎃? Get grounded...introducing "Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval." Meet LeReT (Learning to Retrieve by Trying), a RL-based framework that improves LLM’s ability to use retrieval tools by up to 29%. sherylhsu.com/LeReT/
🧬 Meet Lyra, a new paradigm for accessible, powerful modeling of biological sequences. Lyra is a lightweight SSM achieving SOTA performance across DNA, RNA, and protein tasks—yet up to 120,000x smaller than foundation models (ESM, Evo). Bonus: you can train it on your Mac. read…
Personalization is what makes apps like tiktok, instagram & x so addictive. What if LLMs could be customized to your preferences and needs?
Personalization in LLMs is crucial for meeting diverse user needs, yet collecting real-world preferences at scale remains a significant challenge. Introducing FSPO, a simple framework leveraging synthetic preference data to adapt new users with meta-learning for open-ended QA! 🧵
I am thrilled to announce that I have joined Delve as a Founding AI Engineer, where we are building the future of AI compliance. After two wonderful years of AI Research at Stanford, I'm excited to take this leap into the startup world to help secure AI systems at scale.
Today, we're thrilled to announce our $3.3M seed round at Delve. AI makes millions of decisions every second. But it’s still regulated with spreadsheets and screenshots. When AI moves at the speed of thought, protecting it with yesterday's tools isn't just inefficient, it's…
4o can create calendar events...actually so useful (and wasn't expecting this to work)

