Sean Hendryx

@SeanHendryx

AI research & engineering at Scale

Joined January 2021

157Following

375Followers

Pinned

Sean Hendryx@SeanHendryx · Jun 23

What will the learning environments of the future look like that train artificial super intelligence? In recent work at @scale_AI , we show that training systems that combine verifiable rewards with multi-agent interaction accelerate learning.

SeanHendryx's tweet image. What will the learning environments of the future look like that train artificial super intelligence? In recent work at @scale_AI , we show that training systems that combine verifiable rewards with multi-agent interaction accelerate learning.

128

100

22.0K

Pinned

Sean Hendryx Retweeted

Miles Turpin@milesaturpin · Jul 14

New @Scale_AI paper! 🌟 LLMs trained with RL can exploit reward hacks but not mention this in their CoT. We introduce verbalization fine-tuning (VFT)—teaching models to say when they're reward hacking—dramatically reducing the rate of undetected hacks (6% vs. baseline of 88%).

277

136

22.0K

Sean Hendryx Retweeted

Anisha Gunjal@anisha_gunjal · 5 h

🤔 How do we train LLMs on real-world tasks where it’s hard to define a single verifiable answer? Our work at @scale_AI introduces Rubrics as Rewards (RaR) — a framework for on-policy post-training that uses structured, checklist-style rubrics as interpretable reward signals. 🧵

105

7.0K

Sean Hendryx Retweeted

Jacob Phillips@jacob_dphillips · Apr 25

We’re entering a new era in robotics where generalized systems are starting to work in the real world, but researchers still don’t have good tools for understanding their data. That’s why I built ARES, an open-source platform for ingesting, annotating, and curating robotics data.

163

58.0K

Sean Hendryx Retweeted

Summer Yue@summeryue0 · Apr 17

🤖 AI agents are crossing into the real world. But when they act independently—who’s watching? At Scale, we’re building Agent Oversight: a platform to monitor, intervene, and align autonomous AI. We’re hiring engineers (SF/NYC) to tackle one of the most urgent problems in AI.…

6.0K

Sean Hendryx Retweeted

Summer Yue@summeryue0 · Apr 9

If a model lies when pressured—it’s not ready for AGI. The new MASK leaderboard is live. Built on the private split of our open-source honesty benchmark (w/ @ai_risks), it tests whether models lie under pressure—even when they know better. 📊 Leaderboard:…

7.0K

Sean Hendryx Retweeted

Scale AI@scale_AI · Apr 8

Do safety-aligned LLMs remain safe when used as browser agents? Our paper "Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents", accepted to ICLR, shows they don't. This reveals a critical gap in current alignment approaches.

3.0K

Sean Hendryx Retweeted

Colin Fraser@colin_fraser · Mar 4

“If basic research is so valuable then the market will fund it” is an argument that is directly addressed in like AP economics. When you make that argument without acknowledging its standard rebuttal, you reveal that you stopped paying attention after the fifth lecture.

618

11.0K

1.0K

290.0K