Scale AI

@scale_AI

To make the best models, you need the best data.

Joined July 2016

485Following

68KFollowers

Pinned

Scale AI Retweeted

Miles Turpin@milesaturpin · Jul 14

New @Scale_AI paper! 🌟 LLMs trained with RL can exploit reward hacks but not mention this in their CoT. We introduce verbalization fine-tuning (VFT)—teaching models to say when they're reward hacking—dramatically reducing the rate of undetected hacks (6% vs. baseline of 88%).

282

136

23.0K

Scale AI Retweeted

Anisha Gunjal@anisha_gunjal · Jul 24

🤔 How do we train LLMs on real-world tasks where it’s hard to define a single verifiable answer? Our work at @scale_AI introduces Rubrics as Rewards (RaR) — a framework for on-policy post-training that uses structured, checklist-style rubrics as interpretable reward signals. 🧵

221

181

22.0K

Scale AI Retweeted

Chen Bo Calvin Zhang@calvincbzhang · Jul 9

New @scale_AI research in collaboration with @AnthropicAI introduces SHADE-Arena, a benchmark to test for AI sabotage. SHADE-Arena evaluates an AI agent's ability to complete a task while secretly pursuing a harmful objective, all while being watched by an AI monitor. 🧵

8.0K

Scale AI@scale_AI · Jul 4

Proud to support American innovation, today and every day. 🇺🇸

139

12.0K