Alexander Ku

@alex_y_ku

Cognitive scientist and AI researcher at @GoogleDeepMind and @Princeton

Joined January 2016

351Following

518Followers

Pinned

Alexander Ku@alex_y_ku · May 20

(1/11) Evolutionary biology offers powerful lens into Transformers learning dynamics! Two learning modes in Transformers (in-weights & in-context) mirror adaptive strategies in evolution. Crucially, environmental predictability shapes both systems similarly.

alex_y_ku's tweet image. (1/11) Evolutionary biology offers powerful lens into Transformers learning dynamics! Two learning modes in Transformers (in-weights &amp; in-context) mirror adaptive strategies in evolution. Crucially, environmental predictability shapes both systems similarly.

165

112

24.0K

Pinned

Alexander Ku@alex_y_ku · May 22

Claude Opus 4 and Sonnet 4 are the best coding models, setting new records across the board. 🚀 We are pushing the limits (80.2% on SWE-Bench!!), advancing the frontier while keeping up the momentum. The benchmarks may soon become saturated but the capabilities will not!

AAnthropic@AnthropicAI · May 22

Introducing the next generation: Claude Opus 4 and Claude Sonnet 4. Claude Opus 4 is our most powerful model yet, and the world’s best coding model. Claude Sonnet 4 is a significant upgrade from its predecessor, delivering superior coding and reasoning.

8.0K

Alexander Ku@alex_y_ku · Jul 18

So excited our paper is now out in @CognitionJourn! Huge thanks to our editor and reviewers 🧠 Their thoughtful suggestions inspired Experiments 3 & 4, including a striking inverse correlation between idleness judgments and speed-up predictions

CCognition@CognitionJourn · Jul 18

“People Evaluate Idle Collaborators Based on their Impact on Task Efficiency” 📢 New from: Elizabeth Mieczkowski, Cameron Rouse Turner, Natalia Vélez, & Tom Griffiths sciencedirect.com/science/articl… TL;DR: Sometimes it's acceptable not to help with group work 🧵👇

1.0K

Alexander Ku Retweeted

Cutter Dawes@cutterdawes · Jul 16

Excited to share our new paper on neural networks learning base addition. We found that, if they use the right symmetries, even simple neural networks can achieve radical generalization, and that learnability is closely correlated with the symmetry used. 🧵

673

Alexander Ku Retweeted

Ionatan Kuperwajs@Ikuperwajs · Jul 7

New review on computational approaches to studying human planning out now in @TrendsCognSci! Really enjoyed having the opportunity to write something broader about the field with the help of @evanrussek @marcelomattar @weijima01 and @cocosci_lab cell.com/trends/cogniti…

119

7.0K

Alexander Ku Retweeted

Sanjana Srivastava@sanjana__z · Jun 25

🤖 Household robots are becoming physically viable. But interacting with people in the home requires handling unseen, unconstrained, dynamic preferences, not just a complex physical domain. We introduce ROSETTA: a method to generate reward for such preferences cheaply. 🧵⬇️

131

29.0K

Alexander Ku Retweeted

Aaditya Singh@Aaditya6284 · Mar 10

Transformers employ different strategies through training to minimize loss, but how do these tradeoff and why? Excited to share our newest work, where we show remarkably rich competitive and cooperative interactions (termed "coopetition") as a transformer learns. Read on 🔎⏬

132

103

23.0K

Alexander Ku Retweeted

Andrew Saxe@SaxeLab · Jun 4

How does in-context learning emerge in attention models during gradient descent training? Sharing our new Spotlight paper @icmlconf: Training Dynamics of In-Context Learning in Linear Attention arxiv.org/abs/2501.16265 Led by Yedi Zhang with @Aaditya6284 and Peter Latham

118

12.0K

Alexander Ku Retweeted

Nicolas Zucchet@NicolasZucchet · May 26

🧵What if emergence could be explained by learning a specific circuit: sparse attention? Our new work explores this bold hypothesis, showing a link between emergence and sparse attention that reveals how data properties influence when emergence occurs during training.

257

254

27.0K

Alexander Ku Retweeted

Hanbo Xie@PsychBoyH · May 15

Are you considering attending @cogsci_soc this year? Come to our workshop, 'Reasoning Across Minds and Machines', which features an exciting lineup of interdisciplinary research in AI and CogSci about reasoning. The workshop is W2 and visible along with the main conference.

4.0K