Daniel Wurgaft (@danielwurgaft)

Pinned

D

Daniel Wurgaft@danielwurgaft · Jun 28

🚨New paper! We know models learn distinct in-context learning strategies, but *why*? Why generalize instead of memorize to lower loss? And why is generalization transient? Our work explains this & *predicts Transformer behavior throughout training* without its weights! 🧵 1/

1

15

50

31

6.0K

Pinned

D

Daniel Wurgaft@danielwurgaft · Jun 28

Really nice analysis!

EEkdeep Singh@EkdeepL · Jun 28

🚨New paper! We know models learn distinct in-context learning strategies, but *why*? Why generalize instead of memorize to lower loss? And why is generalization transient? Our work explains this & *predicts Transformer behavior throughout training* without its weights! 🧵 1/

0

2

22

10

3.0K

D

Daniel Wurgaft@danielwurgaft · Jul 12

Submit to our workshop on contextualizing Cogsci approaches for understanding neural networks---"Cognitive interpretability"!

CCogInterp Workshop @ NeurIPS 2025@CogInterp · Jul 11

We’re excited to announce the first workshop on CogInterp: Interpreting Cognition in Deep Learning Models @ NeurIPS 2025! 📣 How can we interpret the algorithms and representations underlying complex behavior in deep learning models? 🌐 coginterp.github.io/neurips2025/ 1/

0

7

20

4

2.0K

D

Daniel Wurgaft@danielwurgaft · Jul 19

Check out my boy @dmkrash presenting our “outstanding paper award” winner at the Actionable Interpretability workshop today!

DDima Krasheninnikov@dmkrash · Jul 19

Check out my posters today if you're at ICML! 1) Detecting high-stakes interactions with activation probes — Outstanding paper @ Actionable interp workshop, 10:40-11:40 2) LLMs’ activations linearly encode training-order recency — Best paper runner up @ MemFM workshop, 2:30-3:45

0

4

18

1

1.0K

D

Daniel Wurgaft@danielwurgaft · Jul 9

Don't forget to tune in tomorrow, July 10th for a session with @EkdeepL on "Rational Analysis of In-Context Learning Elicits a Loss-Complexity Tradeoff" Learn more: cohere.com/events/Cohere-…

CCohere Labs@Cohere_Labs · Jul 3

How do LLMs learn new tasks from just a few examples? What’s happening inside during in-context learning? 🤔 Join us July 10 for a talk by @EkdeepL on how LLMs adapt like cognitive maps—and how we can predict their behavior without accessing weights.

0

4

19

3

2.0K

D

Daniel Wurgaft@danielwurgaft · Jul 9

So proud! Go work with Gabriel, he’ll be the best advisor

GGabriel Poesia@GabrielPoesia · Jul 8

Thrilled to join the UMich faculty in 2026! I'll also be recruiting PhD students this upcoming cycle. If you're interested in AI and formal reasoning, consider applying!

0

5

29

3

6.0K

D

Daniel Wurgaft@danielwurgaft · Jul 8

Thrilled to join the UMich faculty in 2026! I'll also be recruiting PhD students this upcoming cycle. If you're interested in AI and formal reasoning, consider applying!

CComputer Science and Engineering at Michigan@UMichCSE · Jul 8

We’re happy to announce that @GabrielPoesia will be joining our faculty as an assistant professor in Fall 2026. Welcome to CSE! ▶️Learn more about Gabriel here: gpoesia.com #UMichCSE #GoBlue

31

26

268

31

39.0K

Daniel Wurgaft Retweeted

M

Max Kleiman-Weiner@maxhkw · Nov 15

I’m recruiting PhD students to join the Computational Minds and Machines Lab at the University of Washington in Seattle! Join us to work at the intersection of computational cognitive science and AI with a broad focus on social intelligence. (Please reshare!)

13

133

720

276

69.0K

D

Daniel Wurgaft@danielwurgaft · Jul 4

Watch @EkdeepL talk about our recent paper explaining and predicting Transformer in-context learning behavior throughout training!

EEkdeep Singh@EkdeepL · Jul 3

I'll be giving an online talk at Cohere Labs---join!

0

1

0

179

Daniel Wurgaft Retweeted

C

Cohere Labs@Cohere_Labs · Jul 3

How do LLMs learn new tasks from just a few examples? What’s happening inside during in-context learning? 🤔 Join us July 10 for a talk by @EkdeepL on how LLMs adapt like cognitive maps—and how we can predict their behavior without accessing weights.

1

3

23

6

7.0K

Daniel Wurgaft Retweeted

E

Ekdeep Singh@EkdeepL · Jun 28

🚨New paper! We know models learn distinct in-context learning strategies, but *why*? Why generalize instead of memorize to lower loss? And why is generalization transient? Our work explains this & *predicts Transformer behavior throughout training* without its weights! 🧵 1/

9

65

345

346

60.0K

D

Daniel Wurgaft@danielwurgaft · Jun 30

Bayesian models as the ultimate normative theories neural network as the ultimate task-performing models

nnoahdgoodman@noahdgoodman · Jun 30

It turns out that a lot of the most interesting behavior of LLMs can be explained without knowing anything about architecture or learning algorithms. Here we predict the rise (and fall) of in-context learning using hierarchical Bayesian methods.

0

2

14

5

1.0K

D

Daniel Wurgaft@danielwurgaft · Jun 30

It’s like chain-of-thought for humans!

DDaniel Wurgaft@danielwurgaft · Jun 26

Can we record and study human chains of thought? The think-aloud method, where participants voice their thoughts as they solve a task, offers a way! In our #CogSci2025 paper co-led with Ben Prystawski, we introduce a method to automate analysis of human reasoning traces! (1/8)🧵

0

4

12

2

2.0K

D

Daniel Wurgaft@danielwurgaft · Jun 30

It turns out that a lot of the most interesting behavior of LLMs can be explained without knowing anything about architecture or learning algorithms. Here we predict the rise (and fall) of in-context learning using hierarchical Bayesian methods.

EEkdeep Singh@EkdeepL · Jun 28

🚨New paper! We know models learn distinct in-context learning strategies, but *why*? Why generalize instead of memorize to lower loss? And why is generalization transient? Our work explains this & *predicts Transformer behavior throughout training* without its weights! 🧵 1/

4

20

108

77

18.0K

Daniel Wurgaft Retweeted

f

fly51fly@fly51fly · Jun 24

[LG] In-Context Learning Strategies Emerge Rationally D Wurgaft, E S Lubana, C F Park, H Tanaka... [Stanford University & Harvard University] (2025) arxiv.org/abs/2506.17859

0

2

7

5

588

D

Daniel Wurgaft@danielwurgaft · Jun 28

Amazing! I was wondering why there is no good curated dataset on humans playing game of 24. Here it is now :)

DDaniel Wurgaft@danielwurgaft · Jun 26

Can we record and study human chains of thought? The think-aloud method, where participants voice their thoughts as they solve a task, offers a way! In our #CogSci2025 paper co-led with Ben Prystawski, we introduce a method to automate analysis of human reasoning traces! (1/8)🧵

1

2

6

1

582