Taylor Webb (@TaylorWWebb)

Pinned

T

Taylor Webb@TaylorWWebb · Nov 19

Excited to announce that I'll be starting a lab at the University of Montreal (psychology) and Mila (Montreal Institute of Learning Algorithms) starting summer 2025. More info to come soon, but I'll be recruiting grad students. Please share / get in touch if you're interested!

TaylorWWebb's tweet image. Excited to announce that I'll be starting a lab at the University of Montreal (psychology) and Mila (Montreal Institute of Learning Algorithms) starting summer 2025. More info to come soon, but I'll be recruiting grad students. Please share / get in touch if you're interested!

20

36

223

30

18.0K

Taylor Webb Retweeted

B

Brian Odegaard@BrianOdegaard2 · May 27

Led by postdoc Doyeon Lee and grad student Joseph Pruitt, our lab has a new Perspectives piece in PNAS Nexus: "Metacognitive sensitivity: The key to calibrating trust and optimal decision-making with AI" academic.oup.com/pnasnexus/arti… With co-authors Tianyu Zhou and Eric Du 1/

1

3

9

1

660

Taylor Webb Retweeted

M

Mengdi Wang@MengdiWang10 · Apr 3

🚨 Discover the Science of LLM! We uncover how LLMs (Llama3-70B) achieve abstract reasoning through emergent symbolic mechanisms: 1️⃣ Symbol Abstraction Heads: Early layers convert input tokens into abstract variables based on their relationships. 2️⃣ Symbolic Induction Heads:…

4

35

167

122

13.0K

Taylor Webb Retweeted

S

Stephanie Chan@scychan_brains · Mar 11

New work led by @Aaditya6284: "Strategy coopetition explains the emergence and transience of in-context learning in transformers." We find some surprising things!! E.g. that circuits can simultaneously compete AND cooperate ("coopetition") 😯 🧵👇

1

8

41

21

6.0K

Taylor Webb Retweeted

M

Mikel Bober-Irizar@mikb0b · Dec 24

Why do pre-o3 LLMs struggle with generalization tasks like @arcprize? It's not what you might think. OpenAI o3 shattered the ARC-AGI benchmark. But the hardest puzzles didn’t stump it because of reasoning, and this has implications for the benchmark as a whole. Analysis below🧵

19

72

668

431

205.0K

T

Taylor Webb@TaylorWWebb · Dec 20

Truly incredible results. I have been impressed with o1’s capabilities but certainly didn’t expect this leap.

FFrançois Chollet@fchollet · Dec 20

Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task…

0

7

0

777

Taylor Webb Retweeted

F

François Chollet@fchollet · Dec 20

Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task…

202

2.0K

9.0K

3.0K

2.2M

Taylor Webb Retweeted

D

Dylan Foster 🐢@canondetortugas · Dec 17

Given a high-quality verifier, language model accuracy can be improved by scaling inference-time compute (e.g., w/ repeated sampling). When can we expect similar gains without an external verifier? New paper: Self-Improvement in Language Models: The Sharpening Mechanism

3

48

255

254

41.0K

Taylor Webb Retweeted

D

Dongyu Gong@Dongyu_Gong · Nov 8

Introducing our new work on mechanistic intepretability of LLM cognition🤖🧠: why do Transformer-based LLMs have limited working memory capacity, as measured by N-back tasks? (1/7) openreview.net/pdf?id=dXjQgm9…

1

4

17

5

2.0K

Taylor Webb Retweeted

H

Hope Kean@HopeKean · Nov 27

New paper with @alexanderdfung, @PramodRT9 , @jessica__chomik , @Nancy_Kanwisher, @ev_fedorenko on the representations that underlie our intuitive physical reasoning about the world. Thread 🧵about our new preprint 📄✨linked here: tinyurl.com/intphyslang 1/10

1

28

92

30

13.0K

Taylor Webb Retweeted

A

Alexa R. Tartaglini@ARTartaglini · Nov 22

🚨 New paper at @NeurIPSConf w/ @Michael_Lepori! Most work on interpreting vision models focuses on concrete visual features (edges, objects). But how do models represent abstract visual relations between objects? We adapt NLP interpretability techniques for ViTs to find out! 🔍

2

37

261

170

31.0K

T

Taylor Webb@TaylorWWebb · Nov 23

Even ducklings🐣can represent abstract visual relations. Can your favorite ViT? In our new @NeurIPSConf paper, we use mechanistic interpretability to find out!

AAlexa R. Tartaglini@ARTartaglini · Nov 22

🚨 New paper at @NeurIPSConf w/ @Michael_Lepori! Most work on interpreting vision models focuses on concrete visual features (edges, objects). But how do models represent abstract visual relations between objects? We adapt NLP interpretability techniques for ViTs to find out! 🔍

0

4

23

7

3.0K

Taylor Webb Retweeted

M

Matthias Michel@MatthiasMichel_ · Nov 22

In this new preprint @smfleming and I present a theory of the functions and evolution of conscious vision. This is a big project: osf.io/preprints/psya…. We'd love to get your comments!

2

17

52

8

4.0K

Taylor Webb Retweeted

V

Valentina Pyatkin@valentina__py · Nov 21

Open Post-Training recipes! Some of my personal highlights: 💡 We significantly scaled up our preference data! (using more than 330k preference pairs for our 70b model!) 💡 We used RL with Verifiable Rewards to improve targeted skills like math and precise instruction following…

2

28

158

84

15.0K

T

Taylor Webb@TaylorWWebb · Nov 21

It would be great to have a precise enough formulation of ‘approximate retrieval’ for this hypothesis to be rigorously tested. There is a concern that virtually any task can be characterized in this way, by appealing to a vague notion of similarity with other tasks.

SSubbarao Kambhampati (కంభంపాటి సుబ్బారావు)@rao2z · Nov 21

On the fallacy of "If it ain't strictly retrieval, it must be reasoning" argument.. #SundayHarangue (on Wednesday) There is a tendency among some LLM researchers to claim that LLMs must be somehow capable of doing some sort of reasoning since they are after all not doing the…

1

0

12

3

1.0K

T

Taylor Webb@TaylorWWebb · Nov 20

This looks like a very useful and important contribution!

LLaura Ruis@LauraRuis · Nov 20

How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this: Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢 🧵⬇️

0

5

2

715

Taylor Webb Retweeted

L

Laura Ruis@LauraRuis · Nov 20

How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this: Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢 🧵⬇️

24

211

967

988

188.0K

Taylor Webb Retweeted

E

Earl K. Miller@MillerLabMIT · Nov 16

More evidence that working memory is not persistent activity. Instead, it is dynamic on/off states with short-term synaptic plasticity. Intermittent rate coding and cue-specific ensembles support working memory nature.com/articles/s4158… #neuroscience

5

81

302

155

20.0K

T

Taylor Webb@TaylorWWebb · Nov 15

Fascinating paper from Paul Smolensky et al illustrating how transformers can implement a form of compositional symbol processing, and arguing that an emergent form of this may account for in-context learning in LLMs: arxiv.org/abs/2410.17498

1

32

19

2.0K