Thomas Heap (@ThomasEHeap)

Pinned

T

Thomas Heap@ThomasEHeap · Mar 12

Our paper Massively Parallel Expectation Maximization For Approximate Posteriors is now on arXiv! In this work we introduce the QEM method for fast approximate posterior estimation in Hierarchical Bayesian models. 🧵👇

ThomasEHeap's tweet image. Our paper Massively Parallel Expectation Maximization For Approximate Posteriors is now on arXiv! In this work we introduce the QEM method for fast approximate posterior estimation in Hierarchical Bayesian models. 🧵👇

1

6

32

9

3.0K

T

Thomas Heap@ThomasEHeap · May 1

I talked to a lot of people about "a weight decay paper from Wang and Aitchison" at ICLR, which is officially been accepted at #ICML2025 . Laurence summarized the stuff in our paper in the post, here I will talk about the connection with a *broad* collection of existing works 1/

LLaurence Aitchison@laurence_ai · May 28, 2024

1/ Super proud of our recent work on how to change the AdamW weight decay as you scale model + dataset size. Or how μP is broken and how to fix it. arxiv.org/abs/2405.13698…

5

9

42

14

4.0K

T

Thomas Heap@ThomasEHeap · May 2

Our position paper on LLM eval error bars has just been accepted to ICML 2025 as a spotlight poster!

SSam Bowyer@sambowyer__ · Mar 4

Our paper on the best way to add error bars to LLM evals is on arXiv! TL;DR: Avoid the Central Limit Theorem -- there are better, simple Bayesian (and frequentist!) methods you should be using instead. Super lightweight library: github.com/sambowyer/baye… 🧵👇

1

8

18

2

2.0K

Thomas Heap Retweeted

T

Tim Lawson@tslwn · Apr 26

Here we are! #519 is in Hall 2B, opposite the the D E Shaw & Co stand.

1

4

0

202

Thomas Heap Retweeted

T

Tim Lawson@tslwn · Apr 25

I'm at #ICLR, presenting our work on multi-layer SAEs for language-model interpretability tomorrow (Sat 26 Apr) from 10AM at Hall 3 + Hall 2B #519: iclr.cc/virtual/2025/p…

1

2

9

1

413

T

Thomas Heap@ThomasEHeap · Apr 23

#ICLR2025 I will hold two talks on KBLaM (my internship project at MSR Cambridge w/ @jameshensman) at Microsoft’s booth: Thursday and Saturday 4 - 4:30, As well as a poster at Poster Session 5 on Saturday morning.!

MMicrosoft Research@MSFTResearch · Mar 18

Introducing KBLaM, an approach that encodes and stores structured knowledge within an LLM itself. By integrating knowledge without retraining, it offers a scalable alternative to traditional methods. msft.it/6011qniy9

0

3

12

0

655

Thomas Heap Retweeted

T

Tim Lawson@tslwn · Mar 28

Second, we trained SAEs on transformers with randomized parameters, finding that auto-interpretability scores do not always distinguish them from trained models. This underscores the difficulty of automating feature interpretation and the importance of appropriate baselines! 3/

1

3

0

249

T

Thomas Heap@ThomasEHeap · Mar 28

There's a lot to process here, but I was pleased to see that Anthropic's 'Circuit Tracing' paper cites three of our recent contributions to the interpretability literature! 1/

AAnthropic@AnthropicAI · Mar 27

For more, read our papers: On the Biology of a Large Language Model contains an interactive explanation of each case study: transformer-circuits.pub/2025/attributi… Circuit Tracing explains our technical approach in more depth: transformer-circuits.pub/2025/attributi…

1

5

85

42

13.0K

T

Thomas Heap@ThomasEHeap · Mar 12

Really happy to have this paper out on arXiv! Scalable GPU-based Bayesian inference for hierarchical models without requiring gradients wrt model parameters (unlike e.g. VI). arxiv.org/abs/2503.08264

TThomas Heap@ThomasEHeap · Mar 12

Our paper Massively Parallel Expectation Maximization For Approximate Posteriors is now on arXiv! In this work we introduce the QEM method for fast approximate posterior estimation in Hierarchical Bayesian models. 🧵👇

0

2

7

0

440

Thomas Heap Retweeted

S

Sam Bowyer@sambowyer__ · Mar 4

Our paper on the best way to add error bars to LLM evals is on arXiv! TL;DR: Avoid the Central Limit Theorem -- there are better, simple Bayesian (and frequentist!) methods you should be using instead. Super lightweight library: github.com/sambowyer/baye… 🧵👇

1

20

98

97

16.0K

Thomas Heap Retweeted

L

Lucy Farnik ✈️ Bay until 2 Aug@lucyfarnik · Feb 26

🚨NEW PAPER ALERT 🚨 SAEs can give us insight into the representations of LLMs. But what about the LLMs' computations? If we want to understand LLMs, we don't just need sparse SAE activations, but also a sparse computational graph connecting them. So how do we get them? A 🧵

5

25

250

204

23.0K

Thomas Heap Retweeted

E

Edward Milsom@edward_milsom · Feb 25

Our paper "Function-Space Learning Rates" is on arXiv! We give an efficient way to estimate the magnitude of changes to NN outputs caused by a particular weight update. We analyse optimiser dynamics in function space, and enable hyperparameter transfer with our scheme FLeRM! 🧵👇

12

68

423

372

83.0K

Thomas Heap Retweeted

T

Tim Lawson@tslwn · Jan 22

Very pleased to confirm that our paper "Residual Stream Analysis with Multi-Layer SAEs" has been accepted to ICLR 2025! openreview.net/forum?id=XAjfj…

1

3

38

12

1.0K

T

Thomas Heap@ThomasEHeap · Jul 24, 2024

Whoever decided to have nothing but Northern soul floor fillers playing between talks at ICML... respect.

0

3

0

213