Suraj Srinivas @ ICML (@Suuraj)

Pinned

S

One of my most favourite (and thought-provoking) ML papers!

jjxmo@jxmnop · Mar 24

# A new type of information theory this paper is not super well-known but has changed my opinion of how deep learning works more than almost anything else it says that we should measure the amount of information available in some representation based on how *extractable* it is,…

0

2

4

251

Pinned

Suraj Srinivas @ ICML Retweeted

j

jxmo@jxmnop · Mar 24

# A new type of information theory this paper is not super well-known but has changed my opinion of how deep learning works more than almost anything else it says that we should measure the amount of information available in some representation based on how *extractable* it is,…

69

358

3.0K

222.0K

Suraj Srinivas @ ICML Retweeted

M

Michael Black@Michael_J_Black · Jul 23

Here's how my recent papers & reviews are going: * To solve a vision problem today, the sensible thing is to leverage a pre-trained VLM or video diffusion model. Such models implicitly represent a tremendous amount about the visual world that we can exploit. * Figure out how to…

5

56

480

167

36.0K

Suraj Srinivas @ ICML Retweeted

A

Alex Oesterling@alex_oesterling · Jul 17

‼️🕚New paper alert with @ushabhalla_: Leveraging the Sequential Nature of Language for Interpretability (openreview.net/pdf?id=hgPf1ki…)! 1/n

1

8

18

5

2.0K

Suraj Srinivas @ ICML Retweeted

S

Sebastian Bordt@ICML@sbordt · Jul 8

Have you ever wondered whether a few times of data contamination really lead to benchmark overfitting?🤔 Then our latest paper about the effect of data contamination on LLM evals might be for you!🚀 "How Much Can We Forget about Data Contamination?" (accepted at #ICML2025) shows…

1

6

17

4

2.0K

Suraj Srinivas @ ICML Retweeted

j

jxmo@jxmnop · Jun 9

## The case for more ambition i wrote about how AI researchers should ask bigger and simpler questions, and publish fewer papers:

25

99

1.0K

725

77.0K

S

Suraj Srinivas @ ICML@Suuraj · Jun 3

Why does Chain of Thought prompting actually work? @bohang_zhang will be talking about it today. Join us! @Suuraj @tverven

MMichal Moshkovitz@ML_Theorist · May 21

⏰⏰ Theory of Interpretable AI Seminar ⏰⏰ Chain-of-Thought: Why does explaining to LLMs using CoT prompting work? Join us on June 3, when @bohang_zhang will dive into the mechanisms behind chain-of-thought prompting — and what makes it so effective @tverven @Suuraj

0

2

3

1

421

Suraj Srinivas @ ICML Retweeted

G

Goodfire@GoodfireAI · May 27

We created a canvas that plugs into an image model’s brain. You can use it to generate images in real-time by painting with the latent concepts the model has learned. Try out Paint with Ember for yourself 👇

40

98

921

588

171.0K

S

Suraj Srinivas @ ICML@Suuraj · May 22

we live in a world where "verification is easier than generation" is no longer true

aarlo_son@gson_AI · May 20

#NLProc AI Co-Scientists 🤖 can generate ideas, but can they spot mistakes? (not yet! 🚫) In my recent paper, we introduce SPOT, a dataset of STEM manuscripts (math, materials science, chemistry, physics, etc), annotated with real errors. SOTA models like o3, gemini-2.5-pro…

0

6

0

286

Suraj Srinivas @ ICML Retweeted

a

arlo_son@gson_AI · May 20

#NLProc AI Co-Scientists 🤖 can generate ideas, but can they spot mistakes? (not yet! 🚫) In my recent paper, we introduce SPOT, a dataset of STEM manuscripts (math, materials science, chemistry, physics, etc), annotated with real errors. SOTA models like o3, gemini-2.5-pro…

4

38

162

85

25.0K

Suraj Srinivas @ ICML Retweeted

M

Michal Moshkovitz@ML_Theorist · May 21

⏰⏰ Theory of Interpretable AI Seminar ⏰⏰ Chain-of-Thought: Why does explaining to LLMs using CoT prompting work? Join us on June 3, when @bohang_zhang will dive into the mechanisms behind chain-of-thought prompting — and what makes it so effective @tverven @Suuraj

1

2

8

1

724

Suraj Srinivas @ ICML Retweeted

N

Nora Belrose@norabelrose · May 19

data attribution is the most neglected thing in interpretability and people should join me in working on it

15

4

154

44

11.0K

S

Suraj Srinivas @ ICML@Suuraj · May 5

Curious about feature attribution? SHAP & LIME treat features independently—but features interact! Come hear how to "Disentangle Interactions and Dependencies in Feature Attribution" Tuesday (tomorrow!) 4pm CET, 10am ET @Suuraj @tverven

MMichal Moshkovitz@ML_Theorist · Apr 22

⏰⏰Theory of Interpretable AI Seminar ⏰⏰ Interested in Feature Attribution Explanations? In two weeks, May 6, Gunnar König @gcskoenig will talk about "Disentangling Interactions and Dependencies in Feature Attribution" @tverven @Suuraj

0

1

2

0

339

Suraj Srinivas @ ICML Retweeted

M

Michal Moshkovitz@ML_Theorist · Apr 14

In April 2024, we launched the Theory of Interpretable XAI seminar, aiming to build a community—unsure if we’d even have enough speakers. A year later, we’re still growing. New to the seminar? Join us in building the foundations of XAI together @tverven @Suuraj 1/n

1

3

10

0

347

Suraj Srinivas @ ICML Retweeted

M

Michal Moshkovitz@ML_Theorist · Apr 22

⏰⏰Theory of Interpretable AI Seminar ⏰⏰ Interested in Feature Attribution Explanations? In two weeks, May 6, Gunnar König @gcskoenig will talk about "Disentangling Interactions and Dependencies in Feature Attribution" @tverven @Suuraj

0

1

7

0

561

S

Suraj Srinivas @ ICML@Suuraj · Apr 8

Today in **two hours** @mirco_mutti will talk about interpretable bandits Zoom link: uva-live.zoom.us/j/87120549999 @Suuraj @tverven

AAviv Tamar@AvivTamar1 · Apr 8

Can we get a *short* and *interpretable* policy for multi-armed bandits that is guaranteed to perform well? @mirco_mutti will present our (w/ @shiemannor and Jeongyeol Kwon) recent work on this cool new problem in the Theory of Interpretable AI today! (zoom link below)

0

1

8

1

393

S

Suraj Srinivas @ ICML@Suuraj · Apr 2

DID I CRACK IT? I think I figured out at least a chunk of the math. It's trade deficit divided by their exports. EU: exports 531.6, imports 333.4, deficit 198.2. 198.2/531.6 is 37, close to 39. Israel: exports 22.2, imports 14.8, deficit 7.4. 7.4/22.2 is 33.

RRapid Response 47@RapidResponse47 · Apr 2

FULL LIST: Liberation Day

471

2.0K

16.0K

4.0K

5.7M

S

Suraj Srinivas @ ICML@Suuraj · Mar 27

Take a break from arxiv/LW/AF. Sit in the woods with a random textbook and mull new ideas away from interp community lockstep. Diverge. Don’t compete with a saturated subtopic, maybe you’ll get to take weekends off. Premature overinvestment comes from monoculture.

NNeel Nanda@NeelNanda5 · Mar 26

So what should the community do? I'd guess we're over-invested in fundamental SAE research, but shouldn't abandon it completely. And SAEs remain a valuable tool, esp for exploration and debugging I'm most keen on applied work, and making targeted fixes for fundamental issues.

5

18

219

69

28.0K