Adam Jermyn

@AdamSJermyn

AI Interpretability & Safety @AnthropicAI. Previously at @FlatironInst @FlatironCCA, @KITP_UCSB, PhD @Cambridge_Uni, BS @Caltech.

Joined July 2009

207Following

2KFollowers

Adam Jermyn@AdamSJermyn · Jul 25

A collection of small updates from the Anthropic Interpretability team: transformer-circuits.pub/2025/july-upda…

17.0K

Adam Jermyn Retweeted

Liron Shapira@liron · Jul 19

Who knew you could win gold in the International Math Olympiad without truly reasoning?

538

35.0K

Adam Jermyn@AdamSJermyn · Jun 15

Claude is real, and strong, and he's my friend.

AAryaman Arora@aryaman2020 · Jun 14

Claude

145

15.0K

Adam Jermyn@AdamSJermyn · Jun 11

we've got some really cool new work looking at @AnthropicAI's circuit tracing work and comparing it to a circuit that's already been studied. some really interesting findings in here

GGoodfire@GoodfireAI · Jun 11

New research update! We replicated @AnthropicAI's circuit tracing methods to test if they can recover a known, simple transformer mechanism.

3.0K

Adam Jermyn@AdamSJermyn · May 18

Wild new examples of OpenAI shadiness. Great scoop

GGarrison Lovely@GarrisonLovely · May 17

SCOOP: I obtained a previously unreported letter from OpenAI to California's Attorney General that includes surprising admissions about the company's restructuring plans – and shows how OpenAI is attacking critics who question its attempts to diminish its nonprofit governance.

10.0K

Adam Jermyn@AdamSJermyn · Apr 30

A collection of small updates from the Anthropic Interpretability team (transformer-circuits.pub/2025/april-upd…). [Note this is different from the Attention update... it's been a busy time!]

641

Adam Jermyn@AdamSJermyn · Apr 28

An update on Attention from the Anthropic Interpretability team (transformer-circuits.pub/2025/attention…). This one is close to home since understanding attention is a lot of what I've worked on the last ... 18 months.

230

216

18.0K