R
Rohan Gupta
@RohDGupta
Joined April 2023
87Following
13Followers
Rohan Gupta Retweeted
I
Iván Arcuschin@IvanArcus · Jul 22, 2024
Circuit discovery techniques aim to find subgraphs of NNs for specific tasks. Are they correct? Which one is the best? 🕵️ Introducing InterpBench: 17 semi-synthetic, realistic transformers with known circuits to evaluate mechanistic interpretability. Read on... 🧵
1
11
63
24
5.0K