Actionable Interpretability Workshop ICML2025
@ActInterp
🛠️ Actionable Interpretability🔎 @icmlconf 2025 | Bridging the gap between insights and actions ✨ https://actionable-interpretability.github.io
Many thanks to the @ActInterp organisers for highlighting our work - and congratulations to Pedro, Alex and the other awardees! Sad not to have been there in person, it looked like a fantastic workshop. @AmsterdamNLP @EdinburghNLP
Big congrats to Alex McKenzie, Pedro Ferreira, and their collaborators on receiving Outstanding Paper Awards!👏👏 and thanks for the fantastic oral presentations! Check out the papers here 👇
Great to present what’s coming next for NDIF at the @actinterp workshop at #ICML2025! If you missed us, let’s chat after the conference. Reach out here: forms.gle/AhTSBNNttA11JV…
maybe I will live tweet the actionable interp workshop panel
Huge thanks to Sarah Schwettmann for a fascinating keynote on "AI Investigators for Understanding AI Systems" 🤖 @cogconfluence @TransluceAI

Grab a ☕️ and join us for a keynote by @RICEric22: Explanations for Experts via Guarantees and Domain Knowledge: From Attributions to Reasoning

➡️ Join us for the keynote by @byron_c_wallace: “What (if anything) can interpretability do for healthcare?”

The second poster session is starting now!🙌🏻


Come see our poster about how to predict side effects of unlearning and Fine-Tuning at @ActInterp
Crazy amount of cool work concentrated in one room
The first poster session is happening now!
The first poster session is happening now!



The one and only @_beenkim on Agentic Interpretability and Neologism: What LLMs Can Offer Us!

We’ve started!👏 Looking forward to an exciting day!💫🔍⚙️

🚨The Actionable Interpretability Workshop is happening tomorrow at ICML! Join us for an exciting lineup of speakers, nearly 70 posters, and a great panel discussion 🙌 Don’t miss it! 🔍⚙️ @icmlconf @ActInterp


At ICML? Interested in how we can do more with interpretability to have practical impact on the rest of AI? Come to our workshop this Saturday!
Hope everyone’s getting the most out of #icml25. We’re excited and ready for the Actionable Interpretability (@ActInterp) workshop this Saturday! Check out the schedule and join us to discuss how we can move interpretability toward more practical impact.
I will be at the Actionable Interpretability Workshop (@ActInterp, #ICML) presenting *SSAEs* in the East Ballroom A from 1-2pm. Drop by (or send a DM) to chat about (actionable) interpretability, (actionable) identifiability, and everything in between!
1\ Hi, can I get an unsupervised sparse autoencoder for steering, please? I only have unlabeled data varying across multiple unknown concepts. Oh, and make sure it learns the same features each time! Yes! A freshly brewed Sparse Shift Autoencoder (SSAE) coming right up. 🧶