Actionable Interpretability Workshop ICML2025

@ActInterp

🛠️ Actionable Interpretability🔎 @icmlconf 2025 | Bridging the gap between insights and actions ✨ https://actionable-interpretability.github.io

Joined March 2025

11Following

241Followers

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 21

Many thanks to the @ActInterp organisers for highlighting our work - and congratulations to Pedro, Alex and the other awardees! Sad not to have been there in person, it looked like a fantastic workshop. @AmsterdamNLP @EdinburghNLP

AActionable Interpretability Workshop ICML2025@ActInterp · Jul 20

Big congrats to Alex McKenzie, Pedro Ferreira, and their collaborators on receiving Outstanding Paper Awards!👏👏 and thanks for the fantastic oral presentations! Check out the papers here 👇

2.0K

Actionable Interpretability Workshop ICML2025 Retweeted

NDIF@ndif_team · Jul 19

Great to present what’s coming next for NDIF at the @actinterp workshop at #ICML2025! If you missed us, let’s chat after the conference. Reach out here: forms.gle/AhTSBNNttA11JV…

2.0K

Actionable Interpretability Workshop ICML2025 Retweeted

Aryaman Arora@aryaman2020 · Jul 19

maybe I will live tweet the actionable interp workshop panel

101

12.0K

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 19

Huge thanks to Sarah Schwettmann for a fascinating keynote on "AI Investigators for Understanding AI Systems" 🤖 @cogconfluence @TransluceAI

ActInterp's tweet image. Huge thanks to Sarah Schwettmann for a fascinating keynote on "AI Investigators for Understanding AI Systems" 🤖 @cogconfluence @TransluceAI

4.0K

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 19

Grab a ☕️ and join us for a keynote by @RICEric22: Explanations for Experts via Guarantees and Domain Knowledge: From Attributions to Reasoning

ActInterp's tweet image. Grab a ☕️ and join us for a keynote by @RICEric22: Explanations for Experts via Guarantees and Domain Knowledge: From Attributions to Reasoning

699

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 19

➡️ Join us for the keynote by @byron_c_wallace: “What (if anything) can interpretability do for healthcare?”

897

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 19

The second poster session is starting now!🙌🏻

640

Actionable Interpretability Workshop ICML2025 Retweeted

Aly M. Kassem@_AKassem · Jul 19

Come see our poster about how to predict side effects of unlearning and Fine-Tuning at @ActInterp

2.0K

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 19

Crazy amount of cool work concentrated in one room

AActionable Interpretability Workshop ICML2025@ActInterp · Jul 19

The first poster session is happening now!

1.0K

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 19

The first poster session is happening now!

4.0K

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 19

The one and only @_beenkim on Agentic Interpretability and Neologism: What LLMs Can Offer Us!

2.0K

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 19

We’ve started!👏 Looking forward to an exciting day!💫🔍⚙️

1.0K

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 18

🚨The Actionable Interpretability Workshop is happening tomorrow at ICML! Join us for an exciting lineup of speakers, nearly 70 posters, and a great panel discussion 🙌 Don’t miss it! 🔍⚙️ @icmlconf @ActInterp

ActInterp's tweet image. 🚨The Actionable Interpretability Workshop is happening tomorrow at ICML!
Join us for an exciting lineup of speakers, nearly 70 posters, and a great panel discussion 🙌
Don’t miss it! 🔍⚙️

@icmlconf @ActInterp

2.0K

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 18

At ICML? Interested in how we can do more with interpretability to have practical impact on the rest of AI? Come to our workshop this Saturday!

HHadas Orgad @ ICML@OrgadHadas · Jul 17

Hope everyone’s getting the most out of #icml25. We’re excited and ready for the Actionable Interpretability (@ActInterp) workshop this Saturday! Check out the schedule and join us to discuss how we can move interpretability toward more practical impact.

1.0K

Actionable Interpretability Workshop ICML2025@ActInterp · Jul 17

I will be at the Actionable Interpretability Workshop (@ActInterp, #ICML) presenting *SSAEs* in the East Ballroom A from 1-2pm. Drop by (or send a DM) to chat about (actionable) interpretability, (actionable) identifiability, and everything in between!

SShruti Joshi@_shruti_joshi_ · Feb 21

1\ Hi, can I get an unsupervised sparse autoencoder for steering, please? I only have unlabeled data varying across multiple unknown concepts. Oh, and make sure it learns the same features each time! Yes! A freshly brewed Sparse Shift Autoencoder (SSAE) coming right up. 🧶

2.0K