Rohit Gandikota

@rohitgandikota

Ph.D. AI @ Northeastern University. Understanding, mapping, and editing knowledge in large generative models. Ex-Scientist Indian Space Research Organization

Boston, MA

Joined June 2013

111Following

918Followers

Pinned

Rohit Gandikota@rohitgandikota · Feb 7

Can you ask a Diffusion Model to break down a concept? 👀 SliderSpace 🚀 reveals maps of the visual knowledge naturally encoded within diffusion models. It works by decomposing the model's capabilities into intuitive, composable sliders. Here's how 🧵👇

399

310

58.0K

Pinned

Rohit Gandikota@rohitgandikota · Jun 9

RAG for Wikipedia! Seems like a good way to streamline evaluations when editing knowledge in LLMs

RRoy Rinberg@RoyRinberg · Jun 9

RAGs are extremely useful, and yet there isn't an opensource RAG system for wikipedia (or I couldn't find it). So I built WikiRAG, a simple open-source github + hugging repo✨ Spin up your own RAG server for wikipedia in a single line. 🚗💨

274

Rohit Gandikota@rohitgandikota · Jun 12

Yes indeed! Here are 3 pieces of advice by @davidbau for PhD students interested in Mech Interp

MMechanistic Interpretability for Vision @ CVPR2025@miv_cvpr2025 · Jun 12

@davidbau's talk on "What is AI interpretability for?" is about to start! If you know David or his talks in the past - you know it is going to be special🔥 A fan or skeptic of mech Interp? come ask David some questions Location: Grand C1 Hall or Zoom

601

Rohit Gandikota Retweeted

Mechanistic Interpretability for Vision @ CVPR2025@miv_cvpr2025 · Jun 12

Mechanistic Interpretability for Vision Workshop has officially begun @CVPR ! 🚀 Join us at Grand C1 Hall for insightful perspectives on the state of interpretability in vision models by @TamarRottShaham.

949

Rohit Gandikota@rohitgandikota · Jun 12

Come join us at the @miv_cvpr2025 workshop today at the @CVPR in room C1! We have an amazing lineup of speakers 🔊 including @davidbau @soniajoseph_ @trevordarrell @aleks_madry Antonio Torralba and Michal Irani 🙌🏻

MMechanistic Interpretability for Vision @ CVPR2025@miv_cvpr2025 · Jan 17

🔍 Curious about what's really happening inside vision models? Join us at the First Workshop on Mechanistic Interpretability for Vision (MIV) at @CVPR! 📢 Website: sites.google.com/view/miv-cvpr2… Meet our amazing invited speakers! #CVPR2025 #MIV25 #MechInterp #ComputerVision

1.0K

Rohit Gandikota Retweeted

Nick Jiang@nickhjiang · Jun 10

Vision transformers have high-norm outliers that hurt performance and distort attention. While prior work removed them by retraining with “register” tokens, we find the mechanism behind outliers and make registers at ✨test-time✨—giving clean features and better performance! 🧵

136

997

826

128.0K

Rohit Gandikota Retweeted

Amil Dravid@_AmilDravid · Jun 10

Artifacts in your attention maps? Forgot to train with registers? Use 𝙩𝙚𝙨𝙩-𝙩𝙞𝙢𝙚 𝙧𝙚𝙜𝙞𝙨𝙩𝙚𝙧𝙨! We find a sparse set of activations set artifact positions. We can shift them anywhere ("Shifted") — even outside the image into an untrained token. Clean maps, no retrain.

325

211

43.0K

Rohit Gandikota@rohitgandikota · Jun 10

Tired of looking at pixels and want to look at some neurons? Come join us @miv_cvpr2025 this Thursday @CVPR in room "Grand C1"

MMechanistic Interpretability for Vision @ CVPR2025@miv_cvpr2025 · Jun 9

It might be a rainy @CVPR this time, but we at MIV workshop have you covered! Come to Grand C1 hall and listen to our great speakers talk about why mechanistic interpretability is important for vision models Date: June 12th, 9AM Location: Grand C1 hall More info?👇

731

Rohit Gandikota@rohitgandikota · Jun 9

Super excited about @miv_cvpr2025! What an incredible lineup of invited speakers! 👀 You instantly know that the workshop is going to be 🔥 Come to Grand C1 hall on June 12th

MMechanistic Interpretability for Vision @ CVPR2025@miv_cvpr2025 · Jun 9

907

Rohit Gandikota@rohitgandikota · Jun 3

Is the knowledge of a concept **really** removed when we erase a concept from a diffusion model? @kevinlu4588 found that the answer is often - NO! Checkout these clever yet simple techniques to search for the traces of knowledge in your erased models.

KKevin Lu@kevinlu4588 · May 30

When we "erase" a concept from a diffusion model, is that knowledge truly gone? 🤔 We investigated, and the answer is often 'no'! Using simple probing techniques, the knowledge traces of the erased concept can be easily resurfaced 🔍 Here is what we learned 🧵👇

436

Rohit Gandikota Retweeted

Rohit Gandikota@rohitgandikota · Apr 22

1. Enhance a concept You can enhance a concept inside a model's knowledge For example, here is UCE enhancing "mustache" inside @vivago_ai's HiDream-l1 model in under 10 secs and 40GB GPU

1.0K

Rohit Gandikota Retweeted

Sheridan Feucht@sheridan_feucht · Apr 25

I used to think formal reasoning was central to language and intelligence, but now I’m not so sure. Wrote a short post about my thoughts on this, with a couple chewy anecdotes. Would love to get some feedback/pointers to further reading. sfeucht.github.io/syllogisms/

869

Rohit Gandikota Retweeted

Yonatan Belinkov@boknilev · Apr 26

@amuuueller presenting sparse feature circuits at #ICLR2025 Also Aaron is starting as faculty at BU in the fall so reach out to him if you’re looking for PhD positions. You’ll get a terrific mentor!

880

Rohit Gandikota@rohitgandikota · Apr 27

Nice trick to make diffusion model outcomes diverse..

RRohit Gandikota@rohitgandikota · Mar 18

Why do distilled diffusion models generate similar-looking images? 🤔 Our Diffusion Target (DT) visualization reveals the secret to diversity. It is the very first time-step! And—there is a simple, training-free way to make them more diverse! Here is how: 🧵👇

807

Rohit Gandikota@rohitgandikota · Apr 23

An exciting and up-to-date implementation for text-to-image model editing of our TIME method! Great to see these ideas evolve and get applied on SOTA models. Project page & paper 👇🏻

RRohit Gandikota@rohitgandikota · Apr 22

You can now edit HiDream-I1 model under 10 seconds!🚀 (... FLUX and SDXL under 2 seconds on an RTX4090) We are releasing a simple implementation of our UCE work to support any diffusion model Here are some cool things you can do with this lightning fast editing method 🧵👇

996

Rohit Gandikota@rohitgandikota · Apr 23

AI systems are sold as black boxes, so you might think it is impossible to understand their thoughts. But you can. And control their knowledge. Like fixing all sorts of unfair biases—heh, even showing that scientists don't actually wear white coats. x.com/rohitgandikota…

RRohit Gandikota@rohitgandikota · Apr 22

3. Erasing a visual association In addition to erasing a concept entirely, you can also editing an association of two concepts. For example, here is UCE erasing the knowledge from @StabilityAI's SDXL model that "scientists wear glasses" in <2 secs with a 24GB GPU

745

Rohit Gandikota@rohitgandikota · Apr 23

Realize what @rohitgandikota and @OrgadHadas and @materzynska have done here. It goes beyond making an AI that can edit something for you. By understanding how the neurons store knowledge, they let you reach inside and change that knowledge directly. x.com/rohitgandikota…

RRohit Gandikota@rohitgandikota · Apr 22

Those are image editing tools Here, we are editing the model weights. It's like training your own custom model. Now, a common problem with creating custom models is that they take a lot of time and require huge compute GPUs Our method is super fast and requires low compute

473