Alex Li

@alexlioralexli

researcher @AnthropicAI. prev @mldcmu, @AIatMeta, @berkeley_ai

San Francisco, CA

Joined June 2014

444Following

1KFollowers

Pinned

Alex Li@alexlioralexli · Sep 25, 2023

Diffusion models have amazing image creation abilities. But how well does their generative knowledge transfer to discriminative tasks? We present Diffusion Classifier: strong classification results with pretrained conditional diffusion models, *with no additional training*! 1/9

391

186

95.0K

Pinned

Alex Li Retweeted

Ananye Agarwal@anag004 · Dec 21, 2023

Robotic intelligence requires dexterous tool use, but generalizing across tools is hard. Our CoRL23 paper combines semantics (affordances) with low-level control (sim2real) to show functional grasping that generalizes to hammers, drills and more! dexfunc.github.io 1/n

161

28.0K

Alex Li Retweeted

Amil Dravid@_AmilDravid · Jun 10

Artifacts in your attention maps? Forgot to train with registers? Use 𝙩𝙚𝙨𝙩-𝙩𝙞𝙢𝙚 𝙧𝙚𝙜𝙞𝙨𝙩𝙚𝙧𝙨! We find a sparse set of activations set artifact positions. We can shift them anywhere ("Shifted") — even outside the image into an untrained token. Clean maps, no retrain.

325

211

43.0K

Alex Li@alexlioralexli · Apr 26

Excited to be presenting at #ICLR2025 at 10am today on how generative classifiers are much more robust to distribution shift. Come by to chat and say hello!

alexlioralexli's tweet image. Excited to be presenting at #ICLR2025 at 10am today on how generative classifiers are much more robust to distribution shift. Come by to chat and say hello!

6.0K

Alex Li Retweeted

Christina Baek@_christinabaek · Apr 16

Are current reasoning models optimal for test-time scaling? 🌠 No! Models make the same incorrect guess over and over again. We show that you can fix this problem w/o any crazy tricks 💫 – just do weight ensembling (WiSE-FT) for big gains on math! 1/N

483

327

53.0K

Alex Li Retweeted

Simone Scardapane@s_scardapane · Jan 21

*On the Surprising Effectiveness of Attention Transfer for Vision Transformers* by @tydsh @BeidiChen @pathak2206 @endernewton @alexlioralexli Shows that distilling attention patterns in ViTs is competitive with standard fine-tuning. arxiv.org/abs/2411.09702

200

113

10.0K

Alex Li Retweeted

Priyank Jaini@priyankjaini · Jan 17

Do generative video models learn physical principles from watching videos? Very excited to introduce the Physics-IQ benchmark, a challenging dataset of real-world videos designed to test physical understanding of video models. Webpage: physics-iq.github.io

212

109

28.0K

Alex Li Retweeted

Tyler Zhu@tyleryzhu · Jan 4

Have you ever wondered why we don’t use multiple visual encoders for VideoLLMs? We thought the same! Excited to announce our latest work MERV, on using Multiple Encoders for Representing Videos in VideoLLMs, outperforming prior works with the same data. 🧵

126

17.0K

Alex Li@alexlioralexli · Dec 12

I'm presenting our #NeurIPS2024 work on Attention Transfer today! Key finding: Pretrained representations aren't essential - just using attention patterns from pretrained models to guide token interactions is enough for models to learn high-quality features from scratch and…

alexlioralexli's tweet image. I'm presenting our #NeurIPS2024 work on Attention Transfer today!

Key finding: Pretrained representations aren't essential - just using attention patterns from pretrained models to guide token interactions is enough for models to learn high-quality features from scratch and…

160

14.0K

Alex Li Retweeted

Daniel Geng@dangengdg · Dec 4

What happens when you train a video generation model to be conditioned on motion? Turns out you can perform "motion prompting," just like you might prompt an LLM! Doing so enables many different capabilities. Here’s a few examples – check out this thread 🧵 for more results!

147

672

334

93.0K

Alex Li Retweeted

Christina Baek@_christinabaek · Oct 25

Chatbots are often augmented w/ new facts by context from the user or retriever. Models must adapt instead of hallucinating outdated facts. In this work w/@goyalsachin007, @zicokolter, @AdtRaghunathan, we show that instruction tuning fails to reliably improve this behavior! [1/n]

107

16.0K

Alex Li Retweeted

Mihir Prabhudesai@mihirp98 · Aug 19

1/ Happy to share VADER: Video Diffusion Alignment via Reward Gradients. We adapt foundational video diffusion models using pre-trained reward models to generate high-quality, aligned videos for various end-applications. Below we generated a short movie using VADER 😀, we used…

138

13.0K

Alex Li Retweeted

Ananye Agarwal@anag004 · Aug 6

Want to scale RL with your shiny new GPU? 🚀 In our ICML24 Oral we find that RL algorithms hit a barrier when data is scaled up. Our new algorithm, SAPG, proposes a simple fix. It scales to 25k envs and solves hard tasks where PPO makes no progress. sapg-rl.github.io 1/n

415

209

64.0K

Alex Li@alexlioralexli · Jul 26, 2024

I’ll be giving an oral presentation today on how generative classifiers are much more robust to distribution shift! (openreview.net/forum?id=02dpw…) Come by to the SPIGM #ICML2024 workshop at 10:40am for my talk or 3:10pm for the poster session!

alexlioralexli's tweet image. I’ll be giving an oral presentation today on how generative classifiers are much more robust to distribution shift! (openreview.net/forum?id=02dpw…) Come by to the SPIGM #ICML2024 workshop at 10:40am for my talk or 3:10pm for the poster session!

160

12.0K

Alex Li Retweeted

AI at Meta@AIatMeta · May 28, 2024

📝 New from FAIR: An Introduction to Vision-Language Modeling. Vision-language models (VLMs) are an area of research that holds a lot of potential to change our interactions with technology, however there are many challenges in building these types of models. Together with a set…

483

2.0K

1.0K

343.0K

Alex Li Retweeted

Christina Baek@_christinabaek · May 7, 2024

Did you know that the optimizer Sharpness Aware Minimization (SAM) is very robust to heavy label noise, with gains tens of percent above SGD? In our new work, we deep dive into how SAM achieves these gains. As it turns out, it’s not at all about sharpness at convergence!

235

163

36.0K

Alex Li Retweeted

Daniel Geng@dangengdg · Apr 18, 2024

What do you see in these images? These are called hybrid images, originally proposed by Aude Oliva et al. They change appearance depending on size or viewing distance, and are just one kind of perceptual illusion that our method, Factorized Diffusion, can make.

103

451

200

58.0K

Alex Li Retweeted

Patrick Chao@patrickrchao · Apr 3, 2024

Are you interested in jailbreaking LLMs? Have you ever wished that jailbreaking research was more standardized, reproducible, or transparent? Check out JailbreakBench, an open benchmark and leaderboard for Jailbreak attacks and defenses on LLMs! jailbreakbench.github.io 🧵1/n

173

36.0K

Alex Li Retweeted

Jascha Sohl-Dickstein@jaschasd · Feb 12, 2024

Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.

274

2.0K

10.0K

4.0K

1.6M

Alex Li@alexlioralexli · Feb 6, 2024

🤖 VIRL 🌎 Grounding Virtual Intelligence In Real Life 🧐How can we embody agents in environments as rich/diverse as those we inhabit, without real hardware & control constraints? 🧐How can we ensure internet-trained vision/language models will translate to real life globally?

SSaining Xie@sainingxie · Feb 6, 2024

🌎 𝕤𝕒𝕪 𝕙𝕖𝕝𝕝𝕠 𝕥𝕠 𝕧𝕚𝕣𝕝 🌏 virl-platform.github.io

9.0K

Alex Li Retweeted

Daniel Geng@dangengdg · Feb 2, 2024

Can we use motion to prompt diffusion models? Our #ICLR2024 paper does just that. We propose Motion Guidance, a technique that allows users to edit an image by specifying “where things should move.”

16.0K