Apoorv Khandelwal

@apoorvkh

cs phd student at brown

Providence, RI

Joined April 2019

410Following

548Followers

Pinned

Wondering how long it takes to train a 1B-param LM from scratch on your GPUs? 🧵 See our paper to learn about the current state of academic compute and how to efficiently train models! Use our code to test your own models/GPUs! arxiv.org/abs/2410.23261 github.com/apoorvkh/acade…

apoorvkh's tweet card. $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources - apoorvkh/academic-pretraining

656

636

64.0K

Apoorv Khandelwal Retweeted

Etha Tianze Hua@EthaHua · Jul 9

Check out our new paper: “How Do Vision-Language Models Process Conflicting Information Across Modalities?”! Vision-language models often struggle with conflicting inputs - we show how their internal representations and key attention heads reveal when and how this happens, and…

2.0K

Apoorv Khandelwal Retweeted

Yong Zheng-Xin (Yong)@yong_zhengxin · Jul 11

I wrote up this post about how we should **unify RL and next-token-prediction** based on my perspective how humans learn new languages. then realize @jxmnop wrote the exact same thing about how we should scale RL to 10^26 FLOPs

443

542

40.0K

Apoorv Khandelwal Retweeted

Koyena Pal@kpal_koyena · Jun 30

🚨 Registration is live! 🚨 The New England Mechanistic Interpretability (NEMI) Workshop is happening August 22nd 2025 at Northeastern University! A chance for the mech interp community to nerd out on how models really work 🧠🤖 🌐 Info: nemiconf.github.io/summer25/ 📝 Register:…

103

18.0K

Apoorv Khandelwal Retweeted

Ben Recht@beenwrekt · Jul 3

The NeurIPS paper checklist corroborates the bureaucratic theory of statistics. argmin.net/p/standard-err…

117

49.0K

Apoorv Khandelwal@apoorvkh · Jun 23

Is there a clear choice or difference between Cursor, VS Code + Copilot, or something else? They both seem quite similar to me (VS Code-based, chat, tab complete, same downstream LLMs, etc). Thoughts?

334

Apoorv Khandelwal Retweeted

Matt Deitke@mattdeitke · Jun 13

Molmo won the Best Paper Honorable Mention award @CVPR! This work was a long journey over 1.5 years, from failing to get strong performance with massive scale, low quality data, to focusing on modest scale extremely high quality data! Proud to see what it became. #CVPR2025

220

36.0K

Apoorv Khandelwal Retweeted

Ruochen Zhang@ruochenz_ · Jun 4

🤔Ever wonder why LLMs give inconsistent answers in different languages? In our paper, we identify two failure points in the multilingual factual recall process and propose fixes that guide LLMs to the "right path." This can boost performance by 35% in the weakest language! 📈

16.0K

Apoorv Khandelwal@apoorvkh · May 21

excited to finally share on arxiv what we've known for a while now: All Embedding Models Learn The Same Thing embeddings from different models are SO similar that we can map between them based on structure alone. without *any* paired data feels like magic, but it's real:🧵

jjxmo@jxmnop · Feb 23

this is sick all i'll say is that these GIFs are proof that the biggest bet of my research career is gonna pay off excited to say more soon

125

622

6.0K

5.0K

901.0K

Apoorv Khandelwal Retweeted

Lilian Weng@lilianweng · May 17

Giving your models more time to think before prediction, like via smart decoding, chain-of-thoughts reasoning, latent thoughts, etc, turns out to be quite effective for unblocking the next level of intelligence. New post is here :) “Why we think”: lilianweng.github.io/posts/2025-05-…

435

3.0K

2.0K

215.0K

Apoorv Khandelwal@apoorvkh · May 13

The long-term goal of AI is to build models that can handle arbitrary tasks, not just ones they’ve been trained on. We hope our new *benchmark generator* can help measure progress toward this vision

VVivek Verma@vcubingx · May 13

🎮 Excited to announce gg-bench, a fully synthetic benchmark for LLMs consisting of games generated entirely by LLMs!! This benchmark centers around the fact that LLMs are capable of generating complex tasks that they themselves cannot even solve. 📄: arxiv.org/abs/2505.07215

182

124

25.0K

Apoorv Khandelwal Retweeted

Charlie Marsh@charliermarsh · May 13

Today, we’re announcing the preview release of ty, an extremely fast type checker and language server for Python, written in Rust. In early testing, it's 10x, 50x, even 100x faster than existing type checkers. (We've seen >600x speed-ups over Mypy in some real-world projects.)

126

512

5.0K

2.0K

638.0K

Apoorv Khandelwal Retweeted

Amjad Masad@amasad · May 10

Can’t believe ChatGPT delved the em dash. What a loss.

1.0K

202.0K

Apoorv Khandelwal Retweeted

Yong Zheng-Xin (Yong)@yong_zhengxin · May 9

📣 New paper! We observe that reasoning language models finetuned only on English data are capable of zero-shot cross-lingual reasoning through a "quote-and-think" pattern. However, this does not mean they reason the same way across all languages or in new domains. [1/N]

181

101

39.0K

Apoorv Khandelwal Retweeted

William Merrill 🚂ACL@lambdaviking · Apr 30

Excited to announce I'll be starting as an assistant professor at @TTIC_Connect for fall 2026! In the meantime, I'll be graduating and hanging around Ai2 in Seattle🏔️

367

31.0K

Apoorv Khandelwal Retweeted

Vercept@Vercept_ai · Apr 29

Today we're excited to introduce Vy, our AI that sees and acts on your computer. At Vercept, our mission is to reinvent how humans use computers–enabling you to accomplish orders of magnitude more than what you can do today. Vy is a first glimpse at AI that sees and uses your…

298

181

119.0K

Apoorv Khandelwal Retweeted

PyCoder’s Weekly@pycoders · Apr 21

14 Advanced Python Features blog.edward-li.com/tech/advanced-…

3.0K

Apoorv Khandelwal Retweeted

Harvard University@Harvard · Apr 14

The university will not surrender its independence or relinquish its constitutional rights. Neither Harvard nor any other private university can allow itself to be taken over by the federal government. hrvd.me/ResearchFundin…

11.0K

15.0K

82.0K

2.0K

7.2M

Apoorv Khandelwal Retweeted

Jack Merullo@jack_merullo_ · Apr 10

I joined @GoodfireAI a little over a month ago to do interpretability! I am really excited to extend my work beyond just LMs. I think interp has a lot to offer to e.g., scientific models. Understanding them might actually teach us something new about the world 🌎

153

13.0K

Apoorv Khandelwal Retweeted

Devi Parikh@deviparikh · Apr 1

Introducing API. A new era of agentic computer use begins today.

109

1.0K

674

289.0K

Apoorv Khandelwal Retweeted

David Bau@davidbau · Mar 17

Why is interpretability the key to dominance in AI? Not winning the scaling race, or banning China. Our answer to OSTP/NSF, w/ Goodfire's @banburismus_ Transluce's @cogconfluence MIT's @dhadfieldmenell resilience.baulab.info/docs/AI_Action… Here's why:🧵 ↘️

309

178

36.0K