Sanjeev Arora

@prfsanjeevarora

Director, @PrincetonPLI and Professor @PrincetonCS. Seeks math/conceptual understanding of deep learning and large AI models. Also on the "other" social network

New Jersey, USA

Joined July 2017

97Following

24KFollowers

Pinned

Sanjeev Arora@prfsanjeevarora · Sep 18, 2023

Really excited about the launch of this research initiative. Hiring Research Scientists now. Research Software Engineers and postdocs over next few months. 300 H100 GPUs. Multidisciplinary teams. Princeton helps keep AI expertise in the open sphere. More: pli.princeton.edu

PPrinceton PLI@PrincetonPLI · Sep 18, 2023

“The dramatic rise of AI capabilities…is a watershed event for humanity…It is also sure to transform research and teaching in every academic discipline.” – @prfsanjeevarora, director of the new @Princeton Language and Intelligence initiative. For more: pli.princeton.edu

498

159.0K

Pinned

Sanjeev Arora@prfsanjeevarora · Jul 24

I predict though that within next year many other teams will achieve this milestone and without using as much compute. Hoping Goedel prover v3 from @PrincetonPLI will too.

AAlex Kontorovich@AlexKontorovich · Jul 23

Another AI system, ByteDance's SeedProver solved 4 out of 6 IMO problems *with* Lean, and solved a fifth with extended compute. This is becoming routine, like when we went to the moon for the fourth time. There is *nothing* "routine" about this!!...

5.0K

Pinned

Sanjeev Arora@prfsanjeevarora · Jul 21

Agree. Move away from open source will hurt US in the long run.

CChristopher Manning@chrmanning · Jul 21

Kimi K2. More evidence that: • The lead of American “frontier” AI companies is rather small 🔬 • A broad ecosystem of strong foundation model companies is developing in China, with more players than in the US or elsewhere. 🐅 moonshotai.github.io/Kimi-K2/ github.com/MoonshotAI/Kim…

3.0K

Sanjeev Arora@prfsanjeevarora · 16 h

Also wanted to highlight the contributions made by amazing grad students and postdocs and collaborators. Especially @Yong18850571 @sangertang1999 ! 👏👋 Also, note that this is an AI model that is a **solver** of questions. It generates proofs, and is not a verifier of proofs.…

PPrinceton Computer Science@PrincetonCS · Jul 23

⏱️AI is making verification process easier, with models verifying proofs in minutes. 💻 Now, @prfsanjeevarora, @chijinML, @danqi_chen and @PrincetonPLI have released Goedel Prover V2, a model more efficient and more accurate than any previous model. 👉 blog.goedel-prover.com

6.0K

Sanjeev Arora@prfsanjeevarora · Jul 24

Useful new SWE agent from @PrincetonPLI !

KKilian Lieret@KLieret · Jul 24

Releasing mini, a radically simple SWE-agent: 100 lines of code, 0 special tools, and gets 65% on SWE-bench verified! Made for benchmarking, fine-tuning, RL, or just for use from your terminal. It’s open source, simple to hack, and compatible with any LM! Link in 🧵

2.0K

Sanjeev Arora@prfsanjeevarora · Jul 24

Analogous to what happened to some capabilities with the emergence of internet utilities in past decades (eg google search and map). Except now the affects happen across many other capabilities.

DDimitris Papailiopoulos@DimitrisPapail · Jul 24

Is LLM use finally making me less capable? I started using LLMs three years ago for text and code gen. Now, I use several of them, for a ton more things. In fact, I feel like I use them for a huge fraction of the cognitive tasks that I perform that can be described in text.…

4.0K

Sanjeev Arora@prfsanjeevarora · Jul 21

More impressive. But Lean provers have progressed a lot in past 6-7 months, so that day isn't far either blog.goedel-prover.com

JJason Lee@jasondeanlee · Jul 21

Question: would it be or less impressive if the Imo gold medals were done in lean?

6.0K

Sanjeev Arora Retweeted

Gautam Kamath@thegautamkamath · Jul 21

Everyone's talking about AI performance on the IMO. Let me highlight 🇨🇦Canadian 11th grader Warren Bei🇨🇦, one of five participants with a *perfect* 42/42. This is his *fifth* (and final) IMO representing Canada, with three golds and two silvers. (➡️ MIT undergrad in the fall)

2.0K

211

113.0K

Sanjeev Arora@prfsanjeevarora · Jul 21

Exactly. Thx

AAbhishu Oza@AbhishuO · Jul 21

Prof Arora is simply pointing out to an often repeated point by current paradigm skeptics - "current capabilities are not evidence of complex abilities" when that's not the real claim. It's that capabilities are arriving quickly. Imagine throwing an IMO problem to GPT 3.5.

3.0K

Sanjeev Arora@prfsanjeevarora · Jul 21

Wish I were there! Let's catch up in Princeton

MMengdi Wang@MengdiWang10 · Jul 21

Just returned from ICML 2025 where I had the honor of keynoting three remarkable workshops. Grateful for the opportunity to delve into topics like self-evolving Alita agents, CRISPR-GPT for AI-driven science, Genome-Bench, reinforcement-learning agents, and AI biosafety. Special…

3.0K

Sanjeev Arora@prfsanjeevarora · Jul 21

Agree!

DDimitris Papailiopoulos@DimitrisPapail · Jul 21

Speculation: Within a year a <100B open weights model will also solve 5/6 IMO problems.

2.0K

Sanjeev Arora@prfsanjeevarora · Jul 21

Congratulations on this milestone @demishassabis and GDM!

DDemis Hassabis@demishassabis · Jul 21

We achieved this year’s impressive result using an advanced version of Gemini Deep Think (an enhanced reasoning mode for complex problems). Our model operated end-to-end in natural language, producing rigorous mathematical proofs directly from the official problem descriptions –…

8.0K

Sanjeev Arora@prfsanjeevarora · Jul 21

Completely misses the point. Nobody is suggesting that solving IMO problems is useful for math research. The point is that AI has become really good at complex reasoning, and is not just memorizing its training data. It can handle completely new IMO questions designed by a…

GGary Marcus@GaryMarcus · Jul 19

Quote of the day: I certainly don't agree that machines which can solve IMO problems will be useful for mathematicians doing research, in the same way that when I arrived in Cambridge UK as an undergraduate clutching my IMO gold medal I was in no position to help any of the…

598

114

123.0K

Sanjeev Arora@prfsanjeevarora · Jul 21

A thoughtful analysis by @ErnestRyu but it is missing one key insight. To bring the AI model even to IMO gold level, one has to train it to generate new questions and then solve them. (There aren't enough human-generated questions for training.) This is a key idea in Deepmind's…

EErnest Ryu@ErnestRyu · Jul 19

Two cents on AI getting International Math Olympiad (IMO) Gold, from a mathematician. Background: Last year, Google DeepMind (GDM) got Silver in IMO 2024. This year, OpenAI solved problems P1-P5 for IMO 2025 (but not P6), and this performance corresponds to Gold. (1/10)

8.0K

Sanjeev Arora@prfsanjeevarora · Jul 19

Congratulations! Also thanks for making me win my bet with @JitendraMalikCV a year ahead of schedule.

AAlexander Wei@alexwei_ · Jul 19

1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).

10.0K

Sanjeev Arora@prfsanjeevarora · Jul 17

Thanks for your positive words. Numina's dataset release was a big enabler for this research area last year.

JJia Li@JiaLi52524397 · Jul 16

Impressive result ! High performance with low pass rate. Congrats to the Goedel prover team

3.0K

Sanjeev Arora@prfsanjeevarora · Jul 16

Next huge source of training data? Strong Indian students focus primarily on math/science. We should expect similar announcements from other AI actors with deep pockets.

DDemis Hassabis@demishassabis · Jul 16

Exciting news for students in India🇮🇳: get your free @GeminiApp Pro plan for 1 year! This gives you higher rate access to all our best models: 2.5 Pro, Veo 3, Deep Research, NotebookLM, and 2TB storage. Claim it at goo.gle/freepro - enjoy!

9.0K

Sanjeev Arora Retweeted

Princeton PLI@PrincetonPLI · Jul 16

We’re proud that PLI students, post-docs, and faculty will be featuring over 20 papers at the @icmlconf in Vancouver this week! From safer AI agents to long-context reasoning and RL, we’re excited to showcase the cutting edge research for you here: pli.princeton.edu/blog/2025/prin…

2.0K

Sanjeev Arora@prfsanjeevarora · Jul 16

20+ papers (including several spotlights) from @PrincetonPLI being presented at ICML this week. pli.princeton.edu/blog/2025/prin…

2.0K