Denny Zhou
@denny_zhou
Founded & lead the Reasoning Team in Google Brain (now part of Google DeepMind). Build LLMs to reason. Opinions my own.
Slides for my lecture “LLM Reasoning” at Stanford CS 25: dennyzhou.github.io/LLM-Reasoning-… Key points: 1. Reasoning in LLMs simply means generating a sequence of intermediate tokens before producing the final answer. Whether this resembles human reasoning is irrelevant. The crucial…
"Gemini solved the math problems end-to-end in natural language (English)."
An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵
LLMs for IMO 2025: gemini-2.5-pro (31.55%), o3 high (16.67%), Grok 4 (11.90%). matharena.ai

AI now beats every single human in the hardest college entrance exam in India, the IIT JEE. Bytedance silently published this result this week. The top scorer was Rajit Gupta with 332/360, but Google's Gemini 2.5 Pro was at rank 1 with 336/360.
Say hello to the @geminicli, a local CLI to help you build and maintain software with 1,000 free Gemini 2.5 Pro requests per day : )
Scientific progress comes from scaling the right dimension. Scaling all dimensions is just wasteful engineering.
Reasoning is far from being solved. Looking forward to seeing new breakthroughs.
Terence Tao says today's AIs pass the eye test -- but fail miserably on the smell test. They generate proofs that look flawless. But the mistakes are subtle, and strangely inhuman. “There's a metaphorical mathematical smell.. it's not clear how to get AI to duplicate that.”…
Cannot have a better proof on data’s importance
Meta taking a 49% stake in Scale AI for $14.8B, investors and employees get paid. Meta prob wanted to acquire them but didn’t want to go through regulatory approval so found this roundabout scheme. Alexandr will run Meta’s superintelligence lab, and Scale will find a new ceo
Our latest Gemini 2.5 Pro update is now in preview. It’s better at coding, reasoning, science + math, shows improved performance across key benchmarks (AIDER Polyglot, GPQA, HLE to name a few), and leads @lmarena_ai with a 24pt Elo score jump since the previous version. We also…
🚨 This was the BEST Google I/O that I can remember. Google launched over 12 different insane things. Here is every single one of the launches and the best tweets about them: 1/12
At #GoogleIO, we shared how decades of AI research have now become reality. From a total reimagining of Search to Agent Mode, Veo 3 and more, Gemini season will be the most exciting era of AI yet. Some highlights 🧵
Deep Think in 2.5 Pro has landed. 🤯 It’s a new enhanced reasoning mode using our research in parallel thinking techniques - meaning it explores multiple hypotheses before responding. This enables it to handle incredibly complex math and coding problems more effectively.
What a finish! Gemini 2.5 Pro just completed Pokémon Blue!  Special thanks to @TheCodeOfJoel for creating and running the livestream, and to everyone who cheered Gem on along the way.