Melvin Johnson
@melvinjohnsonp
Researcher @ Google Research. Multilingual NLP and MT. Previously, Stanford CS.
This is very impressive. Thanks for doing the deep thinking for Gemini 2.5 Pro 🚀🚀
🚨 Olympiad math + AI: We ran Google’s Gemini 2.5 Pro on the fresh IMO 2025 problems. With careful prompting and pipeline design, it solved 5 out of 6 — remarkable for tasks demanding deep insight and creativity. The model could win gold! 🥇 #AI #Math #LLMs #IMO2025
Excited to share that a scaled up version of Gemini DeepThink achieves gold-medal standard at the International Mathematical Olympiad. This result is official, and certified by the IMO organizers. Watch out this space, more to come soon! deepmind.google/discover/blog/…
Welcome to our new GDM colleagues. Very excited to make Gemini work even better on coding!
Very excited to share that @windsurf_ai co-founders @_mohansolo & Douglas Chen, and some of their talented team have joined @GoogleDeepMind to help advance our work in agentic coding in Gemini. Welcome to our new team mates from Windsurf! theverge.com/openai/705999/…
The Gemini 2.5 Pro and Flash models are stable now. We want to thank you all for the feedback and excitement around these two models and can't wait to see what you'll build with it. And we have a new child in the family: 2.5 Flash-Lite, a low-latency, cost-sensitive alternative.
Introducing the Gemini 2.5 model family: - Gemini 2.5 Pro (Stable, no changes from 06-05) - Gemini 2.5 Flash (Stable, updated pricing from 05-20) - Gemini 2.5 Flash-Lite (Preview, small reasoning model) More info in 🧵
Join us live to learn about the Gemini 2.5 model family ♊️
Join us TODAY at 10:30am PT for our live space where we'll talk about today's 2.5 launches and all things Gemini models with the team @OfficialLoganK @TulseeDoshi @ancadianadragan @melvinjohnsonp @ZacharyGleicher. Drop your questions below! x.com/i/spaces/1vAGR…
You go down a little only to come back stronger. Lots of progress ahead from us. Excited for the relentless model improvements coming.
Google’s updated Gemini 2.5 Pro now leads the AI intelligence frontier, matching OpenAI's o3 in our independent benchmarks Google’s May update of Gemini 2.5 Pro regressed in some performance evaluations compared to the initial March release. This June update not only fixes…
Not to forget #1 on LMArena across the board: x.com/lmarena_ai/sta…
Our latest update to Gemini 2.5 Pro is here. It's SoTA on GPQA Diamond, AIDER and HLE. The team has also worked hard to improve the model on style, persona and creativity. We're excited to see what you build with it. Please let us know any feedback as we're eternally cooking.
Our latest update to Gemini 2.5 Pro is here. It's SoTA on GPQA Diamond, AIDER and HLE. The team has also worked hard to improve the model on style, persona and creativity. We're excited to see what you build with it. Please let us know any feedback as we're eternally cooking.
Our latest Gemini 2.5 Pro update is now in preview. It’s better at coding, reasoning, science + math, shows improved performance across key benchmarks (AIDER Polyglot, GPQA, HLE to name a few), and leads @lmarena_ai with a 24pt Elo score jump since the previous version. We also…
We’re excited to show Deep Think 2.5 Pro which pushes further on inference time scaling to achieve much better performance on hard Math and Coding tasks. The team worked hard burning the midnight oil to get this one out at IO.
Deep Think in 2.5 Pro has landed. 🤯 It’s a new enhanced reasoning mode using our research in parallel thinking techniques - meaning it explores multiple hypotheses before responding. This enables it to handle incredibly complex math and coding problems more effectively.
An LMArena clean sweep by the new Gemini-2.5-Pro 🥇🏆
🚨Breaking: @GoogleDeepMind’s latest Gemini-2.5-Pro is now ranked #1 across all LMArena leaderboards 🏆 Highlights: - #1 in all text arenas (Coding, Style Control, Creative Writing, etc) - #1 on the Vision leaderboard with a ~70 pts lead! - #1 on WebDev Arena, surpassing Claude…