Richard Song (@XingyouSong)

Pinned

R

Richard Song@XingyouSong · Jun 30

Seeing text-to-text regression work for Google’s massive compute cluster (billion $$ problem!) was the final result to convince us we can reward model literally any world feedback. Paper: arxiv.org/abs/2506.21718 Code: github.com/google-deepmin… Just train a simple encoder-decoder…

XingyouSong's tweet image. Seeing text-to-text regression work for Google’s massive compute cluster (billion $$ problem!) was the final result to convince us we can reward model literally any world feedback.

Paper: arxiv.org/abs/2506.21718
Code: github.com/google-deepmin…

Just train a simple encoder-decoder…

12

74

512

422

71.0K

Pinned

R

Richard Song@XingyouSong · Jul 22

From being a kid passionate about IMO problems to now helping lead the effort at Google DeepMind to get an AI to that same level—what a journey. Thanks to my brilliant coworkers & the IMO board. Excited to see how AI will push the frontiers of science for humanity.

GGoogle DeepMind@GoogleDeepMind · Jul 21

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵

5

6

70

1

6.0K

R

Richard Song@XingyouSong · Jul 22

We did it! I'm super proud to have been part of this along with many amazing colleagues. Last year was a breakthrough for formal methods. This year we showed what Gemini is capable of. I'm looking forward to seeing what happens when we put the two of them together.

GGoogle DeepMind@GoogleDeepMind · Jul 21

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵

1

3

16

2

2.0K

Richard Song Retweeted

A

Archit Sharma@archit_sharma97 · Jul 21

I have been waiting for this to be announced, it’s so amazing to see such elegant scaling of the Deep Think system where the same system can now achieve a gold at IMO! deepmind.google/discover/blog/…

12

15

303

19

20.0K

R

Richard Song@XingyouSong · Jul 22

Legendary inference magic from @jon_lee0 :)

JJonathan Lee@jon_lee0 · Jul 21

I’m excited to share the news of Gemini Deep Think’s gold-medal level performance 🥇 at the International Math Olympiad! It has been an absolute blast building Deep Think this year and then scaling it to the IMO.

0

3

0

587

R

Richard Song@XingyouSong · Jul 22

Willing to bet we’ll all converge on the same approach for long-form answer reasoning. See you next year :)

NNoam Brown@polynoamial · Jul 21

Congrats to the GDM team on their IMO result! I think their parallel success highlights how fast AI progress is. Their approach was a bit different than ours, but I think that shows there are many research directions for further progress. Some thoughts on our model and results 🧵

1

3

168

31

22.0K

R

Richard Song@XingyouSong · Jul 21

Let's climb the next mountain: arXiv-level math + theory research?!

AAndreas Kirsch 🇺🇦@BlackHC · Jul 21

A personal shout-out: Special personal thanks to @freeetext, @LeiYu63, and @_wilcoln for the best onboarding one could have ever imagined and for making the work so enjoyable! Huge thanks also to @zuza777, @FredZhang0, @XingyouSong, and @fpedregosa 🤗 The last weeks have been an…

2

1

20

4

3.0K

R

Richard Song@XingyouSong · Jul 21

💯

VVinay Ramasesh@vinayramasesh · Jul 21

It's worth noting that a DeepThink system with no access to this corpus also got gold (again according to the official graders), with exactly the same score.

0

8

0

657

Richard Song Retweeted

V

Vahab Mirrokni@mirrokni · Jul 21

Proud to announce an official Gold Medal at #IMO2025🥇 The IMO committee has certified the result from our general-purpose Gemini system—a landmark moment for our team and for the future of AI reasoning. deepmind.google/discover/blog/… (1/n) Highlights in thread:

13

33

320

43

38.0K

R

Richard Song@XingyouSong · Jul 21

So much easier than the OAI's coordinate bash to P2: github.com/aw31/openai-im…

JJason Lee@jasondeanlee · Jul 21

At least the GDM imo proofs are readable!

0

1

16

2

3.0K

R

Richard Song@XingyouSong · Jul 21

"Gemini solved the math problems end-to-end in natural language (English)."

GGoogle DeepMind@GoogleDeepMind · Jul 21

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵

7

4

129

7

8.0K

R

Richard Song@XingyouSong · Jul 21

Gemini + Deep Think won IMO gold this year 🏅 super honored to be part of this dream team!

GGoogle DeepMind@GoogleDeepMind · Jul 21

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵

3

8

350

8

8.0K

Richard Song Retweeted

G

Google DeepMind@GoogleDeepMind · Jul 21

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵

145

713

4.0K

679

978.0K

Richard Song Retweeted

Y

Yuchen Jin@Yuchenj_UW · Jul 21

This wins my respect.

47

92

2.0K

159

128.0K

Richard Song Retweeted

A

Ankesh Anand@ankesh_anand · Jul 21

We can finally share this now: A Gemini model trained with new RL techniques and scaled up inference-time compute model has achieved gold-medal level performance at IMO 2025! 🥇

13

28

466

78

34.0K

Richard Song Retweeted

D

Demis Hassabis@demishassabis · Jul 21

Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to @lmthang and the team! deepmind.google/discover/blog/…

199

765

6.0K

631

1.4M

Richard Song Retweeted

Q

Quoc Le@quocleix · Jul 21

Excited to share that a scaled up version of Gemini DeepThink achieves gold-medal standard at the International Mathematical Olympiad. This result is official, and certified by the IMO organizers. Watch out this space, more to come soon! deepmind.google/discover/blog/…

9

50

707

72

53.0K

R

Richard Song@XingyouSong · Jul 21

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this…

TThang Luong@lmthang · Jul 25

Super thrilled to share that our AI has has now reached silver medalist level in Math at #imo2024 (1 point away from 🥇)! Since Jan, we now not only have a much stronger version of #AlphaGeometry, but also an entirely new system called #AlphaProof, capable of solving many more…

75

223

2.0K

223

312.0K

R

Richard Song@XingyouSong · Jul 20

Yes, there is an official marking guideline from the IMO organizers which is not available externally. Without the evaluation based on that guideline, no medal claim can be made. With one point deducted, it is a Silver, not Gold.

MMikhail Samin@Mihonarium · Jul 20

🚨 According to a friend, the IMO asked AI companies not to steal the spotlight from kids and to wait a week after the closing ceremony to announce results. OpenAI announced the results BEFORE the closing ceremony. According to a Coordinator on Problem 6, the one problem OpenAI…

15

55

590

95

118.0K

R

Richard Song@XingyouSong · Jul 8

Foundation modeling the real world is getting to be a hot topic - not just for videos but physical systems too!

TTung Nguyen@tungnd_13 · Jul 8

🚀 Introducing PhysiX: One of the first large-scale foundation models for physics simulations! PhysiX is a 4.5B parameter model that unifies a wide range of physical systems, from fluid dynamics to reaction-diffusion, outperforming specialized, state-of-the-art models.

1

0

9

1

675

R

Richard Song@XingyouSong · Jun 30

Thanks so much for the repost, @_akhaliq!!

AAK@_akhaliq · Jun 30

Google presents Performance Prediction for Large Systems via Text-to-Text Regression

0

5

50

18

14.0K