Anian Ruoss

@anianruoss

Quantitative Developer at Quadrature Previously: Google DeepMind (Gemini Diffusion) | ETH Zurich

London

Joined May 2021

360Following

479Followers

Anian Ruoss@anianruoss · Jul 4

We're looking for people to join us to work on Gemini Diffusion and help revolutionize language modeling! Details below: job-boards.greenhouse.io/deepmind/jobs/…

BBrendan O'Donoghue@bodonoghue85 · May 20

Excited to share what my team has been working on lately - Gemini diffusion! We bring diffusion to language modeling, yielding more power and blazing speeds! 🚀🚀🚀 Gemini diffusion is especially strong at coding. In this example the model generates at 2000 tokens/sec,…

268

121

38.0K

Anian Ruoss Retweeted

Jasper Dekoninck@j_dekoninck · Jun 26

Thrilled to share a major step forward for AI for mathematical proof generation! We are releasing the Open Proof Corpus: the largest ever public collection of human-annotated LLM-generated math proofs, and a large-scale study over this dataset!

6.0K

Anian Ruoss@anianruoss · May 28

Very nice related paper that somehow flew under my radar. VLM/LLM playing simple games (see pic) without scaffold. But potentially with in-context demo or parsed (non-RGB) observation. Nothing works, ICL doesn't help, though o1 nails oxo and crosswords, and everyone can pathfind.

AAnian Ruoss@anianruoss · May 28

If you want to see how VLMs without scaffolding compare to a random baseline on gameplay, check out LMAct: arxiv.org/abs/2412.01441 🙂

8.0K

Anian Ruoss Retweeted

Oriol Vinyals@OriolVinyalsML · May 20

Today we introduced Gemini Diffusion⚡️ (& DeepThink, Veo3, Imagen4, 2.5 updates...). It's been a dream of mine to remove the need for "left to right" text generation. It's so fast, that we had to *slow down* the video during the presentation. deepmind.google/models/gemini-…

785

142

77.0K

Anian Ruoss Retweeted

Jack Rae@jack_w_rae · May 20

The Gemini Diffusion release feels like a landmark moment. For text generation, autoregressive models have always outperformed diffusion models from a quality perspective. It wasn't clear that the gap could ever be closed. The team behind this have kept laser focused, broken…

418

116

35.0K

Anian Ruoss@anianruoss · May 20

What can be, unburdened by what has been 😇

BBrendan O'Donoghue@bodonoghue85 · May 20

A similar one inspired by the 'Sparks of AGI paper' by @SebastienBubeck et al: "How many primes are there between 150 and 250? The first thing you should output is the total number, then print the exact list inside [ ] brackets." (ans: 18) GPT-4o fails this one too:…

158

Anian Ruoss Retweeted

Lucas Beyer (bl16)@giffmana · May 20

Congrats to the Gemini diffusion team!!

465

26.0K

Anian Ruoss@anianruoss · May 20

Super excited to have been part of the incredible journey with our team, bringing this to you all the way from research idea to Google IO!

GGoogle DeepMind@GoogleDeepMind · May 20

We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO

2.0K

Anian Ruoss@anianruoss · May 20

🔥 Gemini Diffusion is blazing fast 🔥 Honored to have been part of this amazing team!

GGoogle DeepMind@GoogleDeepMind · May 20

2.0K

Anian Ruoss@anianruoss · Apr 28

Come chat with @anianruoss @bonniesjli and me at our LMAct poster at the #ICLR25 workshop on Reasoning and Planning for LLMs (Garnet 212-213) to find out whether frontier models imitate expert behaviour purely in context!

HHarris Chan@SirrahChan · Dec 3

LMs see, can LMs do? LMAct benchmarks current SOTA foundation models' ability to act in text/visual environments using text as low-level actions in many domains using in-context expert (multimodal) demonstrations. We're excited to see how this benchmark drives further progress!

3.0K

Anian Ruoss Retweeted

Kevin Li@liwenliang · Feb 23

We provide first insights on why prompting is hard. The training distribution matters a lot: if we don't know it (as in large language datasets), prompting is like shooting in the dark. Our results on prediction and in-context RL are intriguing! 1/n arxiv.org/pdf/2502.10760

239

Anian Ruoss Retweeted

Mislav Balunović@mbalunovic · Feb 13

Results of the second part of AIME 2025 are live on matharena.ai: Another convincing win for @openai's o3-mini 🥇 Great work by the entire MathArena team: @j_dekoninck, @ni_jovanovic and @IvoPetrov01!

2.0K

Anian Ruoss Retweeted

Mislav Balunović@mbalunovic · Feb 7

We finally have an answer to the debate over whether LLMs generalize to new math problems or they merely memorized the answers. We evaluated them on the AIME 2025 I competition from *yesterday* and the results are good!

162

2.0K

679

347.0K

Anian Ruoss Retweeted

Lucas Beyer (bl16)@giffmana · Feb 6

Check out the recent work by @anianruoss Eg openreview.net/forum?id=Xlpip…

706