Luca Ambrogioni

@LucaAmb

Ass. prof. of Machine Learning. PI of Generative Memory Lab (@DondersInst). Statistical physics, generative diffusion, memory, and generalization.

Nijmegen, Nederland

Joined July 2011

2KFollowing

6KFollowers

Pinned

Luca Ambrogioni@LucaAmb · Jul 18

Consistency Variational Autoencoders (CoVAE) follow naturally from β-VAEs. A family of β-VAEs (with increasing β) can be organized as a sequence of latent encodings with decreasing SNR . This implicit definition of a 'forward process' is used to define a consistency-style loss!

LucaAmb's tweet image. Consistency Variational Autoencoders (CoVAE) follow naturally from β-VAEs.

A family of β-VAEs (with increasing β) can be organized as a sequence of latent encodings with decreasing SNR .

This implicit definition of a 'forward process' is used to define a consistency-style loss!

438

249

19.0K

Luca Ambrogioni@LucaAmb · Jul 25

Transformers haven't changed much since 2017, but there have been some innovations over the years. This is an excellent summary of architectural differences in recent LLMs. Nice diagrams too! 👏 It would be great to see something like this for diffusion Transformers as well 🤔

SSebastian Raschka@rasbt · Jul 19

From GPT to MoE: I reviewed & compared the main LLMs of 2025 in terms of their architectural design from DeepSeek-V3 to Kimi 2. Multi-head Latent Attention, sliding window attention, new Post- & Pre-Norm placements, NoPE, shared-expert MoEs, and more... magazine.sebastianraschka.com/p/the-big-llm-…

154

122

12.0K

Luca Ambrogioni Retweeted

Michael Bronstein@mmbronstein · Jul 25

Apply for the AITHYRA-CeMM International PhD Program! 15-20 fully funded PhD fellowships available in Vienna in AI/ML and Life Sciences Deadline for applications: 10 September 2025 apply.cemm.at

7.0K

Luca Ambrogioni@LucaAmb · Jul 25

Math Olympiads are a very easy benchmark for LLMs There is tons of nearly identical training data available, and it has a clear unambiguous solutions that can be used for RL Most problems can be solved largely by memory and trial-and-error Doesn't generalize to real math

794

Luca Ambrogioni Retweeted

Nirmalya Kajuri@Kaju_Nut · Jul 24

You’ve heard of water turning into steam. But have you heard of hot gas turning into a black hole? Meet the Hawking–Page transition 🧵

5.0K

Luca Ambrogioni@LucaAmb · Jul 24

📢 Excited to announce that GenMol is now open-sourced. GenMol: A Drug Discovery Generalist with Discrete Diffusion Paper: arxiv.org/abs/2501.06158 Code: github.com/NVIDIA-Digital…

NNVIDIA Healthcare@NVIDIAHealth · Jul 24

🚀 GenMol is now open‑sourced: you can now train and finetune on your data! It uses masked diffusion + a fragment library to craft valid SAFE molecules, from de novo design to lead optimization. #GenMol #DrugDiscovery #Biopharma

127

13.0K

Luca Ambrogioni Retweeted

Roberta Raileanu@robertarail · Jul 24

I’m building a new team at @GoogleDeepMind to work on Open-Ended Discovery! We’re looking for strong Research Scientists and Research Engineers to help us push the frontier of autonomously discovering novel artifacts such as new knowledge, capabilities, or algorithms, in an…

254

2.0K

1.0K

297.0K

Luca Ambrogioni Retweeted

Bao Pham@baophamhq · Jul 23

How to build a factual but creative system? It is a question surrounding memory and creativity in modern ML systems. My colleagues from @IBMResearch and @MITIBMLab are hosting the @MemVis_ICCV25 workshop at #ICCV2025, which explores the intersection between memory and generative…

110

5.0K

Luca Ambrogioni@LucaAmb · Jul 22

Interesting approach! However, we looked at the proofs and methodology and we found a few problems, specifically with the use of hints given to the model. While the scaffold indeed improves performance, it does not solve all problems accurately and would not get a gold medal.🧵

LLin Yang@lyang36 · Jul 22

🚨 Olympiad math + AI: We ran Google’s Gemini 2.5 Pro on the fresh IMO 2025 problems. With careful prompting and pipeline design, it solved 5 out of 6 — remarkable for tasks demanding deep insight and creativity. The model could win gold! 🥇 #AI #Math #LLMs #IMO2025

148

31.0K

Luca Ambrogioni Retweeted

Yuanqi Du@YuanqiD · Jul 22

🌞🌞🌞 The third Structured Probabilistic Inference and Generative Modeling (SPIGM) workshop is **back** this year with @NeurIPSConf at San Diego! In the era of foundation models, we focus on a natural question: is probabilistic inference still relevant? #NeurIPS2025

16.0K

Luca Ambrogioni Retweeted

AshutoshShrivastava@ai_for_success · Jul 21

Class act from Google DeepMind. Much respect 🫡

1.0K

53.0K

Luca Ambrogioni Retweeted

Andy Keller@t_andy_keller · Jul 22

Huge thanks to all my friends and advisors who helped me develop this work. Specifically, this paper would never have happened without @wellingmax's guidance. See the blog for an intro, and the paper for all the proofs! Blog: kempnerinstitute.harvard.edu/research/deepe… Code: github.com/akandykeller/F…

5.0K

Luca Ambrogioni@LucaAmb · Jul 21

The problem comes in when you believe that demographic diversity is of such overriding importance that it requires suppression of the diversity of ideas, which what is actually core to the scientific endeavor.

AAgustín Fuentes@Anthrofuentes · Jul 18

To retreat from diversity, equity, and inclusion as a core aspect of the scientific endeavor is to close the door to possibilities for better science and a better future.

216

21.0K

Luca Ambrogioni Retweeted

Nirmalya Kajuri@Kaju_Nut · Jul 22

I have moved to substack. This is my first post, based on a couple of threads I did recently. Link below.

100

6.0K

Luca Ambrogioni Retweeted

Symmetry and Geometry in Neural Representations@neur_reps · Jul 22

Are you studying how structure shapes computation in the brain and in AI systems? 🧠 Come share your work in San Diego at NeurReps 2025! There is one month left until the submission deadline on August 22: neurreps.org/call-for-papers

17.0K

Luca Ambrogioni@LucaAmb · Jul 22

We are hiring on the Veo team!📽️ Some people asked me about this at #ICML2025. If that's you, I will have told you to check deepmind.google/careers/ regularly. 👀It's just been updated: Europe (London, Zurich) job-boards.greenhouse.io/deepmind/jobs/… US (Mountain View) job-boards.greenhouse.io/deepmind/jobs/…

SShlomi Fruchter@shlomifruchter · Jul 22

Want to be part of a team redefining SOTA for generative video models? Excited about building models that can reach billions of users? The Veo team is hiring! We are looking for amazing researchers and engineers, in North America and Europe. Details below:

146

14.0K

Luca Ambrogioni Retweeted

Peyman Milanfar@docmilanfar · Jul 21

Google DeepMind followed IMO rules to earn gold, unlike OpenAI

297

19.0K

Luca Ambrogioni Retweeted

Shashank@shawshank_v · Jul 21

Can open-data models beat DINOv2? Today we release Franca, a fully open-sourced vision foundation model. Franca with ViT-G backbone matches (and often beats) proprietary models like SigLIPv2, CLIP, DINOv2 on various benchmarks setting a new standard for open-source research🧵

260

193

40.0K

Luca Ambrogioni Retweeted

Peyman Milanfar@docmilanfar · Jul 21

thirty 30 years ago my very first journal article get accepted without revision. i remember thinking "this publishing thing isn't that bad" - it’s been downhill ever since.

212

22.0K

Luca Ambrogioni@LucaAmb · Jul 20

stealing the spotlight from kids just to hype yourselves is not a good look

MMikhail Samin@Mihonarium · Jul 20

🚨 According to a friend, the IMO asked AI companies not to steal the spotlight from kids and to wait a week after the closing ceremony to announce results. OpenAI announced the results BEFORE the closing ceremony. According to a Coordinator on Problem 6, the one problem OpenAI…

187

15.0K

Luca Ambrogioni@LucaAmb · Jul 19

We are hiring! If you are interested in efficient architecture or making training and inference on thousands of GPUs much faster, please feel free to dm me or @WeizhuChen! We are doing RL on very large scales!

LLiliang Ren@liliang_ren · Jul 18

We’re open-sourcing the pre-training code for Phi4-mini-Flash, our SoTA hybrid model that delivers 10× faster reasoning than Transformers — along with μP++, a suite of simple yet powerful scaling laws for stable large-scale training. 🔗 github.com/microsoft/Arch… (1/4)

294

190

36.0K