Sander Dieleman

@sedielem

Research Scientist at Google DeepMind (WaveNet, Imagen, Veo). I tweet about deep learning (research + software), music, generative models (personal account).

London, England

Joined December 2014

2KFollowing

62KFollowers

Pinned

Sander Dieleman@sedielem · Apr 15

New blog post: let's talk about latents! sander.ai/2025/04/15/lat…

sedielem's tweet card. Latent representations for generative models.

195

1.0K

882

165.0K

Sander Dieleman Retweeted

Google Labs@GoogleLabs · 4 h

We just discovered the 🔥 COOLEST 🔥 trick in Flow that we have to share: Instead of wordsmithing the perfect prompt, you can just... draw it. Take the image of your scene, doodle what you'd like on it (through any editing app), and then briefly describe what needs to happen…

562

326

37.0K

Sander Dieleman@sedielem · Jul 22

We are hiring on the Veo team!📽️ Some people asked me about this at #ICML2025. If that's you, I will have told you to check deepmind.google/careers/ regularly. 👀It's just been updated: Europe (London, Zurich) job-boards.greenhouse.io/deepmind/jobs/… US (Mountain View) job-boards.greenhouse.io/deepmind/jobs/…

SShlomi Fruchter@shlomifruchter · Jul 22

Want to be part of a team redefining SOTA for generative video models? Excited about building models that can reach billions of users? The Veo team is hiring! We are looking for amazing researchers and engineers, in North America and Europe. Details below:

145

14.0K

Sander Dieleman@sedielem · Jul 18

What a wonderful event to let researchers sit and chat about interesting ideas! LOVE the Diffusion Circle!

SSander Dieleman@sedielem · Jul 17

We are sitting all the way at the back of the conference center (west building)!

4.0K

Sander Dieleman@sedielem · Jul 17

Mission accomplished! Thank you so much @sedielem for organizing the (diffuse) diffusion circle. I met so many new people and learned a lot too!

SSander Dieleman@sedielem · Jul 17

We are sitting all the way at the back of the conference center (west building)!

6.0K

Sander Dieleman@sedielem · Jul 17

We are sitting all the way at the back of the conference center (west building)!

SSander Dieleman@sedielem · Jul 15

Hello #ICML2025👋, anyone up for a diffusion circle? We'll just sit down somewhere and talk shop. 🕒Join us at 3PM on Thursday July 17. We'll meet here (see photo, near the west building's west entrance), and venture out from there to find a good spot to sit. Tell your friends!

15.0K

Sander Dieleman@sedielem · Jul 13

On the way to #ICML2025! ✈️🇨🇦 Come find me and let's talk about: - diffusion models 😐😶‍🌫️🫥 - generative media 🖼️🎞️🔊 - what the topic of my next blog post should be 🤔💡✍️ Join us at the ML for audio workshop on Saturday! mlforaudioworkshop.github.io

sedielem's tweet image. On the way to #ICML2025! ✈️🇨🇦

Come find me and let's talk about:
- diffusion models 😐😶‍🌫️🫥
- generative media 🖼️🎞️🔊
- what the topic of my next blog post should be 🤔💡✍️

Join us at the ML for audio workshop on Saturday! mlforaudioworkshop.github.io

146

8.0K

Sander Dieleman@sedielem · Jul 11

Tokenization is just a special case of "chunking" - building low-level data into high-level abstractions - which is in turn fundamental to intelligence. Our new architecture, which enables hierarchical *dynamic chunking*, is not only tokenizer-free, but simply scales better.

SSukjun (June) Hwang@sukjun_hwang · Jul 11

Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data

183

1.0K

755

184.0K

Sander Dieleman@sedielem · Jul 8

Excellent blog post by @_albertgu about Transformers, SSMs and the role of tokenisation. Well worth a read. goombalab.github.io/blog/2025/trad…

AAlbert Gu@_albertgu · Jul 8

I converted one of my favorite talks I've given over the past year into a blog post. "On the Tradeoffs of SSMs and Transformers" (or: tokens are bullshit) In a few days, we'll release what I believe is the next major advance for architectures.

9.0K

Sander Dieleman Retweeted

Hao-Wen (Herman) Dong 董皓文@hermanhwdong · Jul 7

🔥Happy to announce that the AI for Music Workshop is coming to #NeurIPS2025! We have an amazing lineup of speakers! We call for papers & demos (due on August 22)! See you in San Diego!🏖️ @chrisdonahuey @Ilaria__Manco @zawazaw @huangcza @McAuleyLabUCSD @zacknovack @NeurIPSConf

207

20.0K

Sander Dieleman@sedielem · Jul 7

Roll call: #ICML2025 diffusion circle 📢 Who's coming? Please tag people that might be interested! Date/time TBD, probably Thursday afternoon. (Beware though👇 joining a diffusion circle is at your own risk!🫣)

SSander Dieleman@sedielem · Dec 13

We are sitting on the floor outside room 301 (west side)

15.0K

Sander Dieleman@sedielem · Jul 6

Reminder:

SSander Dieleman@sedielem · May 20

latents are just fancy pixels

10.0K

Sander Dieleman@sedielem · Jul 5

Diffusion models have analytical solutions, but they involve sums over the entire training set, and they don't generalise at all. They are mainly useful to help us understand how practical diffusion models generalise. Nice blog + code by Raymond Fan: rfangit.github.io/blog/2025/opti…

sedielem's tweet image. Diffusion models have analytical solutions, but they involve sums over the entire training set, and they don't generalise at all. They are mainly useful to help us understand how practical diffusion models generalise.

Nice blog + code by Raymond Fan: rfangit.github.io/blog/2025/opti…

672

537

39.0K

Sander Dieleman@sedielem · Jul 5

This looks like a great deep dive on neural network architectures for diffusion models. tl;dr use a Transformer, but there's quite a bit more to it, and as always in this field, the devil is in the details!

SSayak Paul@RisingSayak · Jul 4

Had the honor to present diffusion transformers at CS25, Stanford. The place is truly magical. Slides: bit.ly/dit-cs25 Recording: youtu.be/vXtapCFctTI?si… Thanks to @stevenyfeng for making it happen!

160

136

18.0K

Sander Dieleman@sedielem · Jul 2

now wouldn't that be something...

JJimmy Apples 🍎/acc@apples_jimmy · Jul 2

Let me play a video game of my veo 3 videos already. Google cooked so good 👌 @OfficialLoganK playable world models wen?

213

244

5.0K

625

554.0K

Sander Dieleman Retweeted

Black Forest Labs@bfl_ml · Jun 26

High quality image editing no longer needs closed models We release FLUX.1 Kontext [dev] - an open weights model for proprietary-level image editing performance. Runs on consumer chips. ✓ Open weights available ✓ Best in-class performance ✓ Self-serve commercial licensing

363

2.0K

845

254.0K

Sander Dieleman Retweeted

Chris Donahue@chrisdonahuey · Jun 20

Excited to announce 🎵Magenta RealTime, the first open weights music generation model capable of real-time audio generation with real-time control. 👋 **Try Magenta RT on Colab TPUs**: colab.research.google.com/github/magenta… 👀 Blog post: g.co/magenta/rt 🧵 below

372

176

65.0K

Sander Dieleman@sedielem · Jun 16

This work uncovers a profound connection between continuous and discrete (non-absorbing) diffusion models, allowing transfer of advanced techniques such as consistency distillation to the discrete setting! Also: amazing title, no notes! 🧑‍🍳😙🤌

SSubham Sahoo@ssahoo_ · Jun 13

🚨 “The Diffusion Duality” is out! @ICML2025 ⚡️ Few-step generation in discrete diffusion language models by exploiting the underlying Gaussian diffusion. 🦾Beats AR on 3/7 zero-shot likelihood benchmarks. 📄 Paper: arxiv.org/abs/2506.10892 💻 Code: github.com/s-sahoo/duo 🧠…

258

157

23.0K