Sitan Chen (@sitanch)

Pinned

S

Sitan Chen@sitanch · Feb 11

Excited about this new work where we dig into the role of token order in masked diffusions! MDMs train on some horribly hard tasks, but careful planning at inference can sidestep the hardest ones, dramatically improving over vanilla MDM sampling (e.g. 7%->90% acc on Sudoku) 1/

sitanch's tweet image. Excited about this new work where we dig into the role of token order in masked diffusions!
MDMs train on some horribly hard tasks, but careful planning at inference can sidestep the hardest ones, dramatically improving over vanilla MDM sampling (e.g. 7%-&gt;90% acc on Sudoku) 1/

5

23

153

100

35.0K

Pinned

Sitan Chen Retweeted

Q

QIP 2025@QIP2025 · Feb 20

Our sixth short plenary is a merged talk about stabilizer bootstrapping, and learning the closest product state! (papers: arxiv.org/abs/2408.06967, arxiv.org/pdf/2411.04283). Overview below. @sitanch @gong_weiyuan @AineshBakshi @JohnBostanci @ewintang @jerryzli @BooleanAnalysis

0

2

10

1

1.0K

Sitan Chen Retweeted

B

Boaz Barak@boazbaraktcs · Jul 21

Intro blog post for CS 2881: AI Safety. windowsontheory.org/2025/07/20/ai-…

8

20

133

112

15.0K

Sitan Chen Retweeted

K

Kempner Institute at Harvard University@KempnerInst · Jul 15

A team from #KempnerInstitute, @hseas & @UTCompSci has won a best paper award at #ICML2025 for work unlocking the potential of masked diffusion models. Congrats to @Jaeyeon_Kim_0, @shahkulin98, Vasilis Kontonis, @ShamKakade6 and @sitanch. kempnerinstitute.harvard.edu/news/kempner-i… #AI

0

10

56

6

15.0K

S

Sitan Chen@sitanch · Jul 16

A bit of a belated announcement 😅 but I’ll be at ICML today presenting S4S, which enables few-NFE diffusion model sampling in <1 hour on 1 A100! 📍East Exhibition Hall, E-3210, 11:00 - 1:30. Looking forward to chatting more about all things diffusion! #ICML2025

EEric Frankel @ ICML@esfrankel · Feb 27

Want to quickly sample high-quality images from diffusion models, but can’t afford the time or compute to distill them? Introducing S4S, or Solving for the Solver, which learns the coefficients and discretization steps for a DM solver to improve few-NFE generation. Thread 👇 1/

0

2

19

0

1.0K

S

Sitan Chen@sitanch · Jul 15

Thrilled to share that our work received the Outstanding Paper Award at ICML! I will be giving the oral presentation on Tuesday at 4:15 PM. @Jaeyeon_Kim_0 and I both will be at the poster session shortly after the oral presentation. Please attend if possible!

SSitan Chen@sitanch · Feb 11

Excited about this new work where we dig into the role of token order in masked diffusions! MDMs train on some horribly hard tasks, but careful planning at inference can sidestep the hardest ones, dramatically improving over vanilla MDM sampling (e.g. 7%->90% acc on Sudoku) 1/

5

21

146

38

14.0K

Sitan Chen Retweeted

M

Marvin Li @ ICML 2025@marvin_li03 · Jul 7

Ecstatic to present an oral paper at ICML this year!!🎉 📚 “Blink of an Eye: a simple theory for feature localization in generative models” 🔗 arxiv.org/abs/2502.00921 Catch me at the poster session right after! See you there! 🚀

1

21

3

1.0K

Sitan Chen Retweeted

J

Jaeyeon (Jay) Kim @ICML@Jaeyeon_Kim_0 · Jul 6

Excited to share that I’ll be presenting two oral papers in this ICML—see u guys in Vancouver!!🇨🇦 1️⃣ arxiv.org/abs/2502.06768 Understanding Masked Diffusion Models theoretically/scientifically 2️⃣ arxiv.org/abs/2502.09376 Theoretical analysis on LoRA training

4

31

248

128

22.0K

S

Sitan Chen@sitanch · Jun 13

Nice thread by Aayush on our new work on diffusion reward guidance! Was quite surprised how well this worked and how simple the algorithm is. Also happy that we finally managed to prove some rigorous guarantees for DPS (diffusion posterior sampling)

AAayush Karan@aakaran31 · Jun 13

Steering diffusion models with external rewards has recently led to exciting results, but what happens when the reward is inherently difficult? Introducing ReGuidance: a simple algorithm to (provably!) boost your favorite guidance method on hard problems! 🚀🚀🚀 A thread: (1/n)

1

2

22

7

2.0K

S

Sitan Chen@sitanch · May 24

Bright spot amidst the chaos: my undergrad advisees @marvin_li03 and Kerem Dayi won the Hoopes Prize! Marvin also got a Fay Prize for top 3 Harvard theses + ICML spotlight. Kerem, a CRA finalist, will present his work at COLT and start a PhD at MIT this fall. Very proud of them🥲

sitanch's tweet image. Bright spot amidst the chaos: my undergrad advisees @marvin_li03 and Kerem Dayi won the Hoopes Prize! Marvin also got a Fay Prize for top 3 Harvard theses + ICML spotlight. Kerem, a CRA finalist, will present his work at COLT and start a PhD at MIT this fall. Very proud of them🥲

1

2

66

5

4.0K

Sitan Chen Retweeted

P

PRX Quantum@PRX_Quantum · May 2

Circumventing recent no-go theorems, this work shows that a logarithmic amount of #quantum memory is sufficient to give exponential advantages in the task of learning the structure of Pauli noise channels. @sitanch @gong_weiyuan Check it out: go.aps.org/3ROvLfT

0

3

18

8

2.0K

S

Sitan Chen@sitanch · Feb 27

Check out Eric's thread on our new lightweight method to learn an optimal solver for any diffusion model! High-quality generation in 5 NFEs, universal gains over previous training-free methods essentially "for free" (<1 hr on 1 A100). Kudos to @esfrankel for the amazing work!

EEric Frankel @ ICML@esfrankel · Feb 27

Want to quickly sample high-quality images from diffusion models, but can’t afford the time or compute to distill them? Introducing S4S, or Solving for the Solver, which learns the coefficients and discretization steps for a DM solver to improve few-NFE generation. Thread 👇 1/

0

3

8

2

2.0K

S

Sitan Chen@sitanch · Feb 11

🚀 Very excited to share our new work on understanding the benefits/drawbacks of training/inference in Masked Diffusion Models (MDMs) with amazing collaborators! 📜 Paper: arxiv.org/pdf/2502.06768 1/

SSitan Chen@sitanch · Feb 11

Excited about this new work where we dig into the role of token order in masked diffusions! MDMs train on some horribly hard tasks, but careful planning at inference can sidestep the hardest ones, dramatically improving over vanilla MDM sampling (e.g. 7%->90% acc on Sudoku) 1/

1

3

8

0

1.0K

S

Sitan Chen@sitanch · Feb 11

check my first work at Harvard! We deepen our understand on Masked Diffusion Model, decomposing it into ‘training (Section 3) and ‘inference (Section 4)’. Special thanks for amazing collaborators @shahkulin98, Vasilis, @ShamKakade6, and @sitanch !!! arxiv.org/pdf/2502.06768

SSitan Chen@sitanch · Feb 11

Excited about this new work where we dig into the role of token order in masked diffusions! MDMs train on some horribly hard tasks, but careful planning at inference can sidestep the hardest ones, dramatically improving over vanilla MDM sampling (e.g. 7%->90% acc on Sudoku) 1/

2

46

9

4.0K