Sarthak Mittal

@sarthmit

Graduate Student at @Mila_Quebec and Visiting Researcher @Meta. Prior Research Intern at @Apple, @MorganStanley, @NVIDIAAI and @YorkUniversity

Montréal, Québec

Joined February 2019

737Following

662Followers

Pinned

Sarthak Mittal@sarthmit · Feb 18

🚀 New Preprint! 🚀 In-Context Parametric Inference: Point or Distribution Estimators? Thrilled to share our work on inferring probabilistic model parameters explicitly conditioned on data, in collab with @Yoshua_Bengio, @FelineAutomaton & @g_lajoie_! 🔗arxiv.org/abs/2502.11617

2.0K

Sarthak Mittal@sarthmit · Jul 15

Explicit latents or implicit marginalization? at #ICML2025 📌 Tue, 11 am 📍East Exhibition Hall A-B (E-1603) Come check out surprising results on whether explicitly incentivizing learning of correct latents improves generalization over implicitly marginalizing it!

sarthmit's tweet image. Explicit latents or implicit marginalization? at #ICML2025

📌 Tue, 11 am
📍East Exhibition Hall A-B (E-1603)

Come check out surprising results on whether explicitly incentivizing learning of correct latents improves generalization over implicitly marginalizing it!

688

Sarthak Mittal Retweeted

Joey Bose@bose_joey · Jul 9

🎉Personal update: I'm thrilled to announce that I'm joining Imperial College London @imperialcollege as an Assistant Professor of Computing @ICComputing starting January 2026. My future lab and I will continue to work on building better Generative Models 🤖, the hardest…

604

51.0K

Sarthak Mittal@sarthmit · Jun 10

⛵️ Excited to share 𝚂𝙰𝙸𝙻𝙾𝚁: a method for *learning to search* with learned world + reward models to plan in the latent space at test-time. Unlike behavior cloning, 𝚂𝙰𝙸𝙻𝙾𝚁 recovers from mistakes without any additional data, DAgger corrections, or ground truth rewards.

GGokul Swamy@g_k_swamy · Jun 10

Say ahoy to 𝚂𝙰𝙸𝙻𝙾𝚁⛵: a new paradigm of *learning to search* from demonstrations, enabling test-time reasoning about how to recover from mistakes w/o any additional human feedback! 𝚂𝙰𝙸𝙻𝙾𝚁 ⛵ out-performs Diffusion Policies trained via behavioral cloning on 5-10x data!

2.0K

Sarthak Mittal@sarthmit · May 26

Physics says it's fine to be lazy New preprint on minimum-excess-work guidance: arxiv.org/abs/2505.13375 Check out the thread below 👇

CChristopher Kolloff@chrisdkolloff · May 26

New preprint alert 🚨 How can you guide diffusion and flow-based generative models when data is scarce but you have domain knowledge? We introduce Minimum Excess Work, a physics-inspired method for efficiently integrating sparse constraints. Thread below 👇arxiv.org/abs/2505.13375

1.0K

Sarthak Mittal@sarthmit · May 16

A great collab with former labmates @AntChen_ & Dongyan! Interesting cognitive limitation in LMs: strong disjunctive bias leads to poor performance on conjunctive causal inference tasks. Mirrors adult human biases—possibly a byproduct of training data priors.

AAnthony GX-Chen@AntChen_ · May 16

Language model (LM) agents are all the rage now—but they may have cognitive biases when inferring causal relationships! We eval LMs on psych task to find: - LMs struggle with certain simple causal relationships - They show biases similar to human adults (but not children) 🧵⬇️

3.0K

Sarthak Mittal Retweeted

Divyat Mahajan@divyat09 · May 1

Happy to share that Compositional Risk Minimization has been accepted at #ICML2025 📌Extensive theoretical analysis along with a practical approach for extrapolating classifiers to novel compositions! 📜 arxiv.org/abs/2410.06303

163

18.0K

Sarthak Mittal Retweeted

Marta Skreta@martoskreto · Apr 28

that’s a wrap for #AI4Mat and #FPIWorkshop at #ICLR2025 🎁 thanks for coming, off to sleep for the next 2 weeks 😴 #FPIWorkshop: @tara_aksa @AlexanderTong7 @bose_joey @sarthmit @k_neklyudov @YuanqiD #AI4Mat: @MiretSantiago Rocío M @anoopnm007 @SteMartiniani @MoosaviSMohamad

104

8.0K

Sarthak Mittal Retweeted

Alex Tong@AlexanderTong7 · Apr 28

Great talk by Grant Rotskoff linking sampling with nonequilibrium physics in the #FPIworkshop. Come by for the poster session (Peridot 202-203) for the next hour. #ICLR25

4.0K

Sarthak Mittal Retweeted

Alex Tong@AlexanderTong7 · Apr 28

FPI workshop off to a great start with Emtiyaz Khan talking about Adaptive Bayesian Intelligence! Come check it out in Peridot 202-203 #FPIWorkshop #ICLR25

13.0K

Sarthak Mittal@sarthmit · Apr 28

Come check out the workshop and hear about novel works and contributions from an exciting lineup of speakers and panelists!

AAlex Tong@AlexanderTong7 · Apr 28

FPI workshop off to a great start with Emtiyaz Khan talking about Adaptive Bayesian Intelligence! Come check it out in Peridot 202-203 #FPIWorkshop #ICLR25

1.0K

Sarthak Mittal Retweeted

Neel Nanda@NeelNanda5 · Mar 26

GDM Mech Interp Update: We study if SAEs help probes generalise OOD (they don't 😢). Based on this + parallel negative results on real-world tasks, we're de-prioritising SAE work. Our guess is that SAEs aren't useless, but also aren't a game-changer More + new research in 🧵

827

482

252.0K

Sarthak Mittal Retweeted

Max Zhdanov@maxxxzdn · Mar 25

Can only speak for my ICML reviewing batch, but the hack of putting scary, convoluted and wrong math still works.

3.0K

Sarthak Mittal@sarthmit · Mar 3

If you're interested in how to keep challenging neural networks throughout training, check out our latest preprint! #sample_efficiency #scaling_laws

RReyhane Askari@ReyhaneAskari · Feb 28

🚀 New Paper Alert! Can we generate informative synthetic data that truly helps a downstream learner? Introducing Deliberate Practice for Synthetic Data (DP)—a dynamic framework that focuses on where the model struggles most to generate useful synthetic training examples. 🔥…

2.0K