Luke Rowe

@Luke22R

Research Intern @Waymo. PhD Student at @Mila_Quebec, focusing on machine learning and autonomous driving.

Montréal, Québec

Joined February 2022

167Following

145Followers

Pinned

Luke Rowe@Luke22R · Jun 12

🚀 Our method, Poutine, was the best-performing entry in the 2025 Waymo Vision-based End-to-End Driving Challenge at #CVPR2025! Our 3 B-parameter VLM Poutine scored 7.99 RFS on the official test set—comfortably ahead of every other entry (see figure).

Luke22R's tweet image. 🚀 Our method, Poutine, was the best-performing entry in the 2025 Waymo Vision-based End-to-End Driving Challenge at #CVPR2025!

Our 3 B-parameter VLM Poutine scored 7.99 RFS on the official test set—comfortably ahead of every other entry (see figure).

3.0K

Luke Rowe Retweeted

Sarath Chandar@apsarathchandar · Jul 4

Origin of most of these innovations is Canada 🇨🇦 though 😜

3.0K

Luke Rowe Retweeted

Roger Creus Castanyer@creus_roger · Jun 23

🚨 Excited to share our new work: "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning"! 📈 We propose gradient interventions that enable stable, scalable learning, achieving significant performance gains across agents and environments! Details below 👇

150

114

29.0K

Luke Rowe Retweeted

Glen Berseth@GlenBerseth · Jun 17

How can we make behavioural cloning (BC) achieve better combinatorial generalization on out-of-distribution goals? We propose BYOL-γ: an auxiliary self-predictive loss to improve generalization for goal-conditioned BC. 🧵1/6

8.0K

Luke Rowe Retweeted

Robotics Papers@OWW · Jun 16

Poutine: Vision-Language-Trajectory Pre-Training and Reinforcement Learning Post-Training Enable Robust End-to-End Autonomous Driving. arxiv.org/abs/2506.11234

686

Luke Rowe@Luke22R · Jun 17

Why do we keep sampling from the same distribution the model was trained on? We rethink this old paradigm by introducing Feynman-Kac Correctors (FKCs) – a flexible framework for controlling the distribution of samples at inference time in diffusion models! Without re-training…

MMarta Skreta@martoskreto · Jun 17

🧵(1/6) Delighted to share our @icmlconf 2025 spotlight paper: the Feynman-Kac Correctors (FKCs) in Diffusion Picture this: it’s inference time and we want to generate new samples from our diffusion model. But we don’t want to just copy the training data – we may want to sample…

133

25.0K

Luke Rowe Retweeted

Nathan Lambert@natolambert · Jun 14

One of the most striking, non-text AI plots I've seen since ChatGPT launched. Scaling keeps working, this time for Waymo's tooling.

187

15.0K

Luke Rowe Retweeted

Emiliano Penaloza@emilianopp_ · Jun 11

Excited that our paper "Addressing Concept Mislabeling in Concept Bottleneck Models Through Preference Optimization" was accepted to ICML 2025! We show how Preference Optimization can reduce the impact of noisy concept labels in CBMs. 🧵/9

2.0K

Luke Rowe Retweeted

Majdi Hassan@majdi_has · Jun 10

(1/n)🚨You can train a model solving DFT for any geometry almost without training data!🚨 Introducing Self-Refining Training for Amortized Density Functional Theory — a variational framework for learning a DFT solver that predicts the ground-state solutions for different…

155

16.0K

Luke Rowe Retweeted

Nanda H Krishna@nandahkrishna · Jun 6

New preprint! 🧠🤖 How do we build neural decoders that are: ⚡️ fast enough for real-time use 🎯 accurate across diverse tasks 🌍 generalizable to new sessions, subjects, and species? We present POSSM, a hybrid SSM architecture that optimizes for all three of these axes! 🧵1/7

10.0K

Luke Rowe Retweeted

Anthony Gosselin@antho_gosselin · Jun 5

🚗💥Introducing Ctrl-Crash: controllable video generation for autonomous driving! SOTA models struggle to generate physically realistic car crashes. We propose an image2video diffusion model with bounding box and crash type control. Website: anthonygosselin.github.io/Ctrl-Crash-Pro… 🧵->

7.0K

Luke Rowe Retweeted

Benjamin Thérien@benjamintherien · May 30

Is AdamW the best inner optimizer for DiLoCo? Does the inner optimizer affect the compressibility of the DiLoCo delta? Excited to introduce MuLoCo: Muon is a practical inner optimizer for DiLoCo! 🧵arxiv.org/abs/2505.23725 1/N

7.0K

Luke Rowe Retweeted

Shashwat Goel@ShashwatGoel7 · May 29

Confused about recent LLM RL results where models improve without any ground-truth signal? We were too. Until we looked at the reported numbers of the Pre-RL models and realized they were serverely underreported across papers. We compiled discrepancies in a blog below🧵👇

122

881

536

316.0K

Luke Rowe@Luke22R · May 28

Thanks @_akhaliq for sharing our work! Excited to present our next generation of SVG models, now using Reinforcement Learning from Rendering Feedback (RLRF). 🧠 We think we cracked SVG generalization with this one. Go read the paper! arxiv.org/abs/2505.20793 More details on…

AAK@_akhaliq · May 28

Rendering-Aware Reinforcement Learning for Vector Graphics Generation RLRF significantly outperforms supervised fine-tuning, addressing common failure modes and enabling precise, high-quality SVG generation with strong structural understanding and generalization

123

15.0K

Luke Rowe Retweeted

Siddarth@ICML2025@siddarthv66 · May 27

Is there a universal strategy to turn any generative model—GANs, VAEs, diffusion models, or flows—into a conditional sampler, or finetuned to optimize a reward function? Yes! Outsourced Diffusion Sampling (ODS) accepted to @icmlconf , does exactly that!

5.0K

Luke Rowe Retweeted

Xichen Pan@xichen_pan · Apr 11

We find training unified multimodal understanding and generation models is so easy, you do not need to tune MLLMs at all. MLLM's knowledge/reasoning/in-context learning can be transferred from multimodal understanding (text output) to generation (pixel output) even it is FROZEN!

415

302

68.0K