Allen Z. Ren

@allenzren

Generalist robot policy @physical_int, PhD @Princeton

Joined August 2019

781Following

2KFollowers

Pinned

Allen Z. Ren@allenzren · Sep 4

👇Introducing DPPO, Diffusion Policy Policy Optimization DPPO optimizes pre-trained Diffusion Policy using policy gradient from RL, showing 𝘀𝘂𝗿𝗽𝗿𝗶𝘀𝗶𝗻𝗴 𝗶𝗺𝗽𝗿𝗼𝘃𝗲𝗺𝗲𝗻𝘁𝘀 over a variety of baselines across benchmarks and sim2real transfer diffusion-ppo.github.io

476

268

75.0K

Pinned

Allen Z. Ren Retweeted

Karl Pertsch@KarlPertsch · Jun 20

We’re releasing the RoboArena today!🤖🦾 Fair & scalable evaluation is a major bottleneck for research on generalist policies. We’re hoping that RoboArena can help! We provide data, model code & sim evals for debugging! Submit your policies today and join the leaderboard! :) 🧵

408

239

92.0K

Allen Z. Ren Retweeted

Aviv Tamar@AvivTamar1 · Jul 18

Want robot imitation learning to generalize to new tasks? Blindfold your human demonstrator! Best robotics paper at EXAIT Workshop #ICML2025 openreview.net/forum?id=zqfT2… Wait, why does this make sense? Read below!

145

106

25.0K

Allen Z. Ren@allenzren · Jul 12

Action chunking + expressive action distribution —> Better exploration for RL! This was one of the biggest lessons we learned in DPPO as well

PPaul Zhou@zhiyuan_zhou_ · Jul 12

Action chunking works really well in imitation learning, and is essential to learning good BC policies in robotics. Can/should we apply the same idea in RL? We find that RL in the action chunk space, when done right (we call it ✨Q-chunking ✨), can be highly efficient🧵👇

144

14.0K

Allen Z. Ren Retweeted

Andrew Wagenmaker@ajwagenmaker · Jun 25

Diffusion policies have demonstrated impressive performance in robot control, yet are difficult to improve online when 0-shot performance isn’t enough. To address this challenge, we introduce DSRL: Diffusion Steering via Reinforcement Learning. (1/n) diffusion-steering.github.io

296

189

53.0K

Allen Z. Ren@allenzren · Jun 20

Join us at two workshops #RSS2025 on 6/21! 📍 Resource Constrained Robotics (RTH109) 🗣️ Oral talk: 11:00–11:15 📍 Continual Robot Learning from Humans (OHE132) 🖼️ Spotlight poster: 10:30–11:00 Come by and chat—we’re excited to share our work!

LLihan Zha@LihanZha · May 13

Want your imitation learning policy to generalize better, but how to collect data to achieve this? 🤖🤔 Enter Factored Scaling Curves (FSC): a tool that quantifies how policy success scales with demos for each environmental factor, enabling principled data collection 📈 . 🌐…

4.0K

Allen Z. Ren Retweeted

Tenny Yin@tennyyin · Jun 11

🔎Can robots search for objects like humans? Humans explore unseen environments intelligently—using prior knowledge to actively seek information and guide search. But can robots do the same? 👀 🚀Introducing WoMAP (World Models for Active Perception): a novel framework for…

196

122

26.0K

Allen Z. Ren Retweeted

Kevin Black@kvablack · Jun 9

In LLM land, a slow model is annoying. In robotics, a slow model can be disastrous! Visible pauses at best, dangerously jerky motions at worst. But large VLAs are slow by nature. What can we do about this? An in-depth 🧵:

430

239

64.0K

Allen Z. Ren Retweeted

Physical Intelligence@physical_int · Jun 9

Our models need to run in real time on real robots, but inference with big VLAs takes a long time. We developed Real-Time Action Chunking (RTC) to enable real-time inference with flow matching for the π0 and π0.5 VLAs! More in the thread👇

668

246

60.0K

Allen Z. Ren@allenzren · May 31

Always fun to chat about generalization with Ani :)

AAnirudha Majumdar@Majumdar_Ani · May 31

Great to have @allenzren back at @Princeton for the PhD hooding ceremony, so I could ask him a whole bunch of questions about the pi_0.5 paper from @physical_int!

4.0K

Allen Z. Ren@allenzren · May 29

Our newest VLA training recipe achieves fast training, fast inference, and great performance, by carefully designing the interface between model backbone and continuous actions. Many lessons learned along the way👇

PPhysical Intelligence@physical_int · May 28

We figured out how to train VLAs with diffusion outputs much faster (7.5x faster), inheriting better language following from the VLM, and leading to better results. The key: protect the VLM backbone during training with knowledge insulation. Let’s talk about what we learned👇

5.0K

Allen Z. Ren Retweeted

Danny Driess@DannyDriess · May 28

How to build vision-language-action models that train fast, run fast & generalize? In our new paper, we formalize & analyze the approach of our π-0.5 model & further improve it with a single stage recipe. Blog: pi.website/research/knowl… Paper: pi.website/download/pi05_…

220

118

16.0K

Allen Z. Ren Retweeted

Rohan Sinha@RohanSinhaSU · May 14

📢 Excited for the second workshop on Out-of-Distribution Generalization in Robotics: Towards Reliable Learning-based Autonomy at RSS! #RSS2025 🎯 How can we build reliable robotic autonomy for the real world? 📅 Short papers due 05/25/25 🌐 tinyurl.com/rss2025ood 🧵(1/4)

7.0K

Allen Z. Ren@allenzren · May 16

Nice to see the use of ManiSkill3 in this work! Simulation is not just useful for RL training. It provides some good cheap deterministic test beds, perfect for testing imitation learning scaling laws at scale. Years of data in hours

LLihan Zha@LihanZha · May 13

6.0K

Allen Z. Ren Retweeted

Dhruv Shah@shahdhruv_ · May 14

In the era of generalist robot foundation models, how do you get their pre-trained model to work well on your robot and task? 🌐factored-data-scaling.github.io 📈 We introduce Factored Scaling Curves (FSC): a principled approach for modeling how policy performance scales with data for…

7.0K

Allen Z. Ren@allenzren · May 13

Data is the fuel that drives robot learning, but we don't have great strategies for figuring out what data to collect to enable strong generalization. Check out @LihanZha's first paper as a PhD student at @Princeton! 𝐆𝐮𝐢𝐝𝐢𝐧𝐠 𝐃𝐚𝐭𝐚 𝐂𝐨𝐥𝐥𝐞𝐜𝐭𝐢𝐨𝐧 𝐯𝐢𝐚…

LLihan Zha@LihanZha · May 13

4.0K

Allen Z. Ren Retweeted

Aran Komatsuzaki@arankomatsuzaki · May 13

Guided Data Collection via Factored Scaling Curves Provides a principled method for deciding what data to collect and how much to collect for each factor by constructing factored scaling curves

9.0K

Allen Z. Ren Retweeted

Amber Xie@amberxie_ · Apr 25

Introducing ✨Latent Diffusion Planning✨ (LDP)! We explore how to use expert, suboptimal, & action-free data. To do so, we learn a diffusion-based *planner* that forecasts latent states, and an *inverse-dynamics model* that extracts actions. w/ @_oleh @DorsaSadigh @chelseabfinn

356

228

36.0K

Allen Z. Ren@allenzren · Apr 23

Check out our newest work in bringing robots closer to open-world generalization! It was truly amazing to see (1) data scaling and (2) iterating over the cross-embodiment co-training recipe solved the tasks that the robot struggled with when I first joined Pi.

PPhysical Intelligence@physical_int · Apr 22

We got a robot to clean up homes that were never seen in its training data! Our new model, π-0.5, aims to tackle open-world generalization. We took our robot into homes that were not in the training data and asked it to clean kitchens and bedrooms. More below⤵️

5.0K