Lihan Zha (@LihanZha)

Pinned

L

Lihan Zha@LihanZha · May 13

Want your imitation learning policy to generalize better, but how to collect data to achieve this? 🤖🤔 Enter Factored Scaling Curves (FSC): a tool that quantifies how policy success scales with demos for each environmental factor, enabling principled data collection 📈 . 🌐…

4

7

76

29

19.0K

Lihan Zha Retweeted

B

Binghao Huang@binghao_huang · Jul 16

Tactile interaction in the wild can unlock fine-grained manipulation! 🌿🤖✋ We built a portable handheld tactile gripper that enables large-scale visuo-tactile data collection in real-world settings. By pretraining on this data, we bridge vision and touch—allowing robots to:…

3

51

280

123

46.0K

Lihan Zha Retweeted

A

Albert Gu@_albertgu · Jul 8

I converted one of my favorite talks I've given over the past year into a blog post. "On the Tradeoffs of SSMs and Transformers" (or: tokens are bullshit) In a few days, we'll release what I believe is the next major advance for architectures.

26

114

782

540

113.0K

L

Lihan Zha@LihanZha · Jun 25

Join us at two workshops #RSS2025 on 6/25! 📍 Benchmarking Robot Manipulation: Improving Interoperability and Modularity (RTH 526, poster stand 88) 📷 Oral talk: 10.30-10.35 📍 Large Foundation Models for Interactive Robot Learning (SGM 123, poster stand 135) 📷 Lightning…

LLihan Zha@LihanZha · May 13

Want your imitation learning policy to generalize better, but how to collect data to achieve this? 🤖🤔 Enter Factored Scaling Curves (FSC): a tool that quantifies how policy success scales with demos for each environmental factor, enabling principled data collection 📈 . 🌐…

0

1

4

0

807

Lihan Zha Retweeted

Q

Qiao Gu@qiaogu1997 · Jun 23

🚀 Excited to introduce SAFE, our work on multitask failure detection for Vision-Language-Action (VLA) models! 🔍 SAFE is a simple yet powerful detector that leans from VLAs’ semantic-rich internal feature space and outputs a scalar score indicating the likelihood of task failure

2

25

125

80

13.0K

L

Lihan Zha@LihanZha · Jun 21

Imitation learning is not merely collecting large-scale demonstration data. It requires effective data collection and curation. FSC is a great example of this! Join Lihan’s session and chat with him to learn how to make your policy more general from a data-centric perspective!

LLihan Zha@LihanZha · Jun 20

Join us at two workshops #RSS2025 on 6/21! 📍 Resource Constrained Robotics (RTH109) 🗣️ Oral talk: 11:00–11:15 📍 Continual Robot Learning from Humans (OHE132) 🖼️ Spotlight poster: 10:30–11:00 Come by and chat—we’re excited to share our work!

0

1

3

0

591

L

Lihan Zha@LihanZha · Jun 20

Check out our recent work, Particle-Grid Neural Dynamics, led by the brilliant @kaiwynd , on 3D world modeling for diverse and challenging deformable objects 🧵👕🧸🛍️📦🍞!!!! The key insight is the hybrid Particle+Grid representation, enabling PGND to capture both fine-grained…

KKaifeng Zhang@kaiwynd · Jun 19

Can we learn a 3D world model that predicts object dynamics directly from videos? Introducing Particle-Grid Neural Dynamics: a learning-based simulator for deformable objects that trains from real-world videos. Website: kywind.github.io/pgnd ArXiv: arxiv.org/abs/2506.15680…

2

3

27

6

3.0K

L

Lihan Zha@LihanZha · Jun 20

Join us at two workshops #RSS2025 on 6/21! 📍 Resource Constrained Robotics (RTH109) 🗣️ Oral talk: 11:00–11:15 📍 Continual Robot Learning from Humans (OHE132) 🖼️ Spotlight poster: 10:30–11:00 Come by and chat—we’re excited to share our work!

LLihan Zha@LihanZha · May 13

Want your imitation learning policy to generalize better, but how to collect data to achieve this? 🤖🤔 Enter Factored Scaling Curves (FSC): a tool that quantifies how policy success scales with demos for each environmental factor, enabling principled data collection 📈 . 🌐…

1

5

25

1

4.0K

Lihan Zha Retweeted

M

Maximilian Du@du_maximilian · Jun 18

Normally, changing robot policy behavior means changing its weights or relying on a goal-conditioned policy. What if there was another way? Check out DynaGuide, a novel policy steering approach that works on any pretrained diffusion policy. dynaguide.github.io 🧵

5

34

141

67

16.0K

L

Lihan Zha@LihanZha · Jun 16

Excited to share more of what I've been working on for while now! We've trained a world model to solve policy evaluation, where we see enough correlation to real-world evaluations for practical production usage.

11X@1x_tech · Jun 16

1X World Model Scaling Evaluation for Robots

5

23

329

75

26.0K

Lihan Zha Retweeted

Y

Yunzhi Zhang@zhang_yunzhi · Jun 14

(1/n) Time to unify your favorite visual generative models, VLMs, and simulators for controllable visual generation—Introducing a Product of Experts (PoE) framework for inference-time knowledge composition from heterogeneous models.

5

64

302

218

44.0K

L

Lihan Zha@LihanZha · Jun 12

Robots struggle to find objects like humans. Why? Understanding "behind the box" (semantics) isn't enough – they need to plan precise, efficient actions to get there. Key Insight: VLMs propose where to look ("Maybe behind the box?"). World models evaluate VLM proposals and…

TTenny Yin@tennyyin · Jun 11

🔎Can robots search for objects like humans? Humans explore unseen environments intelligently—using prior knowledge to actively seek information and guide search. But can robots do the same? 👀 🚀Introducing WoMAP (World Models for Active Perception): a novel framework for…

2

3

21

3

3.0K

Lihan Zha Retweeted

P

Priya Sundaresan@priyasun_ · Jun 9

How can we move beyond static-arm lab setups and learn robot policies in our messy homes? We introduce HoMeR, an imitation learning agent for in-the-wild mobile manipulation. 🧵1/8

6

51

234

128

28.0K

L

Lihan Zha@LihanZha · Jun 6

Check out a recent talk I gave about how we can use structured world models + inference/planning to construct embodied agents that can generalize to many unseen tasks!

EEli Sennesh@EliSennesh · Jun 6

Check out @du_yilun 's talk at the Frontiers in NeuroAI Symposium yesterday! youtu.be/UKbLBOuHQ-0

4

7

85

39

10.0K

Lihan Zha Retweeted

M

Mandi Zhao@ZhaoMandi · Jun 3

How to learn dexterous manipulation for any robot hand from a single human demonstration? Check out DexMachina, our new RL algorithm that learns long-horizon, bimanual dexterous policies for a variety of dexterous hands, articulated objects, and complex motions.

19

104

617

326

113.0K

Lihan Zha Retweeted

J

Jingyun Yang@yjy0625 · Jun 5

Introducing Mobi-π: Mobilizing Your Robot Learning Policy. Our method: ✈️ enables flexible mobile skill chaining 🪶 without requiring additional policy training data 🏠 while scaling to unseen scenes 🧵↓

7

74

296

156

36.0K

Lihan Zha Retweeted

M

May Mei@MeiZhiting · May 29

How can we embed human-like reasoning into driving policies, without having to incur extra inference time? verdi-driving.github.io VERDI distills the reasoning process and commonsense knowledge from a VLM into an e2e model, boosting performance while maintaining inference speed!

6

1

6

1

2.0K

Lihan Zha Retweeted

P

Pedro Milcent@MilcentPedro · May 16

Great new paper by @LihanZha, @ApurvaBadithela, @mzhangio, @justinlidard, @Majumdar_Ani, @allenzren, @shahdhruv_, and team. As we scale data collection for robotics, it must be done intelligently by choosing which data variations to focus resources on. This paper provides a…

1

6

19

7

2.0K

L

Lihan Zha@LihanZha · May 16

Nice to see the use of ManiSkill3 in this work! Simulation is not just useful for RL training. It provides some good cheap deterministic test beds, perfect for testing imitation learning scaling laws at scale. Years of data in hours

LLihan Zha@LihanZha · May 13

Want your imitation learning policy to generalize better, but how to collect data to achieve this? 🤖🤔 Enter Factored Scaling Curves (FSC): a tool that quantifies how policy success scales with demos for each environmental factor, enabling principled data collection 📈 . 🌐…

2

7

43

15

6.0K

L

Lihan Zha@LihanZha · May 15

Happy to share recent work on Factored Scaling Curves for Guiding Data Collection led by my colleague @LihanZha in collaboration with @allenzren @shahdhruv_ @Majumdar_Ani @justinlidard and others! x.com/LihanZha/statu…

LLihan Zha@LihanZha · May 13

Want your imitation learning policy to generalize better, but how to collect data to achieve this? 🤖🤔 Enter Factored Scaling Curves (FSC): a tool that quantifies how policy success scales with demos for each environmental factor, enabling principled data collection 📈 . 🌐…

2

1

4

0

276

L

Lihan Zha@LihanZha · May 15

Data really matters for policy generalizing to diverse environment conditions as seen in pi05 etc. In this work we find strong correlation b/w # data and test performance, enabling targeted data collection. We also find offline metric (policy embedding) to give strong signal!

LLihan Zha@LihanZha · May 13

Want your imitation learning policy to generalize better, but how to collect data to achieve this? 🤖🤔 Enter Factored Scaling Curves (FSC): a tool that quantifies how policy success scales with demos for each environmental factor, enabling principled data collection 📈 . 🌐…

2

25

9

2.0K