Joel Jang (@jang_yoel)

Pinned

J

Joel Jang@jang_yoel · May 20

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

9

74

375

164

103.0K

Joel Jang Retweeted

T

The Humanoid Hub@TheHumanoidHub · Jul 16

A humanoid robot policy trained solely on synthetic data generated by a world model. Research Scientist Joel Jang presents NVIDIA's DreamGen pipeline: ⦿ Post-train the world model Cosmos-Predict2 with a small set of real teleoperation demos. ⦿ Prompt the world model to…

9

42

217

84

19.0K

Joel Jang Retweeted

J

Jim Fan@DrJimFan · Jul 13

I've been a bit quiet on X recently. The past year has been a transformational experience. Grok-4 and Kimi K2 are awesome, but the world of robotics is a wondrous wild west. It feels like NLP in 2018 when GPT-1 was published, along with BERT and a thousand other flowers that…

170

334

4.0K

1.0K

918.0K

Joel Jang Retweeted

A

AgiBot World@agibotworld · Jul 2

Compete for a $560,000 Prize Pool at IROS 2025 AgiBot World Challenge! 💰 The AgiBot World Challenge – Manipulation Track is LIVE! Hosted by @AgiBot and @OpenDriveLab at #IROS2025. 🚀 Challenge: Tackle 10 complex Sim2Real manipulation tasks. 🛠️ Resources: Access a unique…

1

11

21

5

8.0K

J

Joel Jang@jang_yoel · Jun 28

Check out Cosmos-Predict2, a new SOTA video world model trained specifically for Physical AI (powering GR00T Dreams & DreamGen)!

HHanzi Mao@hanna_mao · Jun 28

We build Cosmos-Predict2 as a world foundation model for Physical AI builders — fully open and adaptable. Post-train it for specialized tasks or different output types. Available in multiple sizes, resolutions, and frame rates. 📷 Watch the repo walkthrough…

0

6

44

6

4.0K

Joel Jang Retweeted

Z

Zhengyi “Zen” Luo@zhengyiluo · Jun 21

Nvidia GEAR RSS 2025 Squad Rolling Out

12

5

152

8

19.0K

J

Joel Jang@jang_yoel · Jun 20

🚀 GR00T Dreams code is live! NVIDIA GEAR Lab's open-source solution for robotics data via video world models. Fine-tune on any robot, generate 'dreams', extract actions with IDM, and train visuomotor policies with LeRobot datasets (GR00T N1.5, SmolVLA). github.com/NVIDIA/GR00T-D…

JJoel Jang@jang_yoel · May 20

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

6

43

148

78

26.0K

Joel Jang Retweeted

y

youliang@youliangtan · Jun 18

How we improve VLA generalization? 🤔 Last week we upgraded #NVIDIA GR00T N1.5 with minor VLM tweaks, FLARE, and richer data mixtures (DreamGen, etc.) ✨. N1.5 yields better language following — post-trained on unseen Unitree G1 with 1K trajectories, it follows commands on…

2

22

187

86

15.0K

Joel Jang Retweeted

Q

Qinsheng Zhang@qsh_zh · Jun 12

🚀 Introducing Cosmos-Predict2! Our most powerful open video foundation model for Physical AI. Cosmos-Predict2 significantly improves upon Predict1 in visual quality, prompt alignment, and motion dynamics—outperforming popular open-source video foundation models. It’s openly…

7

61

202

83

74.0K

Joel Jang Retweeted

C

Chris Paxton@chris_j_paxton · Jun 10

Assuming that we need ~2 trillion tokens to get to a robot GPT, how can we get there? I went through a few scenarios looking at how we can combine simulation data, human video data, and looking at the size of existing robot fleets. Some assumptions: - We probably need some real…

13

34

212

148

38.0K

Joel Jang Retweeted

Y

Yiyang Zhou@AiYiyangZ · Jun 9

🔥 ReAgent-V Released! 🔥 A unified video framework with reflection and reward-driven optimization. ✨ Real-time self-correction. ✨ Triple-view reflection. ✨ Auto-selects high-reward samples for training.

1

17

43

18

4.0K

J

Joel Jang@jang_yoel · Jun 9

Giving a talk about GR00T N1, GR00T N1.5, and GR00T Dreams in NVIDIA GTC Paris 06.11 2PM - 2:45PM CEST. If you are at Vivatech in Paris, please stop by the "An Introduction to Humanoid Robotics" Session!

NNVIDIA Robotics@NVIDIARobotics · Jun 4

Are you curious about #humanoidrobotics? Join our experts at #GTCParis for a deep dive into the #NVIDIAIsaac GR00T platform and its four pillars: 🧠 Robot foundation models for cognition and control 🌐 Simulation frameworks built on @nvidiaomniverse and #NVIDIACosmos 📊 Data…

1

6

62

5

6.0K

Joel Jang Retweeted

R

Ruijie Zheng@ruijie_zheng12 · Jun 5

Representation also matters for VLA models! Introducing FLARE: Robot Learning with Implicit World Modeling. With future latent alignment objective, FLARE significantly improves policy performance on multitask imitation learning & unlocks learning from egocentric human videos.

6

21

110

81

25.0K

Joel Jang Retweeted

B

Brett Adcock@adcock_brett · May 25

Nvidia also announced DreamGen, a new engine that scales robot learning with digital dreams It produces large volumes of photorealistic robot videos (using video models) paired with motor action labels and unlocks generalization to new environments

3

7

94

22

6.0K

Joel Jang Retweeted

T

The Humanoid Hub@TheHumanoidHub · May 21

NVIDIA has published a paper on DREAMGEN – a powerful 4-step pipeline for generating synthetic data for humanoids that enables task and environment generalization. - Step 1: Fine-tune a video generation model using a small number of human teleoperation videos - Step 2: Prompt…

2

34

162

57

12.0K

J

Joel Jang@jang_yoel · May 21

It’s not a matter of if, it’s a matter of when, video models and world models are going to be a central tool for building robot foundation models.

JJoel Jang@jang_yoel · May 20

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

0

1

11

1

1.0K

J

Joel Jang@jang_yoel · May 20

Getting robot data is difficult for those who don’t have the resources, and glad to see @NVIDIARobotics is offering an API for everyone to use!

JJoel Jang@jang_yoel · May 20

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

0

1

7

2

891

Joel Jang Retweeted

J

Johnson Lin@zy27962986 · May 20

Introducing 😶‍🌫️DreamGen, the pioneering approach to neural trajectories + robotics at NVIDIA GEAR lab. We’re among the first to show how large-scale synthetic data can significantly improve a robot’s ability to generalize to new actions and environments. If you’re interested,…

2

8

19

2

2.0K

J

Joel Jang@jang_yoel · May 20

This has long been what was missing from video world models imo. Exciting progress.

JJoel Jang@jang_yoel · May 20

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

4

8

71

22

9.0K