Bowen Wen (@bowenwen_me)

Pinned

B

Bowen Wen@bowenwen_me · Mar 13

📢Time to upgrade your depth camera! Introducing **FoundationStereo**, a foundation model for stereo depth estimation in zero-shot (accepted to CVPR 2025 with full scores) [1/n] Code: github.com/NVlabs/Foundat… Website: nvlabs.github.io/FoundationSter… Paper: arxiv.org/abs/2501.09898

10

98

571

305

79.0K

Pinned

B

Bowen Wen@bowenwen_me · Jul 12

Stereo depth sensing is set to revolutionize 3D perception. Can't wait to see the new innovations and applications that emerge! #3Dperception #computervision #robotics realsenseai.com/news-insights/…

bowenwen_me's tweet card. The newly independent company is set to lead in computer vision and machine perception for physical AI and beyond SAN FRANCISCO — July 11, 2025 — RealSense, a pioneer in AI-powered computer vision,...

0

6

59

14

8.0K

Pinned

B

Bowen Wen@bowenwen_me · Jun 7

Want a better representation for collision avoidance and grasping from dense clutter? Try out RaySt3R: our new 3D shape completion pipeline from single-view RGBD (led by @BDuisterhof)!

BBardienus Duisterhof@BDuisterhof · Jun 6

Imagine if robots could fill in the blanks in cluttered scenes. ✨ Enter RaySt3R: a single masked RGB-D image in, complete 3D out. It infers depth, object masks, and confidence for novel views, and merges the predictions into a single point cloud. rayst3r.github.io

0

14

112

44

7.0K

Pinned

B

Bowen Wen@bowenwen_me · Mar 18

More progress on developing a straightforward method to collect first-person (ego) and third-person (exo) data for robotic training with @rerundotio . I’ve been using the HO-cap dataset to establish a baseline, and here are some updates I’ve made: * added in MANO parameters from…

PPablo Vela@pablovelagomez1 · Mar 11

After a short detour to Mast3r SLAM, I’m starting back up on exo-ego data collection, this time bringing in the HOCap dataset (irvlutd.github.io/HOCap/). It has a permissive license, MANO poses, and RGB-D with camera parameters! I managed to get the camera and images so far.

1

12

45

17

6.0K

B

Bowen Wen@bowenwen_me · 22 h

Why don't you just say "this message is for Chinese researchers"? Besides, I am also amazed by your superpower to recognize the ethnicity of anonymous reviewers. Otherwise, how could one just assume a negative review is from a WeChat user?

AAndrea Tagliasacchi 🇨🇦@taiyasaki · 23 h

Repeat after me: not everybody has a slave army of MSc students that (will do anything to have a paper and) executes every possible (boring) quantitative comparison you can think of. Seems I should post this on wechat though.

2

5

186

28

32.0K

B

Bowen Wen@bowenwen_me · Jul 2

Do not miss this great research internship opportunity!

ZZan Gojcic@ZGojcic · Jul 2

📢📢We have a last-minute internship opening on my team at @NVIDIAAI for this summer. If you are interested and have experience with large feedforward reconstruction models or post-training image/video diffusion models, please get in touch!

0

4

0

691

Bowen Wen Retweeted

Y

Yu Xiang@YuXiang_IRVL · Jun 21

Using FoundationStereo @bowenwen_me

1

3

5

1

521

Bowen Wen Retweeted

N

NVIDIA Robotics@NVIDIARobotics · Jun 17

Explore a variety of perception models and systems from #NVIDIAResearch that support a unified 3D perception stack for #robotics. These tools enable robots to understand and interact with unfamiliar environments in real-time. 🤖 Learn more 👉 nvda.ws/4jXKSPE

1

14

78

16

4.0K

B

Bowen Wen@bowenwen_me · Jun 13

Join us today to learn how to push the boundaries of stereo depth estimation!!

BBowen Wen@bowenwen_me · Jun 12

Come and say 👋 tomorrow (06/13) for our oral (1pm, Karl Dean Ballroom) and poster sessions (4pm, ExHall D, #81)! #CVPR2025 @CVPR @CVPRConf @NVIDIAAIDev @NVIDIARobotics #NVIDIA

0

1

5

0

402

Bowen Wen Retweeted

V

Vahe Taamazyan@vaheta · Jun 12

Had a great PIRA workshop at CVPR, thanks to Vincent Vanhoucke, Kartik Iyer, @bowenwen_me, Kartik Venkataraman, and @tomhodan

0

2

0

347

B

Bowen Wen@bowenwen_me · Jun 12

Come and say 👋 tomorrow (06/13) for our oral (1pm, Karl Dean Ballroom) and poster sessions (4pm, ExHall D, #81)! #CVPR2025 @CVPR @CVPRConf @NVIDIAAIDev @NVIDIARobotics #NVIDIA

BBowen Wen@bowenwen_me · Mar 13

📢Time to upgrade your depth camera! Introducing **FoundationStereo**, a foundation model for stereo depth estimation in zero-shot (accepted to CVPR 2025 with full scores) [1/n] Code: github.com/NVlabs/Foundat… Website: nvlabs.github.io/FoundationSter… Paper: arxiv.org/abs/2501.09898

0

2

19

2

1.0K

B

Bowen Wen@bowenwen_me · Jun 9

I use two factors to analyze robot autonomy: environment diversity and task diversity. If a robot just replays data from a single task and environment, of course it’ll succeed. Real autonomy lies in pushing toward the top-right corner of this figure—generalizing both.

JJack 🤖@JacklouisP · Jun 9

Rodney Brooks, robotics legend and iRobot co-founder, just spoke at Stanford about our current AI hype cycle. His blunt take: We're repeating the same mistakes. On hype cycles: We're like "Five-year-olds playing soccer—they all run to the ball. Nothing else is important." This…

0

1

14

8

1.0K

B

Bowen Wen@bowenwen_me · Jun 9

Don’t miss this exciting workshop happening tomorrow at #GTCParis.

NNVIDIA Europe@NVIDIAEU · May 29

🤖 Build smarter robots at #GTCParis. Join our new full-day workshop on Tuesday, June 10, to master simulation-first #robotics: 🔧 Structure modular assets 🔌 Test with #ROS 2 📊 Train with synthetic data ⚡ Accelerate AI with NVIDIA GPUs ➡️ nvda.ws/3HiMkhT

1

7

23

3

4.0K

B

Bowen Wen@bowenwen_me · Jun 8

Incredible learned behavior (assuming no human intervention) at 48:05 when it failed a couple of times but then it suddenly knows how to make it right. Amazing progress!

TThe Humanoid Hub@TheHumanoidHub · Jun 7

Uncut hour-long footage of Figure 02 autonomously transferring and flattening packages for a scanner down the line. The robot is using Figure’s Helix model, a generalist VLA that now incorporates upgrades in temporal memory and force feedback.

1

0

7

1

920

B

Bowen Wen@bowenwen_me · Jun 7

Kudos to Aria team and exciting support of FoundationStereo (nvlabs.github.io/FoundationSter…)! High quality 3D human demonstration data collection for robot learning will be a breeze 😌 @NVIDIAAIDev @AIatMeta #xr #technology

DDilmer Valecillos@Dilmerv · Jun 6

Here’s a great breakdown of Aria Gen 2 and why these research glasses are so impressive! 📌 Few highlights: 🕶️ Design & Comfort - Lighter (74–76g) with folding arms and 8 size options for better fit and comfort 📸 Vision Upgrades - 4 HDR cameras (up from 2), now with 120 dB…

0

6

46

14

3.0K

B

Bowen Wen@bowenwen_me · Jun 5

Thrilled to be nominated as best paper award candidate!! Looking forward to more chats at CVPR. #CVPR2025

BBowen Wen@bowenwen_me · Mar 13

📢Time to upgrade your depth camera! Introducing **FoundationStereo**, a foundation model for stereo depth estimation in zero-shot (accepted to CVPR 2025 with full scores) [1/n] Code: github.com/NVlabs/Foundat… Website: nvlabs.github.io/FoundationSter… Paper: arxiv.org/abs/2501.09898

7

27

274

99

16.0K

B

Bowen Wen@bowenwen_me · May 30

Come and join us! Also make sure you have signed up for our social event (events.nvidia.com/nvcvprresearch…) and earn free GPU 😍 #CVPR @CVPR

NNVIDIA AI Developer@NVIDIAAIDev · May 30

🔎 Explore NVIDIA’s technical workshops at #CVPR2025—dive into volumetric video, 3D point cloud deep learning, and hands-on Kaolin demos. 📝 Plus, discover 60+ papers advancing generative AI, AV, and robotics. Join us in Nashville to push computer vision forward ➡️…

1

2

14

4

3.0K

B

Bowen Wen@bowenwen_me · May 30

Super cool project! Glad to see FoundationPose (github.com/NVlabs/Foundat…) enables learning from low-cost hand-object demonstrations.

TTyler Lum@tylerlum23 · May 30

🧑🤖 Introducing Human2Sim2Robot! 💪🦾 Learn robust dexterous manipulation policies from just one human RGB-D video. Our Real→Sim→Real framework crosses the human-robot embodiment gap using RL in simulation. #Robotics #DexterousManipulation #Sim2Real 🧵1/7

3

10

74

28

6.0K

B

Bowen Wen@bowenwen_me · Apr 1

Synthetic data generation tools like MimicGen create large sim datasets with ease, but using them in the real-world is difficult due to the large sim-to-real gap. Our new work uses simple co-training to unlock the potential of synthetic sim data for real-world manipulation!

ZZhenyu Jiang@SteveTod1998 · Apr 1

How to use simulation data for real-world robot manipulation? We present sim-and-real co-training, a simple recipe for manipulation. We demonstrate that sim data can significantly enhance real-world performance, even with notable differences between the sim and the real. (1/n)

0

3

31

18

3.0K

B

Bowen Wen@bowenwen_me · Mar 24

動画でフレーム間の一貫性が得られるか気になるあと処理速度と人物などまるっこいもの

BBowen Wen@bowenwen_me · Mar 13

📢Time to upgrade your depth camera! Introducing **FoundationStereo**, a foundation model for stereo depth estimation in zero-shot (accepted to CVPR 2025 with full scores) [1/n] Code: github.com/NVlabs/Foundat… Website: nvlabs.github.io/FoundationSter… Paper: arxiv.org/abs/2501.09898

0

1

3

2

704

Bowen Wen Retweeted

Y

Yu Xiang@YuXiang_IRVL · Mar 21

I was preparing a video to introduce our lab @IRVLUTD for a meeting. Happy to share the video here! We are looking forward to collaborating with both academia and industry. Please feel free to reach out

2

30

129

41

14.0K