Wenhao Yu

@Stacormed

Research Scientist @DeepMind

Joined March 2011

133Following

457Followers

Pinned

Wenhao Yu@Stacormed · Mar 28, 2024

No more head bumping from picking up stuff from under the bed! Checkout our recent work that tightly integrates legs and manipulators that can open up so many applications and interesting loco-manipulation research questions!

CChangyi Lin@changyi_lin1 · Mar 28, 2024

LocoMan = Quadrupedal Robot + 2 * Loco-Manipulator Powered by dual lightweight 3-DoF Loco-Manipulators and the Whole-Body Controller, LocoMan achieves various challenging tasks, such as manipulation in narrow spaces and bimanual-manipulation. linchangyi1.github.io/LocoMan 👇👇👇

3.0K

Wenhao Yu@Stacormed · Jul 3

How do imbue robots with the ability to imagine the world and complete tasks better? Join us at CoRL 25 workshop on Robotics World Modeling and share your latest work in this area!

SSean Kirmani@SeanKirmani · Jul 3

🤖🌎 We are organizing a workshop on Robotics World Modeling at @corl_conf 2025! We have an excellent group of speakers and panelists, and are inviting you to submit your papers with a July 13 deadline. Website: robot-world-modeling.github.io

1.0K

Wenhao Yu@Stacormed · May 5

Deadline extended! You now have until May 25th (10 days post-NeurIPS) to submit to our ICML World Model Workshop. Looking forward to your papers!

YYilun Du@du_yilun · Apr 14

How can we connect world models to physical world? Come join our 2025 workshop at ICML on Building Physically Plausible World Models! physical-world-modeling.github.io (1/2)

4.0K

Wenhao Yu Retweeted

Yixin Lin@yixin_lin_ · Mar 13

Complementary to Gemini Robotics -- the massive vision-language-action (VLA) model released yesterday -- we also investigated how far we can push Gemini for robotics _purely from simulation data_ in Proc4Gem: 🧵

361

177

47.0K

Wenhao Yu Retweeted

Sundar Pichai@sundarpichai · Mar 12

We’ve always thought of robotics as a helpful testing ground for translating AI advances into the physical world. Today we’re taking our next step in this journey with our newest Gemini 2.0 robotics models. They show state of the art performance on two important benchmarks -…

176

354

3.0K

389

278.0K

Wenhao Yu@Stacormed · Mar 12

Super excited to share what we’ve been working on!

GGoogle DeepMind@GoogleDeepMind · Mar 12

Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. 🤖 Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵 goo.gle/gemini2-roboti…

652

Wenhao Yu@Stacormed · Jan 13

2nd Earth Rover Challenge is coming! Eager to see how much progress AI will make in navigating real cities against real human agents!

FFrodoBots@frodobots · Jan 11

Announcing the 2nd Earth Rover Challenge: an "AI vs Gamers" global navigation competition (to be held #ICRA2025 in May in Atlanta) Co-organized with researchers from Deepmind, Meta & academia A thread 🧵 - 1/n

340

Wenhao Yu@Stacormed · Dec 12

💪💪

GGoogle DeepMind@GoogleDeepMind · Dec 11

Welcome to the world, Gemini 2.0 ✨ our most capable AI model yet. We're first releasing an experimental version of 2.0 Flash ⚡ It has better performance, new multimodal output, @Google tool use - and paves the way for new agentic experiences. 🧵 goo.gle/gemini-2

191

Wenhao Yu@Stacormed · Nov 29

Wow this is really good! In some way I’m more impressed that it’s teleoperated than if it’s autonomous cuz it feels very plausible to develop a highly specialized RL-based policy to do this, but being able to tele op this opens up a wide range of data to be collected.

TTesla Optimus@Tesla_Optimus · Nov 28

Got a new hand for Black Friday

2.0K

Wenhao Yu@Stacormed · Nov 7

How can we leverage the common sense knowledge from a VLM to understand the progress (and even quality!) of a robotics trajectory? Check out GVL on a surprisingly simple and elegant way to do that! Awesome work by Jason!

JJason Ma@JasonMa2020 · Nov 7

Excited to finally share Generative Value Learning (GVL), my @GoogleDeepMind project on extracting universal value functions from long-context VLMs via in-context learning! We discovered a simple method to generate zero-shot and few-shot values for 300+ robot tasks and 50+…

3.0K

Wenhao Yu@Stacormed · Oct 18

We just open sourced the hardware and software of LocoMan: github.com/linchangyi1/Lo…. Try it out yourself!

CChangyi Lin@changyi_lin1 · Mar 28, 2024

4.0K

Wenhao Yu@Stacormed · Jul 21, 2024

At Vienna for ICML this week! Let me know if you are down to catch up and look forward to the great discussions and talks ahead! Also come check out our work on PIVOT pivot-prompt.github.io during Tuesday’s poster session!

Stacormed's tweet image. At Vienna for ICML this week! Let me know if you are down to catch up and look forward to the great discussions and talks ahead!

Also come check out our work on PIVOT pivot-prompt.github.io during Tuesday’s poster session!

4.0K

Wenhao Yu Retweeted

Google DeepMind@GoogleDeepMind · Jul 11, 2024

How can Gemini 1.5 Pro’s long context window help robots navigate the world? 🤖 A thread of our latest experiments. 🧵

126

198

1.0K

311

474.0K

Wenhao Yu Retweeted

Ayzaan Wahid@ayzwah · Apr 16, 2024

For the past year we've been working on ALOHA Unleashed 🌋 @GoogleDeepmind - pushing the scale and dexterity of tasks on our ALOHA 2 fleet. Here is a thread with some of the coolest videos! The first task is hanging a shirt on a hanger (autonomous 1x)

110

537

163

140.0K

Wenhao Yu Retweeted

Google DeepMind@GoogleDeepMind · Apr 11, 2024

Soccer players have to master a range of dynamic skills, from turning and kicking to chasing a ball. How could robots do the same? ⚽ We trained our AI agents to demonstrate a range of agile behaviors using reinforcement learning. Here’s how. 🧵 dpmd.ai/3vUlgjC

128

512

2.0K

528

445.0K

Wenhao Yu Retweeted

Chen Wang@chenwang_j · Mar 14, 2024

Can we use wearable devices to collect robot data without actual robots? Yes! With a pair of gloves🧤! Introducing DexCap, a portable hand motion capture system that collects 3D data (point cloud + finger motion) for training robots with dexterous hands Everything open-sourced

137

621

244

232.0K