Congyue Deng
@CongyueD
CS PhD student @Stanford | Previous: math undergrad @Tsinghua_Uni | ❤️ 3D vision, geometry, and art
🤖 What if a humanoid robot could make a hamburger from raw ingredients—all the way to your plate? 🔥 Excited to announce ViTacFormer: our new pipeline for next-level dexterous manipulation with active vision + high-resolution touch. 🎯 For the first time ever, we demonstrate…
Compression is the heart of intelligence From Occam to Kolmogorov—shorter programs=smarter representations Meet KARL: Kolmogorov-Approximating Representation Learning. Given an image, token budget T & target quality 𝜖 —KARL finds the smallest t≤T to reconstruct it within 𝜖🧵
The deadline has been extended to July 18. Looking forward to your contributions to the field! 🤩 #ICCV2025 @ICCV
Can scaling data and models alone solve computer vision? 🤔 Join us at the SP4V Workshop at #ICCV2025 in Hawaii to explore this question! 🎤 Speakers: @danfei_xu, @joaocarreira, @jiajunwu_cs, Kristen Grauman, @sainingxie, @vincesitzmann 🔗 sp4v.github.io
Our paper on learning controllable 3D robot models from vision is published in Nature! Huge congrats to Lester and the team, @annan__zhang, @BoyuanChen0, Hanna Matusik, Chao Liu, and Daniela Rus!! Learning joint world models for the environment & the agent is super exciting :)
Now in Nature! 🚀 Our method learns a controllable 3D model of any robot from vision, enabling single-camera closed-loop control at test time! This includes robots previously uncontrollable, soft, and bio-inspired, potentially lowering the barrier of entry to automation! Paper:…
What would a World Model look like if we start from a real embodied agent acting in the real world? It has to have: 1) A real, physically grounded and complex action space—not just abstract control signals. 2) Diverse, real-life scenarios and activities. Or in short: It has to…
Hope you enjoyed our workshop 😛 We have now released the slides of the presentations. Thanks to the amazing speakers! Check it if you didn't attend!
🔍 3D is not just pixels—we care about geometry, physics, topology, and functions. But how to balance these inductive biases with scalable learning? 👀 Join us at Ind3D workshop @CVPR (June 12, afternoon) for discussions on the future of 3D models! 🌐 ind3dworkshop.github.io/cvpr2025
Join our #RSS2025 MoMa Workshop tomorrow to hear about latest advancements and challenges in mobile manipulation. 📰 Learn more: rss-moma-2025.github.io 🛜 Also available on Zoom!
We welcome submissions on all kinds of structural priors — including scene-centric models, 3D geometry, temporal and motion cues, egocentric behavior, physics-based reasoning, and more! 🗓️ Submission Deadline: July 3, 2025 👉 sp4v.github.io/call_for_paper…
Can scaling data and models alone solve computer vision? 🤔 Join us at the SP4V Workshop at #ICCV2025 in Hawaii to explore this question! 🎤 Speakers: @danfei_xu, @joaocarreira, @jiajunwu_cs, Kristen Grauman, @sainingxie, @vincesitzmann 🔗 sp4v.github.io
Interested in 3D Vision, 3D Scene Understanding and 3D Generation? We have open PhD/PostDoc positions in Lugano, Switzerland - I'm at #CVPR2025, please reach out 🚀 or send me an email @CVPR
Great opening talk by Bill Freeman at the #CVPR2025 Workshop on Visual Generative Modeling: What’s After Diffusion?
This was a really fun and exciting workshop #CVPR2025! Huge thanks to all the speakers, organizers and reviewers @CVPR! We hope to be able to release the video recordings soon!
Join us for the 4D Vision Workshop @CVPR on June 11 starting at 9:20am! We'll have an incredible lineup of speakers discussing the frontier of 3D computer vision techniques for dynamic world modeling across spatial AI, robotics, astrophysics, and more. 4dvisionworkshop.github.io
And finally, talk by the great David Forsyth!
🔍 3D is not just pixels—we care about geometry, physics, topology, and functions. But how to balance these inductive biases with scalable learning? 👀 Join us at Ind3D workshop @CVPR (June 12, afternoon) for discussions on the future of 3D models! 🌐 ind3dworkshop.github.io/cvpr2025
Daniel Cremers talking about classical and learning-based approaches for 4D!
🔍 3D is not just pixels—we care about geometry, physics, topology, and functions. But how to balance these inductive biases with scalable learning? 👀 Join us at Ind3D workshop @CVPR (June 12, afternoon) for discussions on the future of 3D models! 🌐 ind3dworkshop.github.io/cvpr2025
Maks Ovsjanikov talking about physically plausible generative modeling!
🔍 3D is not just pixels—we care about geometry, physics, topology, and functions. But how to balance these inductive biases with scalable learning? 👀 Join us at Ind3D workshop @CVPR (June 12, afternoon) for discussions on the future of 3D models! 🌐 ind3dworkshop.github.io/cvpr2025
Talk by @KostasPenn happening now!
🔍 3D is not just pixels—we care about geometry, physics, topology, and functions. But how to balance these inductive biases with scalable learning? 👀 Join us at Ind3D workshop @CVPR (June 12, afternoon) for discussions on the future of 3D models! 🌐 ind3dworkshop.github.io/cvpr2025
Happening now! Join us at Davidson C3!
🔍 3D is not just pixels—we care about geometry, physics, topology, and functions. But how to balance these inductive biases with scalable learning? 👀 Join us at Ind3D workshop @CVPR (June 12, afternoon) for discussions on the future of 3D models! 🌐 ind3dworkshop.github.io/cvpr2025
At #CVPR2025 today (6/12) and looking forward to give two workshop talks on Mobile Manipulation! 9:30-10am at the MEIS Workshop: Towards Collaborative Mobile Manipulation 2-2:30pm at the OpenSUN3D Workshop: The Challenges and Opportunities of Mobile Manipulation