Ian Huang (@IanHuang3D)

Pinned

I

Ian Huang@IanHuang3D · Mar 26

🏡Building realistic 3D scenes just got smarter! Introducing our #CVPR2025 work, 🔥FirePlace, a framework that enables Multimodal LLMs to automatically generate realistic and geometrically valid placements for objects into complex 3D scenes. How does it work?🧵👇

23

97

378

172

113.0K

Ian Huang Retweeted

J

Jason Wei@_jasonwei · Jul 16

New blog post about asymmetry of verification and "verifier's law": jasonwei.net/blog/asymmetry… Asymmetry of verification–the idea that some tasks are much easier to verify than to solve–is becoming an important idea as we have RL that finally works generally. Great examples of…

50

242

1.0K

330.0K

I

Ian Huang@IanHuang3D · Jun 14

We are presenting 17:00-19:00 today at Poster 267 in ExHall D for #CVPR25! Come and check out the first #VLM #3D #Graphics Benchmark! 📣📣📣

YYunqi (Richard) Gu@richard_yunqigu · Apr 12

Which multimodal LLM should you be using to edit graphics in Blender? Today, we’re releasing our #CVPR2025 Highlight🌟 work, #BlenderGym 🏋️‍♀️, the first agentic 3D graphics editing benchmark that will tell you exactly how multimodal LLMs compare in their Blender-editing skills.…

0

2

1

424

I

Ian Huang@IanHuang3D · Jun 14

📣Happening at Poster #269 TODAY between 10:30am - 12:30am at #CVPR2025 ! Come learn about multimodal #VLM reasoning for #3D scene generation!

IIan Huang@IanHuang3D · Mar 26

🏡Building realistic 3D scenes just got smarter! Introducing our #CVPR2025 work, 🔥FirePlace, a framework that enables Multimodal LLMs to automatically generate realistic and geometrically valid placements for objects into complex 3D scenes. How does it work?🧵👇

0

8

1

384

I

Ian Huang@IanHuang3D · Apr 12

If you're wondering which multimodal LLMs you should be using to build 3D graphics agents 🧑‍💻 , check out our #CVPR2025 Highlight work, BlenderGym -- not only does BlenderGym benchmark the top open and closed models, it also reveals a trick about *how* you should be allocating…

YYunqi (Richard) Gu@richard_yunqigu · Apr 12

Which multimodal LLM should you be using to edit graphics in Blender? Today, we’re releasing our #CVPR2025 Highlight🌟 work, #BlenderGym 🏋️‍♀️, the first agentic 3D graphics editing benchmark that will tell you exactly how multimodal LLMs compare in their Blender-editing skills.…

0

1

28

7

2.0K

Ian Huang Retweeted

H

Hansheng Chen@HanshengCh · Apr 8

Excited to share our work: Gaussian Mixture Flow Matching Models (GMFlow) github.com/lakonik/gmflow GMFlow generalizes diffusion models by predicting Gaussian mixture denoising distributions, enabling precise few-step sampling and high-quality generation.

1

32

124

54

11.0K

I

Ian Huang@IanHuang3D · Apr 5

📣 Happy to share that FirePlace got a #CVPR2025 Highlight ! See you all in Nashville! 🎷

IIan Huang@IanHuang3D · Mar 26

🏡Building realistic 3D scenes just got smarter! Introducing our #CVPR2025 work, 🔥FirePlace, a framework that enables Multimodal LLMs to automatically generate realistic and geometrically valid placements for objects into complex 3D scenes. How does it work?🧵👇

0

28

3

1.0K

I

Ian Huang@IanHuang3D · Apr 5

Impressive demo and work! Amazing stuff going on with MLLMs.

IIan Huang@IanHuang3D · Mar 26

🏡Building realistic 3D scenes just got smarter! Introducing our #CVPR2025 work, 🔥FirePlace, a framework that enables Multimodal LLMs to automatically generate realistic and geometrically valid placements for objects into complex 3D scenes. How does it work?🧵👇

0

1

0

345

I

Ian Huang@IanHuang3D · Mar 29

Thanks for sharing our work! And yes — sound ON is a good idea.

nnaveen manwani@NaveenManwani17 · Mar 29

🚨CVPR 2025 Paper Alert 🚨 ➡️Paper Title: FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement 🌟Few pointers from the paper 🎯Scene generation with 3D assets presents a complex challenge, requiring both high-level semantic understanding and…

1

0

5

0

610

I

Ian Huang@IanHuang3D · Mar 28

Uh. What. 🤯

0

5

0

356

I

Ian Huang@IanHuang3D · Mar 27

y'all wanted Ghibli, so here it is. FirePlace 3D scene 🏠 -> render 📸-> ChatGPT4o + prompting 🎨 fireplace3d.github.io

IIan Huang@IanHuang3D · Mar 26

🏡Building realistic 3D scenes just got smarter! Introducing our #CVPR2025 work, 🔥FirePlace, a framework that enables Multimodal LLMs to automatically generate realistic and geometrically valid placements for objects into complex 3D scenes. How does it work?🧵👇

0

7

2

560

I

Ian Huang@IanHuang3D · Mar 27

Thanks for sharing our work! If you’re interested in a bitesized breakdown of how it works 👉🏻x.com/IanHuang3D/sta…

AAK@_akhaliq · Mar 26

FirePlace Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement

0

6

0

513

I

Ian Huang@IanHuang3D · Mar 26

neat looking pipeline for mixing vision LLM with 3d objects simply asks the MLLM stuff like object size, rotation, surface alignment constraints, etc and then applies them to 3d scene overlayed on the image & anchor object

AAK@_akhaliq · Mar 26

FirePlace Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement

2

8

7

6.0K

I

Ian Huang@IanHuang3D · Mar 27

Okay, we have to be mere days away from this personal household item inventory app idea I keep talking about.

AAK@_akhaliq · Mar 26

FirePlace Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement

0

1

0

273

I

Ian Huang@IanHuang3D · Mar 27

People should combine this, 4o image gen and hunyuan 3d in an AR glasses to create “physical” objects out of thin air

AAK@_akhaliq · Mar 26

FirePlace Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement

0

1

3

1

1.0K

Ian Huang Retweeted

A

AK@_akhaliq · Mar 26

FirePlace Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement

12

74

433

319

49.0K