Weijie Wang

@wjwang2003

Undergraduate student at @ZJU_China, incoming PhD student at ZIP Lab, Zhejiang University

Zhejiang, China

Joined February 2024

80Following

44Followers

Pinned

Weijie Wang@wjwang2003 · May 30

🚀 We're excited to introduce ZPressor, a bottleneck-aware compression module for scalable feed-forward 3DGS. Existing feed-forward 3DGS models struggle with dense views, facing performance drops & massive redundancy. ZPressor leverages Information Bottleneck Theory to compress…

wjwang2003's tweet image. 🚀 We're excited to introduce ZPressor, a bottleneck-aware compression module for scalable feed-forward 3DGS.

Existing feed-forward 3DGS models struggle with dense views, facing performance drops &amp; massive redundancy. ZPressor leverages Information Bottleneck Theory to compress…

8.0K

Pinned

Weijie Wang@wjwang2003 · Jun 6

With thanks to @janusch_patas for the recommendation. See our homepage for further details and results: aim-uofa.github.io/PMLoss

MMrNeRF@janusch_patas · Jun 6

Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting • We pinpoint an unexposed yet critical issue that leads to lower-quality 3D Gaussians predicted by feed-forward 3DGS models, rooted in the long-standing discontinuity issue of depth. • We introduce a…

458

Weijie Wang Retweeted

MrNeRF@janusch_patas · Jun 9

4DGT: Learning a 4D Gaussian Transformer Using Real-World Monocular Videos Abstract: We propose 4DGT, a 4D Gaussian-based Transformer model for dynamic scene reconstruction, trained entirely on real-world monocular posed videos. Using 4D Gaussian as an inductive bias, 4DGT…

386

258

24.0K

Weijie Wang Retweeted

Zhenjun Zhao@zhenjun_zhao · Jun 8

Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting @Duochao_Shi, @wjwang2003, @donydchen, Zeyu Zhang, Jia-Wang Bian, @supremeZhuang, @chunhua_shen tl;dr: pre-trained 3D reconstruction models->pointmaps->geometry prior->loss arxiv.org/abs/2506.05327

7.0K

Weijie Wang Retweeted

Renaud Vandeghen@rvandeghen · May 27

🚀 𝐓𝐫𝐢𝐚𝐧𝐠𝐥𝐞 𝐒𝐩𝐥𝐚𝐭𝐭𝐢𝐧𝐠 𝐟𝐨𝐫 𝐑𝐞𝐚𝐥-𝐓𝐢𝐦𝐞 𝐑𝐚𝐝𝐢𝐚𝐧𝐜𝐞 𝐅𝐢𝐞𝐥𝐝 𝐑𝐞𝐧𝐝𝐞𝐫𝐢𝐧𝐠 is out! We bring triangles back to the spotlight for photorealistic, real-time novel view synthesis. arxiv.org/abs/2505.19175 🧵👇

125

27.0K

Weijie Wang Retweeted

Francis Engelmann@FrancisEngelman · May 31

What makes a good 3D scene representation? Instead of meshes or Gaussians, we propose Superquadrics to decompose 3D scenes into extremely compact representations ➡️ check out our paper for exciting use-cases in robotics🤖 and GenAI🚀 super-dec.github.io w/ @efedele16 @mapo1

364

192

24.0K

Weijie Wang@wjwang2003 · May 30

Thanks to @zhenjun_zhao for the recommendation. For further information and video results, please visit our project page at lhmd.top/zpressor

ZZhenjun Zhao@zhenjun_zhao · May 30

ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS @wjwang2003, @donydchen, @SteveZeyuZhang, Duochao Shi, Akide Liu, @supremeZhuang tl;dr: views->anchor & support sets; support view info->anchor views->compressed latent state Z arxiv.org/abs/2505.23734

Weijie Wang Retweeted

Wenbo Hu@gordonhu608 · May 29

🤔How to maintain a long-term memory for a 3D embodied AI agent across dynamic spatial-temporal environment changes in complex tasks? 🚀Introducing 3DLLM-Mem, a memory-enhanced 3D embodied agent that incrementally builds and maintains a task-relevant long-term memory while it…

32.0K

Weijie Wang@wjwang2003 · May 26

Checkout our recent work on RL for computer use agent! 💻🎮

PPengxiang Li@oliverlee1999 · May 26

🔥Introducing SPORT, a multimodal agent that explores tool usage without human annotation. It leverages step-wise DPO to further enhance tool-use capabilities following SFT. SPORT achieves improvements on the GTA and GAIA benchmarks. sport-agents.github.io

2.0K

Weijie Wang Retweeted

Songlin Yang@SonglinYang4 · May 24

📢 (1/16) Introducing PaTH 🛣️ — a RoPE-free contextualized position encoding scheme, built for stronger state tracking, better extrapolation, and hardware-efficient training. PaTH outperforms RoPE across short and long language modeling benchmarks arxiv.org/abs/2505.16381

530

329

71.0K