Jianwei Yang

@jw2yang4ai

Research Scientist at @Meta SuperIntelligence Lab; ex-MSR; Core contributor of Project Florence, Phi-3V, Omniparser; Inventor of FocalNet, SEEM, SoM and Magma.

Redmond, WA

Joined July 2016

502Following

4KFollowers

Pinned

Jianwei Yang@jw2yang4ai · Jun 16

Life Update: Now that I have finished the presentation of last @MSFTResearch project Magma at @CVPR, I am excited to share that I have joined @AIatMeta as a research scientist to further push forward the boundary of multimodal foundation models! I have always been passionate…

jw2yang4ai's tweet image. Life Update: Now that I have finished the presentation of last @MSFTResearch project Magma at @CVPR, I am excited to share that I have joined @AIatMeta as a research scientist to further push forward the boundary of multimodal foundation models!

I have always been passionate…

385

29.0K

Pinned

Jianwei Yang Retweeted

Yiping Wang@ypwang61 · Apr 30

We only need ONE example for RLVR on LLMs to achieve significant improvement on math tasks! 📍RLVR with one training example can boost: - Qwen2.5-Math-1.5B: 36.0% → 73.6% - Qwen2.5-Math-7B: 51.0% → 79.2% on MATH500. 📄 Paper: arxiv.org/abs/2504.20571…

413

332

72.0K

Pinned

Jianwei Yang Retweeted

Mu Cai@MuCai7 · Apr 25

I am excited to announce that I am not at #ICLR presenting Matryoshka Multimodal Models matryoshka-mm.github.io. 😀 But rather, I am online at Bay Area. Ping me if you have any questions or ideas w.r.t paper! Feel free to read the poster at Hall 3 + Hall 2B #86 this morning!

124

11.0K

Jianwei Yang@jw2yang4ai · Jul 23

VLM struggles badly to interpret 3D from 2D observations, but what if it has a good mental model about the world? Checkout our MindJourney - A test-time scaling for spatial reasoning in 3D world. Without any specific training, MindJourney imagines (acts mentally) step-by-step…

YYuncong Yang@YuncongYY · Jul 21

Test-time scaling nailed code & math—next stop: the real 3D world. 🌍 MindJourney pairs any VLM with a video-diffusion World Model, letting it explore an imagined scene before answering. One frame becomes a tour—and the tour leads to new SOTA in spatial reasoning. 🚀 🧵1/

3.0K

Jianwei Yang@jw2yang4ai · Jul 18

VLMs often struggle with physical reasoning tasks such as spatial reasoning. Excited to share how we can use world models + test-time search to zero-shot improve spatial reasoning in VLMs!

AAK@_akhaliq · Jul 18

MindJourney Test-Time Scaling with World Models for Spatial Reasoning

186

24.0K

Jianwei Yang@jw2yang4ai · Jun 20

Wow, this is so cool! Have been dreaming of building agents that can interact with humans via language communications, and the world via physical interaction (locomotion, manipulation, etc). Definitely a great step-stone and playground!

CChuang Gan@gan_chuang · Jun 20

World Simulator, reimagined — now alive with humans, robots, and their vibrant society unfolding in 3D real-world geospatial scenes across the globe! 🚀 One day soon, humans and robots will co-exist in the same world. To prepare, we must address: 1️⃣ How can robots cooperate or…

3.0K

Jianwei Yang@jw2yang4ai · Jun 13

check our poster at 240 on exhibition hall D at 10:30 today!

JJiasen Lu@jiasenlu · Dec 5

(1/10) 🔥Thrilled to introduce OneDiffusion—our latest work in unified diffusion modeling! 🚀 This model bridges the gap between image synthesis and understanding, excelling in a wide range of tasks: T2I, conditional generation, image understanding, identity preservation,…

2.0K

Jianwei Yang@jw2yang4ai · Jun 11

Our afternoon session is about to start very soon with Prof. @RanjayKrishna at Room 101B!

JJianwei Yang@jw2yang4ai · Jun 11

🔥@CVPR2025 CVinW 2025 is about to take place very soon!! We have a plenty of great talks and spotlight talks upcoming (@BoqingGo, @RanjayKrishna @furongh @YunzhuLiYZ @sainingxie @CordeliaSchmid, Shizhe Chen). Look forward to seeing you all at 101B from 9am-5pm, June 11th!…

1.0K

Jianwei Yang@jw2yang4ai · Jun 11

JJianwei Yang@jw2yang4ai · Apr 28

🚀 Excited to announce our 4th Workshop on Computer Vision in the Wild (CVinW) at @CVPR 2025! 🔗 computer-vision-in-the-wild.github.io/cvpr-2025/ ⭐We have invinted a great lineup of speakers: Prof. Kaiming He, Prof. @BoqingGo, Prof. @CordeliaSchmid, Prof. @RanjayKrishna, Prof. @sainingxie, Prof.…

7.0K

Jianwei Yang Retweeted

Furong Huang@furongh · Jun 10

Excited to speak at the Workshop on Computer Vision in the Wild @CVPR 2025! 🎥🌍 🗓️ June 11 | 📍 Room 101 B, Music City Center, Nashville, TN 🎸 🧠 Talk: From Perception to Action: Building World Models for Generalist Agents Let’s connect if you're around! #CVPR2025 #robotics…

6.0K

Jianwei Yang Retweeted

Cohere Labs@Cohere_Labs · May 19

Our community-led Computer Vision group is thrilled to host @jw2yang4ai, Principal Researcher at Microsoft Research for a session on "Magma: A Foundation Model for Multimodal AI Agents" Thanks to @cataluna84 and @Arkhymadhe for organizing this speaker session 👏

3.0K

Jianwei Yang@jw2yang4ai · May 16

Hope you all had a great #NeurIPS2025 submissions and have a good rest! We are still open to submissions to our CVinW workshop at @CVPR! Welcome to share your work on our workshop with a few clicks! 👉Submit Portal: openreview.net/group?id=thecv…

JJianwei Yang@jw2yang4ai · Apr 28

7.0K

Jianwei Yang Retweeted

Richard Sutton@RichardSSutton · May 6

The latest episode of the Derby Mill Podcast is just out and focused on the "Era of Experience" paper by David Silver and myself. Substack: insights.intrepidgp.com/p/welcome-to-t… Spotify: open.spotify.com/episode/254sxl… Apple: podcasts.apple.com/us/podcast/wel… YouTube: youtube.com/watch?v=dhfJfQ…

202

105

22.0K

Jianwei Yang Retweeted

Ahmed Awadallah@AhmedHAwadallah · May 1

Introducing Phi-4-reasoning, adding reasoning models to the Phi family of SLMs. The model is trained with both supervised finetuning (using a carefully curated dataset of reasoning demonstration) and Reinforcement Learning. 📌Competitive results on reasoning benchmarks with…

142

35.0K