Fu-En (Fred) Yang

@FuEnYang1

Research Scientist @NVIDIAAI | Ph.D. @NTU_TW | Prev. Research Intern @NVIDIAAI | Vision & Language | Multimodal AI

Joined February 2020

1KFollowing

548Followers

Pinned

🤖 How can we teach embodied agents to think before they act? 🚀 Introducing ThinkAct — a hierarchical Reasoning VLA framework with an MLLM for complex, slow reasoning and an action expert for fast, grounded execution. Slow think, fast act. 🧠⚡🤲

FuEnYang1's tweet image. 🤖 How can we teach embodied agents to think before they act?

🚀 Introducing ThinkAct — a hierarchical Reasoning VLA framework with an MLLM for complex, slow reasoning and an action expert for fast, grounded execution.
Slow think, fast act. 🧠⚡🤲

7.0K

Fu-En (Fred) Yang Retweeted

Papers of the day@ArxivToday · Jul 23

New paper introduces ThinkAct - a framework that teaches robots to reason before acting. It's like giving robots a moment to think through their next moves, just like we do. 🧵

116

Fu-En (Fred) Yang@FuEnYang1 · 15 h

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning Author's Explanation: x.com/FuEnYang1/stat… Overview: ThinkAct introduces a dual-system framework that generates embodied reasoning plans with a multimodal LLM guided by reinforced action-aligned…

FFu-En (Fred) Yang@FuEnYang1 · Jul 23

485

Fu-En (Fred) Yang Retweeted

The AI Timeline@TheAITimeline · 15 h

🚨This week's top AI/ML research papers: - GSPO - Diffusion Beats Autoregressive in Data-Constrained Settings - Gemini 2.5 Pro Capable of Winning Gold at IMO 2025 - Rubrics as Rewards - Deep Researcher with Test-Time Diffusion - Learning without training - Stabilizing Knowledge,…

410

352

40.0K

Fu-En (Fred) Yang Retweeted

Oleksii Kuchaiev@kuchaev · Jul 25

Very excited to announce Llama-Nemotron-Super-V1.5! Super-V1.5 is now better than Ultra-V1. This is currently the best model that can be deployed on a single H100. Reasoning On/Off and drop in replacement for V1. Open-weight, code and data on HF huggingface.co/nvidia/Llama-3…

183

37.0K

Fu-En (Fred) Yang@FuEnYang1 · Jul 23

Thanks @_akhaliq for sharing our latest VLA Reasoning work! Please see more details here: x.com/FuEnYang1/stat… @NVIDIAAIDev @NVIDIAAI @nvidia #NVIDIA #NVIDIAResearch #VLA #reasoning #RL

AAK@_akhaliq · Jul 23

Nvidia presents ThinkAct Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

12.0K

Fu-En (Fred) Yang Retweeted

AK@_akhaliq · Jul 23

Nvidia presents ThinkAct Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

120

601

382

55.0K

Fu-En (Fred) Yang Retweeted

Demis Hassabis@demishassabis · Jul 21

Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to @lmthang and the team! deepmind.google/discover/blog/…

202

765

6.0K

635

1.4M

Fu-En (Fred) Yang Retweeted

AK@_akhaliq · Jul 18

RoboBrain 2.0 Technical Report

263

112

30.0K

Fu-En (Fred) Yang@FuEnYang1 · Jul 17

And today we have just opened sourced the Eagle 2.5 model huggingface.co/nvidia/Eagle2.… You are welcome to download and give a try! We will also open source the fine-tuning code for Eagle 2/2.5 soon at github.com/NVlabs/Eagle. Stay tuned.

ZZhiding Yu@ZhidingYu · Jul 17

I did not notice this until just now. Thank you @andimarafioti for the recommendation! Very glad that even though Eagle 2 is not our latest work, people still find it very useful.

4.0K

Fu-En (Fred) Yang Retweeted

OpenAI@OpenAI · Jul 17

ChatGPT can now do work for you using its own computer. Introducing ChatGPT agent—a unified agentic system combining Operator’s action-taking remote browser, deep research’s web synthesis, and ChatGPT’s conversational strengths.

836

2.0K

14.0K

5.0K

3.5M

Fu-En (Fred) Yang Retweeted

AK@_akhaliq · Jun 19

GenRecal Generation after Recalibration from Large to Small Vision-Language Models

12.0K

Fu-En (Fred) Yang Retweeted

Generalist@GeneralistAI_ · Jun 17

Today we're excited to share a glimpse of what we're building at Generalist. As a first step towards our mission of making general-purpose robots a reality, we're pushing the frontiers of what end-to-end AI models can achieve in the real world. Here's a preview of our early…

147

853

264

257.0K