Longtao Zheng

@ltzheng01

PhD student @NTUsg. Building open-ended agents in open-ended worlds.

Singapore

Joined July 2020

562Following

141Followers

Pinned

🛠️🤖 Introducing SimpleTIR: An end-to-end solution for stable multi-turn tool use RL 📈 Multi-turn RL training suffers from catastrophic instability, but we find a simple fix ✨ The secret? Strategic trajectory filtering keeps training rock-solid! 🎯 Stable gains straight from…

ltzheng01's tweet image. 🛠️🤖 Introducing SimpleTIR: An end-to-end solution for stable multi-turn tool use RL

📈 Multi-turn RL training suffers from catastrophic instability, but we find a simple fix
✨ The secret? Strategic trajectory filtering keeps training rock-solid!
🎯 Stable gains straight from…

6.0K

Longtao Zheng Retweeted

Google DeepMind@GoogleDeepMind · Jun 13

The first film from our partnership with @primordialsoup_ - a storytelling venture founded by visionary director Darren Aronofsky - is debuting at @Tribeca. Directed by Eliza McNitt, ANCESTRA uses traditional filmmaking alongside Veo, our generative video model. Take a look ↓…

104

660

236

329.0K

Longtao Zheng@ltzheng01 · Jun 7

This i did not expect. Cool.

IIvanka Trump@IvankaTrump · Jun 6

Perhaps the most important thing you can read about AI this year : “Welcome to the Era of Experience” This excellent paper from two senior DeepMind researchers argues that AI is entering a new phase—the "Era of Experience"—which follows the prior phases of simulation-based…

1.0K

224

126.0K

Longtao Zheng@ltzheng01 · May 29

Good researchers can smell the BS without even reading the papers :) x.com/agihippo/statu…

yyi@agihippo · May 27

Lol the RL papers in the wilderness are wonky ngl

4.0K

Longtao Zheng Retweeted

Richard Sutton@RichardSSutton · Mar 31

Rich's slogans for AI research (revised 2006): 1. Approximate the solution, not the problem (no special cases) 2. Drive from the problem 3. Take the agent’s point of view 4. Don’t ask the agent to achieve what it can’t measure 5. Don't ask the agent to know what it can't verify…

158

921

566

59.0K

Longtao Zheng@ltzheng01 · Dec 20

Rich Sutton, the godfather of reinforcement learning, gave me some golden advice today: work hard, think hard, and play—and don’t have too much respect. I can’t agree more—it’s advice I’ll take to heart.

Longtao Zheng@ltzheng01 · Dec 6

lol this song is funny🤣

AAndy Wojcicki@pretendsmarts · Dec 6

Gave a quick test to MEMO (actually ... not that quick -20 minut for a 30s video, on a A100😳). bonus #Udio song about the project. portrait of yours truly from @artflow_ai Links, including a colab in next message, if anyone is interested.

103

Longtao Zheng Retweeted

camenduru@camenduru · Dec 6

🔥 MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation 😎 Open source is the way! 🔊

219

195

23.0K

Longtao Zheng Retweeted

Jack Parker-Holder@jparkerholder · Dec 4

Introducing 🧞Genie 2 🧞 - our most capable large-scale foundation world model, which can generate a diverse array of consistent worlds, playable for up to a minute. We believe Genie 2 could unlock the next wave of capabilities for embodied agents 🧠.

284

474

3.0K

1.0K

2.6M

Longtao Zheng Retweeted

Jeff Clune@jeffclune · Nov 18

The secret to doing good research is always to be a little underemployed. You waste years by not being able to waste hours. - Amos Tversky

1.0K

269

89.0K

Longtao Zheng@ltzheng01 · Oct 13

I just tried out playing Counter-Strike in a neural network on my MacBook. In my first run, it diverged into mush pretty quickly. The recording is sped up 5x.

EEloi Alonso@EloiAlonso1 · Oct 11

Ever wanted to play Counter-Strike in a neural network? These videos show people playing (with keyboard & mouse) in 💎 DIAMOND's diffusion world model, trained to simulate the game Counter-Strike: Global Offensive. 💻 Download and play it yourself → github.com/eloialonso/dia… 🧵

357

129

135.0K

Longtao Zheng@ltzheng01 · Oct 6

I've made this point before: video generation systems are not good world models (at least, not necessarily). They could be mode-collapsed, and you wouldn't know.

FFrançois Fleuret@francoisfleuret · Oct 6

Coming back to video generation, IMO we have to be careful when we instinctively see it as a world model. The latter tolerates far less any form of mod collapse. 1/4

421

112

137.0K

Longtao Zheng@ltzheng01 · Oct 7

Our new paper on using YouTube videos to learn language conditioned navigation is out! By leveraging pretrained models and video data mined from the web, we can get robots to better understand language instructions.

nnoriaki_hirose@NoriakiHirose · Oct 7

Excited to share our recent research, LeLaN for learning language-condtitioned navigation policy from in-the-wild video in UC Berkeley and Toyota Motor North America. We present the LeLaN on CoRL 2024. @CatGlossop @ajaysridhar0 @shahdhruv_ @oier_mees and @svlevine

203

29.0K

Longtao Zheng Retweeted

Rishabh Agarwal@agarwl_ · Oct 5

Really promising results we got recently: Generative CoT Verifiers trained on only grade-school math problems in GSM8K generalize quite well to much harder *high-school competition* problems in MATH!

464

316

64.0K

Longtao Zheng Retweeted

Animesh Garg@animesh_garg · Oct 3

Every wondered if we can model motion as a language? can we tokenize this new language? is it useful? Turns out tremendously! 🚀 In out latest #NeurIPS2024 paper on QueST: Self-Supervised Skill Abstractions for Learning Continuous Control, we find that action tokenization…

244

155

26.0K