Mikael Henaff (@HenaffMikael)

Pinned

M

Mikael Henaff@HenaffMikael · Oct 24, 2023

Super stoked to share this work led by @proceduralia & @MartinKlissarov. Our method Motif uses LLMs to rank pairs of observation captions and synthesize dense intrinsic rewards specified by natural language. New SOTA on NetHack while being easily steerable. Paper+code in thread!

PPierluca D'Oro@proceduralia · Oct 24, 2023

Can reinforcement learning from AI feedback unlock new capabilities in AI agents? Introducing Motif, an LLM-powered method for intrinsic motivation from AI feedback. Motif extracts reward functions from Llama 2's preferences and uses them to train agents with reinforcement…

2

3

36

3

9.0K

Mikael Henaff Retweeted

�

🇺🇦Olexandr Maksymets@o_maksymets · Jul 15

On #ICML2025 16 Jul, 11 AM We present Meta Locate 3D: a model for accurate object localization in 3D environments. Meta Locate 3D can help robots accurately understand their surroundings and interact more naturally with humans. Demo, model, paper: go.fb.me/2lx31s

5

14

55

21

4.0K

Mikael Henaff Retweeted

T

Tim Rocktäschel@_rockt · Jun 24

Happy "@NetHack_LE is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com).…

3

28

134

31

26.0K

M

Mikael Henaff@HenaffMikael · Jun 9

A couple bits of news: 1. Happy to share my first (human) NetHack ascension-next step is RL agents :) 2. I wrote a post discussing some @NetHack_LE challenges & how they map to open problems in RL & agentic AI. Still the best RL benchmark imo. mikaelhenaff.substack.com/p/first-nethac…

HenaffMikael's tweet image. A couple bits of news:

1. Happy to share my first (human) NetHack ascension-next step is RL agents :)

2. I wrote a post discussing some @NetHack_LE challenges &amp; how they map to open problems in RL &amp; agentic AI. Still the best RL benchmark imo.

mikaelhenaff.substack.com/p/first-nethac…

4

11

57

23

10.0K

Mikael Henaff Retweeted

A

AI at Meta@AIatMeta · May 8

Introducing Meta Locate 3D: a model for accurate object localization in 3D environments. Learn how Meta Locate 3D can help robots accurately understand their surroundings and interact more naturally with humans. You can download the model and dataset, read our research paper,…

36

214

1.0K

415

80.0K

Mikael Henaff Retweeted

D

David Fan@DavidJFan · Apr 2

Can visual SSL match CLIP on VQA? Yes! We show with controlled experiments that visual SSL can be competitive even on OCR/Chart VQA, as demonstrated by our new Web-SSL model family (1B-7B params) which is trained purely on web images – without any language supervision.

12

95

459

303

70.0K

M

Mikael Henaff@HenaffMikael · Mar 12

My good friend @arcanelibrary designs old-school D&D games and her latest kickstarter is up! I've had lots of fun playing Shadowdark, highly recommend if you're into RPGs :)

TThe Arcane Library@arcanelibrary · Mar 11

Shadowdark: The Western Reaches is now live on Kickstarter and funded in two minutes! kickstarter.com/projects/shado…

0

3

0

361

Mikael Henaff Retweeted

S

Sasha (Alexander) Sax@iamsashasax · Mar 4

Introducing ⚡️Fast3R: the bitter lesson comes for SfM. By using a big dumb ViT, we can reconstruct pointmaps for 1000 images in a single forward pass @ 250 FPS. How do we do this? Using techniques from LLMS. Website: fast3r-3d.github.io Demo: fast3r.ngrok.app 🧵

2

5

23

7

1.0K

M

Mikael Henaff@HenaffMikael · Mar 4

Btw, the lead author @jed_yang is graduating this year and will be on the job market. Jed is highly motivated and creative, a great engineer and researcher who gets stuff to work, and has been a pleasure to work with...if you're hiring I suggest reaching out to him!

MMikael Henaff@HenaffMikael · Mar 4

Excited to share our Fast3R paper, to be presented at CVPR 2025. This recasts 3D reconstruction and camera pose estimation from video as an end-to-end learning problem, leading to ~4x-300x improvements in speed while maintaining performance. Code, model & demo in thread!

0

2

11

0

2.0K

M

Mikael Henaff@HenaffMikael · Mar 4

Excited to share our Fast3R paper, to be presented at CVPR 2025. This recasts 3D reconstruction and camera pose estimation from video as an end-to-end learning problem, leading to ~4x-300x improvements in speed while maintaining performance. Code, model & demo in thread!

JJianing “Jed” Yang @ CVPR@jed_yang · Mar 4

⚡️ Excited to announce Fast3R: 3D reconstruction of 1000+ images in a single forward pass! Fast3R achieves 251 FPS at its peak. 🔥 Try the demo with your images or video! 🔗 Website: fast3r-3d.github.io 🎮 Demo: fast3r.ngrok.app #CVPR2025 #3D @AIatMeta

0

5

16

4

5.0K

M

Mikael Henaff@HenaffMikael · Feb 11

MaestroMotif has been given an oral presentation at ICLR! 🙏 See how your AI could solve tasks like: “Do not leave the first dungeon level until you achieve XP level 4, then find a shopkeeper and sell an item that you have collected; finally survive for another 300 steps”

MMartin Klissarov@MartinKlissarov · Feb 4

Can AI agents adapt zero-shot, to complex multi-step language instructions in open-ended environments? We present MaestroMotif, a method for AI-assisted skill design that produces highly capable and steerable hierarchical agents. To the best of our knowledge, it is the first…

1

4

48

6

5.0K

M

Mikael Henaff@HenaffMikael · Feb 5

The era of Hierarchical Agents has begun

MMartin Klissarov@MartinKlissarov · Feb 4

Can AI agents adapt zero-shot, to complex multi-step language instructions in open-ended environments? We present MaestroMotif, a method for AI-assisted skill design that produces highly capable and steerable hierarchical agents. To the best of our knowledge, it is the first…

2

4

36

3

4.0K

M

Mikael Henaff@HenaffMikael · Feb 4

Martin led this great work, check it out. For a dinosaur like me, let me say that, in more classical RL terms, this is a demonstration of how we can effectively combine options and LLMs through programmatic policies.

MMartin Klissarov@MartinKlissarov · Feb 4

Can AI agents adapt zero-shot, to complex multi-step language instructions in open-ended environments? We present MaestroMotif, a method for AI-assisted skill design that produces highly capable and steerable hierarchical agents. To the best of our knowledge, it is the first…

4

6

32

10

4.0K

M

Mikael Henaff@HenaffMikael · Feb 5

Super excited to see MaestroMotif out into the world--the first hierarchical LLM agent that can solve open-ended compositional tasks requiring hundreds of steps 🚀🚀🚀 🤖 What can MaestroMotif do? - solve complex tasks by re-combining skills - adapt zero-shot to new instructions…

MMartin Klissarov@MartinKlissarov · Feb 4

Can AI agents adapt zero-shot, to complex multi-step language instructions in open-ended environments? We present MaestroMotif, a method for AI-assisted skill design that produces highly capable and steerable hierarchical agents. To the best of our knowledge, it is the first…

1

11

48

19

5.0K

M

Mikael Henaff@HenaffMikael · Feb 4

Another banger led by dream team @MartinKlissarov and @proceduralia, to be presented at ICLR 2025. MaestroMotif is a hierarchical agent which zero-shot composes Motif skills using an LLM controller, reaching new depths of the NetHack dungeon. Code available!

MMartin Klissarov@MartinKlissarov · Feb 4

Can AI agents adapt zero-shot, to complex multi-step language instructions in open-ended environments? We present MaestroMotif, a method for AI-assisted skill design that produces highly capable and steerable hierarchical agents. To the best of our knowledge, it is the first…

4

5

40

19

35.0K

Mikael Henaff Retweeted

D

Davide Paglieri@PaglieriDavide · Jan 29

🚨 DeepSeek crushed existing benchmarks. But how does it fare in embodied agentic tasks? We tested @deepseek_ai R1 Distil Qwen 32B on BALROG, and the results were both inspiring and entertaining. The good news for those still beginning their careers is… lots to do here! 🚀⬇️

6

29

123

50

28.0K

M

Mikael Henaff@HenaffMikael · Dec 20

Yearly reminder

hheiner@HeinrichKuttler · Dec 1, 2022

"Is it AGI" flow chart. Developed with @_rockt at NeurIPS 2022.

0

9

51

2

7.0K

Mikael Henaff Retweeted

B

Brandon Amos@brandondamos · Dec 19

🤔 How to extract knowledge from LLMs to train better RL agents? 📚 Our new paper (with @qqyuzu @HenaffMikael @yayitsamyzhang @adityagrover_ ) studies LLM-driven rewards for NetHack! Paper: arxiv.org/abs/2410.23022 Code: github.com/facebookresear…

2

41

208

131

16.0K

M

Mikael Henaff@HenaffMikael · Dec 19

ONI offers concurrent policy training & reward synthesizing, a good fit for long horizon sparse reward problems! I also believe its great potential to be extended to multimodal inputs and complex planning/reasoning environments!

BBrandon Amos@brandondamos · Dec 19

🤔 How to extract knowledge from LLMs to train better RL agents? 📚 Our new paper (with @qqyuzu @HenaffMikael @yayitsamyzhang @adityagrover_ ) studies LLM-driven rewards for NetHack! Paper: arxiv.org/abs/2410.23022 Code: github.com/facebookresear…

0

2

15

3

2.0K