nathan monette

@nathanrmonette

msc student @FLAIR_Ox

Oxford, England

Joined July 2024

346Following

105Followers

Pinned

nathan monette@nathanrmonette · May 28

Excited to announce my first paper, with @j_foerst and @FLAIR_Ox, was accepted into @rl_conference 2025! We establish a new UED method called NCC that obtains strong performance based on principles of optimisation theory.

nathanrmonette's tweet image. Excited to announce my first paper, with @j_foerst and @FLAIR_Ox, was accepted into @rl_conference 2025!

We establish a new UED method called NCC that obtains strong performance based on principles of optimisation theory.

8.0K

Pinned

nathan monette@nathanrmonette · Jul 25

He's terrible. Screwed my buddy sgd.

YYiping Lu@2prime_PKU · Jul 25

Anyone knows adam?

138

11.0K

nathan monette@nathanrmonette · Jul 26

reinforcement learning infrastructure

NNorgard@BrianNorgard · Jul 25

Every great consumer app is just a slot machine in disguise.

238

19.0K

nathan monette Retweeted

Alex Goldie@AlexDGoldie · Jul 24

1/ 🕵️ Algorithm discovery could lead to huge AI breakthroughs! But what is the best way to learn or discover new algorithms? I'm so excited to share our brand new @rl_conference paper which takes a step towards answering this! 🧵

200

144

15.0K

nathan monette Retweeted

darren@darrenangle · Jul 15

you can change history with an 11-page paper

524

365

33.0K

nathan monette Retweeted

Karim Abdel Sadek@Karim_abdelll · Jul 8

*New AI Alignment Paper* 🚨 Goal misgeneralization occurs when AI agents learn the wrong reward function, instead of the human's intended goal. 😇 We show that training with a minimax regret objective provably mitigates it, promoting safer and better-aligned RL policies!

138

15.0K

nathan monette Retweeted

Theo Wolf@TheoW0lf · Jul 3

🚀 Excited to announce Hyperoptax, a library for parallel hyperparameter tuning in JAX. Implements Grid, Random, and Bayesian search in pure JAX so that you can rapidly search across parameter configurations in parallel ‖. 📦 pip install hyperoptax github.com/TheodoreWolf/h…

4.0K

nathan monette Retweeted

Ola Kalisz@OlaKalisz8 · Jul 2

Antiviral therapy design is myopic 🦠🙈 optimised only for the current strain. That's why you need a different Flu vaccine every year! Our #ICML2025 paper ADIOS proposes "shaper therapies" that steer viral evolution in our favour & remain effective. Work done @FLAIR_Ox 🧵👇

9.0K

nathan monette Retweeted

Andrei Lupu@_andreilupu · Jun 26

Theory of Mind (ToM) is crucial for next gen LLM Agents, yet current benchmarks suffer from multiple shortcomings. Enter 💽 Decrypto, an interactive benchmark for multi-agent reasoning and ToM in LLMs! Work done with @TimonWilli & @j_foerst at @AIatMeta & @FLAIR_Ox 🧵👇

103

22.0K

nathan monette Retweeted

Ilya Zisman@suessmannn · Jun 15

Had a blast together with @how_uhh at @LeRobotHF hackathon this weekend. Built phone-based teleoperation for my SO-100 arm using pose estimation. Here’s a quick BTS of the final demo with teleop working (+ a small victory dance 🎉)

414

130

53.0K

nathan monette@nathanrmonette · May 9

True dat

ffinbarr@finbarrtimbers · May 7

now that RL is hot again, you should all register for RLC and come visit Edmonton in August rl-conference.cc/index.html

334

26.0K

nathan monette Retweeted

Dimitris Papailiopoulos@DimitrisPapail · Apr 30

I am afraid to report, RL works.

864

133

111.0K

nathan monette Retweeted

Jack Parker-Holder@jparkerholder · Apr 26

So many incredible, inspiring ideas in @_rockt’s keynote…. But my personal favorite slide was a clarification on the world model definition 😱

115

6.0K

nathan monette Retweeted

Foerster Lab for AI Research@FLAIR_Ox · Apr 24

FLAIR is at ICLR 🇸🇬 Find out our schedule for the week 👇

6.0K

nathan monette Retweeted

Kunal Jha@kjha02 · Apr 18

Our new paper (first one of my PhD!) on cooperative AI reveals a surprising insight: Environment Diversity > Partner Diversity. Agents trained in self-play across many environments learn cooperative norms that transfer to humans on novel tasks. shorturl.at/fqsNN🧵

142

44.0K