Matthew Jackson (@JacksonMattT)

Pinned

M

Matthew Jackson@JacksonMattT · Apr 18

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

5

36

144

80

30.0K

Matthew Jackson Retweeted

A

Alex Goldie@AlexDGoldie · Jul 24

1/ 🕵️ Algorithm discovery could lead to huge AI breakthroughs! But what is the best way to learn or discover new algorithms? I'm so excited to share our brand new @rl_conference paper which takes a step towards answering this! 🧵

2

30

190

141

14.0K

Matthew Jackson Retweeted

O

Ola Kalisz@OlaKalisz8 · Jul 2

Antiviral therapy design is myopic 🦠🙈 optimised only for the current strain. That's why you need a different Flu vaccine every year! Our #ICML2025 paper ADIOS proposes "shaper therapies" that steer viral evolution in our favour & remain effective. Work done @FLAIR_Ox 🧵👇

1

18

52

2

9.0K

Matthew Jackson Retweeted

C

Cong Lu@cong_ml · Jun 10

🚀Introducing “StochasTok: Improving Fine-Grained Subword Understanding in LLMs”!🚀 LLMs are incredible but still struggle disproportionately with subword tasks, e.g., for character counts, wordplay, multi-digit numbers, fixing typos… Enter StochasTok, led by @anyaasims! [1/]

1

24

77

29

18.0K

Matthew Jackson Retweeted

n

nathan monette@nathanrmonette · May 28

Excited to announce my first paper, with @j_foerst and @FLAIR_Ox, was accepted into @rl_conference 2025! We establish a new UED method called NCC that obtains strong performance based on principles of optimisation theory.

1

10

69

27

8.0K

Matthew Jackson Retweeted

H

Haider.@slow_developer · Apr 14

Google DeepMind, David Silver reveals: we built a system that used RL to discover its own RL algorithms. this AI-designed system outperformed all human-created RL algorithms developed over the years.

82

515

4.0K

2.0K

395.0K

M

Matthew Jackson@JacksonMattT · Apr 21

The best of RL research, brought to Offline RL! 🚀 TL;DR 1. CleanRL-style implementations ⚡️ 2. Rainbow-style algorithm unification 🦾 3. Rliable-style evaluation protocol 🔬 Check out our paper + library!

MMatthew Jackson@JacksonMattT · Apr 18

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

0

1

14

10

1.0K

M

Matthew Jackson@JacksonMattT · Apr 21

Making offline RL more honest, reproducible, and robust.

MMatthew Jackson@JacksonMattT · Apr 18

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

1

7

96

30

18.0K

M

Matthew Jackson@JacksonMattT · Apr 18

🔮Looking forward, we intend Unifloral🌹to be more than a library—it's a scaffolding 🌱 for indexing current & future ORL work!🏵️ We encourage 🥺 you to: 🔄 PR your awesome work using the 🌹 format 🎮 Explore the unified implementation 🧩 Try to find new SOTA algos with it

MMatthew Jackson@JacksonMattT · Apr 18

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

2

4

20

3

1.0K

M

Matthew Jackson@JacksonMattT · Apr 18

⁉️ While trying to find the best hyperparameter setting of ORL algorithms using a bandit, we noticed something unexpected: 🤯 After evaluating the episodic returns of more and more policies online, the bandit's performance *decreased*! x.com/JacksonMattT/s…

MMatthew Jackson@JacksonMattT · Apr 18

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

1

4

17

3

2.0K

Matthew Jackson Retweeted

W

Wayve@wayve_ai · Mar 26

Introducing GAIA-2 🌎Generative world modeling just stepped up a gear. GAIA-2 is the latest development of Wayve’s video-generative world model tailored for driving. GAIA-2 offers richer, more realistic, and highly controllable synthetic driving scenarios, accelerating Wayve’s…

8

85

356

157

41.0K

Matthew Jackson Retweeted

J

Jakob Foerster@j_foerst · Mar 12

My group @FLAIR_Ox is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecruit…

0

14

34

3

10.0K

Matthew Jackson Retweeted

J

Jack Parker-Holder@jparkerholder · Dec 4

Introducing 🧞Genie 2 🧞 - our most capable large-scale foundation world model, which can generate a diverse array of consistent worlds, playable for up to a minute. We believe Genie 2 could unlock the next wave of capabilities for embodied agents 🧠.

284

474

3.0K

1.0K

2.6M

M

Matthew Jackson@JacksonMattT · Nov 12

Huge unlock for RL foundation models and environment design... 1) Grounded in real-world physics, 2) Level design in the browser, 3) End-to-end Jax, so lightning fast! More great work from the Michaels 🔥

MMichael Beukman@mcbeukman · Nov 12

🏋️‍♂️Go from creating an environment to having a trained expert agent within minutes! As part of Kinetix, we are releasing an editor that can create custom physics-based RL environments, and import them seamlessly into an RL training loop. 1/

0

24

2

2.0K

Matthew Jackson Retweeted

M

Michael Beukman@mcbeukman · Nov 12

🏋️‍♂️Go from creating an environment to having a trained expert agent within minutes! As part of Kinetix, we are releasing an editor that can create custom physics-based RL environments, and import them seamlessly into an RL training loop. 1/

3

17

96

34

20.0K