Joseph Suarez (e/🐡)

@jsuarez5341

I build sane open-source RL tools. MIT PhD, creator of Neural MMO and founder of PufferAI. https://puffer.ai

Joined March 2019

99Following

16KFollowers

Pinned

Joseph Suarez (e/🐡)@jsuarez5341 · Jun 23

PufferLib 3.0: We trained reinforcement learning agents on 1 Petabyte / 12,000 years of data with 1 server. Now you can, too! Our latest release includes algorithmic breakthroughs, massively faster training, and 10 new environments. Live demos on our site. Volume on for trailer!

689

393

70.0K

Pinned

Joseph Suarez (e/🐡) Retweeted

Laker Newhouse@LakerNewhouse · Jul 19

[1/9] We created a performant Lipschitz transformer by spectrally regulating the weights—without using activation stability tricks: no layer norm, QK norm, or logit softcapping. We think this may address a “root cause” of unstable training.

563

545

132.0K

Joseph Suarez (e/🐡)@jsuarez5341 · 2 h

Doing this to pufferlib is just about the only thing that will get you Torvalds level shredded on stream. RL was made 1000x faster this year by hand with simple low level dev. Not by outsourcing your thinking

wwill brown@willccbb · 8 h

uv add your-vibecoded-rl-environment

5.0K

Joseph Suarez (e/🐡)@jsuarez5341 · 23 h

Doing a few things in person in SF this week, so no streams for a bit. But I have all Saturday booked! Either 6 dof arm from scratch or material science sim. Leaning towards material science because I'm sick of messing up quaternion transforms

4.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 23

Quick RL poll brought to you by being stuck in traffic. What article do you want next? Also @x please add articles drafting to mobile

2.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 23

The AI boom makes a lot more sense if you view 90% of companies as marketing wrappers for existing tech. This bothers people like me more than it should because I remember when 90% of the companies were pure tech with nerds doing their best to also sell

196

12.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 22

Reinforcement Learning Research Live x.com/i/broadcasts/1…

3.0K

Joseph Suarez (e/🐡) Retweeted

Marathon Fusion@MarathonFusion · Jul 18

marathonfusion.com/alchemy.pdf

145

77.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 22

Reinforcement Learning Research Live x.com/i/broadcasts/1…

2.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 22

I propose we hold an IMO gold tiebreaker. The winner is whoever doesn't lobby for AI regulation to favor incumbents.

3.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 21

Reinforcement Learning Research Live x.com/i/broadcasts/1…

2.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 20

I post a lot about how good RL is with PufferLib that I'm realizing sounds increasingly grifty. Please just go try it. It's free. If you've done RL years ago, it will feel like a different field. We have new programmers doing RL on custom sims. That wasn't a thing before.

325

102

19.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 20

Kyoung is the most meticulously organized person I've ever worked with. Spun up in AI in around a year with a rare level of discipline

KKyoung Whan Choe@kywch500 · Jul 20

I recently started using the Apple Vision Pro (kind of late, right?) and was amazed at how well they nailed the eye-tracking. The AVP's eye-tracking calibration is performed against black, gray, and white backgrounds, and I happen to know from my previous life (i.e., during my…

7.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 20

JJoseph Suarez (e/🐡)@jsuarez5341 · Jul 19

x.com/i/article/1946…

123

25.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 20

When life randomizes your size, mass, axial inertia, etc. just generalize super-human control!

179

10.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 20

Found the bug. Trains in 2 minutes. No collisions/randomization for first test but pretty zippy! Multi-task + domain randomized next

514

163

35.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 19

Reinforcement Learning Research Live x.com/i/broadcasts/1…

4.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 19

x.com/i/article/1946…

187

2.0K

3.0K

400.0K

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 19

Reinforcement Learning Research Live x.com/i/broadcasts/1…

3.0K