Spencer Cheng

@spenccheng

2x founder | AI + Construction | I build insanely fast simulators for reinforcement learning at http://puffer.ai

Dallas, TX

Joined May 2013

291Following

2KFollowers

Pinned

Spencer Cheng@spenccheng · Jul 25

x.com/i/article/1948…

1.0K

2.0K

221.0K

Pinned

Spencer Cheng Retweeted

Joseph Suarez (e/🐡)@jsuarez5341 · Jul 12

Since I've been getting lots of questions today - PufferAI is a private reinforcement learning lab with all OSS research and tools. Our business is helping companies solve RL problems and in-house the capabilities. DM if you would like to chat!

216

33.0K

Pinned

Spencer Cheng@spenccheng · Jul 3

I’ve spent the past year learning RL from Joseph and am extremely grateful for the mentorship. Excited to share that I’ll be helping bring Puffer’s research to industry. If reinforcement learning or simulation is critical to your company, let’s chat!

JJoseph Suarez (e/🐡)@jsuarez5341 · Jul 2

PufferLib 3.0 was a truly massive amount of work. Here's a quick thread crediting some of the people involved! @spenccheng Made a ton of new environments and has has been a major force behind this release @DanAdvantage Has gotten us new envs, tons of fixes, and user support

2.0K

Spencer Cheng@spenccheng · Jul 26

Puffer AI is truly a game changer You actually don't understand how much room there is left as well Trust me on this

SSpencer Cheng@spenccheng · Jul 25

x.com/i/article/1948…

273

154

21.0K

Spencer Cheng@spenccheng · Jul 25

Working with Spencer on RL for self-driving has been awesome. Check out the story of what scaling up in RL looks like

SSpencer Cheng@spenccheng · Jul 25

x.com/i/article/1948…

5.0K

Spencer Cheng@spenccheng · Jul 17

Great example of utilizing domain randomization to help close the sim to real gap. Performance on real drones coming soon!

JJoseph Suarez (e/🐡)@jsuarez5341 · Jul 17

Early prototype of the new drone racing sim. Every drone here is a different size, weight, axial inertias, etc. We reinforcement learn the policy in <2 minutes with PufferLib. This is an extension to the original sim submitted by Fin and Sam

2.0K

Spencer Cheng@spenccheng · Jul 13

There is so much untapped value in applying RL to niche industries without touching LLM land. I’ve been having so much fun talking to different domain experts.

JJoseph Suarez (e/🐡)@jsuarez5341 · Jul 13

And LLMs won't even be the biggest application! Massive but diffuse impact across industries. Anywhere you can build sims

4.0K

Spencer Cheng@spenccheng · Jul 11

Great article for anyone new to programming.

JJoseph Suarez (e/🐡)@jsuarez5341 · Jul 11

x.com/i/article/1941…

1.0K

Spencer Cheng@spenccheng · Jul 11

Joseph's guide provides clear tactical advice on how to learn RL. Give it a read. Build Environments. You can just learn RL.

JJoseph Suarez (e/🐡)@jsuarez5341 · Jul 11

x.com/i/article/1940…

4.0K

Spencer Cheng@spenccheng · Jul 9

x.com/i/article/1934…

507

908

106.0K

Spencer Cheng@spenccheng · Jul 5

The highest leverage thing unskilled engineers can do rn is learn to code and then build RL environments correctly. Plenty of PufferLib contributors have done so already!

JJustus Mattern@MatternJustus · Jul 5

Highest leverage thing unskilled engineers can do rn to contribute to frontier AI research is vibecoding RL environments

104

10.0K

Spencer Cheng@spenccheng · Jul 2

The trick to writing fast RL sims ? Build it in C. Contiguous memory + memcpy is a cheat code for performance.

1.0K

Spencer Cheng@spenccheng · Jul 1

Working on Multi-GPU RL training for the first time. Little bit of tinkering with hyperparams but then I got an expert policy in ~10 min instead of an hour. This is wild. Training Details: Same Total Timesteps: 1.8B Orange: 6 GPUs - Score: 0.995 in 10 min. 0.997 by end of…

spenccheng's tweet image. Working on Multi-GPU RL training for the first time. Little bit of tinkering with hyperparams but then I got an expert policy in ~10 min instead of an hour. This is wild.

Training Details:
Same Total Timesteps: 1.8B

Orange: 6 GPUs - Score: 0.995 in 10 min. 0.997 by end of…

3.0K

Spencer Cheng@spenccheng · Jul 1

500x cheaper training!! Puffer provides an unfair advantage to anyone in RL. Check out Joseph's article on training Neural MMO3 to see the power of simulation at scale.

JJoseph Suarez (e/🐡)@jsuarez5341 · Jul 1

x.com/i/article/1940…

800

Spencer Cheng@spenccheng · Jun 29

At Puffer, we build sims as lofi video games. You want to be able to play your own sim. Debugging RL problems when you don't even know if UP actually goes UP is not fun.

5.0K

Spencer Cheng@spenccheng · Jun 28

Raylib is the best! Building envs in C is actually quite easy. Here's pong in 300 lines. It's all simple for loops and conditionals! github.com/PufferAI/Puffe… We have people who have never coded before building and contributing environments in C.

jjustboulatbek@1258632 · Jun 28

do you use raylib for this? man must be tough building envs in C

853