drubinstein

@dsrubinstein

Making models go brrr | Engineering @reflection_ai

Joined March 2024

54Following

390Followers

Pinned

drubinstein@dsrubinstein · Mar 5

Excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. Blog posted below

388

193

53.0K

drubinstein@dsrubinstein · Jul 16

I'm very excited for @reflection_ai 's product announcement! The product team here been hard at work building the next generation of AI Agents.

MMisha Laskin@MishaLaskin · Jul 16

Engineers spend 70% of their time understanding code, not writing it. That’s why we built Asimov at @reflection_ai. The best-in-class code research agent, built for teams and organizations.

245

drubinstein Retweeted

Misha Laskin@MishaLaskin · Jul 16

Engineers spend 70% of their time understanding code, not writing it. That’s why we built Asimov at @reflection_ai. The best-in-class code research agent, built for teams and organizations.

100

174

1.0K

317.0K

drubinstein Retweeted

Seth Karten@sethkarten · Jul 14

🚀 Launch day! The NeurIPS 2025 PokéAgent Challenge is live. Two tracks: ① Showdown Battling – imperfect-info, turn-based strategy ② Pokemon Emerald Speedrunning – long horizon RPG planning 5 M labeled replays • starter kit • baselines. Bring your LLM, RL, or hybrid…

145

41.0K

drubinstein@dsrubinstein · Jul 13

When people read ICML, do they think "I See ML; must be a computer vision conference?"

187

drubinstein@dsrubinstein · Apr 30

I wonder if this is how NeurIPs was coined: Neural Network Neural IPs NeurIPs? NeurIPs!

456

drubinstein@dsrubinstein · Apr 30

All protobufs should have field number 1 be the .proto schema. The user could use it to deserialize the received proto message.

317