Yoram Bachrach

@yorambac

Research Scientist at Meta (prev Google DeepMind and Microsoft Research). Working on LLM Agents and Multi-Agent Systems.

London

Joined January 2008

7KFollowing

3KFollowers

Pinned

Yoram Bachrach Retweeted

Roberta Raileanu@robertarail · Feb 21

Super excited to share 🧠MLGym 🦾 – the first Gym environment for AI Research Agents 🤖🔬 We introduce MLGym and MLGym-Bench, a new framework and benchmark for evaluating and developing LLM agents on AI research tasks. The key contributions of our work are: 🕹️ Enables the…

119

490

351

101.0K

Pinned

Yoram Bachrach Retweeted

Oleg Ostroumov@OlegOstroumov · Dec 8, 2023

I just published the story of how I created the world’s first No-Limit Holdem poker solver and made $500k by age 23 medium.com/@olegostroumov… I had to keep the story secret since 2013, but now you can read how I went from near broke to reshaping world's toughest poker games

234

148

99.0K

Yoram Bachrach Retweeted

Alexander Holden Miller@alex_h_miller · Jul 23

Hiring! We're looking to fill contractor Research Engineer roles in New York City to work with us in FAIR on AI Research Agents. If that sounds fun, please fill out the expression of interest here: forms.gle/7m4fVqLXY5GwuL…

117

13.0K

Yoram Bachrach Retweeted

Yuandong Tian@tydsh · Jun 18

📢We show that continuous latent reasoning has a theoretical advantage over discrete token reasoning (arxiv.org/abs/2505.12514): For a graph with n vertices and graph diameter D, a two-layer transformer with D steps of continuous CoTs can solve the directed graph reachability…

170

1.0K

906

292.0K

Yoram Bachrach Retweeted

Brandon Amos@brandondamos · Jul 8

Excited to release AlgoTune!! It's a benchmark and coding agent for optimizing the runtime of numerical code 🚀 algotune.io 📚 algotune.io/paper.pdf 🤖 github.com/oripress/AlgoT… with @OfirPress @ori_press @PatrickKidger @b_stellato @ArmanZharmagam1 & many others 🧵

180

13.0K

Yoram Bachrach@yorambac · Jul 7

AI Research Agents are becoming proficient at machine learning tasks, but how can we help them search the space of candidate solutions and codebases? Read our new paper looking at MLE-Bench: arxiv.org/pdf/2507.02554 #LLM #Agents #MLEBench

yorambac's tweet image. AI Research Agents are becoming proficient at machine learning tasks, but how can we help them search the space of candidate solutions and codebases? Read our new paper looking at MLE-Bench: arxiv.org/pdf/2507.02554
#LLM #Agents #MLEBench

314

168

28.0K

Yoram Bachrach Retweeted

Pascale Fung@pascalefung · Jul 4

Our research on embodied AI agents that can perceive, learn, act and interact in the virtual and physical worlds. #metaAI #AIAgent #embodied #worldmodel #superintelligemce arxiv.org/abs/2506.22355

284

198

47.0K

Yoram Bachrach@yorambac · Jun 30

Love this project: nanoGPT -> recursive self-improvement benchmark. Good old nanoGPT keeps on giving and surprising :) - First I wrote it as a small little repo to teach people the basics of training GPTs. - Then it became a target and baseline for my port to direct C/CUDA…

MMinqi Jiang@MinqiJiang · Jun 30

Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner. How can we get a pulse check on whether current LLMs are capable of driving this kind of total…

695

4.0K

3.0K

428.0K

Yoram Bachrach Retweeted

Minqi Jiang@MinqiJiang · Jun 30

This project was co-led by @BingChenZhao2, @MarlaMagka and myself, with the support of a tremendous team under @yorambac and @j_foerst. Read the full paper detailing the benchmark design and our findings here: arxiv.org/abs/2506.22419

11.0K

Yoram Bachrach Retweeted

Minqi Jiang@MinqiJiang · Jun 30

194

1.0K

783

528.0K

Yoram Bachrach Retweeted

Jason Weston@jaseweston · Jun 3

🚨Self-Challenging Language Model Agents🚨 📝: arxiv.org/abs/2506.01716 A new paradigm to train LLM agents to use different tools with challenging self-generated data ONLY: Self-challenging agents (SCA) both propose new tasks and solve them, using self-generated verifiers to…

110

525

397

81.0K

Yoram Bachrach Retweeted

Jakob Foerster@j_foerst · May 27

Hello World: My team at FAIR / @metaai (AI Research Agent) is looking to hire contractors across software engineering and ML. If you are interested and based in the UK, please fill in the following short EoI form: docs.google.com/forms/d/e/1FAI…

114

15.0K

Yoram Bachrach Retweeted

Alexander Holden Miller@alex_h_miller · May 13

Come join us! We have a crack team across US + UK (@yorambac) working on agents that can do AI research. We're hiring a full-time PhD new grad Research Scientist based in New York. Ideal candidate has published on RL / reasoning with LLMs.

3.0K

Yoram Bachrach Retweeted

Gabriel Synnaeve@syhw · Feb 26

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution arxiv.org/abs/2502.18449 by @YuxiangWei9 @sidawxyz and the whole team! Get started with your favorite model here github.com/facebookresear…

117

10.0K

Yoram Bachrach@yorambac · Dec 7, 2023

Tired of using FID for evaluating generative models? Come to our #NeurIPS2023 poster on FLS, a new complete metric for generative models that also penalizes overfitting! neurips.cc/virtual/2023/p… github.com/marcojira/fls @bose_joey @drimgemp Chongli Qin @yorambac @gauthier_gidel

MMarco Jiralerspong@marcojira · Feb 13, 2023

How can metrics for evaluating generative models take into account generalization? In our new paper, we propose a new sample-based metric to address exactly this challenge: the Feature Likelihood Score (FLS). Paper: arxiv.org/abs/2302.04440 Github: github.com/marcojira/fls 1/12

8.0K

Yoram Bachrach Retweeted

Ian Gemp@drimgemp · Feb 6, 2024

What do haggling, debate, and convincing your kids to go to bed all have in common with Poker? With #LLMs, we map them all onto the framework of #gametheory; we then generate conversational strategies using the same methods that beat top Poker pros. arxiv.org/abs/2402.01704

4.0K

Yoram Bachrach Retweeted

David Stutz@davidstutz92 · Dec 8, 2023

Student researcher positions at @GoogleDeepMind are now open for applications until Dec 15 – see our careers webpage. Also a good opportunity to re-share my article of how I prepared for my internship back in 2019: davidstutz.de/how-i-prepared…

412

545

62.0K

Yoram Bachrach Retweeted

Petar Veličković@PetarV_93 · Oct 17, 2023

⚽🌐🕸️🤖 arXiv:2310.10553 arxiv.org/abs/2310.10553

193

27.0K