Martin Josifoski

@MartinJosifoski

Researching AI research agents at @AIatMeta. Spent time at @EPFL, @MSFTResearch, @ETH Anon feedback: https://admonymous.co/mj

London

Joined April 2019

332Following

420Followers

Pinned

Martin Josifoski@MartinJosifoski · Jul 7

Scaling AI research agents is key to tackling some of the toughest challenges in the field. But what's required to scale effectively? It turns out that simply throwing more compute at the problem isn't enough. We break down an agent into four fundamental components that shape…

MartinJosifoski's tweet image. Scaling AI research agents is key to tackling some of the toughest challenges in the field. But what's required to scale effectively? It turns out that simply throwing more compute at the problem isn't enough.

We break down an agent into four fundamental components that shape…

152

17.0K

Pinned

Martin Josifoski@MartinJosifoski · Mar 23, 2024

Did you know that everything is a Flow👀? Happy to see a packed room at the workshop on developing and customizing AI workflows with aiFlows (github.com/epfl-dlab/aifl…) at AMLD earlier today! @NickyBaldwin3

MartinJosifoski's tweet image. Did you know that everything is a Flow👀?

Happy to see a packed room at the workshop on developing and customizing AI workflows with aiFlows (github.com/epfl-dlab/aifl…) at AMLD earlier today!

@NickyBaldwin3

283

Martin Josifoski Retweeted

Alexander Holden Miller@alex_h_miller · Jul 23

Hiring! We're looking to fill contractor Research Engineer roles in New York City to work with us in FAIR on AI Research Agents. If that sounds fun, please fill out the expression of interest here: forms.gle/7m4fVqLXY5GwuL…

122

13.0K

Martin Josifoski Retweeted

Roberta Raileanu@robertarail · Jul 24

I’m building a new team at @GoogleDeepMind to work on Open-Ended Discovery! We’re looking for strong Research Scientists and Research Engineers to help us push the frontier of autonomously discovering novel artifacts such as new knowledge, capabilities, or algorithms, in an…

10.0K

Martin Josifoski Retweeted

Rohan Paul@rohanpaul_ai · Jul 8

AIRA‑dojo shows that picking smarter code‑tweaking moves and a better search style lifts automated Kaggle agents from 39.6% to 47.7% medal hits. AIRA-dojo is a computer playground where an AI research agent can write code, train models, and test ideas while staying inside its…

1.0K

Martin Josifoski Retweeted

elvis@omarsar0 · Jul 7

AI Research Agents for ML Achieves state-of-the-art on MLE-bench lite! Using AI to automate the training of ML models is one of the most exciting and promising areas of research today. Lots of cool ideas in this paper:

131

606

497

41.0K

Martin Josifoski@MartinJosifoski · Jul 7

Solid work from @AIatMeta on ablating and improving AIDE on MLE-Bench! The rigor of empirical evaluation has reached a new level, making the experimental signals super strong. Highly recommended for anyone interested in AI-Driven R&D/Agentic Search!

YYoram Bachrach@yorambac · Jul 7

AI Research Agents are becoming proficient at machine learning tasks, but how can we help them search the space of candidate solutions and codebases? Read our new paper looking at MLE-Bench: arxiv.org/pdf/2507.02554 #LLM #Agents #MLEBench

3.0K

Martin Josifoski Retweeted

Brandon Amos@brandondamos · Jul 8

Excited to release AlgoTune!! It's a benchmark and coding agent for optimizing the runtime of numerical code 🚀 algotune.io 📚 algotune.io/paper.pdf 🤖 github.com/oripress/AlgoT… with @OfirPress @ori_press @PatrickKidger @b_stellato @ArmanZharmagam1 & many others 🧵

181

14.0K

Martin Josifoski Retweeted

Andrei Lupu@_andreilupu · Jun 26

Theory of Mind (ToM) is crucial for next gen LLM Agents, yet current benchmarks suffer from multiple shortcomings. Enter 💽 Decrypto, an interactive benchmark for multi-agent reasoning and ToM in LLMs! Work done with @TimonWilli & @j_foerst at @AIatMeta & @FLAIR_Ox 🧵👇

103

22.0K

Martin Josifoski Retweeted

Giovanni Monea@giomonea · Jun 14, 2024

What happens when LLMs encounter information that contradicts their static knowledge? 🤔 Discover our findings, including a new dataset and interpretability method, in our ACL 2024 paper! 🧵👇 📄 Read the paper: arxiv.org/abs/2312.02073 🖥️ Explore more: epfl-dlab.github.io/llm-grounding-…

12.0K

Martin Josifoski Retweeted

Maxime Peyrard@peyrardMax · Mar 25, 2024

Orchestrated interactions between LLMs, humans, and tools show great promise. Today, we introduce “Semantic Decoding" - a perspective that views these interactions as optimization and search in the space of semantic tokens (thoughts). 📄 arxiv.org/abs/2403.14562

8.0K

Martin Josifoski@MartinJosifoski · Mar 22, 2024

Tomorrow, with @NickyBaldwin3, we'll be hosting the "Learn to develop and customize AI workflows with Flows" workshop at the #AMLDEPFL2024 conference! Join us to learn about the new version of aiFlows (github.com/epfl-dlab/aifl…) coming out later today! @cervisiarius @peyrardMax

MartinJosifoski's tweet image. Tomorrow, with @NickyBaldwin3, we'll be hosting the "Learn to develop and customize AI workflows with Flows" workshop at the #AMLDEPFL2024 conference! Join us to learn about the new version of aiFlows (github.com/epfl-dlab/aifl…) coming out later today!

@cervisiarius @peyrardMax

618

Martin Josifoski Retweeted

Saibo-Creator@SaiboGeng · Jan 22, 2024

When I talk with people about constrained decoding, I'm always asked: "Can it be applied to blackbox LLMs like GPT-4?" My response has been a bit pessimistic due to the limited logit access.🤔 🚀However, we're excited to announce Sketch-Guide Constrained Decoding (SGCD), a…

133

30.0K