Minqi Jiang (@MinqiJiang)

Pinned

M

Minqi Jiang@MinqiJiang · Jun 30

Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner. How can we get a pulse check on whether current LLMs are capable of driving this kind of total…

MinqiJiang's tweet image. Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner.

How can we get a pulse check on whether current LLMs are capable of driving this kind of total…

41

194

1.0K

783

528.0K

Minqi Jiang Retweeted

B

Brandon Amos@brandondamos · Jul 8

Excited to release AlgoTune!! It's a benchmark and coding agent for optimizing the runtime of numerical code 🚀 algotune.io 📚 algotune.io/paper.pdf 🤖 github.com/oripress/AlgoT… with @OfirPress @ori_press @PatrickKidger @b_stellato @ArmanZharmagam1 & many others 🧵

3

34

180

75

13.0K

M

Minqi Jiang@MinqiJiang · Jun 30

Love this project: nanoGPT -> recursive self-improvement benchmark. Good old nanoGPT keeps on giving and surprising :) - First I wrote it as a small little repo to teach people the basics of training GPTs. - Then it became a target and baseline for my port to direct C/CUDA…

MMinqi Jiang@MinqiJiang · Jun 30

Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner. How can we get a pulse check on whether current LLMs are capable of driving this kind of total…

97

694

4.0K

3.0K

428.0K

M

Minqi Jiang@MinqiJiang · Jun 30

This is the most serious work I've seen on the path to recursive self-improvement (RSI), from Meta AI. It tasks agents with reproducing the chain of human innovations on improving LLMs. Proud that the authors used a scaffold extending AIDE (@WecoAI's core tech)!

MMinqi Jiang@MinqiJiang · Jun 30

Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner. How can we get a pulse check on whether current LLMs are capable of driving this kind of total…

1

6

49

13

4.0K

M

Minqi Jiang@MinqiJiang · Jun 30

The AIRA team @metaai has the ambitious goal of building/training an agent that can do frontier AI research to help the open-source ecosystem leapfrog closed source LLMs. As a relatively small team we cannot succeed in this mission without the support of the community so we'll…

MMinqi Jiang@MinqiJiang · Jun 30

Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner. How can we get a pulse check on whether current LLMs are capable of driving this kind of total…

1

11

102

35

12.0K

M

Minqi Jiang@MinqiJiang · Jun 29

1

0

17

2

1.0K

Minqi Jiang Retweeted

Y

Yiding Jiang@yidingjiang · Jun 26

A mental model I find useful: all data acquisition (web scrapes, synthetic data, RL rollouts, etc.) is really an exploration problem 🔍. This perspective has some interesting implications for where AI is heading. Wrote down some thoughts: yidingjiang.github.io/blog/post/expl…

5

57

428

396

36.0K

Minqi Jiang Retweeted

A

Andrei Lupu@_andreilupu · Jun 26

Theory of Mind (ToM) is crucial for next gen LLM Agents, yet current benchmarks suffer from multiple shortcomings. Enter 💽 Decrypto, an interactive benchmark for multi-agent reasoning and ToM in LLMs! Work done with @TimonWilli & @j_foerst at @AIatMeta & @FLAIR_Ox 🧵👇

4

30

103

35

22.0K

M

Minqi Jiang@MinqiJiang · Jun 23

You may not like it, but this is what a world-class AI + hardware team looks like.

1

7

1

1.0K

M

Minqi Jiang@MinqiJiang · Jun 10

The next few years of AI in a nutshell: Reinforcement learning with humans in the loop.

7

8

137

21

10.0K

M

Minqi Jiang@MinqiJiang · Jun 2

LLMs are both astonishing and brittle. The difference between superintelligence and stochastic parrot can be as small as a single whitespace character in your prompt.

3

0

29

2

3.0K

Minqi Jiang Retweeted

O

Oiwi Parker Jones@oiwi3000 · May 27

Can you predict when the brain is processing speech/non-speech? The 2025 PNPL Competition 🏆 features two foundational decoding tasks which make efficient use of the PNPL🍍LibriBrain data. The first task to launch will be Speech Detection 🚀. More details: libribrain.com

1

10

30

6

5.0K