Super Dario

@inductionheads

Autoregression is the secret sauce

Joined January 2022

4KFollowing

4KFollowers

Pinned

Super Dario@inductionheads · Jul 12

The most important thing to understand for continuous learning is that memory is a tool Agents must be RLed to learn how to use their own memories

128

9.0K

Super Dario@inductionheads · 1 h

much more empirical/interpy work needed to understand why RL with CoT is so much better than without (not looking for theoretical explanations like test-time scaling expressivity or latent variable expectation maximization)

AAlexander Doria@Dorialexander · 9 h

pictured: non-reasoning model doing non-reasoning things

3.0K

Super Dario Retweeted

Alex Albert@alexalbert__ · 8 h

It's becoming more and more clear that Claude Code is the everything agent

885

141

90.0K

Super Dario@inductionheads · 4 h

Welcome home Ilya

AAndrew Curran@AndrewCurran_ · 4 h

He said there have been over 70 million user videos made with Veo 3, and also mentioned Ilya's SSI. He said Safe Superintelligence will exclusively use Google TPU's.

572

Super Dario Retweeted

Elon Musk@elonmusk · 8 h

Andrej, my long lost brother, let us work together again!

850

612

10.0K

343

498.0K

Super Dario Retweeted

Chen-Yu Lee@chl260 · 18 h

Thrilled to introduce "𝗗𝗲𝗲𝗽 𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵𝗲𝗿 𝘄𝗶𝘁𝗵 𝗧𝗲𝘀𝘁-𝗧𝗶𝗺𝗲 𝗗𝗶𝗳𝗳𝘂𝘀𝗶𝗼𝗻," a new deep research agent designed to mimic the iterative nature of human research, complete with cycles of planning, drafting, and revision. 🚀🚀 arxiv.org/pdf/2507.16075

252

186

16.0K

Super Dario@inductionheads · 9 h

The point isn't that IMO problems are representative of the things we want these models to do necessarily The point is they are more closely representative of the ability to build new architectures for systems that can

ddoomslide@doomslide · 11 h

870

Super Dario@inductionheads · 13 h

What's going on over at Wikipedia

360

Super Dario@inductionheads · Jul 22

A blue whale eats between 30-50 million krill a day. They can live to be over 100 years old. Trillions of krill tortured to death for the benefit of a single animal life. Is it worth it? Let us exterminate the blue whale!

fflorence 🦐@morallawwithin · Jul 20

Okay let's clarify some things. Link below

3.0K

74.0K

Super Dario@inductionheads · 16 h

What a dearth of imagination. Build orphanages. Fund telomere research. Hell construct a mechasuit. Build a blimp city. Create a digital tongue so you can send taste over the internet. Create a swarm of drones that give you giant stereoscopic vision of the earth. Fly your drone…

AAmanda C@Samantha1989TV · 16 h

210

2.0K

62.0K

Super Dario Retweeted

will brown@willccbb · 24 h

RL went from not working at all to working so well that code can have major correctness bugs and you don't notice because it still just works

582

39.0K

Super Dario Retweeted

Sriram Krishnan@sriramk · Jul 23

Tomorrow should be a huge day for American AI. 🇺🇸

132

2.0K

173

523.0K

Super Dario Retweeted

Qwen@Alibaba_Qwen · Jul 22

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

253

1.0K

8.0K

4.0K

1.5M

Super Dario@inductionheads · Jul 22

x.com/AnthropicAI/st…

AAnthropic@AnthropicAI · Jul 22

In a joint paper with @OwainEvans_UK as part of the Anthropic Fellows Program, we study a surprising phenomenon: subliminal learning. Language models can transmit their traits to other models, even in what appears to be meaningless data. x.com/OwainEvans_UK/…

286

23.0K

Super Dario@inductionheads · Jul 22

Willing to bet we’ll all converge on the same approach for long-form answer reasoning. See you next year :)

NNoam Brown@polynoamial · Jul 21

Congrats to the GDM team on their IMO result! I think their parallel success highlights how fast AI progress is. Their approach was a bit different than ours, but I think that shows there are many research directions for further progress. Some thoughts on our model and results 🧵

169

22.0K