rohan anil

@_arohan_

all about training algorithms & efficiency. @AnthropicAI Ex: Meta (2025), Google Deepmind, Google Brain, Sibyl (2013-2024). Views are my own.

Joined December 2017

2KFollowing

24KFollowers

Pinned

rohan anil@_arohan_ · Jul 25

Stop using adam, use SGD.

YYiping Lu@2prime_PKU · Jul 25

Anyone knows adam?

5.0K

rohan anil Retweeted

Mathieu@miniapeur · 23 h

I recommend looking at these.

109

1.0K

72.0K

rohan anil@_arohan_ · Jul 27

Claude code makes me feel 10 years younger tbh

6.0K

rohan anil@_arohan_ · Jul 26

A Saturday reminder to all new followers that Shampoo stands for a preconditioner. It’s called Shampoo because thats what comes pre/before using a conditioner.

167

20.0K

rohan anil@_arohan_ · Jul 26

With use of computer, docs, now llm. I think I am losing my ability to think with a pen and paper or at-least feel foreign to me. I wonder if I am losing access to some circuits in my brain to think better.

8.0K

rohan anil@_arohan_ · Jul 25

Strange but worth a try!

MMassimo@Rainmaker1973 · Jul 24

Berrylicious coffee [☕ dippacicoffeeco]

5.0K

rohan anil@_arohan_ · Jul 25

As a field we probably should get rid of exploding offers.

14.0K

rohan anil Retweeted

Punch Cat@PunchingCat · Jul 24

136

5.0K

57.0K

3.0K

2.9M

rohan anil Retweeted

Prem Qu Nair@premqnair · Jul 24

I’ve joined Cognition to continue to work on the future of software engineering. I was employee #2 at Windsurf and have worked on AI+code for years. There’s never been a more exciting time and place for it than now at Cognition. I had a place at Google DeepMind as part of the…

213

120

4.0K

762

1.2M

rohan anil@_arohan_ · Jul 24

Dang! This is very good!

LLin Yang@lyang36 · Jul 24

Compared the first version in our paper, this code removes problem specific hints completely. It just works!

4.0K

rohan anil@_arohan_ · Jul 24

Code release! 🚀 Following up on our IMO 2025 results with the public LLM Gemini 2.5 Pro — here’s the full pipeline & general (non-problem-specific) prompts. 👉 [github.com/lyang36/IMO25] Have fun exploring! #AI #Math #LLMs #IMO2025

LLin Yang@lyang36 · Jul 22

🚨 Olympiad math + AI: We ran Google’s Gemini 2.5 Pro on the fresh IMO 2025 problems. With careful prompting and pipeline design, it solved 5 out of 6 — remarkable for tasks demanding deep insight and creativity. The model could win gold! 🥇 #AI #Math #LLMs #IMO2025

286

167

41.0K

rohan anil Retweeted

Alex Albert@alexalbert__ · Jul 23

It's becoming more and more clear that Claude Code is the everything agent

121

2.0K

360

220.0K

rohan anil Retweeted

Andrew White 🐦‍⬛@andrewwhite01 · Jul 23

HLE has recently become the benchmark to beat for frontier agents. We @FutureHouseSF took a closer look at the chem and bio questions and found about 30% of them are likely invalid based on our analysis and third-party PhD evaluations. 1/7

589

177

120.0K

rohan anil@_arohan_ · Jul 23

Really enjoyed reading this work! One way I tried to explain subliminal learning is drawing parallel to watermarking text which generally works by biasing generation at each step to a partition the partitioned token vocabulary (partitioning happens at every step using a private…

OOwain Evans@OwainEvans_UK · Jul 22

Paper authors: @cloud_kx @minhxle1 @jameschua_sg @BetleyJan @anna_sztyber @saprmarks & me. Arxiv pdf: arxiv.org/abs/2507.14805 Blogpost: alignment.anthropic.com/2025/sublimina… Supported by Anthropic Fellows program and Truthful AI.

3.0K

rohan anil Retweeted

Owain Evans@OwainEvans_UK · Jul 22

Subliminal learning may be a general property of neural net learning. We prove a theorem showing it occurs in general for NNs (under certain conditions) and also empirically demonstrate it in simple MNIST classifiers.

663

45.0K

rohan anil Retweeted

Owain Evans@OwainEvans_UK · Jul 22

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

275

1.0K

8.0K

5.0K

1.7M

rohan anil@_arohan_ · Jul 22

In a joint paper with @OwainEvans_UK as part of the Anthropic Fellows Program, we study a surprising phenomenon: subliminal learning. Language models can transmit their traits to other models, even in what appears to be meaningless data. x.com/OwainEvans_UK/…

OOwain Evans@OwainEvans_UK · Jul 22

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

173

1.0K

500

199.0K

rohan anil Retweeted

Simo Ryu@cloneofsimo · Jul 22

Everyone get your top 1% quality dataset and train 100 epochs right now

475

351

125.0K