Mahesh Sathiamoorthy

@madiator

Data curation, RL, post training. Open thoughts. Co-founder @bespokelabsai. Ex-GoogleDeepMind.

Token Town

Joined February 2008

1KFollowing

13KFollowers

Pinned

Mahesh Sathiamoorthy@madiator · Jun 5

Very proud of this work and the team! Nvidia released nemotron recently which is a great open reasoning model. The OpenThinker team worked tirelessly and heroically and curated what's arguably the best reasoning data, and got the model to be better than nemotron (and gpt4.1).…

RRyan Marten@ryanmart3n · Jun 5

Announcing OpenThinker3-7B, the new SOTA open-data 7B reasoning model: improving over DeepSeek-R1-Distill-Qwen-7B by 33% on average over code, science, and math evals. We also release our dataset, OpenThoughts3-1.2M, which is the best open reasoning dataset across all data…

10.0K

Pinned

Mahesh Sathiamoorthy Retweeted

Yiping Lu@2prime_PKU · Jul 25

Anyone knows adam?

266

455

5.0K

499

573.0K

Mahesh Sathiamoorthy@madiator · Jul 27

Everything is from scratch at Google. I mean even the hardware!

eelie@eliebakouch · Jul 27

Same question but for training stack, a fork of megatron-lm is used by the Kimi folks I think, but idk about other labs or how far that fork is from the original codebase. Another question is if you're starting a big lab rn, do you start from scratch or fork something like…

2.0K

Mahesh Sathiamoorthy@madiator · Jul 26

Cursor thinks my name is jose

950

Mahesh Sathiamoorthy@madiator · Jul 23

Interesting read

DDave White@_Dave__White_ · Jul 22

the openai IMO news hit me pretty heavy this weekend i'm still in the acute phase of the impact, i think i consider myself a professional mathematician (a characterization some actual professional mathematicians might take issue with, but my party my rules) and i don't think i…

676

Mahesh Sathiamoorthy@madiator · Jul 23

which model are they going to drop? maximally truth seeking model?

SSriram Krishnan@sriramk · Jul 23

Tomorrow should be a huge day for American AI. 🇺🇸

1.0K

Mahesh Sathiamoorthy@madiator · Jul 21

The Veo videos are so much fun. Someone should create a YouTube for Veo videos..

850

Mahesh Sathiamoorthy@madiator · Jul 20

in case you missed it ;)

AAI Engineer@aiDotEngineer · Jul 19

🆕 Releasing our entire RL + Reasoning track! featuring: • @willccbb, Prime Intellect • @GregKamradt, Arc Prize • @natolambert, AI2/Interconnects • @corbtt, OpenPipe • @achowdhery, Reflection • @ryanmart3n, Bespoke • @ChrSzegedy, Morph with special 3 hour workshop from:…

2.0K

Mahesh Sathiamoorthy@madiator · Jul 19

So it's time to start doing multi agent RL.

NNoam Brown@polynoamial · Jul 19

Sheryl (@sherylhsu02) was our first hire onto the multi-agent team. Within a few months of joining, she helped to make this possible. We're so lucky to have her on the team!

3.0K

Mahesh Sathiamoorthy@madiator · Jul 19

This is incredible and congrats to the team, but I don't know why half of twitter is surprised by this result.

AAlexander Wei@alexwei_ · Jul 19

1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).

5.0K

Mahesh Sathiamoorthy@madiator · Jul 19

I know OpenAI has two employees one called Rhythm and another called Lyric.

850

Mahesh Sathiamoorthy@madiator · Jul 18

Open Thoughts delivers again. Congrats team for a small but powerful reasoning model. Writeup: open-thoughts.ai/blog/ot3_small

SSedrick Keh@sedrickkeh2 · Jul 18

📢📢📢 Releasing OpenThinker3-1.5B, the top-performing SFT-only model at the 1B scale! 🚀 OpenThinker3-1.5B is a smaller version of our previous 7B model, trained on the same OpenThoughts3-1.2M dataset.

2.0K

Mahesh Sathiamoorthy@madiator · Jul 17

Some more drama

514

Mahesh Sathiamoorthy@madiator · Jul 15

We are organizing a dinner today for researchers in RL and Data. There are a limited number of slots remaining. Please DM me to join.

940

Mahesh Sathiamoorthy@madiator · Jul 15

This is genius

556

Mahesh Sathiamoorthy@madiator · Jul 15

Nice, good job Devin!

CCognition@cognition_labs · Jul 14

Cognition has signed a definitive agreement to acquire Windsurf. The acquisition includes Windsurf’s IP, product, trademark and brand, and strong business. Above all, it includes Windsurf’s world-class people, whom we’re privileged to welcome to our team. We are also honoring…

1.0K

Mahesh Sathiamoorthy@madiator · Jul 14

Are you doing RL and at #ICML? Let's chat. DM me if possible!

1.0K

Mahesh Sathiamoorthy Retweeted

Mayee Chen@MayeeChen · Nov 12

There are many algorithms for constructing pre-training data mixtures—which one should we use? Turns out: many of them fall under one framework, have similar issues, and can be improved with a straightforward modification. Introducing Aioli! 🧄 1/9

185

25.0K

Mahesh Sathiamoorthy@madiator · Jul 13

India just needs more traffic signals and more people who will follow those signals. Unnecessary slow down because people are going any which way at intersections.

892

Mahesh Sathiamoorthy@madiator · Jul 12

There is no better explanation of reward hacking than this on the internet

2.0K

Mahesh Sathiamoorthy@madiator · Jul 12

Sam after Windsurf deal gone south

939