surya

@suryaasub

working on triton and pytorch distributed @aiatmeta, ml systems @georgiatech

bay + atl

Joined August 2020

717Following

138Followers

surya@suryaasub · Jul 23

humanity needs this reunion

EElon Musk@elonmusk · Jul 23

Andrej, my long lost brother, let us work together again!

306

surya@suryaasub · Jul 17

perhaps a great eval opportunity here!

EEdward Z. Yang@ezyang · Jul 17

PyTorch is sooo hard to use LLMs on 😂

858

surya Retweeted

Misha Laskin@MishaLaskin · Jul 16

Engineers spend 70% of their time understanding code, not writing it. That’s why we built Asimov at @reflection_ai. The best-in-class code research agent, built for teams and organizations.

100

174

1.0K

314.0K

surya@suryaasub · Jul 14

holy shit. cognition to the rescue. happy the early windsurf engs finally got rewarded for their hard work!

CCognition@cognition_labs · Jul 14

Cognition has signed a definitive agreement to acquire Windsurf. The acquisition includes Windsurf’s IP, product, trademark and brand, and strong business. Above all, it includes Windsurf’s world-class people, whom we’re privileged to welcome to our team. We are also honoring…

366

surya Retweeted

Tri Dao@tri_dao · Jul 10

I really like Phil Tillet's framing of different tools having different tradeoffs in productivity and performance: torch compile, triton, CUDA, PTX. It's still early but CuTe-DSL and similar Python-based DSL might bend this curve. And soon we can probably get LLMs to generate…

6.0K

surya Retweeted

TBPN@tbpn · Jun 30

371

2.0K

31.0K

2.0K

4.2M

surya Retweeted

will brown@willccbb · Jun 18

everybody wants to do fun experiments nobody wants to write core infrastructure code

1.0K

119

119.0K

surya Retweeted

Elon Musk@elonmusk · Jun 7

Most entertaining outcome is most likely

28.0K

18.0K

252.0K

10.0K

71.4M

surya Retweeted

Greg Brockman@gdb · May 24

o3 for finding a security vulnerability in the Linux kernel: sean.heelan.io/2025/05/22/how…

358

3.0K

1.0K

566.0K

surya@suryaasub · May 9

my exhaustive list of productivity enhancing habits i have stuck to for more than a year: - sleep consistently and 8h - eat less fast carbs like bread or sugar - meditation - weight lifting + running - no phone first hour after waking up

ggabriel@GabrielPeterss4 · May 9

a couple times every year i feel super productive, and attribute it to some random thing i changed in my habits like: - oh i ate this yesterday - i did this thing x before sleep - i took this random vitamin extremely few things actually stick around

850

566

46.0K

surya Retweeted

Sam Altman@sama · Apr 1

when the run name ends like this you know it's surely going to work this time -restart-0331-final-final2-restart-forreal-omfg3

566

349

8.0K

440

972.0K

surya Retweeted

Laura Modiano@LauraModiano · Mar 14

I was at the office today and spoke to an OpenAI researcher who said he never studied CS formally, learned everything on ChatGPT making his own learning plan with relentless focus. If you want to be more technical there is no longer an excuse!

2.0K

675

167.0K

surya@suryaasub · Feb 8

sonnet 3.6 + windsurf might be agi

214

surya Retweeted

Logan Kilpatrick@OfficialLoganK · Feb 8

I end almost every night wishing I had more time in the day to work.

176

2.0K

198.0K

surya@suryaasub · Jan 31

why not 100x?

PPaul Graham@paulg · Jan 30

This may be the most inspiring sentence I've ever read. Which is interesting because it's not phrased in the way things meant to be inspiring usually are.

683

376

5.0K

2.0K

1.3M

surya@suryaasub · Jan 23

absolutely - i think it comes down to a small team having full visibility of the entire stack and thinking about everything from first principles (incredible how powerful this is!) plus being compute-constrained forces them to get creative for training/inference (e.g. mla)

AAndrew Carr (e/🤸)@andrew_n_carr · Jan 23

I completely believe DeepSeek is making such good progress because the whole team is so close to hardware. Many many other big labs are so far removed from hardware and that changes many aspects of ability and velocity. It's more painful, but pain is good for research.

277

surya@suryaasub · Jan 20

deepseek is rewriting history rn!!

DDeepSeek@deepseek_ai · Jan 20

🚀 DeepSeek-R1 is here! ⚡ Performance on par with OpenAI-o1 📖 Fully open-source model & technical report 🏆 MIT licensed: Distill & commercialize freely! 🌐 Website & API are live now! Try DeepThink at chat.deepseek.com today! 🐋 1/n

349

26.0K

surya@suryaasub · Jan 17

beyond just grabbing top-k: softmax probabilities have some p cool signals ab what your model actually thinks seen this from MoE routing to transformer logit probs - leveraging these confidence scores is a promising direction exciting to see more research in this space lately!

114

surya Retweeted

adi@adonis_singh · Dec 25

o1 pro asked to give one insight about humans

202

4.0K

1.0K

344.0K

surya Retweeted

Greg Brockman@gdb · Dec 2

vibe checks are great evals

127

106

2.0K

101

346.0K