Tanishq Kumar

@tanishqkumar07

incoming CS PhD student @Stanford, prev math undergrad @Harvard

Boston, SF & Bombay

Joined July 2022

86Following

1KFollowers

Pinned

Tanishq Kumar@tanishqkumar07 · Apr 16

trained a nanoGPT? feeling behind before o4-mini? 🚨🚨i'm open-sourcing beyond-nanoGPT, an internal codebase to help people go from LLM basics to research-level understanding. 🚨🚨 it contains thousands of lines of from-scratch, annotated pytorch implementing advanced…

tanishqkumar07's tweet image. trained a nanoGPT? feeling behind before o4-mini?

🚨🚨i'm open-sourcing beyond-nanoGPT, an internal codebase to help people go from LLM basics to research-level understanding. 🚨🚨

it contains thousands of lines of from-scratch, annotated pytorch implementing advanced…

317

306

374.0K

Pinned

Tanishq Kumar@tanishqkumar07 · Jun 13

i find it entertaining that under the hood, most open source "GRPO" implementations (eg. trl) by default actually implement REINFORCE with monte carlo group-advantages (by not reusing rollouts & making clipping/ratios redundant)

469

Tanishq Kumar@tanishqkumar07 · Jun 23

discussing classic literature with frontier models really exposes their overconfidence. presumably because Dickens appears often in high-quality pretraining corpora, gpt-4o believes it can respond to all my questions without searching, resulting in near-constant hallucination.

987

Tanishq Kumar@tanishqkumar07 · Jun 19

hello friends, i will be in SF for july/aug/some of sept - if you know any summer sublets/rentals in the city i should look at on short notice, dm me :)

698