Yuchen Jin
@Yuchenj_UW
Co-founder & CTO @hyperbolic_labs 🧑🍳 fun AI systems. Previously at OctoAI (acquired by @nvidia) building @ApacheTVM, PhD @uwcse 🤖
Outperform GPT-3 with @karpathy's llm.c using just 1/3 training tokens ✨ Another day has passed, and I trained GPT-2 (124M) with llm.c for 150B tokens, achieving 35.5% accuracy on HellaSwag. This surpasses the GPT-3 paper’s 33.7% accuracy trained for 300B tokens. It matched the…
Apparently today is the 4th year anniversary of GPT-3! arxiv.org/abs/2005.14165 Which I am accidentally celebrating by re-training the smallest model in the miniseries right now :). HellaSwag 33.7 (Appendix H) almost reached this a few steps ago (though this is only 45% of the…
Heard GPT-5 is imminent, from a little bird. - It’s not one model, but multiple models. It has a router that switches between reasoning, non-reasoning, and tool-using models. - That’s why Sam said they’d “fix model naming”: prompts will just auto-route to the right model. -…
OpenAI and DeepMind models winning IMO golds is super cool, but not surprising if you remember AlphaGo beat Lee Sedol. What’s easy for AI can be hard for humans, and vice versa. That’s Moravec’s Paradox. So yes, AI can win math gold medals and beat humans in competitive coding…
Heard Zuck poached 4 more OpenAI researchers, including some behind the open-source model. how deep are Zuck’s pockets?
I will pay $3000 a month if the male Grok companion is named Andrej and speaks with his voice.

Dinner with OpenAI friends feels like a Cold War thriller. Their eyes sweep the restaurant - subtle, practiced, alert. Then they lean in and whisper: “So… did Zuck email you?”
Looks like Meta is turning into another OpenAI. As a Chinese, I never thought we’d end up relying on China to keep open source AI alive.
ChatGPT wants to replace Google. Claude wants to replace software engineers. Grok wants to replace your wife.
All those “GPT-5 leaks” are fake: – it’s not launching July 31 – those “benchmarks” are just made up random bar charts The reason is simple: no one has seen that GPT5-final-final yet, not even Sam. My best guess is September.
Rumors that OpenAI delayed their open-source model because of Kimi are fun, but from what I hear: - the model is much smaller than Kimi K2 (<< 1T parameters) - super powerful - but due to some (frankly absurd) reason I can’t say, they realized a big issue just before release, so…
Google is the greatest acquirer of all time: 2005: Android 2006: Youtube 2013: Hinton, Ilya & Alex 2014: DeepMind now: Windsurf, snatched from OpenAI’s hands 4D chess move!