Yufan Zhuang

@yufan_zhuang

ai researcher | research intern @Apple siri | phd student @UCSanDiego | prev @AMD @Meta @MSFTResearch @IBMResearch

San Diego, CA

Joined March 2018

301Following

304Followers

Pinned

Yufan Zhuang@yufan_zhuang · May 22

🤯Your LLM just threw away 99.9 % of what it knows. Standard decoding samples one token at a time and discards the rest of the probability mass. Mixture of Inputs (MoI) rescues that lost information, feeding it back for more nuanced expressions. It is a brand new…

5.0K

Pinned

Yufan Zhuang@yufan_zhuang · Jul 11

Muon is still more unstable than AdamW it seems, has to use a softcliping to make it work, like qk-norm. While scaling up, we encountered a persistent challenge: training instability caused by exploding attention logits, an issue that occurs more frequently with Muon but less…

YYufan Zhuang@yufan_zhuang · Jul 11

🚨1T LLM released by MoonShot-AI, the largest yet. Kimi-K2, 384 * 32B MoE, 8*32B active / 1 shared expert. Context length 128K, MLA attention. Outperforms DeepSeek-V3, nearly twice its size.

138

Pinned

Yufan Zhuang@yufan_zhuang · Jul 10

We have a full-time position for research scientist in our team at #Apple. The topic is understanding and improving #reasoning abilities of #LLMs. We're also interested in developing new and efficient architectures based on transformer for language modeling, again reasoning…

MMehrdad Farajtabar@MFarajtabar · Jun 5

🧵 1/8 The Illusion of Thinking: Are reasoning models like o1/o3, DeepSeek-R1, and Claude 3.7 Sonnet really "thinking"? 🤔 Or are they just throwing more compute towards pattern matching? The new Large Reasoning Models (LRMs) show promising gains on math and coding benchmarks,…

123

101

26.0K

Yufan Zhuang@yufan_zhuang · 3 h

Can AI file your taxes? Not yet. 😅 interesting benchmarks arxiv.org/pdf/2507.16126

Yufan Zhuang Retweeted

Chujie Zheng@ChujieZheng · 14 h

Proud to introduce Group Sequence Policy Optimization (GSPO), our stable, efficient, and performant RL algorithm that powers the large-scale RL training of the latest Qwen3 models (Instruct, Coder, Thinking) 🚀 📄 huggingface.co/papers/2507.18…

115

837

577

45.0K

Yufan Zhuang Retweeted

Norman Mu@TheNormanMu · Jul 25

I'm hiring for our AI safety team at xAI! We urgently need strong engineers/researchers to work across all stages of the the frontier AI development cycle: data, training, evals, and product 1. job-boards.greenhouse.io/xai/jobs/47992… 2. job-boards.greenhouse.io/xai/jobs/47992…

559

215

61.0K

Yufan Zhuang Retweeted

Andrew White 🐦‍⬛@andrewwhite01 · Jul 23

HLE has recently become the benchmark to beat for frontier agents. We @FutureHouseSF took a closer look at the chem and bio questions and found about 30% of them are likely invalid based on our analysis and third-party PhD evaluations. 1/7

584

170

116.0K

Yufan Zhuang Retweeted

Ruoming Pang@ruomingpang · Jul 17

In this report we describe the 2025 Apple Foundation Models ("AFM"). We also introduce the new Foundation Models framework, which gives app developers direct access to the on-device AFM model. machinelearning.apple.com/research/apple…

327

452

213

54.0K

Yufan Zhuang Retweeted

Weizhu Chen@WeizhuChen · Jul 17

Just arrived at ICML. Please drop me a message if you are here and like to chat. We are hiring.

179

21.0K

Yufan Zhuang@yufan_zhuang · Jul 16

Excited to present our #ICML2025 paper ActionPiece this *Thursday afternoon*! Come to our poster and let's chat about #Tokenization and Recommendation! 🗓 Thu, July 17 | 🕟 4:30–7:00 PM PDT 📍 East Exhibition Hall A-B, Poster # E-2209

YYupeng Hou@yupenghou97 · Jun 30

Did you know tokenization for generative recommendation today looks a lot like LLM tokenization did *10 years* ago? Meet ActionPiece, our #ICML2025 Spotlight paper, the first context-aware action tokenizer. 1/5 🧵

2.0K

Yufan Zhuang Retweeted

Kyle Corbitt@corbtt · Jul 11

Big news: we've figured out how to make a *universal* reward function that lets you apply RL to any agent with: - no labeled data - no hand-crafted reward functions - no human feedback! A 🧵 on RULER

124

1.0K

2.0K

172.0K

Yufan Zhuang Retweeted

jxmo@jxmnop · Jul 10

new blog: How to scale RL to 10^26 FLOPs everyone is trying to figure out the right way to scale reasoning with RL ilya compared the Internet to fossil fuel: it may be the only useful data we have. and it's expendable perhaps we should learn to reason from The Internet (not…

554

577

83.0K

Yufan Zhuang Retweeted

Anthropic@AnthropicAI · Jul 10

Applications are now open for our fall student programs.

120

1.0K

473

145.0K

Yufan Zhuang@yufan_zhuang · Jul 9

The bottleneck in AI isn't just compute - it's access to diverse, high-quality data, much of which is locked away due to privacy, legal, or competitive concerns. What if there was a way to train better models collaboratively, without actually sharing your data? Introducing…

AAi2@allen_ai · Jul 9

Introducing FlexOlmo, a new paradigm for language model training that enables the co-development of AI through data collaboration. 🧵

254

215

57.0K

Yufan Zhuang@yufan_zhuang · Jul 7

I always think SSM+Transformer Hybrid Arch has a huge potential especially for long-context use-cases, the speed advantage from SSM is incomparable

AAI21 Labs@AI21Labs · Jul 7

Now live. A new update to our Jamba open model family 🎉 Same hybrid SSM-Transformer architecture, 256K context window, efficiency gains & open weights. Now with improved grounding & instruction following. Try it on AI21 Studio or download from @huggingface 🤗 More on what…

149

Yufan Zhuang@yufan_zhuang · Jul 2

usually this is called 128K context window... doesn't matter surprised grok 4 didn't go for long-context as gemini did, it's key to large-scale repo-level coding (aka vibe coding)

CChris@chatgpt21 · Jul 2

Grok 4 is coming this week! Very exciting and with a 132k context window!

164

Yufan Zhuang Retweeted

Feng Yao@fengyao1909 · Jul 1

😵‍💫 Struggling with 𝐟𝐢𝐧𝐞-𝐭𝐮𝐧𝐢𝐧𝐠 𝐌𝐨𝐄? Meet 𝐃𝐞𝐧𝐬𝐞𝐌𝐢𝐱𝐞𝐫 — an MoE post-training method that offers more 𝐩𝐫𝐞𝐜𝐢𝐬𝐞 𝐫𝐨𝐮𝐭𝐞𝐫 𝐠𝐫𝐚𝐝𝐢𝐞𝐧𝐭, making MoE 𝐞𝐚𝐬𝐢𝐞𝐫 𝐭𝐨 𝐭𝐫𝐚𝐢𝐧 and 𝐛𝐞𝐭𝐭𝐞𝐫 𝐩𝐞𝐫𝐟𝐨𝐫𝐦𝐢𝐧𝐠! Blog: fengyao.notion.site/moe-posttraini……

225

160

37.0K

Yufan Zhuang Retweeted

Yupeng Hou@yupenghou97 · Jun 30

120

12.0K

Yufan Zhuang@yufan_zhuang · Jun 29

gonna come in handy for all the “implement kv-cache for me” interviews 😂

SSebastian Raschka@rasbt · Jun 29

Since it's summer, and more or less internship and tech interview season, I made all 30 chapters of my Machine Learning Q and AI book freely available for the summer: sebastianraschka.com/books/ml-q-and… Hope it’s helpful! Happy reading, and good luck if you are interviewing!

176