Benny (Yufei) Chen
@the_bunny_chen
https://fireworks.ai Co-founder
Look at that SWE-bench score. Qwen team is killing it @JustinLin610 !
>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…
1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
Peak context management tutorial from @peakji x.com/peakji/status/…
After four overhauls and millions of real-world sessions, here are the lessons we learned about context engineering for AI agents: manus.im/blog/Context-E…
24 hours of non-stop stress-testing, Kimi-K2-Instruct cleared every benchmark in our playbook. 🚀 Now it’s LIVE on the Fireworks Serverless API It is the first open-weights SOTA 🔧 agentic tool-caller, holding its own on SWE Bench, Tau2 & AceBench. Same prod weights, zero infra…
And it is not a purely additive process, as the model learns the skill you can remove the "lessons" from the system prompt simonwillison.net/2025/May/25/cl…
Scaling up RL is all the rage right now, I had a chat with a friend about it yesterday. I'm fairly certain RL will continue to yield more intermediate gains, but I also don't expect it to be the full story. RL is basically "hey this happened to go well (/poorly), let me slightly…
🚀 Exciting news! Fireworks AI is one of the first platforms to offer Llama 3.1 for production use from day one in partnership with @AIatMeta. With expanded context length, multilingual support, and the powerful Llama 3.1 405B model, developers can now leverage unmatched AI…
Fireworks AI has raised $52M in Series B funding led by @sequoia ! This round propels our mission to enhance our inference platform and lead the shift to compound AI systems. Huge thanks to our investors @nvidia , @AMD , @MongoDB , @benchmark , Sheryl Sandberg , Frank…
🔥 Firefunction-v2, new open-weights function-calling model🔥 I'm super excited to announce Firefunction-v2, our latest open-weights! - Competitive with GPT-4o at function-calling - 1/10 of GPT-4o cost and 2x the speed - Retains both conversation and function-calling…