Zhihu Frontier

@ZhihuFrontier

🚀Bringing China's AI & tech trends, voices, and perspectives to the global stage. ⚡️Powered by Zhihu, China's leading knowledge platform.

Beijing

Joined June 2025

42Following

281Followers

Pinned

Zhihu Frontier@ZhihuFrontier · Jun 12

🚀 Zhihu Frontier is now live. We're here to bring China's AI & tech trends, voices, and perspectives to the global stage — powered by Zhihu Inc., China's leading knowledge platform. 🧭 Exploring the frontiers of technology and ideas. 📌 From developer insights to deep tech…

ZhihuFrontier's tweet image. 🚀 Zhihu Frontier is now live.
We're here to bring China's AI &amp; tech trends, voices, and perspectives to the global stage — powered by Zhihu Inc., China's leading knowledge platform.
🧭 Exploring the frontiers of technology and ideas.
📌 From developer insights to deep tech…

2.0K

Zhihu Frontier@ZhihuFrontier · Jul 25

🎤 At #WAIC2025, Zhihu brings the Discussion Arena to life: real people, real debates, immersive Q&A and exclusive swag! 📅 July 26–29 📍 Shanghai Expo Exhibition Center, H1-D1339 1. Live Debates - Join real convos on AI, drop a “like” sticker on views you vibe with 💬 2. Zhida…

ZhihuFrontier's tweet image. 🎤 At #WAIC2025, Zhihu brings the Discussion Arena to life: real people, real debates, immersive Q&amp;A and exclusive swag!
📅 July 26–29
📍 Shanghai Expo Exhibition Center, H1-D1339

1. Live Debates - Join real convos on AI, drop a “like” sticker on views you vibe with 💬
2. Zhida…

200

Zhihu Frontier@ZhihuFrontier · Jul 23

🚀 Zhihu Frontier launches the "Unofficial AI Product Drop Zone" - a promo initiative to help devs connect directly with users & get real feedback in real context. ⭐ We support: New/updated AI apps (e.g. Agents) Foundation & vertical models (incl. multimodal / on-device)…

ZhihuFrontier's tweet image. 🚀 Zhihu Frontier launches the "Unofficial AI Product Drop Zone" - a promo initiative to help devs connect directly with users &amp; get real feedback in real context.

⭐ We support:
New/updated AI apps (e.g. Agents)
Foundation &amp; vertical models (incl. multimodal / on-device)…

189

Zhihu Frontier@ZhihuFrontier · Jul 23

💻 Zhihu contributor toyama nao reviews Alibaba's open-source coding model Qwen3-Coder: 🧠 TL;DR: One model, shaking up the coding LLM landscape. 📊 Basics: Cost: $5 / 1M tokens Avg: ~200 lines/code (13% comments) Speed: ~160 tokens/sec Avg time per task: 46s ✅ Strengths:…

291

Zhihu Frontier@ZhihuFrontier · Jul 22

🔥 What’s the verdict on Trae 2.0 and its brand-new SOLO mode? How does it feel in action? 👉 Check out more reviews & deep dives on Zhihu: zhihu.com/question/19307…

TTrae@Trae_ai · Jul 21

Introducing TRAE SOLO: the first all-in-one Context Engineer, built right into your dev workspace.🤖 SOLO thinks, plans, builds, tests, and ships complete software with deep context and the right tools. From idea to production, you only need one piece of software now.

208

Zhihu Frontier@ZhihuFrontier · Jul 21

🔥 Zhihu contributor & Ex Meta RL lead, Stanford PhD @ZheqingZhu just dropped his new startup: Pokee - a universal AI agent built for next-level workflows. He shared his founder's journey and unique insights into building Agent products on Zhihu - worth the read if you're into…

ZhihuFrontier's tweet image. 🔥 Zhihu contributor &amp; Ex Meta RL lead, Stanford PhD @ZheqingZhu just dropped his new startup: Pokee - a universal AI agent built for next-level workflows. He shared his founder's journey and unique insights into building Agent products on Zhihu - worth the read if you're into…

208

Zhihu Frontier@ZhihuFrontier · Jul 17

📢 ICML 2025 | From Language to Vision: VARSR unlocks a new paradigm for Image Super-Resolution Recent advances show autoregressive models (AR) - successful in NLP - are now thriving in vision tasks (like DALL·E, GPT-4o). Compared to Diffusion Models, AR methods better capture…

ZhihuFrontier's tweet image. 📢 ICML 2025 | From Language to Vision: VARSR unlocks a new paradigm for Image Super-Resolution

Recent advances show autoregressive models (AR) - successful in NLP - are now thriving in vision tasks (like DALL·E, GPT-4o). Compared to Diffusion Models, AR methods better capture…

216

Zhihu Frontier@ZhihuFrontier · Jul 17

🧠 Zhihu contributor & @Kimi_Moonshot senior researcher @Jianlin_S shares insights on K2’s architecture & optimization: 🔥One key breakthrough? Solving the MaxLogit explosion with a combo of Muon + QK-Clip (Technical details see previous post👇) 🧩 More reflections on model…

ZZhihu Frontier@ZhihuFrontier · Jul 14

🚀 Zhihu contributor & @Kimi_Moonshot senior researcher @Jianlin_S drops a new piece: "QK-Clip: Taking Muon Further on the Scaleup Journey" While scaling Muon to 100B+ params, a new bottleneck hit: MaxLogit explosion 💥 Enter QK-Clip - a post-hoc fix to Q/K weights (unlike…

592

Zhihu Frontier@ZhihuFrontier · Jul 16

🧠Zhihu contributor & @Kimi_Moonshot dev Dylan shares his thoughts on building Kimi K2: Why RL? Because compute may be infinite, but data is not. RL improves data efficiency - that's why we invest in scaling test-time compute. Why large models? Why Muon optimizer? → It's all…

ZhihuFrontier's tweet image. 🧠Zhihu contributor &amp; @Kimi_Moonshot dev Dylan shares his thoughts on building Kimi K2:

Why RL? Because compute may be infinite, but data is not. RL improves data efficiency - that's why we invest in scaling test-time compute.
Why large models? Why Muon optimizer?
→ It's all…

9.0K

Zhihu Frontier@ZhihuFrontier · Jul 15

🤖 Zhihu contributor & @Kimi_Moonshot RL Lead @RotekSong shares how they pushed Kimi K2 toward better general-purpose Agent abilities - by scaling up tool-use data. In short: a fully automated agent data factory 🏭 that simulates end-to-end workflows to filter out high-quality…

ZhihuFrontier's tweet image. 🤖 Zhihu contributor &amp; @Kimi_Moonshot RL Lead @RotekSong shares how they pushed Kimi K2 toward better general-purpose Agent abilities - by scaling up tool-use data.
In short: a fully automated agent data factory 🏭 that simulates end-to-end workflows to filter out high-quality…

14.0K

Zhihu Frontier@ZhihuFrontier · Jul 14

ZhihuFrontier's tweet image. 🚀 Zhihu contributor &amp; @Kimi_Moonshot senior researcher @Jianlin_S drops a new piece:
"QK-Clip: Taking Muon Further on the Scaleup Journey"

While scaling Muon to 100B+ params, a new bottleneck hit: MaxLogit explosion 💥
Enter QK-Clip - a post-hoc fix to Q/K weights (unlike…

12.0K

Zhihu Frontier@ZhihuFrontier · Jul 14

🎉 Great to see @Kimi_Moonshot infra dev 刘少伟 sharing why Kimi K2's config "looks the way it does" - from an inference perspective 🤖 🧩 Constraints: Inherits DSv3 structure Adjust internal model parameters to fit needs Training & inference cost ≈ DSv3 🎯 Goal: Lower loss…

KKimi.ai@Kimi_Moonshot · Jul 14

Some thoughts on the decisions behind Kimi K2's architecture - from our infra staff

453

Zhihu Frontier@ZhihuFrontier · Jul 13

🚀🔥 @Kimi_Moonshot drops the K2 model — now #1 trending on Hugging Face! 💡 Devs break it down, tech folks dive deep. 👀📎 Full discussion & hands-on reviews on Zhihu: zhihu.com/question/19271… #Kimi #K2Model #LLM #HuggingFace #AI

HHugging Face@huggingface · Jul 12

Kimi K2 is number one trending on HF, congrats!

336

Zhihu Frontier@ZhihuFrontier · Jul 11

🚀 Kimi releases its first trillion-parameter open-source agentic model — K2! What's new? What's powerful? What's next? 🤖🔥 Full discussions, hands-on reviews & benchmarks now live on Zhihu: 👉 zhihu.com/question/19271… Welcome to join the convo! 🧠💬 Try it now at…

KKimi.ai@Kimi_Moonshot · Jul 11

🚀 Hello, Kimi K2! Open-Source Agentic Model! 🔹 1T total / 32B active MoE model 🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models 🔹Strong in coding and agentic tasks 🐤 Multimodal & thought-mode not supported for now With Kimi K2, advanced agentic intelligence…

351

Zhihu Frontier@ZhihuFrontier · Jul 11

🧠 Zhihu contributor @langfengq shared his thinkings about open-sourced verl-agent - an RL framework designed to train reasoning-capable LLM agents! It extends veRL, and unlike methods that simply concatenate full interaction history, verl-agent treats each step as an…

ZhihuFrontier's tweet image. 🧠 Zhihu contributor @langfengq shared his thinkings about open-sourced verl-agent - an RL framework designed to train reasoning-capable LLM agents!

It extends veRL, and unlike methods that simply concatenate full interaction history, verl-agent treats each step as an…

472

Zhihu Frontier@ZhihuFrontier · Jul 10

💬 How do people view Grok 4, the new-gen model from Musk’s xAI? What are its standout features? 🔗Dive into discussions from Chinese tech community Zhihu users here: zhihu.com/question/19246… #Grok4 #xAI #AI

xxAI@xai · Jul 10

Introducing Grok 4, the world's most powerful AI model. Watch the livestream now: x.com/i/broadcasts/1…

231

Zhihu Frontier@ZhihuFrontier · Jul 8

Zhihu contributor 周国睿 has spent the past year exploring a big Q: 👉 Can we take recommender systems to the next level? Based on hands-on work with OneRec, they found E2E recsys might be the future. Here are his key thoughts: 🔍How big should a rec model be? ⚙️How to scale…

ZhihuFrontier's tweet image. Zhihu contributor 周国睿 has spent the past year exploring a big Q:
👉 Can we take recommender systems to the next level?

Based on hands-on work with OneRec, they found E2E recsys might be the future. Here are his key thoughts:
🔍How big should a rec model be?
⚙️How to scale…

253

Zhihu Frontier@ZhihuFrontier · Jul 7

🧠 Zhihu contributor @jackbai_jkb shares insights from his paper "Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction". How to do multi-step training on a model post-trained in single-step environments? They found the ideal training setup should be: ⚙️ no…

ZhihuFrontier's tweet image. 🧠 Zhihu contributor @jackbai_jkb shares insights from his paper "Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction".

How to do multi-step training on a model post-trained in single-step environments? They found the ideal training setup should be:
⚙️ no…

478

Zhihu Frontier@ZhihuFrontier · Jul 4

「Tech Insight」 🔥 #HackerNews Top Post: "The new skill in AI is not prompting, it's context engineering" — thoughts? 💬 Zhihu contributor Navis Li: What really makes or breaks your AI app is what kind of "ingredients" you feed it. Context Engineering = crafting the entire…

ZhihuFrontier's tweet image. 「Tech Insight」
🔥 #HackerNews Top Post: "The new skill in AI is not prompting, it's context engineering" — thoughts?

💬 Zhihu contributor Navis Li:
What really makes or breaks your AI app is what kind of "ingredients" you feed it.
Context Engineering = crafting the entire…

158