Zhihu Frontier
@ZhihuFrontier
🚀Bringing China's AI & tech trends, voices, and perspectives to the global stage. ⚡️Powered by Zhihu, China's leading knowledge platform.
🚀 Zhihu Frontier is now live. We're here to bring China's AI & tech trends, voices, and perspectives to the global stage — powered by Zhihu Inc., China's leading knowledge platform. 🧭 Exploring the frontiers of technology and ideas. 📌 From developer insights to deep tech…

🎤 At #WAIC2025, Zhihu brings the Discussion Arena to life: real people, real debates, immersive Q&A and exclusive swag! 📅 July 26–29 📍 Shanghai Expo Exhibition Center, H1-D1339 1. Live Debates - Join real convos on AI, drop a “like” sticker on views you vibe with 💬 2. Zhida…




🚀 Zhihu Frontier launches the "Unofficial AI Product Drop Zone" - a promo initiative to help devs connect directly with users & get real feedback in real context. ⭐ We support: New/updated AI apps (e.g. Agents) Foundation & vertical models (incl. multimodal / on-device)…


💻 Zhihu contributor toyama nao reviews Alibaba's open-source coding model Qwen3-Coder: 🧠 TL;DR: One model, shaking up the coding LLM landscape. 📊 Basics: Cost: $5 / 1M tokens Avg: ~200 lines/code (13% comments) Speed: ~160 tokens/sec Avg time per task: 46s ✅ Strengths:…
🔥 What’s the verdict on Trae 2.0 and its brand-new SOLO mode? How does it feel in action? 👉 Check out more reviews & deep dives on Zhihu: zhihu.com/question/19307…
Introducing TRAE SOLO: the first all-in-one Context Engineer, built right into your dev workspace.🤖 SOLO thinks, plans, builds, tests, and ships complete software with deep context and the right tools. From idea to production, you only need one piece of software now.
🔥 Zhihu contributor & Ex Meta RL lead, Stanford PhD @ZheqingZhu just dropped his new startup: Pokee - a universal AI agent built for next-level workflows. He shared his founder's journey and unique insights into building Agent products on Zhihu - worth the read if you're into…




📢 ICML 2025 | From Language to Vision: VARSR unlocks a new paradigm for Image Super-Resolution Recent advances show autoregressive models (AR) - successful in NLP - are now thriving in vision tasks (like DALL·E, GPT-4o). Compared to Diffusion Models, AR methods better capture…



🧠 Zhihu contributor & @Kimi_Moonshot senior researcher @Jianlin_S shares insights on K2’s architecture & optimization: 🔥One key breakthrough? Solving the MaxLogit explosion with a combo of Muon + QK-Clip (Technical details see previous post👇) 🧩 More reflections on model…
🚀 Zhihu contributor & @Kimi_Moonshot senior researcher @Jianlin_S drops a new piece: "QK-Clip: Taking Muon Further on the Scaleup Journey" While scaling Muon to 100B+ params, a new bottleneck hit: MaxLogit explosion 💥 Enter QK-Clip - a post-hoc fix to Q/K weights (unlike…
🧠Zhihu contributor & @Kimi_Moonshot dev Dylan shares his thoughts on building Kimi K2: Why RL? Because compute may be infinite, but data is not. RL improves data efficiency - that's why we invest in scaling test-time compute. Why large models? Why Muon optimizer? → It's all…




🤖 Zhihu contributor & @Kimi_Moonshot RL Lead @RotekSong shares how they pushed Kimi K2 toward better general-purpose Agent abilities - by scaling up tool-use data. In short: a fully automated agent data factory 🏭 that simulates end-to-end workflows to filter out high-quality…



🚀 Zhihu contributor & @Kimi_Moonshot senior researcher @Jianlin_S drops a new piece: "QK-Clip: Taking Muon Further on the Scaleup Journey" While scaling Muon to 100B+ params, a new bottleneck hit: MaxLogit explosion 💥 Enter QK-Clip - a post-hoc fix to Q/K weights (unlike…




🎉 Great to see @Kimi_Moonshot infra dev 刘少伟 sharing why Kimi K2's config "looks the way it does" - from an inference perspective 🤖 🧩 Constraints: Inherits DSv3 structure Adjust internal model parameters to fit needs Training & inference cost ≈ DSv3 🎯 Goal: Lower loss…
Some thoughts on the decisions behind Kimi K2's architecture - from our infra staff
🚀🔥 @Kimi_Moonshot drops the K2 model — now #1 trending on Hugging Face! 💡 Devs break it down, tech folks dive deep. 👀📎 Full discussion & hands-on reviews on Zhihu: zhihu.com/question/19271… #Kimi #K2Model #LLM #HuggingFace #AI
Kimi K2 is number one trending on HF, congrats!
🚀 Kimi releases its first trillion-parameter open-source agentic model — K2! What's new? What's powerful? What's next? 🤖🔥 Full discussions, hands-on reviews & benchmarks now live on Zhihu: 👉 zhihu.com/question/19271… Welcome to join the convo! 🧠💬 Try it now at…
🚀 Hello, Kimi K2! Open-Source Agentic Model! 🔹 1T total / 32B active MoE model 🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models 🔹Strong in coding and agentic tasks 🐤 Multimodal & thought-mode not supported for now With Kimi K2, advanced agentic intelligence…
🧠 Zhihu contributor @langfengq shared his thinkings about open-sourced verl-agent - an RL framework designed to train reasoning-capable LLM agents! It extends veRL, and unlike methods that simply concatenate full interaction history, verl-agent treats each step as an…




💬 How do people view Grok 4, the new-gen model from Musk’s xAI? What are its standout features? 🔗Dive into discussions from Chinese tech community Zhihu users here: zhihu.com/question/19246… #Grok4 #xAI #AI
Introducing Grok 4, the world's most powerful AI model. Watch the livestream now: x.com/i/broadcasts/1…
Zhihu contributor 周国睿 has spent the past year exploring a big Q: 👉 Can we take recommender systems to the next level? Based on hands-on work with OneRec, they found E2E recsys might be the future. Here are his key thoughts: 🔍How big should a rec model be? ⚙️How to scale…




🧠 Zhihu contributor @jackbai_jkb shares insights from his paper "Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction". How to do multi-step training on a model post-trained in single-step environments? They found the ideal training setup should be: ⚙️ no…




「Tech Insight」 🔥 #HackerNews Top Post: "The new skill in AI is not prompting, it's context engineering" — thoughts? 💬 Zhihu contributor Navis Li: What really makes or breaks your AI app is what kind of "ingredients" you feed it. Context Engineering = crafting the entire…



