Yushi Bai

@realYushiBai

Ph.D. from Tsinghua University, currently focusing on long context LLM and reasoning models

Beijing

Joined April 2023

152Following

151Followers

Pinned

Yushi Bai@realYushiBai · Jun 24

🚀 New milestone in ultra-long text generation! LongWriter-Zero uses pure RL (no SFT, no synthetic data) to produce ultra-long, coherent texts (10k+ words). Beats open-source models like DeepSeek-R1 and Qwen3-235B in many domains. 👉 huggingface.co/THU-KEG/LongWr…

realYushiBai's tweet image. 🚀 New milestone in ultra-long text generation!
LongWriter-Zero uses pure RL (no SFT, no synthetic data) to produce ultra-long, coherent texts (10k+ words).
Beats open-source models like DeepSeek-R1 and Qwen3-235B in many domains.
👉 huggingface.co/THU-KEG/LongWr…

982

Yushi Bai Retweeted

Rohan Paul@rohanpaul_ai · Jun 26

This paper proposes LongWriter-Zero, an incentivization-based reinforcement learning method. It trains models from scratch, enabling high-quality, ultra-long text generation without relying on pre-existing synthetic data. Methods 🔧: - LongWriter-Zero uses Group Relative…

2.0K

Yushi Bai Retweeted

Adina Yakup@AdinaYakup · Jun 24

LongWriter-Zero 🔥 A Purely RL trained LLM handles 10K+ token coherent passages by @Tsinghua_Uni Model: huggingface.co/THU-KEG/LongWr… Dataset: huggingface.co/datasets/THU-K… Paper: huggingface.co/papers/2506.18… ✨ 32B ✨ Multi-reward GRPO: length, fluency, structure, non-redundancy ✨ Enforces…

11.0K

Yushi Bai Retweeted

Chenxin An@AnChancy46881 · Jun 20

# 🚨 4B open-recipe model beats Claude-4-Opus 🔓 100% open data, recipe, model weights and code. Introducing Polaris✨--a post-training recipe for scaling RL on advanced reasoning models. 🥳 Check out how we boost open-recipe reasoning models to incredible performance levels…

447

399

96.0K

Yushi Bai Retweeted

Rohan Paul@rohanpaul_ai · Jun 7

Large language models struggle to generate consistently coherent and high-quality long text over increasing length. This paper mimics the human iterative process of planning and refining writing using an AI agent framework to train a new model. Methods 🔧: → The agent…

4.0K

Yushi Bai Retweeted

Ji QI@miracle_jiqi · Apr 28

Promoting Our Preliminary Work on Efficient Video Understanding of LMMs Grateful for the support from Yuan Yao and all mentors! Since videos inherently exhibit varying temporal density (static/dynamic segments), a natural idea is to dynamically segment and compress a video to…

Yushi Bai@realYushiBai · Apr 23

Off to #ICLR2025 🇸🇬 to present LongWriter: Unleashing 10,000+ Word Generation from Long-Context LLMs. Catch our poster during the first poster session on the morning of 4/24. Excited to reconnect and chat about long context, reasoning models, and more!

realYushiBai's tweet image. Off to #ICLR2025 🇸🇬 to present LongWriter: Unleashing 10,000+ Word Generation from Long-Context LLMs. Catch our poster during the first poster session on the morning of 4/24. Excited to reconnect and chat about long context, reasoning models, and more!

469

Yushi Bai@realYushiBai · Apr 17

Try out the ⚡lightning fast models at z.ai! We're continuously optimizing our reasoning model, z1, to deliver the best possible experience.

ZZ.ai@Zai_org · Apr 15

🚀 New name, same mission — ChatGLM is now Z.ai. To mark this new chapter, we’re launching the fully open-source GLM-4-0414 model family under the MIT License — use it, build on it, profit from it. We’re open-sourcing six models across 9B and 32B sizes. Here…

163

Yushi Bai@realYushiBai · Apr 17

o3's visual reasoning with tool use closely resembles our work CogCoM (visual Chain-of-Manipulation) published in 2024.02. Learn more in the CogCoM paper linked in the quoted thread!

JJi QI@miracle_jiqi · Apr 17

Today, on April 17th, OpenAI released a new visual reasoning model, o3, capable of solving complex tasks by analyzing and manipulating images. We have observed that the reasoning approach employed by the o3 model bears a striking resemblance to our earlier work, Cogcom…

147

Yushi Bai@realYushiBai · Mar 30

Congrats on the impressive long-context capabilities of Gemini 2.5 Pro! It’s exciting to see these rapid advancements of long-context LLMs through LongBench v2! 🌐Website: longbench2.github.io

NNikolay Savinov 🇺🇦@SavinovNikolay · Mar 28

Just saw these LongBench v2 results - enjoy 1M context in our model, soon to be 2M! longbench2.github.io/#leaderboard

177

Yushi Bai Retweeted

AK@_akhaliq · Mar 17

StdGEN Semantic-Decomposed 3D Character Generation from Single Images

193

119

20.0K

Yushi Bai Retweeted

Kimi.ai@Kimi_Moonshot · Jan 23

x.com/i/article/1882…

141

562

407

76.0K

Yushi Bai@realYushiBai · Jan 23

📢 LongWriter got accepted for #ICLR2025. See you in Singapore!

AAK@_akhaliq · Aug 14

LongWriter Unleashing 10,000+ Word Generation from Long Context LLMs discuss: huggingface.co/papers/2408.07… Current long context large language models (LLMs) can process inputs up to 100,000 tokens, yet struggle to generate outputs exceeding even a modest length of 2,000 words.…

12.0K