Chenxin An (@AnChancy46881)

Pinned

C

Chenxin An@AnChancy46881 · Jun 20

# 🚨 4B open-recipe model beats Claude-4-Opus 🔓 100% open data, recipe, model weights and code. Introducing Polaris✨--a post-training recipe for scaling RL on advanced reasoning models. 🥳 Check out how we boost open-recipe reasoning models to incredible performance levels…

AnChancy46881's tweet image. # 🚨 4B open-recipe model beats Claude-4-Opus
🔓 100% open data, recipe, model weights and code.

Introducing Polaris✨--a post-training recipe for scaling RL on advanced reasoning models.

🥳 Check out how we boost open-recipe reasoning models to incredible performance levels…

24

82

447

399

96.0K

Pinned

C

Chenxin An@AnChancy46881 · Jul 3

amazing

MMichael Luo@michaelzluo · Jul 2

🚀The era of overpriced, black-box coding assistants is OVER. Thrilled to lead the @Agentica_ team in open-sourcing and training DeepSWE—a SOTA software engineering agent trained end-to-end with @deepseek_ai -like RL on Qwen32B, hitting 59% on SWE-Bench-Verified and topping the…

0

6

0

488

C

Chenxin An@AnChancy46881 · Jul 2

Excited to share that Describe Anything has been accepted at ICCV 2025! 🎉 Describe Anything Model (DAM) is a powerful Multimodal LLM that generates detailed descriptions for user-specified regions in images or videos using points, boxes, scribbles, or masks. Open-source code,…

AAK@_akhaliq · Apr 23

Nvidia just dropped Describe Anything on Hugging Face Detailed Localized Image and Video Captioning

2

26

117

58

20.0K

Chenxin An Retweeted

L

Lingpeng Kong@ikekong · Jul 15

What happend after Dream 7B? First, Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. Plus, DreamOn cracks the variable-length generation problem! It enables code infilling that goes beyond a fixed canvas.

1

32

70

20

6.0K

Chenxin An Retweeted

Z

Zirui Wu@WilliamZR7 · Jul 15

We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models. Our approach boosts code infilling performance significantly and even catches up with oracle results.

2

28

113

57

14.0K

Chenxin An Retweeted

J

Jiacheng Ye@JiachengYe15 · Jul 15

📢 Update: Announcing Dream's next-phase development. - Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. - DreamOn: targeting the variable-length generation problem in dLLM!

1

22

78

14

9.0K

C

Chenxin An@AnChancy46881 · Jul 16

check out Dream-coder🔥

ZZhihui Xie@_zhihuixie · Jul 15

🚀 Thrilled to announce Dream-Coder 7B — the most powerful open diffusion code  LLM to date.

0

1

8

1

377

C

Chenxin An@AnChancy46881 · Jul 11

Check out Reka Flash 3.1. It's finally open-source!🥳

XXiaonan Li@yyyjtrzj · Jul 11

Very excited to lead the continued pre-training (Math, Coding & Long-Context) and long-reasoning cold start & RL of this model. Proud moment seeing it go open source!🚀(1/2)

1

0

13

2

1.0K

Chenxin An Retweeted

L

Liliang Ren@liliang_ren · Jul 9

Reasoning can be made much, much faster—with fundamental changes in neural architecture. 😮 Introducing Phi4-mini-Flash-Reasoning: a 3.8B model that surpasses Phi4-mini-Reasoning on major reasoning tasks (AIME24/25, MATH500, GPQA-D), while delivering up-to 10× higher throughput…

2

72

361

209

38.0K

Chenxin An Retweeted

Z

Zihan Wang - on RAGEN@wzihanw · Apr 23

Why does your RL training always collapse? In our new paper of RAGEN, we explore what breaks when you train LLM *Agents* with multi-turn reinforcement learning—and possibly how to fix it. 📄 github.com/RAGEN-AI/RAGEN… 🌐 ragen-ai.github.io 1/🧵👇

8

86

427

379

94.0K

Chenxin An Retweeted

Z

Zhaoye Fei(ngc7293)@ngc7293q · Jul 5

🎙️ Welcome to try MOSS-TTSD~ When we first heard our AI voices naturally chatting and even interrupting each other, the shock was indescribable. This isn't cold TTS anymore - it's dialogue with real warmth. Try it online! huggingface.co/spaces/fnlp/MO…

2

1

4

0

294

C

Chenxin An@AnChancy46881 · Jul 3

Polaris results are quite impressive! I converted the 4B to MLX. I'm not a fan of quantization on small models, but I created 4,5,6,8 and bf16. Enjoy!

CChenxin An@AnChancy46881 · Jun 20

# 🚨 4B open-recipe model beats Claude-4-Opus 🔓 100% open data, recipe, model weights and code. Introducing Polaris✨--a post-training recipe for scaling RL on advanced reasoning models. 🥳 Check out how we boost open-recipe reasoning models to incredible performance levels…

4

18

94

43

15.0K

C

Chenxin An@AnChancy46881 · Jul 2

Excited to introduce our 7B Coding Diffusion LLM, DiffuCoder — advancing open-source diffusion models for high-quality code generation! Diffusion offers powerful global planning via iterative generation, and code is the perfect testbed to push its limits! #DiffusionModels

SSansa Gong@sansa19739319 · Jul 2

🤖Can diffusion models write code competitively? Excited to share our latest 7B coding diffusion LLM!!💻 With DiffuCoder, we explore how they decode, why temperature🔥 matters, and how to improve them via coupled-GRPO that speaks diffusion!!📈 Code: github.com/apple/ml-diffu… 🧵

2

11

67

15

5.0K

Chenxin An Retweeted

S

Sansa Gong@sansa19739319 · Jul 2

🤖Can diffusion models write code competitively? Excited to share our latest 7B coding diffusion LLM!!💻 With DiffuCoder, we explore how they decode, why temperature🔥 matters, and how to improve them via coupled-GRPO that speaks diffusion!!📈 Code: github.com/apple/ml-diffu… 🧵

5

112

580

377

45.0K

Chenxin An Retweeted

Z

Zhaoye Fei(ngc7293)@ngc7293q · Jul 1

🚀 New work: OpenMOSS Embodied Planner-R1 - A step toward AI self-improvement in interactive planning! We've developed an RL framework where LLMs learn to plan through autonomous environmental exploration - no human demonstrations needed. 🤖 🧵 Thread below 👇

7

2

8

0

1.0K

Chenxin An Retweeted

Y

Yushi Bai@realYushiBai · Jun 24

🚀 New milestone in ultra-long text generation! LongWriter-Zero uses pure RL (no SFT, no synthetic data) to produce ultra-long, coherent texts (10k+ words). Beats open-source models like DeepSeek-R1 and Qwen3-235B in many domains. 👉 huggingface.co/THU-KEG/LongWr…

1

3

13

7

982

Chenxin An Retweeted

T

Tanishq Abraham is at ICML@iScienceLuvr · Jun 26

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Apple introduces DiffuCoder, a 7B diffusion LLM trained on 130B tokens of code authors also propose a diffusion-native RL training framework, coupled-GRPO Decoding of dLLMs differ from…

4

73

296

169

26.0K

Chenxin An Retweeted

Y

Yi Wu@jxwuyi · Jun 4

We release fully async RL system AReaL-boba² for LLM & SOTA code RL w. Qwen3-14B! @Alibaba_Qwen #opensource 🚀system&algorithm co-design → 2.77x faster ✅ 69.1 on LiveCodeBench 🔥 multi-turn RL ready 🔗 Project: github.com/inclusionAI/AR… 📄 Paper: arxiv.org/pdf/2505.24298 1/3👇

7

41

154

74

130.0K

C

Chenxin An@AnChancy46881 · Jun 22

Some nice ablations confirming lots of what we know. A recurring theme I'm explaining these days is how the current RL regime is so much more complex technically than the previous post-training approaches were (at least in open research)

CChenxin An@AnChancy46881 · Jun 20

# 🚨 4B open-recipe model beats Claude-4-Opus 🔓 100% open data, recipe, model weights and code. Introducing Polaris✨--a post-training recipe for scaling RL on advanced reasoning models. 🥳 Check out how we boost open-recipe reasoning models to incredible performance levels…

1

47

258

140

25.0K