Yasmine

@CyouSakura

Researcher @StepFun_ai. Working on scalable RL methods in #LLM. she/her/hers. 守护最好的猫猫@biliacat_public

Joined June 2014

721Following

2KFollowers

Pinned

Yasmine@CyouSakura · Jul 14

We are excited to introduce Open Vision Reasoner (OVR) 🚀 — transferring linguistic cognitive behavior to unlock advanced visual reasoning! 💡 Two-stage recipe • Massive linguistic cold-start on Qwen-2.5-VL-7B sparks “mental imagery” • ~1 k-step multimodal RL refines & scales…

CyouSakura's tweet image. We are excited to introduce Open Vision Reasoner (OVR) 🚀 — transferring linguistic cognitive behavior to unlock advanced visual reasoning!

💡 Two-stage recipe
• Massive linguistic cold-start on Qwen-2.5-VL-7B sparks “mental imagery”
• ~1 k-step multimodal RL refines &amp; scales…

135

9.0K

Pinned

Yasmine@CyouSakura · Jul 18

Honestly blown away by how fast I built the Open Vision Reasoner (OVR) homepage — all thanks to Anycoder + Kimi K2! 🛠️ Anycoder is a free, open-source coding playground built entirely in Gradio — super intuitive and perfect for rapid prototyping. Give it a try 👉…

CyouSakura's tweet image. Honestly blown away by how fast I built the Open Vision Reasoner (OVR) homepage — all thanks to Anycoder + Kimi K2!

🛠️ Anycoder is a free, open-source coding playground built entirely in Gradio — super intuitive and perfect for rapid prototyping.

Give it a try 👉…

13.0K

Pinned

Yasmine Retweeted

Richard Sutton@RichardSSutton · Mar 5

awards.acm.org/about/2024-tur… Machines that learn from experience were explored by Alan Turing almost eighty years ago, which makes it particularly gratifying and humbling to receive an award in his name for reviving this essential but still nascent idea.

156

350

3.0K

276

223.0K

Yasmine@CyouSakura · Jul 15

Thanks for the enthusiasm! We’re finishing final checks on the training datasets and will release soon so everyone can build on Open Vision Reasoner. Stay tuned!

AAK@_akhaliq · Jul 14

Open Vision Reasoner Transferring Linguistic Cognitive Behavior for Visual Reasoning

1.0K

Yasmine Retweeted

AK@_akhaliq · Jul 14

Open Vision Reasoner Transferring Linguistic Cognitive Behavior for Visual Reasoning

195

33.0K

Yasmine Retweeted

Yana Wei@yanawei_ · Jul 11

🔥 Thrilled to release our new multimodal RL work: Open Vision Reasoner! A powerful 7B model with SOTA performance on language & vision reasoning benchmarks, trained with nearly 1K steps of multimodal RL. Our journey begins with a central question: Can the cognitive behaviors…

1.0K

Yasmine Retweeted

Zonghan Yang@yang_zonghan · Jun 16

Thrilled to introduce Kimi-Dev-72B, our new open-source coding LLM for software engineering tasks. Kimi-Dev-72B achieves 60.4% resolve rate on SWE-bench Verified, setting a new SoTA result among open-source models. (1/5)

466

214

117.0K

Yasmine Retweeted

MiniMax (official)@MiniMax__AI · Jun 16

Day 1/5 of #MiniMaxWeek: We’re open-sourcing MiniMax-M1, our latest LLM — setting new standards in long-context reasoning. - World’s longest context window: 1M-token input, 80k-token output - State-of-the-art agentic use among open-source models - RL at unmatched efficiency:…

306

1.0K

673

1.8M

Yasmine Retweeted

Songlin Yang@SonglinYang4 · May 24

📢 (1/16) Introducing PaTH 🛣️ — a RoPE-free contextualized position encoding scheme, built for stronger state tracking, better extrapolation, and hardware-efficient training. PaTH outperforms RoPE across short and long language modeling benchmarks arxiv.org/abs/2505.16381

506

302

67.0K

Yasmine Retweeted

Qwen@Alibaba_Qwen · Apr 28

Introducing Qwen3! We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general…

356

2.0K

8.0K

2.0K

2.2M

Yasmine Retweeted

OpenAI@OpenAI · Apr 16

Livestream in o3 hours.

662

778

8.0K

397

2.3M

Yasmine Retweeted

Mislav Balunović@mbalunovic · Apr 2

Big update to our MathArena USAMO evaluation: Gemini 2.5 Pro, which was released *the same day* as our benchmark, is the first model to achieve non-trivial amount of points (24.4%). The speed of progress is really mind-blowing.

146

999

178

300.0K

Yasmine Retweeted

AK@_akhaliq · Apr 1

Open-Reasoner-Zero is out on Hugging Face An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

164

17.0K

Yasmine@CyouSakura · Mar 31

🥳 Excited to announce major updates to Open-Reasoner-Zero (ORZ), our open-source initiative scaling Reinforcement Learning on base models! 🌊 Updated Paper & Superior Results Using the same base model as DeepSeek-R1-Zero, ORZ-32B achieves better performance on AIME2024,…

CyouSakura's tweet image. 🥳 Excited to announce major updates to Open-Reasoner-Zero (ORZ), our open-source initiative scaling Reinforcement Learning on base models!

🌊 Updated Paper &amp; Superior Results
Using the same base model as DeepSeek-R1-Zero, ORZ-32B achieves better performance on AIME2024,…

207

143

18.0K

Yasmine Retweeted

Jie Liu@jie_liu1 · Mar 28

After hacking GPT-4o's frontend, I made amazing discoveries: 💡The line-by-line image generation effect users see is just a browser-side animation (pure frontend trick) 🔦OpenAI's server sends only 5 intermediate images per generation, captured at different stages 🎾Patch size=8

193

2.0K

1.0K

366.0K

Yasmine Retweeted

Qwen@Alibaba_Qwen · Mar 5

Today, we release QwQ-32B, our new reasoning model with only 32 billion parameters that rivals cutting-edge reasoning model, e.g., DeepSeek-R1. Blog: qwenlm.github.io/blog/qwq-32b HF: huggingface.co/Qwen/QwQ-32B ModelScope: modelscope.cn/models/Qwen/Qw… Demo: huggingface.co/spaces/Qwen/Qw… Qwen Chat:…

487

2.0K

9.0K

3.0K

3.5M