Wenhao Zhu

@Wenhao_NLP

AI researcher@ByteDance Seed | prev. @EdinburghNLP | Multilingual LLM & machine translation

Edinburgh, Scotland

Joined October 2019

707Following

532Followers

Pinned

Wenhao Zhu Retweeted

Pinzhen "Patrick" Chen@pinzhen_chen · May 13

📢Participate in *WMT25 terminology task* to showcase how you customise translations! What's new? More languages, more domains, sent/doc-level, and Pareto optimal of term accuracy and overall quality. Don't miss it cuz it only happens once every two years. statmt.org/wmt25/terminol…

805

Wenhao Zhu@Wenhao_NLP · Jul 25

The video in the link will surprise you. Trust me!

SShanbo Cheng@cshanbo · Jul 25

Not a social media/ X person, but still glad to announce Seed LiveInterpret 2.0. In short, it is an end-to-end, full duplex speech-to-speech simultaneous interpretation model. Achieves high-quality, ultra-low latency S2S translation. Website: seed.bytedance.com/en/seed_livein…

445

Wenhao Zhu@Wenhao_NLP · Jul 22

Could multi-turn interaction the next promising direction for scaling?

MMulti-Turn Interaction LLM Workshop @ NeurIPS 2025@mti_neurips · Jul 21

🚀 Call for Papers — @NeurIPSConf 2025 Workshop Multi-Turn Interactions in LLMs 📅 December 6/7 · 📍 San Diego Convention Center Join us to shape the future of interactive AI. Topics include but are not limited to: 🧠 Multi-Turn RL for Agentic Tasks (e.g., web & GUI agents,…

531

Wenhao Zhu Retweeted

Zeyu Huang@ZeroyuHuang · Jul 18

🚀 Introducing Prefix-RFT to blend SFT and RFT! SFT can learn more complex problems by mimicking, but can have poor generalization. RFT has better overall performance but is limited by the initial policy. Our method, Prefix-RFT, makes the best of both worlds!

184

136

20.0K

Wenhao Zhu@Wenhao_NLP · Jul 18

Check this out if you are working on RL!

CCasper Hansen@casper_hansen_ · Jul 17

The RL codebase I like the most: - The NanoGPT of RL - Supports multi-turn RL - Just 1k lines of code in Python - Data, Tensor, Sequence Parallel github.com/ChenmienTan/RL2

461

Wenhao Zhu Retweeted

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex · Jul 18

ByteDance Seed released Seed-X, a Mistral-7B shaped LLM specialized for translation, apparently pretrained on ≈6.4B tokens, equaling the likes or R1 and 2.5-Pro in human evaluation. «We deliberately exclude STEM, coding, and reasoning-focused data» lol unexpected data paper

308

188

27.0K

Wenhao Zhu Retweeted

Simon Yu@simon_ycl · Jun 16

Introducing RL2: Ray-Less Reinforcement Learning for LLMs 🚀 Want to run RL experiments but tired of complicated abstractions? We've got you covered with <1K lines PPO/REINFORCE implementation: 🎯 Ray Less = Launch RL exps with torchrun just like SFT ⚡ Long-context…

296

258

32.0K

Wenhao Zhu Retweeted

Yijun_Yang@yijun_yang123 · Jun 5

Why do Long Context Language Models (LCLMs) excel at needle-in-a-haystack tasks but struggle with real-world applications? Can we evaluate them in a fully controlled setting? 🎉 Introducing our latest work: "A Controllable Examination for Long-Context Language Models" TL;DR:…

503

Wenhao Zhu@Wenhao_NLP · Apr 10

😕Feeling frustrated with this round of ACL Rolling Review (February). The interaction between reviewers and authors seems to have deteriorated compared to previous rounds. - As an Area Chair, I noticed almost no reviewers responded or updated their reviews after the rebuttal…

1.0K

Wenhao Zhu Retweeted

Nathan Godey@nthngdy · Mar 5

🚀 New Paper Alert! 🚀 We introduce Q-Filters, a training-free method for efficient KV Cache compression! It is compatible with FlashAttention and can compress along generation which is particularly useful for reasoning models ⚡ ⬇️R1-Distill-Llama-8B with 128 KV pairs ⬇️ 🧵

186

130

14.0K

Wenhao Zhu@Wenhao_NLP · Feb 17

Tired of mGSM & multilingual MMLU? Saturated performance, limited task types & complexity... Academia researchers and industry LLM teams both need a better way to comprehensively evaluate LLM multilingual capabilities. Introducing BenchMAX! Maximizing the spectrum of…

XXu Huang@xuhuang87 · Feb 13

🤩Excited to announce our new work BenchMAX!🥳 BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper: huggingface.co/papers/2502.07… Repo: github.com/CONE-MT/BenchM… Datasets: huggingface.co/collections/LL…

2.0K

Wenhao Zhu Retweeted

Unitree@UnitreeRobotics · Feb 14

What Dance Would You Like to Perform with Unitree G1? With the upgraded algorithm, G1 can learn any dance. Leave a comment to tell us what dance you'd like to see！😘 #Unitree #AGI #EmbodiedAI #SpringFestivalGalaRobot #AI #Humanoid #Bipedal #WorldModel #Dance

547

912

5.0K

919

3.4M

Wenhao Zhu Retweeted

Junxian He@junxian_he · Jan 25

We replicated the DeepSeek-R1-Zero and DeepSeek-R1 training on 7B model with only 8K examples, the results are surprisingly strong. 🚀 Starting from Qwen2.5-Math-7B (base model), we perform RL on it directly. No SFT, no reward model, just 8K MATH examples for verification, the…

651

4.0K

3.0K

916.0K