Xiaohang Tang

@xiaohang_tang

PhD student @ucl

London, England

Joined November 2021

657Following

143Followers

Pinned

Xiaohang Tang@xiaohang_tang · Jul 15

🧶1/ Diffusion-based LLMs (dLLMs) are fast & promising—but hard to fine-tune with RL. Why? Because their likelihoods are intractable, making common RL (like GRPO) inefficient & biased. 💡We present a novel method 𝐰𝐝𝟏, that mitigates these headaches. Let’s break it down.👇

4.0K

Pinned

Xiaohang Tang Retweeted

Aditya Grover@adityagrover_ · Jun 24

Since our launch earlier this year, we are thrilled to witness the growing community around dLLMs. The Mercury tech report from @InceptionAILabs is now on @arxiv with more extensive evaluations: arxiv.org/abs/2506.17298 New model updates dropping later this week!

255

140

22.0K

Xiaohang Tang@xiaohang_tang · Jun 26

Huge milestone from the team! A blazing-fast diffusion LLM built for chat, delivering real-time performance at commercial scale. If you liked Mercury Coder for code, you'll love this for conversation.

IInception@InceptionAILabs · Jun 26

We’re excited to launch Mercury, the first commercial-scale diffusion LLM tailored for chat applications! Ultra-fast and efficient, Mercury brings real-time responsiveness to conversations, just like Mercury Coder did for code.

181

32.0K

Xiaohang Tang Retweeted

Alfonso Amayuelas@AlfonAmayuelas · Jun 9

New paper 🚨📜🚀 Introducing “Agents of Change: Self-Evolving LLM Agents for Strategic Planning”! In this work, we show how LLM-powered agents can rewrite their own prompts & code to climb the learning curve in the board game Settlers of Catan 🎲 🧵👇

299

336

41.0K

Xiaohang Tang Retweeted

Cong Lu@cong_ml · Jun 10

🚀Introducing “StochasTok: Improving Fine-Grained Subword Understanding in LLMs”!🚀 LLMs are incredible but still struggle disproportionately with subword tasks, e.g., for character counts, wordplay, multi-digit numbers, fixing typos… Enter StochasTok, led by @anyaasims! [1/]

18.0K

Xiaohang Tang Retweeted

Tim Franzmeyer@frtimlive · Jun 5

What if LLMs knew when to stop? 🚧 HALT finetuning teaches LLMs to only generate content they’re confident is correct. 🔍 Insight: Post-training must be adjusted to the model’s capabilities. ⚖️ Tunable trade-off: Higher correctness 🔒 vs. More completeness 📝 with @AIatMeta 🧵

8.0K

Xiaohang Tang Retweeted

Google DeepMind@GoogleDeepMind · May 20

We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO

663

5.0K

1.0K

1.2M

Xiaohang Tang Retweeted

Alexander Novikov@SashaVNovikov · May 14

After 1.5 years of work, I'm so excited to announce AlphaEvolve – our new LLM + evolution agent! Learn more in the blog post: deepmind.google/discover/blog/… White paper PDF: storage.googleapis.com/deepmind-media… (1/2)

392

3.0K

2.0K

358.0K

Xiaohang Tang Retweeted

Muhammad Khalifa@MKhalifaaaa · May 10

I personally reviewed the ICLR 2025 submission and it was one of the best papers i read in a long time

3.0K

Xiaohang Tang@xiaohang_tang · May 3

🎉🎉We are excited to announce the acceptance of our paper, “Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift”, to ICML 2025! We look forward to seeing you in Vancouver 🥰

SSeongho Son@seongho_son_ml · Jul 29

How can we align LLMs when preferences change over time? We’re thrilled to introduce Non-Stationary DPO (NS-DPO), a preference optimization fine-tuning approach that is provably robust to non-stationary preferences! See our paper: arxiv.org/pdf/2407.18676

1.0K

Xiaohang Tang Retweeted

Inception@InceptionAILabs · Apr 30

We are launching our API in open beta! Visit the Inception Platform to create your account and get started using the first commercial-scale diffusion large language models (dLLMs). platform.inceptionlabs.ai

137

63.0K

Xiaohang Tang Retweeted

Zhihong Shao@zhs05232838 · Apr 30

We just released DeepSeek-Prover V2. - Solves nearly 90% of miniF2F problems - Significantly improves the SoTA performance on the PutnamBench - Achieves a non-trivial pass rate on AIME 24 & 25 problems in their formal version Github: github.com/deepseek-ai/De…

321

2.0K

630

451.0K

Xiaohang Tang Retweeted

Roberta Raileanu@robertarail · Apr 26

With a stellar lineup of speakers and panelists, including Yoshua Bengio 🙀, the Scaling Self-Improving Foundation Models at @iclr_conf promises to be 🔥 ⏰ Sunday, April 27 📍 Garnet 214-215

145

27.0K