Xiusi Chen

@xiusi_chen

Postdoc @UofIllinois @uiuc_nlp, Ph.D. @UCLA, BS @PKU1898. RM-R1. Ex-Intern @AmazonScience (x2),@NECLabsAmerica. LLM, Neuro-Symbolic AI.

Urbana-Champaign, IL

Joined June 2012

451Following

593Followers

Pinned

Xiusi Chen@xiusi_chen · May 6

🚀 Can we cast reward modeling as a reasoning task? 📖 Introducing our new paper: RM-R1: Reward Modeling as Reasoning 📑 Paper: arxiv.org/pdf/2505.02387 💻 Code: github.com/RM-R1-UIUC/RM-… Inspired by recent advances of long chain-of-thought (CoT) on reasoning-intensive tasks, we…

xiusi_chen's tweet image. 🚀 Can we cast reward modeling as a reasoning task?

📖 Introducing our new paper:
RM-R1: Reward Modeling as Reasoning

📑 Paper: arxiv.org/pdf/2505.02387
💻 Code: github.com/RM-R1-UIUC/RM-…

Inspired by recent advances of long chain-of-thought (CoT) on reasoning-intensive tasks, we…

202

113

36.0K

Xiusi Chen Retweeted

Zhenhailong Wang@zhenhailongW · Jul 10

Learning to perceive while learning to reason! We introduce PAPO: Perception-Aware Policy Optimization, a direct upgrade to GRPO for multimodal reasoning. PAPO relies on internal supervision signals. No extra annotations, reward models, or teacher models needed. 🧵1/3

2.0K

Xiusi Chen Retweeted

Yangyi Chen (on job market)@YangyiChen6666 · Jun 15

🚀 I'm looking for full-time research scientist jobs on foundation models! I study pre-training and post-training of foundation models, and LLM-based coding agents. The figure highlights my research/publications. Please DM me if there is any good fit! Highly appreciated!

128

17.0K

Xiusi Chen Retweeted

Gaotang Li@GaotangLi · Jun 10

😲 Not only reasoning?! Inference scaling can now boost LLM safety! 🚀 Introducing Saffron-1: - Reduces attack success rate from 66% to 17.5% - Uses only 59.7 TFLOP compute - Counters latest jailbreak attacks - No model finetuning On the AI2 Refusals benchmark. 📖 Paper:…

14.0K

Xiusi Chen Retweeted

Cheng Qian@qiancheng1231 · May 27

📢 New Paper Drop: From Solving to Modeling! LLMs can solve math problems — but can they model the real world? 🌍 📄 arXiv: arxiv.org/pdf/2505.15068 💻 Code: github.com/qiancheng0/Mod… Introducing ModelingAgent, a breakthrough system for real-world mathematical modeling with LLMs.

101

12.0K

Xiusi Chen Retweeted

Heng Ji@hengjinlp · May 18

We are extremely excited to announce mCLM, a Modular Chemical Language Model that is friendly to automatable block-based chemistry and mimics bilingual speakers by “code-switching” between functional molecular modules and natural language descriptions of the functions. 1/2

104

14.0K

Xiusi Chen Retweeted

Emre Can Acikgoz @acl2025@emrecanacikgoz · Apr 25

What are the capabilities of current Conversational Agents? What challenges persist and what actually we should expect from these agents as a next step? 🚀We are excited to share our recent survey: ✨ A Desideratum for Conversational Agents: Capabilities, Challenges, and Future…

5.0K

Xiusi Chen Retweeted

Hongru Wang@HongruWang007 · Apr 22

💥 We are so excited to introduce OTC-PO, the first RL framework for optimizing LLMs’ tool-use behavior in Tool-Integrated Reasoning. Arxiv: arxiv.org/pdf/2504.14870 Huggingface: huggingface.co/papers/2504.14… ⚙️ Simple, generalizable, plug-and-play (just a few lines of code) 🧠…

10.0K

Xiusi Chen Retweeted

Cheng Qian@qiancheng1231 · Apr 22

🚀 ToolRL unlocks LLMs' true tool mastery! The secret? Smart rewards > more data. 📖 Introducing newest paper: ToolRL: Reward is all Tool Learning Needs Paper Link: arxiv.org/pdf/2504.13958 Github Link: github.com/qiancheng0/Too…

132

14.0K