Peixuan Han (韩沛煊)

@peixuanhakhan

1st year Ph.D. student at UIUC @IllinoisCS Amazon 25Summer Intern LLM researcher

Urbana

Joined September 2024

75Following

79Followers

Peixuan Han (韩沛煊) Retweeted

Alexi Gladstone@AlexiGlad · Jul 7

How can we unlock generalized reasoning? ⚡️Introducing Energy-Based Transformers (EBTs), an approach that out-scales (feed-forward) transformers and unlocks generalized reasoning/thinking on any modality/problem without rewards. TLDR: - EBTs are the first model to outscale the…

249

2.0K

300.0K

Peixuan Han (韩沛煊)@peixuanhakhan · Jun 5

Super excited to begin my Applied Scientist Internship at @amazon, which is my first internship in the industry. I'm looking forward to conducting interesting and insightful research on the efficient reasoning of LLMs!

peixuanhakhan's tweet image. Super excited to begin my Applied Scientist Internship at @amazon, which is my first internship in the industry.

I'm looking forward to conducting interesting and insightful research on the efficient reasoning of LLMs!

105

Peixuan Han (韩沛煊) Retweeted

Xiusi Chen@xiusi_chen · Jun 4

Can LLMs make rational decisions like human experts? 📖Introducing DecisionFlow: Advancing Large Language Model as Principled Decision Maker We introduce a novel framework that constructs a semantically grounded decision space to evaluate trade-offs in hard decision-making…

5.0K

Peixuan Han (韩沛煊) Retweeted

Jiaxun Zhang@JiaxunZhang6 · Jun 2

⚠️ Rogue AI scientists? 🛡️ SafeScientist rejects unsafe prompts for ethical discoveries. Check out paper ➡️ (arxiv.org/pdf/2505.23559) #AISafety #LLM #SafeAI #AI

634

Peixuan Han (韩沛煊) Retweeted

Cheng Qian@qiancheng1231 · May 27

📢 New Paper Drop: From Solving to Modeling! LLMs can solve math problems — but can they model the real world? 🌍 📄 arXiv: arxiv.org/pdf/2505.15068 💻 Code: github.com/qiancheng0/Mod… Introducing ModelingAgent, a breakthrough system for real-world mathematical modeling with LLMs.

101

12.0K

Peixuan Han (韩沛煊) Retweeted

Zijia Liu@xwzliuzijia · May 25

💥Time-R1 is here! Can a 3B LLM truly grasp time? 🤔 YES! Excited to share our new work, Time-R1: Towards Comprehensive Temporal Reasoning in LLMs 🚀 Check it out: 📖 Paper: arxiv.org/abs/2505.13508 💻 Code: github.com/ulab-uiuc/Time… #TemporalReasoning #RL #LLMs

2.0K

Peixuan Han (韩沛煊) Retweeted

Cursor@cursor_ai · May 6

Cursor is now free for students. Enjoy!

2.0K

4.0K

42.0K

16.0K

11.1M

Peixuan Han (韩沛煊) Retweeted

Jiarui Yao@ExplainMiracles · May 6

We introduce Gradient Variance Minimization (GVM)-RAFT, a principled dynamic sampling strategy that minimizes gradient variance to improve the efficiency of chain-of-thought (CoT) training in LLMs. – Achieves 2–4× faster convergence than RAFT – Improves accuracy on math…

6.0K

Peixuan Han (韩沛煊) Retweeted

Xiusi Chen@xiusi_chen · May 6

🚀 Can we cast reward modeling as a reasoning task? 📖 Introducing our new paper: RM-R1: Reward Modeling as Reasoning 📑 Paper: arxiv.org/pdf/2505.02387 💻 Code: github.com/RM-R1-UIUC/RM-… Inspired by recent advances of long chain-of-thought (CoT) on reasoning-intensive tasks, we…

202

113

36.0K

Peixuan Han (韩沛煊) Retweeted

Haofei Yu@haofeiyu44 · May 1

🧪 Want an AI-generated paper draft in just 1 minute? Or dreaming of building auto-research apps but frustrated with setups? Meet tiny-scientist, a minimal package to start AI-powered research: 👉 pip install tiny-scientist 🔗 github.com/ulab-uiuc/tiny… #AIAgent #pythonpackages

7.0K

Peixuan Han (韩沛煊) Retweeted

Qwen@Alibaba_Qwen · Apr 28

Introducing Qwen3! We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general…

355

2.0K

8.0K

2.0K

2.2M

Peixuan Han (韩沛煊)@peixuanhakhan · Mar 13

🚀 Excited to announce that our paper 𝐒𝐞𝐚𝐫𝐜𝐡-𝐑𝟏 is now live! 📄 We introduce an RL framework (an extension of 𝐃𝐞𝐞𝐩𝐬𝐞𝐞𝐤-𝐑𝟏) for training reasoning-and-retrieval interleaved LLMs. We’re also open-sourcing all resources—models, data, and more! 📜 Paper:…

BBowen Jin@BowenJin13 · Feb 28

🚀 Introducing 𝗦𝗲𝗮𝗿𝗰𝗵-𝗥𝟭 – the first 𝗿𝗲𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝗼𝗳 𝗗𝗲𝗲𝗽𝘀𝗲𝗲𝗸-𝗥𝟭 (𝘇𝗲𝗿𝗼) for training reasoning and search-augmented LLM agents with reinforcement learning! This is a step towards training an 𝗼𝗽𝗲𝗻-𝘀𝗼𝘂𝗿𝗰𝗲 𝗢𝗽𝗲𝗻𝗔𝗜 “𝗗𝗲𝗲𝗽…

110

465

249

49.0K

Peixuan Han (韩沛煊) Retweeted

Sundar Pichai@sundarpichai · Mar 12

Gemma 3 is here! Our new open models are incredibly efficient - the largest 27B model runs on just one H100 GPU. You'd need at least 10x the compute to get similar performance from other models ⬇️

328

888

9.0K

1.0K

1.1M