Yangyi Chen (on job market)

@YangyiChen6666

CS Ph.D. Candidate at UIUC @IllinoisCS, focusing on scalable foundation models. I’m on the industry job market, seeking full-time research scientist positions!

Champaign, IL

Joined August 2021

299Following

1KFollowers

Pinned

Yangyi Chen (on job market)@YangyiChen6666 · Jun 15

🚀 I'm looking for full-time research scientist jobs on foundation models! I study pre-training and post-training of foundation models, and LLM-based coding agents. The figure highlights my research/publications. Please DM me if there is any good fit! Highly appreciated!

YangyiChen6666's tweet image. 🚀 I'm looking for full-time research scientist jobs on foundation models! I study pre-training and post-training of foundation models, and LLM-based coding agents. The figure highlights my research/publications.

Please DM me if there is any good fit! Highly appreciated!

128

17.0K

Pinned

Yangyi Chen (on job market) Retweeted

Wei Ping@_weiping · May 23

Introducing AceReason-Nemotron: Advancing math and code reasoning through reinforcement learning (RL) We propose conducting RL on math-only prompts first, then on code-only prompts. Our key findings include: - Math-only RL significantly boosts both math and code benchmarks! -…

150

52.0K

Pinned

Yangyi Chen (on job market) Retweeted

Shizhe Diao@shizhediao · Apr 18

Thrilled to share my first project at NVIDIA! ✨ Today’s language models are pre-trained on vast and chaotic Internet texts, but these texts are unstructured and poorly understood. We propose CLIMB — Clustering-based Iterative Data Mixture Bootstrapping — a fully automated…

312

207

32.0K

Yangyi Chen (on job market) Retweeted

Zhenhailong Wang@zhenhailongW · Jul 10

Learning to perceive while learning to reason! We introduce PAPO: Perception-Aware Policy Optimization, a direct upgrade to GRPO for multimodal reasoning. PAPO relies on internal supervision signals. No extra annotations, reward models, or teacher models needed. 🧵1/3

2.0K

Yangyi Chen (on job market)@YangyiChen6666 · Jul 2

😂

T@ ·

364

Yangyi Chen (on job market) Retweeted

May Fung ✈️@ACL'25 🇦🇹@May_F1_ · Jul 2

🧠 How can AI evolve from statically 𝘵𝘩𝘪𝘯𝘬𝘪𝘯𝘨 𝘢𝘣𝘰𝘶𝘵 𝘪𝘮𝘢𝘨𝘦𝘴 → dynamically 𝘵𝘩𝘪𝘯𝘬𝘪𝘯𝘨 𝘸𝘪𝘵𝘩 𝘪𝘮𝘢𝘨𝘦𝘴 as cognitive workspaces, similar to the human mental sketchpad? 🔍 What’s the 𝗿𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗿𝗼𝗮𝗱𝗺𝗮𝗽 from tool-use → programmatic…

352

Yangyi Chen (on job market) Retweeted

Martin Ziqiao Ma@ziqiao_ma · Jun 24

Can we scale 4D pretraining to learn general space-time representations that reconstruct an object from a few views at any time to any view at any other time? Introducing 4D-LRM: a Large Space-Time Reconstruction Model that ... 🔹 Predicts 4D Gaussian primitives directly from…

100

16.0K

Yangyi Chen (on job market) Retweeted

Yang Chen@ychenNLP · Jun 17

📢We conduct a systematic study to demystify the synergy between SFT and RL for reasoning models. The result? We trained a 7B model - AceReason-Nemotron-1.1, significantly improved from version 1.0 on math and coding benchmarks. ✅AIME2025 (math): 53.6% -> 64.8% ✅LiveCodeBench…

206

159

17.0K

Yangyi Chen (on job market) Retweeted

Shizhe Diao@shizhediao · Jun 2

Does RL truly expand a model’s reasoning🧠capabilities? Contrary to recent claims, the answer is yes—if you push RL training long enough! Introducing ProRL 😎, a novel training recipe that scales RL to >2k steps, empowering the world’s leading 1.5B reasoning model💥and offering…

407

364

53.0K

Yangyi Chen (on job market) Retweeted

Hyeonjeong Ha@hyeonjeong_ai · May 23

Thrilled to share that our paper has been accepted to #ACL2025 Main 🇦🇹 Huge thanks to my amazing collaborators and my advisor @hengjinlp 🙃 📄arxiv.org/abs/2502.17793 Happy to chat about our work as well as MLLM research projects 🙌

4.0K

Yangyi Chen (on job market) Retweeted

Mistral AI@MistralAI · May 21

Meet Devstral, our SOTA open model designed specifically for coding agents and developed with @allhands_ai mistral.ai/news/devstral

104

443

3.0K

813

598.0K

Yangyi Chen (on job market)@YangyiChen6666 · May 21

Soft Soft Soft 🍰

HHiMyNameIsJC 🫡@HiMyNameIsJC_ · May 21

Glad people are seeing SGA flop on national television Absolute joke a player to watch The type of hoops that makes u understand why the NBA is losing ratings

262

Yangyi Chen (on job market) Retweeted

Heng Ji@hengjinlp · May 18

We are extremely excited to announce mCLM, a Modular Chemical Language Model that is friendly to automatable block-based chemistry and mimics bilingual speakers by “code-switching” between functional molecular modules and natural language descriptions of the functions. 1/2

104

14.0K

Yangyi Chen (on job market) Retweeted

Xiusi Chen@xiusi_chen · May 6

🚀 Can we cast reward modeling as a reasoning task? 📖 Introducing our new paper: RM-R1: Reward Modeling as Reasoning 📑 Paper: arxiv.org/pdf/2505.02387 💻 Code: github.com/RM-R1-UIUC/RM-… Inspired by recent advances of long chain-of-thought (CoT) on reasoning-intensive tasks, we…

202

113

36.0K

Yangyi Chen (on job market) Retweeted

Dylan@dylan_works_ · May 2

Decision: Tweet Comment: Okay, here is the summary of this Summary: Summary: Besides this picture, this message hallucinates the full name of our approach based on the acronym, which includes 2 words that appeared ZERO times in the entire paper. @icmlconf

29.0K

Yangyi Chen (on job market) Retweeted

Shizhe Diao@shizhediao · Apr 18

[Approach] ➤ Embeds and clusters web-scale data semantically. ➤ Searches, iteratively and efficiently, for optimal data mixtures using a lightweight proxy model + predictor loop. ➤ Learns how different domains interact, and how the right mix can unlock downstream performance…

1.0K