Simon Yu (@simon_ycl)

Pinned

S

Simon Yu@simon_ycl · Jul 1

The future of RL+LLM? Self-play. Why? Competitive scenarios offer: ✅ Built-in verification ✅ Automated curriculum learning ✅ Infinite complexity scaling Games prove this works for multi-turn, multi-agent systems. But the real potential? Extending beyond games to real-world…

BBo Liu (Benjamin Liu)@Benjamin_eecs · Jul 1

We've always been excited about self-play unlocking continuously improving agents. Our insight: RL selects generalizable CoT patterns from pretrained LLMs. Games provide perfect testing grounds with cheap, verifiable rewards. Self-play automatically discovers and reinforces…

1

8

47

27

19.0K

Pinned

Simon Yu Retweeted

C

Christopher Manning@chrmanning · Jul 21

Kimi K2. More evidence that: • The lead of American “frontier” AI companies is rather small 🔬 • A broad ecosystem of strong foundation model companies is developing in China, with more players than in the US or elsewhere. 🐅 moonshotai.github.io/Kimi-K2/ github.com/MoonshotAI/Kim…

2

6

85

24

11.0K

Simon Yu Retweeted

A

AI Safety Papers@safe_paper · Jul 22

LLMs Encode Harmfulness and Refusal Separately Jiachen Zhao (@jcz12856876), Jing Huang, Zhengxuan Wu (@ZhengxuanZenWu), @davidbau, Weiyan Shi (@shi_weiyan)

0

7

38

15

3.0K

S

Simon Yu@simon_ycl · Jul 24

For the first time it feels within reach to go from the current status quo of single-turn oracles to generally intelligent, general purpose multi-turn agents. Very happy to help organize this NeurIPS workshop (lead by @simon_ycl ). If you are working on something related,…

MMulti-Turn Interaction LLM Workshop @ NeurIPS 2025@mti_neurips · Jul 21

🚀 Call for Papers — @NeurIPSConf 2025 Workshop Multi-Turn Interactions in LLMs 📅 December 6/7 · 📍 San Diego Convention Center Join us to shape the future of interactive AI. Topics include but are not limited to: 🧠 Multi-Turn RL for Agentic Tasks (e.g., web & GUI agents,…

0

3

14

6

2.0K

Simon Yu Retweeted

w

will brown@willccbb · Jul 23

this is basically self-contained + very concise but still should have pretty great perf github.com/ChenmienTan/RL2

1

3

61

96

2.0K

Simon Yu Retweeted

Q

Qwen@Alibaba_Qwen · Jul 22

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

272

1.0K

9.0K

4.0K

1.8M

Simon Yu Retweeted

G

Greg Kamradt@GregKamradt · Jul 22

The world is moving towards agents Static benchmarks don't measure what agents do best (multi-turn reasoning) Thus, interactive benchmarks: * Terminal Bench (@alexgshaw, @Mike_A_Merrill) * Text Arena (@LeonGuertler) * BALROG (@PaglieriDavide, @_rockt) * ARC-AGI-3 (@arcprize)

14

29

219

162

19.0K

S

Simon Yu@simon_ycl · Jul 22

LLMs encode 'this is harmful' and 'I should refuse' as two different things. Why this matters 👇 check the amazing work by my labmate @jcz12856876!

WWeiyan Shi@ICLR and CHI@shi_weiyan · Jul 22

💥New Paper💥 #LLMs encode harmfulness and refusal separately! 1️⃣We found a harmfulness direction 2️⃣The model internally knows a prompt is harmless, but still refuses it🤯 3️⃣Implication for #AI #safety & #alignment? Let’s analyze the harmfulness direction and use Latent Guard 🛡️

0

4

0

211

Simon Yu Retweeted

J

Jiachen Zhao@jcz12856876 · Jul 22

1/ 🚨New Paper 🚨 LLMs are trained to refuse harmful instructions, but internally, do they see harmfulness and refusal as the same? ⚔️We find causal evidence that 👈”LLMs encode harmfulness and refusal separately” 👉. ✂️LLMs may know a prompt is harmful internally yet still…

5

15

64

32

24.0K

S

Simon Yu@simon_ycl · Jul 21

Pleased to share that our Multi-Turn Interactions in LLMs workshop at NeurIPS 2025! …shop-multi-turn-interaction.github.io Welcome work on Multi-Turn RL, multi-turn human<->agent/agent<->agent/agent<->environment interactions, multi-turn tool use, multi-turn alignment, multi-turn evaluation,…

MMulti-Turn Interaction LLM Workshop @ NeurIPS 2025@mti_neurips · Jul 21

🚀 Call for Papers — @NeurIPSConf 2025 Workshop Multi-Turn Interactions in LLMs 📅 December 6/7 · 📍 San Diego Convention Center Join us to shape the future of interactive AI. Topics include but are not limited to: 🧠 Multi-Turn RL for Agentic Tasks (e.g., web & GUI agents,…

0

3

15

5

1.0K

S

Simon Yu@simon_ycl · Jul 21

Excited to share that our multi-turn interactions workshop has been accepted by neurips!

MMulti-Turn Interaction LLM Workshop @ NeurIPS 2025@mti_neurips · Jul 21

🚀 Call for Papers — @NeurIPSConf 2025 Workshop Multi-Turn Interactions in LLMs 📅 December 6/7 · 📍 San Diego Convention Center Join us to shape the future of interactive AI. Topics include but are not limited to: 🧠 Multi-Turn RL for Agentic Tasks (e.g., web & GUI agents,…

0

6

46

3

3.0K

S

Simon Yu@simon_ycl · Jul 21

🤔Long-horizon tasks: How to train LLMs for the marathon?🌀 Submit anything on 🔁"Multi-turn Interactions in LLMs"🔁 to our @NeurIPSConf workshop by 08/22: 📕 Multi-Turn RL ⚖️ Multi-Turn Alignment 💬 Multi-Turn Human-AI Teaming 📊 Multi-Turn Eval ♾️You name it! #neurips #LLM

MMulti-Turn Interaction LLM Workshop @ NeurIPS 2025@mti_neurips · Jul 21

🚀 Call for Papers — @NeurIPSConf 2025 Workshop Multi-Turn Interactions in LLMs 📅 December 6/7 · 📍 San Diego Convention Center Join us to shape the future of interactive AI. Topics include but are not limited to: 🧠 Multi-Turn RL for Agentic Tasks (e.g., web & GUI agents,…

1

14

78

28

7.0K

S

Simon Yu@simon_ycl · Jul 21

Interested in Multi-turn RL, Agent, Alignment, and Interactions with LLMs? Join and submit your paper on our multi-turn interaction workshop at #NeurIPS2025 !

MMulti-Turn Interaction LLM Workshop @ NeurIPS 2025@mti_neurips · Jul 21

🚀 Call for Papers — @NeurIPSConf 2025 Workshop Multi-Turn Interactions in LLMs 📅 December 6/7 · 📍 San Diego Convention Center Join us to shape the future of interactive AI. Topics include but are not limited to: 🧠 Multi-Turn RL for Agentic Tasks (e.g., web & GUI agents,…

0

3

15

2

820

S

Simon Yu@simon_ycl · Jul 21

Join us at #NeurIPS2025 workshop to explore the future of multi-turn AI interactions! We welcome submissions on RL for agents, alignment, evaluation methods, and more.

MMulti-Turn Interaction LLM Workshop @ NeurIPS 2025@mti_neurips · Jul 21

🚀 Call for Papers — @NeurIPSConf 2025 Workshop Multi-Turn Interactions in LLMs 📅 December 6/7 · 📍 San Diego Convention Center Join us to shape the future of interactive AI. Topics include but are not limited to: 🧠 Multi-Turn RL for Agentic Tasks (e.g., web & GUI agents,…

0

7

25

4

3.0K

Simon Yu Retweeted

C

Cohere Labs@Cohere_Labs · Jul 19

We’re excited to share that our work “Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement” will be highlighted today at ICML 2025 in Vancouver, Canada! 🇨🇦 🎉Congrats to all authors! @simon_ycl, @cliangyu_, Sara Ahmadian, @mziizm,

1

4

27

4

1.0K

S

Simon Yu@simon_ycl · Jul 18

Love this clever combination of SFT + RL for distribution shift 🔥 Check it out!

ZZeyu Huang@ZeroyuHuang · Jul 18

🚀 Introducing Prefix-RFT to blend SFT and RFT! SFT can learn more complex problems by mimicking, but can have poor generalization. RFT has better overall performance but is limited by the initial policy. Our method, Prefix-RFT, makes the best of both worlds!

0

7

2

341

Simon Yu Retweeted

C

Casper Hansen@casper_hansen_ · Jul 17

The RL codebase I like the most: - The NanoGPT of RL - Supports multi-turn RL - Just 1k lines of code in Python - Data, Tensor, Sequence Parallel github.com/ChenmienTan/RL2

3

56

448

498

26.0K

Simon Yu Retweeted

K

Kevin Wang@KevinWang_111 · Jul 17

Excited to announce the Mindgame @NeurIPS Competition is officially LIVE! 🤖 Pit your agents against others in Mafia, Codename, Prisoner’s Dilemma, Stg Hunt, and Colonel Blotto. Sign up now for $500 in compute credits on your initial run! 🔗 Register : mindgamesarena.com

5

18

79

40

17.0K

Simon Yu Retweeted

P

Pasquale Minervini@PMinervini · Jul 16

two quick reasons why I'm not convinced that latent reasoning > CoT: - if that worked, we would have been doing non-autoregressive decoding for ages - in arxiv.org/abs/2011.03459 we found that grounding intermediate symbols works better than having fuzzy embeddings when reasoning

2

4

37

24

2.0K

Simon Yu Retweeted

S

Sara Hooker@sarahookr · Jul 16

Every few years I think you should jump into a new deep-end by doing something that pushes you out of your comfort zone. However, people rarely do because it is easier to stick with what is known. I think this is true in research, life partners and big life decisions.

8

14

206

39

12.0K

S

Simon Yu@simon_ycl · Jul 14

ICML ✈️ this week. open to chat and learn mech interp from you. @aryaman2020 and i have cool ideas about steering, just come to our AxBench poster. new steering blog: zen-wu.social/steer/index.ht… 中文: zen-wu.social/steer/cn_index…

AAryaman Arora@aryaman2020 · Jul 14

i forgot the whole point of saying you're at a conference is to advertise your poster please come check out AxBench by @ZhengxuanZenWu* me* et al. on Tuesday, 15 July at 11 AM - 1:30 PM

1

9

52

4

24.0K