Michael Hu

@michahu8

NLP, training data, RL | PhD @NYU | multi-agent collaboration 🤖🤝🤖 @Microsoft | @NSF GRFP fellow | prev @princeton_nlp, @cocosci_lab.

New York, NY

Joined August 2019

620Following

734Followers

Pinned

Michael Hu@michahu8 · Feb 27

Training on a little 🤏 formal language BEFORE natural language can make pretraining more efficient! How and why does this work? The answer lies…Between Circuits and Chomsky. 🧵1/6👇

michahu8's tweet image. Training on a little 🤏 formal language BEFORE natural language can make pretraining more efficient!

How and why does this work? The answer lies…Between Circuits and Chomsky.

🧵1/6👇

107

712

481

79.0K

Michael Hu@michahu8 · Jul 3

aka the HATER track 😍

RRylan Schaeffer@RylanSchaeffer · Jul 3

New position paper! Machine Learning Conferences Should Establish a “Refutations and Critiques” Track Joint w/ @sanmikoyejo @JoshuaK92829 @yegordb @bremen79 @koustuvsinha @in4dmatics @JesseDodge @suchenzang @BrandoHablando @MGerstgrasser @is_h_a @ObbadElyas 1/6

2.0K

Michael Hu@michahu8 · Jun 26

RL can certainly teach LLMs new skills in principle, but in practice token-level exploration is so challenging that we end up relying on pretraining and synthetic data. the era of experience implies the era of exploration

YYiding Jiang@yidingjiang · Jun 26

A mental model I find useful: all data acquisition (web scrapes, synthetic data, RL rollouts, etc.) is really an exploration problem 🔍. This perspective has some interesting implications for where AI is heading. Wrote down some thoughts: yidingjiang.github.io/blog/post/expl…

920

Michael Hu@michahu8 · Jun 9

how LLMs solve a task can change as the task gets harder! we analyzed this phenomenon here in a crisp, controlled setting with formal languages

JJackson Petty@jowenpetty · Jun 9

How well can LLMs understand tasks with complex sets of instructions? We investigate through the lens of RELIC: REcognizing (formal) Languages In-Context, finding a significant overhang between what LLMs are able to do theoretically and how well they put this into practice.

481

Michael Hu@michahu8 · May 28

Accepted to ACL! See you in Vienna 🫡 code: github.com/michahu/pre-pr… arxiv: arxiv.org/abs/2502.19249

MMichael Hu@michahu8 · Feb 27

Training on a little 🤏 formal language BEFORE natural language can make pretraining more efficient! How and why does this work? The answer lies…Between Circuits and Chomsky. 🧵1/6👇

8.0K

Michael Hu@michahu8 · May 26

hot multi agent researcher summer

MMichael Hu@michahu8 · Jun 19, 2024

hot interpretability researcher summer

920

Michael Hu Retweeted

Chris Paxton@chris_j_paxton · May 9

Reinforcement learning has been posited as a solution to data issues when training everything from general-purpose reasoning models like Deepseek R1, to humanoid robots like Optimus and Unitree G1. But when is it useful? What makes a problem suitable for RL? I think it's…

239

139

16.0K

Michael Hu Retweeted

Mayee Chen@MayeeChen · Apr 22

!!! I'm at #ICLR2025 to present 🧄Aioli🧄 a unified framework for data mixing on Thursday afternoon! 🔗 arxiv.org/abs/2411.05735 Message me to chat about pre/post training data (mixing, curriculum, understanding); test-time compute/verification; or to try new food 🇸🇬

155

19.0K

Michael Hu@michahu8 · Feb 28

it is my great honour to be appointed as the Glen se Vries Professor of Health Statistics. i have quickly written about this in my blog post: kyunghyuncho.me/glen-de-vries-…

NNYU Courant@NYU_Courant · Feb 28

Kyunghyun Cho (@kchonyc), Professor of Computer Science and Data Science, has been named recipient of the Glen de Vries Chair for Health Statistics by the Courant Institute and New York University. Congratulations!

342

33.0K