Noam Razin

@noamrazin

Postdoctoral Fellow at @PrincetonPLI | Past: Computer Science PhD @TelAvivUni & Apple Scholar in AI/ML | Interested in the foundations of deep learning

Joined May 2020

303Following

605Followers

Pinned

Noam Razin@noamrazin · Mar 20

The success of RLHF depends heavily on the quality of the reward model (RM), but how should we measure this quality? 📰 We study what makes a good RM from an optimization perspective. Among other results, we formalize why more accurate RMs are not necessarily better teachers! 🧵

noamrazin's tweet image. The success of RLHF depends heavily on the quality of the reward model (RM), but how should we measure this quality?

📰 We study what makes a good RM from an optimization perspective. Among other results, we formalize why more accurate RMs are not necessarily better teachers!
🧵

121

758

583

70.0K

Noam Razin Retweeted

Pierfrancesco Beneventano@PierBeneventano · Jul 23

New extended version of the preprint “Edge of Stochastic Stability (EoSS)” out! w/ @arseniqum 👉 arxiv.org/pdf/2412.20553 🗓️ Tomorrow (Wed July 23, 12 PM EDT) I’ll talk about it at OWML — sfu.zoom.us/j/89334355925 I've never explained what that was about I'll do it here:

365

Noam Razin Retweeted

Yong Lin@Yong18850571 · Jul 15

(1/4)🚨 Introducing Goedel-Prover V2 🚨 🔥🔥🔥 The strongest open-source theorem prover to date. 🥇 #1 on PutnamBench: Solves 64 problems—with far less compute. 🧠 New SOTA on MiniF2F: * 32B model hits 90.4% at Pass@32, beating DeepSeek-Prover-V2-671B’s 82.4%. * 8B > 671B: Our 8B…

247

117

56.0K

Noam Razin Retweeted

Yoni Slutzky@YoniSlutzky · Jun 6

Do neural nets really need gradient descent to generalize?🚨 We dive into matrix factorization and find a sharp split: wide nets rely on GD, while deep nets can thrive with any low-training-error weights! arxiv.org/abs/2506.03931 🧵

4.0K

Noam Razin@noamrazin · Jun 7

Do NNs need GD to generalize? Check out our new paper 👇

YYoni Slutzky@YoniSlutzky · Jun 6

673

Noam Razin Retweeted

Zixuan Wang@zzZixuanWang · May 30

LLMs can solve complex tasks that require combining multiple reasoning steps. But when are such capabilities learnable via gradient-based training? In our new COLT 2025 paper, we show that easy-to-hard data is necessary and sufficient! arxiv.org/abs/2505.23683 🧵 below (1/10)

190

150

23.0K

Noam Razin Retweeted

Princeton PLI@PrincetonPLI · May 8

In a new blog post, @HowardYen1 and @xiye_nlp introduce HELMET and LongProc, two benchmarks from a recent effort to build a holistic test suite for evaluating long-context LMs. Read now: pli.princeton.edu/blog/2025/long…

3.0K