Sahil Verma

@Sahil1V

PhD student @uwcse. Robustness and Interpretability. Currently at @MSFTResearch. Former intern at @amazon, @itsArthurAI. Undergrad @IITKanpur

Seattle, WA

Joined September 2013

1KFollowing

573Followers

Pinned

Sahil Verma@Sahil1V · Jun 23

🚨 Code is live! Check out LoRe – a modular, lightweight codebase for personalized reward modeling from user preferences. 📦 Few-shot personalization 📊 Benchmarks: TLDR, PRISM, PersonalLLM 👉 github.com/facebookresear… Huge thanks to @AIatMeta for open-sourcing this research 🙌

AAvinandan Bose@avibose22 · Apr 22

🧠 Your LLM should model how you think, not reduce you to preassigned traits 📢 Introducing LoRe: a low-rank reward modeling framework for personalized RLHF ❌ Demographic grouping/handcrafted traits ✅ Infers implicit preferences ✅ Few-shot adaptation 📄 arxiv.org/abs/2504.14439

4.0K

Sahil Verma@Sahil1V · Jul 25

Llama Nemotron model just got Super-Charged ⚡️We released Llama-Nemotron-Super-v1.5 today! The best open model that can be deployed on a single H100 🚀 Enhanced for reasoning, tool use, general chat, and instruction following. HF : huggingface.co/nvidia/Llama-3…

OOleksii Kuchaiev@kuchaev · Jul 25

Very excited to announce Llama-Nemotron-Super-V1.5! Super-V1.5 is now better than Ultra-V1. This is currently the best model that can be deployed on a single H100. Reasoning On/Off and drop in replacement for V1. Open-weight, code and data on HF huggingface.co/nvidia/Llama-3…

2.0K

Sahil Verma@Sahil1V · Jul 17

I will be at the Actionable Interpretability Workshop (@ActInterp, #ICML) presenting *SSAEs* in the East Ballroom A from 1-2pm. Drop by (or send a DM) to chat about (actionable) interpretability, (actionable) identifiability, and everything in between!

SShruti Joshi@_shruti_joshi_ · Feb 21

1\ Hi, can I get an unsupervised sparse autoencoder for steering, please? I only have unlabeled data varying across multiple unknown concepts. Oh, and make sure it learns the same features each time! Yes! A freshly brewed Sparse Shift Autoencoder (SSAE) coming right up. 🧶

2.0K

Sahil Verma Retweeted

Mattia Opper@zvez11 · Jul 17

Transformers struggle with length generalization and long context. What can we do about it? Our new #TMLR paper with @rolandalong , @paul_smolensky and @JianfengGao0217 shows how to handle the issue. Using a new attention mechanism called TRA. Curious? Read the 🧵 for more 🤓

1.0K

Sahil Verma Retweeted

Mattia Opper@zvez11 · Jul 14

Are you compositionally curious 🤓 Want to know how to learn embeddings using🌲? In our new #ICML2025 paper, we present Banyan: A recursive net that you can train super efficiently for any language or domain, and get embeddings competitive with much much larger LLMs 1/🧵

2.0K

Sahil Verma Retweeted

Feng Yao@fengyao1909 · Jul 1

😵‍💫 Struggling with 𝐟𝐢𝐧𝐞-𝐭𝐮𝐧𝐢𝐧𝐠 𝐌𝐨𝐄? Meet 𝐃𝐞𝐧𝐬𝐞𝐌𝐢𝐱𝐞𝐫 — an MoE post-training method that offers more 𝐩𝐫𝐞𝐜𝐢𝐬𝐞 𝐫𝐨𝐮𝐭𝐞𝐫 𝐠𝐫𝐚𝐝𝐢𝐞𝐧𝐭, making MoE 𝐞𝐚𝐬𝐢𝐞𝐫 𝐭𝐨 𝐭𝐫𝐚𝐢𝐧 and 𝐛𝐞𝐭𝐭𝐞𝐫 𝐩𝐞𝐫𝐟𝐨𝐫𝐦𝐢𝐧𝐠! Blog: fengyao.notion.site/moe-posttraini……

226

160

38.0K

Sahil Verma@Sahil1V · Jun 11

Using retrieval? --> check out this work by my awesome collaborator on how to increase diversity when retrieving!

AArnav Das@arnaved · Jun 11

1/8 🚀 How can retrieval augmentation be made both relevant and non-redundant for few-shot adaptation? I'm excited to introduce COBRA. Catch our poster at #CVPR25 (ExHall D, Poster #450) on Sat 14 Jun, 5–7 p.m. CDT: cvpr.thecvf.com/virtual/2025/p…

497

Sahil Verma Retweeted

Feng Yao@fengyao1909 · Jun 4

🔥 "Vibe coding" is everywhere—but is it really care-free? We introduce 𝐑𝐞𝐚𝐋, an RL framework that trains LLMs with automated program analysis feedback, enabling "vibe coding" to be not just fast—but 𝐯𝐮𝐥𝐧𝐞𝐫𝐚𝐛𝐢𝐥𝐢𝐭𝐲-𝐟𝐫𝐞𝐞 & 𝐩𝐫𝐨𝐝𝐮𝐜𝐭𝐢𝐨𝐧-𝐫𝐞𝐚𝐝𝐲 🛡️…

137

13.0K

Sahil Verma Retweeted

Jinay@jinaycodes · May 18

Introducing soarXiv ✈️, the most beautiful way to explore human knowledge Take any paper's URL and replace arxiv with soarxiv (show in video) to teleport to its place in the universe I've embedded all 2.8M papers up until April 2025 Try it at: soarxiv dot org

154

1.0K

9.0K

8.0K

501.0K

Sahil Verma Retweeted

Soumye Singhal@soumyesinghal · Apr 8

⚡⚡ Llama-Nemotron-Ultra-253B just dropped: our most advanced open reasoning model 🧵👇

3.0K

Sahil Verma@Sahil1V · Apr 8

⚡️ Llama-Nemotron-Ultra is fully open — weights and post-training data. Achieves 76.0% on GPQA via FP8 RL training with GRPO. Best open model for scientific reasoning 🚀 x.com/soumyesinghal/…

OOleksii Kuchaiev@kuchaev · Apr 8

We are excited to release Llama-Nemotron-Ultra! This is a reasoning ON/OFF, dense 253B model. Open weights and post-training data. huggingface.co/nvidia/Llama-3… We started with llama-405B, changed it via NAS pruning then followed by reasoning-focused post-training: SFT + RL in FP8.

897

Sahil Verma@Sahil1V · Mar 18

🚀 Meet Llama-Nemotron-Super-49B, our team’s new reasoning model released at #GTC25! Proud to have contributed 🧠. Optimized via NAS for single-GPU inference, it delivers impressive reasoning performance at 49B scale with reasoning mode control (ON/OFF). huggingface.co/nvidia/Llama-3…

OOleksii Kuchaiev@kuchaev · Mar 18

We are excited to release new Llama-Nemotron models. These models allow you to set reasoning ON/OFF during runtime. We also release all the post-training data under CC-BY-4! Try it now on build.nvidia.com/nvidia/llama-3… HF collection: huggingface.co/collections/nv…

1.0K

Sahil Verma@Sahil1V · Feb 27

Come for the ridiculous 30 column spreadsheet created at @sweetgreen, stay for a critical discussion of how you *actually* scale models.

MMargaret Li@margs_li · Feb 27

We nearly drove ourselves insane trying to reproduce scaling laws papers 📉 So of course we wrote a paper about it 😵‍💫 1/9

5.0K

Sahil Verma Retweeted

Shruti Joshi@_shruti_joshi_ · Feb 21

16.0K

Sahil Verma@Sahil1V · Feb 4

🤖 LLM alignment feeling like a black box? 📦 Our paper introduces Reward-Aware Preference Optimization (RPO)—linking DPO, IPO, SimPO, REINFORCE (LOO), & more! 🔗 We unpack key design choices with crisp ablations & clear insights! 🚀 Led by @ssydasheng arxiv.org/abs/2502.00203

OOleksii Kuchaiev@kuchaev · Feb 4

Our team put together a unified mathematical framework to analyze popular model alignment algorithms. “Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment” arxiv.org/pdf/2502.00203

1.0K