Gabe Orlanski

@GOrlanski

PhD student @WisconsinCS, MS @nyuniversity, Former @replit, @magicailabs, @Google, @MerlinAero, @Theteamatx

Joined January 2021

192Following

149Followers

Pinned

Gabe Orlanski@GOrlanski · Feb 7, 2023

📢Measuring The Impact Of Programming Language Distribution We present the BabelCode framework for multi-lingual code evaluation and an investigation into the impact of PL distributions in training data. Paper: arxiv.org/abs/2302.01973 Code: github.com/google-researc… 🧵

GOrlanski's tweet image. 📢Measuring The Impact Of Programming Language Distribution

We present the BabelCode framework for multi-lingual code evaluation and an investigation into the impact of PL distributions in training data.

Paper: arxiv.org/abs/2302.01973
Code: github.com/google-researc…
🧵

45.0K

Gabe Orlanski Retweeted

Alex Gu@minimario1729 · Jul 18

come to our ai for math workshop tomorrow it'll be super fun!! 🎉🎉

5.0K

Gabe Orlanski@GOrlanski · Jun 18

Reinforcement learning has driven impressive gains in LLM reasoning—but what exactly does RL improve? SPARKLE answers this question with a fine-grained evaluation framework that dissects reasoning into plan-following, problem decomposition, and knowledge use. The results are…

SSalesforce AI Research@SFResearch · Jun 18

🧠 New research reveals why Reinforcement Learning makes language models better at reasoning (spoiler: it's not what we thought!) 📝 Paper: arxiv.org/pdf/2506.04723 💻 Website: sparkle-reasoning.github.io We performed multi-stage curriculum-style RL training from base LLMs. Key…

2.0K

Gabe Orlanski Retweeted

Albert Ge@albert_ge_95 · May 8

Online data mixing reduces training costs for foundation models, but faces challenges: ⚠️ Human-defined domains miss semantic nuances ⚠️ Limited eval accessibility ⚠️ Poor scalability Introducing 🎵R&B: first regroup data, then dynamically reweight domains during training!

11.0K

Gabe Orlanski Retweeted

Stefania Druga@Stefania_druga · Apr 25

I will be talking about the Future of Multimodal AI applications at this @iclr_conf workshop on Monday 28th April at 2 pm local time #ICLR25 dl4c.github.io/schedule/

1.0K

Gabe Orlanski Retweeted

Fred Sala@fredsala · Apr 23

Today at #ICLR2025---come chat with @Changho_Shin_ about our work on what types of data drive weak-to-strong generalization!

2.0K

Gabe Orlanski Retweeted

Nicholas Roberts@nick11roberts · Mar 21

📉📉NEW SCALING LAW PHENOMENON 📉📉 We find that knowledge and reasoning exhibit different scaling behaviors! Super excited to finally tell you all about our paper on the compute optimal scaling of skills: arxiv.org/pdf/2503.10061 [1/n]

172

1.0K

989

132.0K

Gabe Orlanski Retweeted

Fred Sala@fredsala · Feb 26

Tired of evaluating frontier models on contrived math olympiad problems? We have a cure! Try your models on our new benchmark for theoretical physics TPBench.

128

22.0K

Gabe Orlanski Retweeted

Michele Catasta@pirroh · Feb 25

Announcing Replit Agent v2, available in Early Access today! More highlights & how to get started in 🧵

811

503

195.0K

Gabe Orlanski Retweeted

Tzu-Heng Huang@zihengh1 · Feb 9

Tons of model weights available, but what else can we do besides prediction? 🤔 📣 Introducing Grad-Mimic! A new data selection framework using well-trained model’s weights to find high-value samples for foundation models. Boost data curation & data efficiency!

3.0K

Gabe Orlanski@GOrlanski · Feb 9

If you are familiar with everything covered in this presentation, I got a job for you. Join me and the AI team at @Replit to build the most impactful project of your career — expect intensity, intellectual challenge and outsized glory once the mission is accomplished!

VVaibhav Kumar@vaibhavk97 · Feb 8

Curious about how Replit Agent works? At @Replit we hosted tech talks discussing agent's internals. Let's dive into what goes into an LLM that powers the agent - slides from my talk. 🧵

192

148

31.0K