Elyas Obbad (@ObbadElyas)

Pinned

E

Elyas Obbad@ObbadElyas · Oct 25

🚨 What’s the best way to select data for fine-tuning LLMs effectively? 📢Introducing ZIP-FIT—a compression-based data selection framework that outperforms leading baselines, achieving up to 85% faster convergence in cross-entropy loss, and selects data up to 65% faster. 🧵1/8

ObbadElyas's tweet image. 🚨 What’s the best way to select data for fine-tuning LLMs effectively?

📢Introducing ZIP-FIT—a compression-based data selection framework that outperforms leading baselines, achieving up to 85% faster convergence in cross-entropy loss, and selects data up to 65% faster.

🧵1/8

10

44

246

252

43.0K

E

Elyas Obbad@ObbadElyas · Jul 19

Come to Convention Center West room 208-209 2nd floor to learn about optimal data selection using compression like gzip! tldr; you can learn much faster if you use gzip compression distances to select data given a task! DM if you are interested or what to use the code!

EElyas Obbad@ObbadElyas · Oct 25

🚨 What’s the best way to select data for fine-tuning LLMs effectively? 📢Introducing ZIP-FIT—a compression-based data selection framework that outperforms leading baselines, achieving up to 85% faster convergence in cross-entropy loss, and selects data up to 65% faster. 🧵1/8

0

4

7

1

1.0K

E

Elyas Obbad@ObbadElyas · Jul 18

Check out our work on verifiable code generation! This is an extremely promising research direction and I’m excited to see it gain traction

BBrando Miranda @ ICML 2025@BrandoHablando · Jul 18

🔄 We were nominated for Oral+top 1 in the MATH-AI workshp at #ICML! 🚨Why? ≈46 % of GitHub commits are AI-generated—but can we verify them correct? 📢 VeriBench challenges agents; turn Python into Lean code! 🧵1/14 📃 Paper: openreview.net/forum?id=rWkGF…

1

0

6

0

371

Elyas Obbad Retweeted

B

Brando Miranda @ ICML 2025@BrandoHablando · Jul 17

Tweet 17 / 14 Thank you for the wonderful team! @aryanguls @sanmikoyejo @stai_research @kaifronsdal @ObbadElyas Emily, Bruno, @3ricme (we will post their handles soon!) 🧵17/14

1

3

4

0

728

Elyas Obbad Retweeted

B

Brando Miranda @ ICML 2025@BrandoHablando · Jul 17

🚨 Can your LLM really do math—or is it cramming the test set? 📢 Meet Putnam-AXIOM, a advanced mathematics contamination-resilient benchmark that finally hurts FMs. 1. openreview.net/forum?id=kqj2C… 2. icml.cc/virtual/2025/p… #ICML2025 East Exhibition Hall A-B, #E-2502 🧵1/14

4

18

60

31

10.0K

E

Elyas Obbad@ObbadElyas · Jul 15

We use Trace in our new VeriBench benchmark for code verification in Lean 4 -- stay tuned for more details soon! Openreview: openreview.net/forum?id=rWkGF… should be public soon, ICML MATH-AI workshop, come Friday to chat with me about this and AI for correct and verified code! And AI…

CChing-An Cheng @ICML2025@chinganc_rl · Dec 11

#Trace and #GenerativeOptimization enables training a new kind of agents and model architectures. Come to chat with us and learn how #Trace works behind the scene and its theory. We will present its poster at #NeurIPS2024 on Friday (4:30pm-7:30pm, E Exhibit Hall A-C #2709).…

0

1

10

2

855

E

Elyas Obbad@ObbadElyas · Jul 8

I'm in ICML too! Code and math -- especially ai for formal methods and lean!

AAlex Gu@minimario1729 · Jul 7

going to icml next week! let me know if you want to chat about math/code :)

0

1

4

0

384

Elyas Obbad Retweeted

R

Rylan Schaeffer@RylanSchaeffer · Jul 3

New position paper! Machine Learning Conferences Should Establish a “Refutations and Critiques” Track Joint w/ @sanmikoyejo @JoshuaK92829 @yegordb @bremen79 @koustuvsinha @in4dmatics @JesseDodge @suchenzang @BrandoHablando @MGerstgrasser @is_h_a @ObbadElyas 1/6

12

54

406

132

61.0K

Elyas Obbad Retweeted

R

Rylan Schaeffer@RylanSchaeffer · Jun 30

Third #ICML2025 paper! What effect will web-scale synthetic data have on future deep generative models? Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World 🔄 @JoshuaK92829 @ApratimDey2 @MGerstgrasser @rm_rafailov @sanmikoyejo 1/7

4

24

111

56

15.0K

E

Elyas Obbad@ObbadElyas · Jun 26

So about a month ago, Percy posted a version of this plot of our Marin 32B pretraining run. We got a lot of feedback, both public and private, that the spikes were bad. (This is a thread about how we fixed the spikes. Bear with me. )

PPercy Liang@percyliang · May 22

Marin 32B training crossed 1.5 trillion tokens today...

23

101

1.0K

294.0K

Elyas Obbad Retweeted

R

Rylan Schaeffer@RylanSchaeffer · Jun 26

Another #ICML2025 paper! Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? TLDR: Predicting language model performance with scale on multiple choice question-answer (MCQA) benchmarks is made difficult b/c ... 1/3

2

19

88

71

13.0K

Elyas Obbad Retweeted

R

Rylan Schaeffer@RylanSchaeffer · Jun 17

🚨New preprint 🚨 Turning Down the Heat: A Critical Analysis of Min-p Sampling in Language Models We examine min-p sampling (ICLR 2025 oral) & find significant problems in all 4 lines of evidence: human eval, NLP evals, LLM-as-judge evals, community adoption claims 1/8

12

40

286

223

73.0K

Elyas Obbad Retweeted

R

Rylan Schaeffer@RylanSchaeffer · Jun 13

A bit late to the party, but our paper on predictable inference-time / test-time scaling was accepted to #icml2025 🎉🎉🎉 TLDR: Best of N was shown to exhibit power (polynomial) law scaling (left), but maths suggest one should expect exponential scaling (center). We show how to…

9

21

116

63

17.0K

Elyas Obbad Retweeted

K

Katie Everett@_katieeverett · May 22

1. We often observe power laws between loss and compute: loss = a * flops ^ b + c 2. Models are rapidly becoming more efficient, i.e. use less compute to reach the same loss But: which innovations actually change the exponent in the power law (b) vs change only the constant (a)?

16

92

621

596

242.0K

E

Elyas Obbad@ObbadElyas · May 20

Marin is amazing! It allows for friendly and reproducible AI research and has enabled our team to rapidly test out different ideas. Great community as well (: cc @BrandoHablando @dlwh @RylanSchaeffer @percyliang @sanmikoyejo

PPercy Liang@percyliang · May 19

What would truly open-source AI look like? Not just open weights, open code/data, but *open development*, where the entire research and development process is public *and* anyone can contribute. We built Marin, an open lab, to fulfill this vision:

0

2

6

0

333

E

Elyas Obbad@ObbadElyas · May 19

One of our newest pre-training projects was built with Marin! Stay tuned for more soon! Thanks for @ObbadElyas & @dlwh for being so fun to work with -- and @percyliang help test Marin & @sanmikoyejo really good kind advice. & @RylanSchaeffer for his very efficient feedback ;)

DDavid Hall@dlwh · May 19

Super excited Marin is finally out! Come see what we've been building! Code/platform for training fully reproducible models end-to-end, from data to evals. Plus a new high quality 8B base model, fully documented from start to finish.

0

2

13

1

739

E

Elyas Obbad@ObbadElyas · May 19

Super excited Marin is finally out! Come see what we've been building! Code/platform for training fully reproducible models end-to-end, from data to evals. Plus a new high quality 8B base model, fully documented from start to finish.

PPercy Liang@percyliang · May 19

What would truly open-source AI look like? Not just open weights, open code/data, but *open development*, where the entire research and development process is public *and* anyone can contribute. We built Marin, an open lab, to fulfill this vision:

10

17

66

14

9.0K

Elyas Obbad Retweeted

V

Virtue AI@VirtueAI_co · May 16

🚀 Introducing VirtueAgent, the first security layer for the agentic era. As AI agents begin to act autonomously in real-world environments, such as personal assistants, finance, healthcare, ensuring they operate securely and compliant is critical. VirtueAgent provides…

2

6

21

2

2.0K

Elyas Obbad Retweeted

A

Angelina Wang @angelinawang.bsky.social@ang3linawang · Mar 17

I've recently put together a "Fairness FAQ": tinyurl.com/fairness-faq. If you work in non-fairness ML and you've heard about fairness, perhaps you've wondered things like what the best definitions of fairness are, and whether we can train algorithms that optimize for it.

3

22

116

79

15.0K

Elyas Obbad Retweeted

J

Josiah Aklilu@AkliluJosiah2 · Apr 14

There’s growing excitement around VLMs and their potential to transform surgery🏥—but where exactly are we on the path to AI-assisted surgical procedures? In our latest work, we systematically evaluated leading VLMs across major surgical tasks where AI is gaining traction..🧵

2

6

31

7

9.0K