Loubna Ben Allal

@LoubnaBenAllal1

SmolLMs @huggingface

Paris, France

Joined September 2020

804Following

8KFollowers

Pinned

Loubna Ben Allal@LoubnaBenAllal1 · Jul 8

Introducing SmolLM3: a strong, smol reasoner! > SoTA 3B model > dual mode reasoning (think/no_think) > long context, up to 128k > multilingual: en, fr, es, de, it, pt > fully open source (data, code, recipes) huggingface.co/blog/smollm3

LoubnaBenAllal1's tweet image. Introducing SmolLM3: a strong, smol reasoner!

&gt; SoTA 3B model
&gt; dual mode reasoning (think/no_think)
&gt; long context, up to 128k
&gt; multilingual: en, fr, es, de, it, pt
&gt; fully open source (data, code, recipes)

huggingface.co/blog/smollm3

204

1.0K

524

265.0K

Loubna Ben Allal Retweeted

Anton Lozhkov@anton_lozhkov · 15 h

Hey fellow researchers and hackers, I’m looking at about a petabyte of raw code for The Next Big Dataset. What is on your wish list for code data features that you wanna see? 🎅

1.0K

Loubna Ben Allal Retweeted

Locally AI - Private AI Chat@LocallyAIApp · Jul 21

SmolLM 3 made by @huggingface is now available in the app 🤗 - SOTA 3B model - Dual mode reasoning - Multilingual (6 languages) SmolLM 3 outperforms other 3B models while staying competitive with larger 4B models. Running on-device on iPhone and iPad. Optimized for Apple MLX.

118

6.0K

Loubna Ben Allal Retweeted

Sebastian Raschka@rasbt · Jul 19

From GPT to MoE: I reviewed & compared the main LLMs of 2025 in terms of their architectural design from DeepSeek-V3 to Kimi 2. Multi-head Latent Attention, sliding window attention, new Post- & Pre-Norm placements, NoPE, shared-expert MoEs, and more... magazine.sebastianraschka.com/p/the-big-llm-…

359

2.0K

108.0K

Loubna Ben Allal Retweeted

Qwen@Alibaba_Qwen · Jul 21

Bye Qwen3-235B-A22B, hello Qwen3-235B-A22B-2507! After talking with the community and thinking it through, we decided to stop using hybrid thinking mode. Instead, we’ll train Instruct and Thinking models separately so we can get the best quality possible. Today, we’re releasing…

213

571

4.0K

827

847.0K

Loubna Ben Allal Retweeted

Xuan-Son Nguyen@ngxson · Jul 21

Richy Mini and SmolLM3 are featured in Github's weekly news! 🚀 🚀

723

Loubna Ben Allal Retweeted

elie@eliebakouch · Jul 21

We've just release 100+ intermediate checkpoints and our training logs from SmolLM3-3B training. We hope this can be useful to the researcher working on mech interpret, training dynamics, RL and other topics :) Training logs: -> Usual training loss (the gap in the loss are due…

391

192

30.0K

Loubna Ben Allal Retweeted

clem 🤗@ClementDelangue · Jul 18

Really cool to see SmoLLM3, the current state-of-the-art 3B model land on @Azure AI. @Microsoft is a strong force in small efficient models as shown with the Phi family and others and we've been enjoying partnering with them closely on this and other topics. Thanks @satyanadella…

114

6.0K

Loubna Ben Allal Retweeted

Clémentine Fourrier 🍊@clefourrier · Jul 16

HF's got a couple papers at COLM 2025 covering all stages of open model life! 📚Data: FineWeb2 (led by @HKydlicek and @gui_penedo) 🧱Model creation: SmolLM2 (led by @LoubnaBenAllal1) and SmolVLM (led by @andimarafioti ) 🧪Evals: YourBench (led by @sumukx) Good job team! 🎉

2.0K

Loubna Ben Allal Retweeted

Mishig Davaadorj@mishig25 · Jul 14

It is cool to be capable. It is cool to know shit. That's why the HF team is open-sourcing not just the model, but the training code and datasets too. Learn. Build. Make it your own. github.com/huggingface/sm…

636

421

35.0K

Loubna Ben Allal Retweeted

Carlos Miguel Patiño@cmpatino_ · Jul 11

We're releasing SmolTalk2: the dataset we used to post-train SmolLM3-3B! Our model wouldn't be fully open-source without the dataset we used to train it, so we're including all our processed data with the details to replicate our post-training. huggingface.co/datasets/Huggi… (1/3)

124

7.0K

Loubna Ben Allal Retweeted

Jia Li@JiaLi52524397 · Jul 10

Happy to introduce Kimina-Prover-72B ! Reaching 92.2% on miniF2F using Test time RL. It can solve IMO problems using more than 500 lines of Lean 4 code ! Check our blog post here: huggingface.co/blog/AI-MO/kim… And play with our demo ! demo.projectnumina.ai

274

26.0K

Loubna Ben Allal@LoubnaBenAllal1 · Jul 10

Smol cake to celebrate SmolLM3 🤏

159

3.0K

Loubna Ben Allal Retweeted

Maxime Labonne@maximelabonne · Jul 10

Liquid AI open-sources a new generation of edge LLMs! 🥳 I'm so happy to contribute to the open-source community with this release on @huggingface! LFM2 is a new architecture that combines best-in-class inference speed and quality into 350M, 700M, and 1.2B models.

107

695

427

49.0K

Loubna Ben Allal@LoubnaBenAllal1 · Jul 10

v1.10.4 is out with support for SmolLM3! Android: play.google.com/store/apps/det… iOS: Coming soon. In the meantime, try it on TestFlight: testflight.apple.com/join/B3KE74MS

LLoubna Ben Allal@LoubnaBenAllal1 · Jul 8

2.0K

Loubna Ben Allal@LoubnaBenAllal1 · Jul 9

One of many training anecdotes. Would people be interested in more of these behind-the-scenes stories?

LLewis Tunstall@_lewtun · Jul 9

Here’s a nice example which highlights the messiness of AI R&D and why I’m somewhat bearish on its automation in the near future. In this example @eliebakouch and I were a bit stumped on how to preserve the long context performance of the SmolLM3 base model after post-training.…

4.0K

Loubna Ben Allal Retweeted

Thomas Wolf@Thom_Wolf · Jul 9

when your multiple recent releases cross each others on the front page of reddit

11.0K