Arthur Zucker
@art_zucker
Head of transformers @huggingface 🤗
A quick update on the future of the `transformers` library! In order to provide a source of truth for all models, we are working with the rest of the ecosystem to make the modeling code the standard. A joint effort with vLLM, LlamaCPP, SGLang, Mlx, Qwen, Glm, Unsloth, Axoloth,…
The @huggingface Transformers ↔️ @vllm_project integration just leveled up: Vision-Language Models are now supported out of the box! If the model is integrated into Transformers, you can now run it directly with vLLM. github.com/vllm-project/v… Great work @RTurganbay 👏
Bye Qwen3-235B-A22B, hello Qwen3-235B-A22B-2507! After talking with the community and thinking it through, we decided to stop using hybrid thinking mode. Instead, we’ll train Instruct and Thinking models separately so we can get the best quality possible. Today, we’re releasing…
We’ve updated Qwen3 and made excellent progress. The non‑reasoning model now delivers significant improvements across a wide range of tasks and many of its capabilities already rival those of reasoning models. It’s truly remarkable, and we hope you enjoy it!
Bye Qwen3-235B-A22B, hello Qwen3-235B-A22B-2507! After talking with the community and thinking it through, we decided to stop using hybrid thinking mode. Instead, we’ll train Instruct and Thinking models separately so we can get the best quality possible. Today, we’re releasing…
Alibaba Qwen has just released a non-thinking model even more powerful than Kimi K2... And even better than Claude Opus 4 🤯 → 100% open source → Only 22B active parameters → Available for free in Qwen Chat All the links below
🤗🤗🤗 🤗❤️🤗 @huggingface & Cline = your LLM playground 🤗🤗🤗 You can access Kimi K2 & 6,140 (!) other open source models in Cline.
Here's how you can use the @huggingface provider in Cline 🤗 (thread)
Pydantic AI now supports @huggingface as a provider! You can use it to run open source models like DeepSeek R1 on scalable serverless infrastructure. They have a free tier allowance so you can test it out. Thanks to the Hugging Face team (@hanouticelina ) for this great…
We already have a solution for kernel install issues, in transformers you can hotswap with this: huggingface.co/kernels-commun…, its a single install, very light (200mb?) because matches only your hardware, and... will support metal 😄
The current state of the ecosystem for post-training using GRPO w/ vllm + flash attention is frustratingly brittle. - The most recent vllm only supports PyTorch==2.7.0 - vllm requires xformers, but specifically only v0.0.30 is supported for torch 2.7.0. Any prior version of…
We open-sourced 99% of US caselaw on @huggingface. Both AI and legal tech companies are selling this data for a high premium. You can simply just build a wrapper around it and freely compete with them now. That is why we love open-source. huggingface.co/datasets/commo…
Why are legal services such a lucrative opportunity for AI? ・ ⚖️ Legal work is programming with words, a natural fit for AI. ・ 🚀 The market pull is unprecedented, with sales cycles cut from months to weeks. ・ 💻 AI engineers have an unfair advantage, able to build the…
1T parameters, open-weights, just released on @huggingface!
SmolLM3-TTS 🧑🍳
Introducing SmolLM3: a strong, smol reasoner! > SoTA 3B model > dual mode reasoning (think/no_think) > long context, up to 128k > multilingual: en, fr, es, de, it, pt > fully open source (data, code, recipes) huggingface.co/blog/smollm3
Thrilled to finally share what we've been working on for months at @huggingface 🤝@pollenrobotics Our first robot: Reachy Mini A dream come true: cute and low priced, hackable yet easy to use, powered by open-source and the infinite community. Tiny price, small size, huge…
We’re releasing the top 3B model out there SOTA performances It has dual mode reasoning (with or without think) Extended long context up to 128k And it’s multilingual with strong support for en, fr, es, de, it, pt What do you need more? Oh yes we’re also open-sourcing all…
Not sure if it's my env: `uv pip install flash_attn=2.7.4.post1`
