Sylvain Gugger

@GuggerSylvain

Machine Learning at Jane Street. Previously at @huggingface and @fastdotai Co-author of http://github.com/fastai/fastbook He/him

Brooklyn, NY

Joined March 2018

350Following

27KFollowers

Pinned

Sylvain Gugger@GuggerSylvain · Jul 29, 2020

One year and half after starting the first draft of the first chapter, look what arrived in the mail!

120

1.0K

147

Sylvain Gugger@GuggerSylvain · Jul 25

Very excited to collaborate with Mark on this!

MMark Saroufim@marksaroufim · Jul 25

On Sep 6 in NYC, this won't be your typical hackathon where you do your own thing in a corner and then present at the of the day. You'll deploy real models to the market, trades will happen, chaos should be expected. The fastest model is great but time to market matters more.

3.0K

Sylvain Gugger Retweeted

Lysandre@LysandreJik · Jul 25

The new transformers release comes w/ a surprise: kernels support ⚡️ It integrates deeply with precompiled kernels on the HF Hub. - opt-in, automatic kernels for your hardware and software - kernels like FA2/3 w/o compilation - community-built kernels, for inference & training

11.0K

Sylvain Gugger Retweeted

Wentao Guo@WentaoGuo7 · Jul 10

🦆🚀QuACK🦆🚀: new SOL mem-bound kernel library without a single line of CUDA C++ all straight in Python thanks to CuTe-DSL. On H100 with 3TB/s, it performs 33%-50% faster than highly optimized libraries like PyTorch's torch.compile and Liger. 🤯 With @tedzadouri and @tri_dao

318

192

73.0K

Sylvain Gugger Retweeted

Thomas Wolf@Thom_Wolf · Jul 9

Thrilled to finally share what we've been working on for months at @huggingface 🤝@pollenrobotics Our first robot: Reachy Mini A dream come true: cute and low priced, hackable yet easy to use, powered by open-source and the infinite community. Tiny price, small size, huge…

234

517

3.0K

2.0K

1.2M

Sylvain Gugger Retweeted

Vijay@__tensorcore__ · May 13

🚨🔥 CUTLASS 4.0 is released 🔥🚨 pip install nvidia-cutlass-dsl 4.0 marks a major shift for CUTLASS: towards native GPU programming in Python slidehelloworld.png docs.nvidia.com/cutlass/media/…

425

155

74.0K

Sylvain Gugger Retweeted

Mark Saroufim@marksaroufim · Mar 25

x.com/i/article/1904…

398

286

62.0K

Sylvain Gugger Retweeted

João Gante@joao_gante · Mar 24

Speculative Decoding before: limited choices, the draft model must have the same tokenizer 😬 Speculative Decoding now: unlimited choices, ANY draft model can be used and better speedup opportunities 😎 The folks at Intel have been cooking, and Speculative Decoding (with…

7.0K

Sylvain Gugger Retweeted

Benjamin F Spector@bfspector · Mar 5

(1/7) Inspired by DeepSeek's FlashMLA, we're releasing ThunderMLA—a fused megakernel optimized for variable-prompt decoding! ⚡️🐱ThunderMLA is up to 35% faster than FlashMLA and just 400 LoC. Blog: bit.ly/4kubAAK With @AaryanSinghal4, @realDanFu, and @hazyresearch!

372

151

57.0K

Sylvain Gugger Retweeted

GPU MODE@GPU_MODE · Feb 23

Write a fast kernel and run it on Discord. See how you compare against the best! If you're familiar with Leetcode, Kaggle or Codeforces then this should feel right at home

424

231

113.0K

Sylvain Gugger Retweeted

Nouamane Tazi@Nouamanetazi · Feb 19

🚀 Excited to release *THE* Ultra-Scale Playbook - a comprehensive guide on training LLMs from 1 to 1000s of GPUs!

234

1.0K

151.0K

Sylvain Gugger@GuggerSylvain · Feb 3

This is huge, huge, huge - DeepSpeed is now a community-owned project as it's now a part of the Linux Foundation. Committer access should be possible now. Thank you, @MSFTResearch for breathing life into this very important to the ML community scalability framework and now…

LLF AI & Data Foundation@LFAIDataFdn · Feb 3

🚀 Excited to introduce DeepSpeed, a deep learning optimization library from @Microsoft! It simplifies distributed training and inference, making AI scaling more efficient and cost-effective. Learn more 👉 hubs.la/Q0351DJC0 #DeepSpeed #AI #OpenSource #LFAIData

126

12.0K

Sylvain Gugger@GuggerSylvain · Jan 7

TIL Jane Street have an eng podcast Most recent episode is with @GuggerSylvain on training & ML infra

SSatyaki Upadhyay@satyaki_u · Jan 7

They have a nice blog about it signalsandthreads.com

4.0K

Sylvain Gugger@GuggerSylvain · Jan 7

We had an awesome talk at Jane Street from the amazing @cHHillee on scaling ML systems to and I just realized the recording is now online: youtu.be/139UPjoq7Kw?si…

453

481

170.0K

Sylvain Gugger@GuggerSylvain · Nov 16

Jane Street tech talks have always been super awesome. So I'm quite excited to be visiting Jane Street on Monday to give a talk on building ML systems for a trillion trillion FLOPs :) I'll talk about a bunch of fun things, including cool GPU optimizations, how I think about…

YYaron (Ron) Minsky@yminsky · Nov 15

This is. such a fun talk from @ixyene! All about system jitter and how to hunt it down. Also, it features a cameo appearance from magic-trace.org, my favorite profiling tool that no one has heard of. youtu.be/I_TtMk5z0O0?si…

822

493

133.0K

Sylvain Gugger Retweeted

PyTorch@PyTorch · Oct 17

PyTorch 2.5 is here 🔥 We are excited to announce the release of #PyTorch 2.5, featuring a new CuDNN backend for SDPA, regional compilation of torch.compile, & TorchInductor CPP backend performance speedup Read more in our blog: hubs.la/Q02TRs9p0

157

678

50.0K