Mathurin Videau

@mathuvu_

Joined October 2024

66Following

115Followers

Mathurin Videau Retweeted

🚨New AI Security paper alert: Winter Soldier 🥶🚨 In our last paper, we show: -how to backdoor a LM _without_ training it on the backdoor behavior -use that to detect if a black-box LM has been trained on your protected data Yes, Indirect data poisoning is real and powerful!

5.0K

Mathurin Videau Retweeted

Nikola Jovanović@ni_jovanovic · Jun 23

There's a lot of work now on LLM watermarking. But can we extend this to transformers trained for autoregressive image generation? Yes, but it's not straightforward 🧵(1/10)

323

259

47.0K

Mathurin Videau Retweeted

Tanishq Abraham is at ICML@iScienceLuvr · Jun 18

From Bytes to Ideas: Language Modeling with Autoregressive U-Nets "Byte Pair Encoding (BPE) and similar schemes split text once, build a static vocabulary, and leave the model stuck with that choice. We relax this rigidity by introducing an autoregressive U-Net that learns to…

516

349

56.0K

Mathurin Videau Retweeted

elvis@omarsar0 · Jun 18

From Bytes to Ideas Avoids using predefined vocabs and memory-heavy embedding tables. Instead, it uses Autoregressive U-Nets to embed information directly from raw bytes. This is huge! Enables infinite vocab size and more. More in my notes below:

193

179

45.0K

Mathurin Videau Retweeted

Aran Komatsuzaki@arankomatsuzaki · Jun 18

From Bytes to Ideas: Language Modeling with Autoregressive U-Nets Presents an autoregressive U-Net that processes raw bytes and learns hierarchical token representation Matches strong BPE baselines, with deeper hierarchies demonstrating promising scaling trends

365

200

59.0K

Mathurin Videau Retweeted

Krunoslav Lehman Pavasovic@KrunoLehman · Apr 9

1/ Happy to share my first accepted paper as a PhD student at @Meta and @ENS_ULM which I will present at @iclr_conf: 📚 Our work proposes difFOCI, a novel rank-based objective for ✨better feature learning✨ In collab with David Lopez-Paz, @GiulioBiroli and @leventsagun!

3.0K

Mathurin Videau Retweeted

TimDarcet@TimDarcet · Feb 14

Want strong SSL, but not the complexity of DINOv2? CAPI: Cluster and Predict Latents Patches for Improved Masked Image Modeling.

108

601

488

159.0K

Mathurin Videau Retweeted

Simo Ryu@cloneofsimo · Oct 31

Goddamn, this repo is true beauty. simple (not bloated) effective, scalable elegant, just the right amount of abstraction.

374

296

23.0K

Mathurin Videau Retweeted

Andrew Carr (e/🤸)@andrew_n_carr · Oct 30

A great example of FlexAttention used in a reasonably modern code base is Lingua. Which is designed to reproduce Llama 2 7B overnight They have a great example of batched / sequence-stacked attention masking for within document attention. Which then is used in the mod function…

110

7.0K