Tristan Thrush

@TristanThrush

PhD-ing @StanfordAILab @stanfordnlp. Interested in data, multimodality, scaling, and many more things.

Joined April 2021

904Following

3KFollowers

Pinned

Tristan Thrush@TristanThrush · Sep 10

Do you want to select great LLM pretraining data but don’t have 1000 H100s for a ton of mixture experiments? What about a method that requires none of your own training, matches the best known existing method, and has some nice theory? New preprint: Perplexity Correlations

TristanThrush's tweet image. Do you want to select great LLM pretraining data but don’t have 1000 H100s for a ton of mixture experiments?

What about a method that requires none of your own training, matches the best known existing method, and has some nice theory?

New preprint: Perplexity Correlations

535

498

80.0K

Tristan Thrush Retweeted

CLS@ChengleiSi · Jun 30

Are AI scientists already better than human researchers? We recruited 43 PhD students to spend 3 months executing research ideas proposed by an LLM agent vs human experts. Main finding: LLM ideas result in worse projects than human ideas.

169

597

204

138.0K

Tristan Thrush Retweeted

Keshigeyan Chandrasegaran@keshigeyan · Jun 10

1/ Model architectures have been mostly treated as fixed post-training. 🌱 Introducing Grafting: A new way to edit pretrained diffusion transformers, allowing us to customize architectural designs on a small compute budget. 🌎 grafting.stanford.edu Co-led with @MichaelPoli6

195

116

70.0K

Tristan Thrush@TristanThrush · Apr 29

Can you rotate a dice 🎲 in your head? Mental imagery plays a key role in perspective reasoning for humans - but can it help VLMs reason spatially? We show that Abstract Perspective Change significantly improves VLM reasoning from unseen views. Check out our preprint for more:

PPhillip (Yuseung) Lee@yuseungleee · Apr 29

❗️Vision-Language Models (VLMs) struggle with even basic perspective changes! ✏️ In our new preprint, we aim to extend the spatial reasoning capabilities of VLMs to ⭐️arbitrary⭐️ perspectives. 📄Paper: arxiv.org/abs/2504.17207 🔗Project: apc-vlm.github.io 🧵[1/N]

9.0K

Tristan Thrush Retweeted

Julie Kallini ✨@JulieKallini · Apr 15

🚀 In T-minus 1 week, I’ll be at ICLR presenting MrT5! The final version has tons of updates: - New controller algorithm for targeted compression rates - More baselines and downstream tasks - Scaled-up experiments to 1.23B parameter models And now, MrT5 is on 🤗HuggingFace! 🧵

127

28.0K

Tristan Thrush@TristanThrush · Apr 15

🔭 Science relies on shared artifacts collected for the common good. 🛰 So we asked: what's missing in open language modeling? 🪐 DataDecide 🌌 charts the cosmos of pretraining—across scales and corpora—at a resolution beyond any public suite of models that has come before.

AAi2@allen_ai · Apr 15

Ever wonder how LLM developers choose their pretraining data? It’s not guesswork— all AI labs create small-scale models as experiments, but the models and their data are rarely shared. DataDecide opens up the process: 1,050 models, 30k checkpoints, 25 datasets & 10 benchmarks 🧵

12.0K

Tristan Thrush Retweeted

Karan Dalal@karansdalal · Apr 7

Today, we're releasing a new paper – One-Minute Video Generation with Test-Time Training. We add TTT layers to a pre-trained Transformer and fine-tune it to generate one-minute Tom and Jerry cartoons with strong temporal consistency. Every video below is produced directly by…

187

939

6.0K

3.0K

1.4M

Tristan Thrush Retweeted

Ken Liu@kenziyuliu · Apr 3

An LLM generates an article verbatim—did it “train on” the article? It’s complicated: under n-gram definitions of train-set inclusion, LLMs can complete “unseen” texts—both after data deletion and adding “gibberish” data. Our results impact unlearning, MIAs & data transparency🧵

326

197

87.0K

Tristan Thrush@TristanThrush · Apr 2

.@stanfordnlp will be in @Singapore with lots of @iclr_conf papers. @TristanThrush, @ChrisGPotts & @tatsu_hashimoto will show how to select good pretraining data: LLM losses on texts correlate with downstream benchmarks, so select high-correlation docs. arxiv.org/abs/2409.05816

TTristan Thrush@TristanThrush · Sep 10

8.0K