Stanford AI Lab

@StanfordAILab

The Stanford Artificial Intelligence Laboratory (SAIL), a leading #AI lab since 1963. ⛵️🤖 Emmy-winning video: https://www.youtube.com/watch?v=Cn6nmWlu1EA

Stanford, CA

Joined November 2018

333Following

204KFollowers

Stanford AI Lab Retweeted

Chris Bail (chris_bail_duke 🧵)@chris_bail · Jul 16

Fascinating new paper on AI companionship w/data donation from Character.ai by @Diyi_Yang and colleagues: arxiv.org/abs/2506.12605

10.0K

Stanford AI Lab Retweeted

Sang Truong@sangttruong · Jul 16

@sanmikoyejo gives a nice talk contextualizing our paper contribution in the broader AI Measurement Sciences community in an @StanfordHAI seminar earlier this year: hai.stanford.edu/events/hai-sem… (starting at 30:35) 🧵8/9

7.0K

Stanford AI Lab@StanfordAILab · Jul 18

Thanks @willccbb!! For those at ICML, I'm giving a talk on Cartridges at the ES-FoMo workshop on Saturday at 10:45 -- come through!! Excited to talk memory, test-time training, and continual learning!

wwill brown@willccbb · Jul 17

cant stop thinking about this one insanely elegant, seems insanely powerful

9.0K

Stanford AI Lab Retweeted

Allen Nie (🇺🇦☮️✈️ICML)@allenainie · Jul 15

Flying to ICML tomorrow. Excited to present these works with incredible collaborators! RL has come a long way to deliver sizable impacts on problems across distributed systems, planning, cybersecurity, math, and game playing.

9.0K

Stanford AI Lab Retweeted

Brando Miranda @ ICML 2025@BrandoHablando · Jul 17

🚨 Can your LLM really do math—or is it cramming the test set? 📢 Meet Putnam-AXIOM, a advanced mathematics contamination-resilient benchmark that finally hurts FMs. 1. openreview.net/forum?id=kqj2C… 2. icml.cc/virtual/2025/p… #ICML2025 East Exhibition Hall A-B, #E-2502 🧵1/14

10.0K

Stanford AI Lab@StanfordAILab · Jul 17

We’re presenting Minions at ICML starting now until 1:30pm at E-2907 — come by and chat!!

DDan Biderman@dan_biderman · Feb 25

How can we use small LLMs to shift more AI workloads onto our laptops and phones? In our paper and open-source code, we pair on-device LLMs (@ollama) with frontier LLMs in the cloud (@openai, @together), to solve token-intensive workloads on your 💻 at 17.5% of the cloud cost…

6.0K

Stanford AI Lab@StanfordAILab · Jul 12

heading to @icmlconf #ICML2025 next week! come say hi & i'd love to learn about your work :) i'll present this paper (arxiv.org/abs/2503.17514) on the pitfalls of training set inclusion in LLMs, Thursday 11am here are my talk slides to flip through: ai.stanford.edu/~kzliu/files/m…

KKen Liu@kenziyuliu · Apr 3

An LLM generates an article verbatim—did it “train on” the article? It’s complicated: under n-gram definitions of train-set inclusion, LLMs can complete “unseen” texts—both after data deletion and adding “gibberish” data. Our results impact unlearning, MIAs & data transparency🧵

311

179

38.0K

Stanford AI Lab Retweeted

ishigaki@ibadora · Jul 17

Just uploaded our code of Multi agent-based scientific research idea generation (accepted to #SIGDIAL2025): github.com/g6000/MultiAge… This is an extended version of @stanfordnlp's implementation. Thanks to @ChengleiSi and @tatsu_hashimoto for providing the original code😀

8.0K

Stanford AI Lab Retweeted

Aakanksha Chowdhery@achowdhery · Jul 16

Today we launch Asimov. Asimov is our code research agent that is best-in-class in codebase comprehension. It is built for teams, built for enterprises, and built to remember. We use it everyday to accelerate our velocity and streamline distributed ops. Link below to sign up…

366

146.0K

Stanford AI Lab@StanfordAILab · Jul 16

Come chat with me at our ICML poster about interpretability as a communication problem, and the need to derive new words for referencing language model concepts! 4:30PM-7, East Exhibition Hall A-B #E-500 We Can’t Understand AI Using our Existing Vocabulary

JJohn Hewitt@johnhewtt · Feb 12

Understanding and control are two sides of the problem of communicating differing concepts between humans and machines. New position paper: Robert Geirhos, @_beenkim, and I argue we must develop neologisms - new words - for human and machine concepts to understand and control AI

13.0K

Stanford AI Lab@StanfordAILab · Jul 16

I am presenting our position paper: "Societal Impacts Research Requires Benchmarks for Creative Composition Tasks" at #ICML2025 today at 11 am in #E-500! Come by and say hi! This paper won the Best Societal Impacts Paper Award at the Bi-align workshop 🎉🥳

JJudy Shen@judyhshen · Apr 27

If you're at ICLR, come check out the @bi_align workshop tomorrow! I'll be giving an oral presentation at 14:50 on our recent position paper: "Societal Impacts Research Requires Benchmarks for Creative Composition Tasks." arxiv.org/abs/2504.06549

8.0K

Stanford AI Lab@StanfordAILab · Jul 16

At #ICML2025 in Vancouver 🇨🇦 this week, presenting some work from my first year at Stanford! Come find me at posters or just around the conference! Thursday: KernelBench: Can LLMs Write Efficient GPU Kernels? 11AM East E-2010 Saturday: Kevin: Multi-Turn RL for Generating…

AAzalia Mirhoseini@Azaliamirh · Jul 16

Looking forward to attending ICML! Here are some works on memory/long context, verification, kernel design, multi-model AI systems, and theoretical understanding of test-time scaling from my awesome students and collaborators!

15.0K

Stanford AI Lab Retweeted

Sang Truong@sangttruong · Jul 16

Interested in LLM evaluation reliability & efficiency? Check our ICML’25 paper Reliable and Efficient Amortized Model-based Evaluation arxiv.org/abs/2503.13335 w/ @percyliang @uiuc_aisecure @sanmikoyejo @yuhengtu @VirtueAI_co @StanfordAILab @stai_research @StanfordCRFM 🧵1/9

9.0K

Stanford AI Lab Retweeted

Microsoft Research@MSFTResearch · Jul 15

Recipient of an ICML 2025 Outstanding Paper Award, CollabLLM improves how LLMs collaborate with users, including knowing when to ask questions and how to adapt tone and communication style to different situations. This approach helps move AI toward more user-centric and…

14.0K

Stanford AI Lab Retweeted

Azalia Mirhoseini@Azaliamirh · Jul 16

22.0K

Stanford AI Lab@StanfordAILab · Jul 15

The #SIGIR2025 Best Paper just awarded to the WARP engine for fast late interaction! Congrats to Luca Scheerer🎉 WARP was his @ETH_en MS thesis, completed while visiting us at @StanfordNLP. Incidentally, it's the fifth Paper Award for a ColBERT paper since 2020!* Luca did an…

OOmar Khattab@lateinteraction · Jul 14

📢 If you’re at #SIGIR2025 this week, make sure to be at Luca Scheerer’s paper talk: “WARP: An Efficient Engine for Multi-Vector Retrieval” (Wednesday 11am) WARP makes PLAID, the famous ludicrously fast ColBERT engine, another 3x faster on CPUs. With the usual ColBERT quality!

182

27.0K

Stanford AI Lab Retweeted

Chenchen Gu@chenchenygu · Jul 14

Prompt caching lowers inference costs but can leak private information from timing differences. Our audits found 7 API providers with potential leakage of user data. Caching can even leak architecture info—OpenAI's embedding model is likely a decoder-only Transformer! 🧵1/9

140

18.0K

Stanford AI Lab@StanfordAILab · Jul 14

ICML ✈️ this week. open to chat and learn mech interp from you. @aryaman2020 and i have cool ideas about steering, just come to our AxBench poster. new steering blog: zen-wu.social/steer/index.ht… 中文: zen-wu.social/steer/cn_index…

AAryaman Arora@aryaman2020 · Jul 14

i forgot the whole point of saying you're at a conference is to advertise your poster please come check out AxBench by @ZhengxuanZenWu* me* et al. on Tuesday, 15 July at 11 AM - 1:30 PM

24.0K

Stanford AI Lab Retweeted

James Zou@james_y_zou · Jul 15

🏆Thrilled that #CollabLLM won the #ICML2025 Outstanding Paper Award! We propose a new approach to optimize human-AI collaboration, which is critical for agents. Congratulations to my fantastic co-authors; great job @ShirleyYXWu and Michel Galley driving the project!👏

183

15.0K

Stanford AI Lab Retweeted

Judy Shen@judyhshen · Jul 15

Come check out our Spotlight Poster at #ICML2025 tomorrow at 4:30 PM W-813! Anders will be presenting our new work: Algorithms with Calibrated ML Predictions. Paper Link: arxiv.org/abs/2502.02861

7.0K