Max Zuo

@max_zuo

Ph.D Student @BrownCSDept | Generalization via Limited Supversion & RL | Advisors: @mlittmancs & @stevebach. Previously @gtcomputing & @google.

Earth

Joined November 2016

316Following

302Followers

Pinned

Max Zuo@max_zuo · Jul 10, 2024

Ever wonder if LLMs use tools🛠️ the way we ask them? We explore LLMs using classical planners: are they writing *correct* PDDL (planning) problems? Say hi👋 to Planetarium🪐, a benchmark of 132k natural language & PDDL problems. 📜 Preprint: arxiv.org/abs/2407.03321 🧵1/n

max_zuo's tweet image. Ever wonder if LLMs use tools🛠️ the way we ask them?

We explore LLMs using classical planners: are they writing *correct* PDDL (planning) problems?

Say hi👋 to Planetarium🪐, a benchmark of 132k natural language &amp; PDDL problems.

📜 Preprint: arxiv.org/abs/2407.03321
🧵1/n

195

194

189.0K

Max Zuo Retweeted

Brown CS@BrownCSDept · Jun 2

We're happy to announce that effective as of July 1, 2025, faculty members @stevebach and @drsrinathsridha have received named chairs. Steve is now the Eliot Horowitz Assistant Professor in CS and Srinath is the John E. Savage Assistant Professor in CS: cs.brown.edu/news/2025/06/0…

5.0K

Max Zuo Retweeted

Francisco Piedrahita Velez@FPiedrahitaV · May 1

Excited to present our paper arxiv.org/abs/2407.03321 at #NAACL this Friday, May 2, at 10am in Ballroom A! If you're interested in LLMs and Planning, I hope you'll join us to hear about our work!

4.0K

Max Zuo@max_zuo · Apr 22

I will be at #ICLR2025 in a few days to present this work with @surajk610! Feel free to DM me if you want to chat about mechinterp, cognitive science, or anything else!

SSuraj Anand@surajk610 · Jul 1, 2024

How robust are in-context algorithms? In new work with @michael_lepori, @jack_merullo, and @brown_nlp, we explore why in-context learning disappears over training and fails on rare and unseen tokens. We also introduce a training intervention that fixes these failures.

3.0K

Max Zuo Retweeted

Apoorv Khandelwal @ ACL@apoorvkh · Feb 7

I started a blog! First post is everything I know about setting up (fast, reproducible, error-proof) Python project environments using the latest tools. These methods have saved me a lot of grief. Also a short guide to CUDA in appendix :) blog.apoorvkh.com/posts/project-…

5.0K

Max Zuo Retweeted

Jack Merullo@jack_merullo_ · Dec 5

If we guide the activation in the ‘right’ part of the subspace, we can improve performance pretty dramatically, although we don’t completely fix the problem.

358

Max Zuo Retweeted

Jack Merullo@jack_merullo_ · Dec 5

Using the composition score, we find two highly communicating heads, then using text corpus data, find highly activating contexts. In this case we look at a component in head 8.3 which composes super strongly with a mover head

465

Max Zuo Retweeted

Stephen Bach@stevebach · Nov 10

Round-up 🧵 of our papers at #EMNLP2024: Reach out or get in touch with lead authors if you'd like to chat! #1 If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions arxiv.org/abs/2403.16442 Tue Nov. 12 11am Jasmine

7.0K

Max Zuo Retweeted

Ruochen Zhang@ruochenz_ · Nov 9

🤔How do multilingual LLMs encode structural similarities across languages? 🌟We find that LLMs use identical circuits when languages share the same morphosyntactic processes. However, they involve specialized components to handle tasks if contain specific linguistic features⤵️

156

31.0K

Max Zuo@max_zuo · Jul 12, 2024

Not only can't LLMs plan, they can't even generate specifications of a problem (in PDDL) that a standard planner could solve.

MMax Zuo@max_zuo · Jul 10, 2024

108

684

378

182.0K