Ziyu Yao (@ZiyuYao)

Pinned

Z

Ziyu Yao@ZiyuYao · Jan 1

Happy to make the progress with my wonderful students in 2024 (reasoning&planning, LLM interpretability, human-LLM interaction)! Look forward to an exciting and fruitful 2025🎉🎉(Welcome collaborations as always! PhD openings for Fall’25)

ZiyuYao's tweet image. Happy to make the progress with my wonderful students in 2024 (reasoning&amp;planning, LLM interpretability, human-LLM interaction)! Look forward to an exciting and fruitful 2025🎉🎉(Welcome collaborations as always! PhD openings for Fall’25)

5

7

118

27

13.0K

Z

Ziyu Yao@ZiyuYao · Jul 14

Tutorial happening in a minute at West Exhibit Hall C! @DakingRai

ZZiyu Yao@ZiyuYao · May 12

Happy to announce that we (w/ my student @DakingRai ) will present a tutorial on 𝐌𝐞𝐜𝐡𝐚𝐧𝐢𝐬𝐭𝐢𝐜 𝐈𝐧𝐭𝐞𝐫𝐩𝐫𝐞𝐭𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐟𝐨𝐫 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥𝐬! Look forward to meeting people @icmlconf Stay tuned! ziyu-yao-nlp-lab.github.io/ICML25-MI-Tuto… @GeorgeMasonU @GMUCompSci

0

6

27

5

3.0K

Ziyu Yao Retweeted

X

XLLM-Reason-Plan@XllmReasonPlan · Jun 24

⏳Deadline extended! The submission deadline for XLLM-Reason-Plan has been moved to June 27th. More time to submit your work — we look forward to your submissions! Details: …reasoning-planning-workshop.github.io

0

6

12

0

2.0K

Z

Ziyu Yao@ZiyuYao · Jun 18

If you work in the space of LLM explainability, submit your completed/ongoing/recently accepted/under review work to us! Your chance to win awards😍 @XllmReasonPlan

XXLLM-Reason-Plan@XllmReasonPlan · Jun 18

🚨Deadline alert: If you work on LLM explainability for reasoning and planning, submit your work by June 23! - Non-archival, two formats (long/short) - Welcome recently accepted papers and dual submissions - 🏆Two awards will be announced! Details: …reasoning-planning-workshop.github.io

1

7

17

2

2.0K

Ziyu Yao Retweeted

X

XLLM-Reason-Plan@XllmReasonPlan · Jun 18

🚨Deadline alert: If you work on LLM explainability for reasoning and planning, submit your work by June 23! - Non-archival, two formats (long/short) - Welcome recently accepted papers and dual submissions - 🏆Two awards will be announced! Details: …reasoning-planning-workshop.github.io

0

7

15

5

4.0K

Z

Ziyu Yao@ZiyuYao · Jun 12

Had a great time at this CVPR community-building workshop---lots of fun discussions and some really important insights for early-career researchers. I also gave a talk on "Research as an Infinite Game." Here are the slides: canva.com/design/DAGp0iR…

AAnand Bhattad@anand_bhattad · Jun 10

In this #CVPR2025 edition of our community-building workshop series, we focus on supporting the growth of early-career researchers. Join us tomorrow (Jun 11) at 12:45 PM in Room 209 Schedule: sites.google.com/view/standoutc… We have an exciting lineup of invited talks and candid…

18

64

354

127

42.0K

Z

Ziyu Yao@ZiyuYao · Jun 9

Check out our #CVPR25 paper! @aghzalm has done a series of work on 𝐋𝐋𝐌 𝐏𝐥𝐚𝐧𝐧𝐢𝐧𝐠 uniquely from LLM + Robotics perspective: arxiv.org/pdf/2310.03249 arxiv.org/pdf/2502.12435 arxiv.org/pdf/2406.12000 Go talk to him if you are also on this topic! @GMUCompSci @GeorgeMasonU

MMohamed Aghzal@aghzalm · Jun 9

[1/6] LLMs/VLMs aren't reliable planners—can they evaluate plans? 🤔 Our #CVPR2025 paper tests this in path planning. We find that VLMs show weak low-level perception & hallucinated reasoning. 📄 arxiv.org/abs/2411.18711 📊 huggingface.co/datasets/maghz… 📅 Fri Jun 13 4-6 PM @ ExHall D

0

2

17

2

1.0K

Z

Ziyu Yao@ZiyuYao · Jun 3

🎉 We're excited to announce three more amazing speakers for the @XllmReasonPlan at @COLM_conf! @zhuzining , @mark_riedl, @hhsun1. More info about workshop: …reasoning-planning-workshop.github.io

XXLLM-Reason-Plan@XllmReasonPlan · May 20

📢Announcing 𝐭𝐡𝐞 𝐟𝐢𝐫𝐬𝐭 𝐰𝐨𝐫𝐤𝐬𝐡𝐨𝐩 𝐨𝐧 𝐭𝐡𝐞 𝐀𝐩𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧 𝐨𝐟 𝐋𝐋𝐌 𝐄𝐱𝐩𝐥𝐚𝐢𝐧𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐭𝐨 𝐑𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠 𝐚𝐧𝐝 𝐏𝐥𝐚𝐧𝐧𝐢𝐧𝐠 at @COLM_conf! We welcome perspectives from LLM, XAI, and HCI! CFP (Due June 23): …reasoning-planning-workshop.github.io

0

3

13

1

4.0K

Z

Ziyu Yao@ZiyuYao · Jun 3

🤩Check out our amazing line-up of speakers …reasoning-planning-workshop.github.io covering topics: explainability for reasoning, agent safety, human-AI interaction, mechanistic interpretability, and MORE!!

XXLLM-Reason-Plan@XllmReasonPlan · Jun 3

🎉 We're excited to announce three more amazing speakers for the @XllmReasonPlan at @COLM_conf! @zhuzining , @mark_riedl, @hhsun1. More info about workshop: …reasoning-planning-workshop.github.io

0

2

10

0

1.0K

Z

Ziyu Yao@ZiyuYao · May 20

📣 Call for Reviewers We're looking for reviewers with expertise in LLM interpretability, reasoning, or planning for @XllmReasonPlan at @COLM_conf . If you’re Interested, sign up here: forms.office.com/r/Z7qXnEKZne 📝 Review period: June 24 – July 7 📄 Load: 2–3 papers

XXLLM-Reason-Plan@XllmReasonPlan · May 20

📢Announcing 𝐭𝐡𝐞 𝐟𝐢𝐫𝐬𝐭 𝐰𝐨𝐫𝐤𝐬𝐡𝐨𝐩 𝐨𝐧 𝐭𝐡𝐞 𝐀𝐩𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧 𝐨𝐟 𝐋𝐋𝐌 𝐄𝐱𝐩𝐥𝐚𝐢𝐧𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐭𝐨 𝐑𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠 𝐚𝐧𝐝 𝐏𝐥𝐚𝐧𝐧𝐢𝐧𝐠 at @COLM_conf! We welcome perspectives from LLM, XAI, and HCI! CFP (Due June 23): …reasoning-planning-workshop.github.io

0

4

6

1

735

Z

Ziyu Yao@ZiyuYao · May 20

🥳We are organizing a workshop at COLM to discuss the research gap about applying explainability/interpretability to enhance LLMs in challenging reasoning and planning tasks! Check out our tentative schedule at …reasoning-planning-workshop.github.io **Submit your excellent work to us!**

XXLLM-Reason-Plan@XllmReasonPlan · May 20

📢Announcing 𝐭𝐡𝐞 𝐟𝐢𝐫𝐬𝐭 𝐰𝐨𝐫𝐤𝐬𝐡𝐨𝐩 𝐨𝐧 𝐭𝐡𝐞 𝐀𝐩𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧 𝐨𝐟 𝐋𝐋𝐌 𝐄𝐱𝐩𝐥𝐚𝐢𝐧𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐭𝐨 𝐑𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠 𝐚𝐧𝐝 𝐏𝐥𝐚𝐧𝐧𝐢𝐧𝐠 at @COLM_conf! We welcome perspectives from LLM, XAI, and HCI! CFP (Due June 23): …reasoning-planning-workshop.github.io

0

6

19

2

2.0K

Ziyu Yao Retweeted

Y

Yonatan Belinkov@boknilev · May 15

BlackboxNLP will be co-located with #EMNLP2025 in Suzhou this November! 📷This edition will feature a new shared task on circuits/causal variable localization in LMs, details: blackboxnlp.github.io/2025/task If you're into mech interp and care about evaluation, please submit!

1

21

74

13

10.0K

Ziyu Yao Retweeted

I

ICLR 2026@iclr_conf · Apr 28

That's a wrap for #ICLR2025! See you all next year in Brazil! Please all welcome @BharathHarihar3 as the new Senior Program Chair! (With @cvondrick continuing on as General Chair.)

7

60

684

45

124.0K

Ziyu Yao Retweeted

H

Haohan Wang@HaohanWang · Apr 17

🚨 We’re hiring a postdoc! Join us to push the frontier of AI and machine learning in genomics—aiming to uncover the genetic basis of complex human disorders. Please help RT! 🙏. For more information, please visit: haohanwang.github.io/postdoc_hiring…

0

2

10

1

2.0K

Z

Ziyu Yao@ZiyuYao · Apr 13

Proud to share exploration w/ my student @salokr_deep abt 𝐃𝐨 𝐋𝐚𝐫𝐠𝐞 𝐑𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠 𝐌𝐨𝐝𝐞𝐥𝐬 𝐬𝐭𝐢𝐥𝐥 𝐧𝐞𝐞𝐝 𝐏𝐫𝐨𝐦𝐩𝐭 𝐎𝐩𝐭𝐢𝐦𝐢𝐳𝐚𝐭𝐢𝐨𝐧? arxiv.org/pdf/2504.07357 Key💡: vs. LLMs, LRMs benefit more from prompt opt and are also better prompt optimizers.

SSaurabh Srivastava@salokr_deep · Apr 13

🚨New Preprint🚨 (1/n) Do SOTA #LRMs like #DeepSeekR1 and #o1 still need Prompt Optimization? We put them to the test on a structured task, Event Extraction, and did the first deep dive into prompt optimization. We found: Yes, they do benefit from it. #NLProc #LLMs #LRMs A🧵

0

17

102

47

10.0K

Z

Ziyu Yao@ZiyuYao · Apr 8

🚀After a year of development based on our OSWorld, Computer Use Agent Arena is LIVE! Test top AI agents (Operator, Claude 3.7...) on any kinds of computer use tasks with zero setup. Cloud-hosted, safe, and FREE! Try it now: arena.xlang.ai ! Data & code coming soon!

BBowen Wang@BowenWangNLP · Apr 8

🎮 Computer Use Agent Arena is LIVE! 🚀 🔥 Easiest way to test computer-use agents in the wild without any setup 🌟 Compare top VLMs: OpenAI Operator, Claude 3.7, Gemini 2.5 Pro, Qwen 2.5 vl and more 🕹️ Test agents on 100+ real apps & webs with one-click config 🔒 Safe & free…

5

22

101

22

14.0K

Z

Ziyu Yao@ZiyuYao · Mar 21

We just released Version 2 of our survey on 𝐌𝐞𝐜𝐡𝐚𝐧𝐢𝐬𝐭𝐢𝐜 𝐈𝐧𝐭𝐞𝐫𝐩𝐫𝐞𝐭𝐚𝐛𝐢𝐥𝐢𝐭𝐲, ext. to 𝟑𝟓 𝐩𝐚𝐠𝐞𝐬! arxiv.org/pdf/2407.02646 Major updates🧵: x.com/DakingRai/stat… We received positive feedback from V1 and are excited to see it become a helpful guide.

DDaking Rai@DakingRai · Jul 8, 2024

[1/6] Mechanistic Interpretability (MI) is an emerging sub-field of interpretability that aims to understand LMs by reverse-engineering its underlying computation. Here we present a comprehensive survey curated specifically as a 𝐠𝐮𝐢𝐝𝐞 𝐟𝐨𝐫 𝐧𝐞𝐰𝐜𝐨𝐦𝐞𝐫𝐬 𝐭𝐨 𝐭𝐡𝐢𝐬…

6

49

259

241

29.0K

Ziyu Yao Retweeted

A

ACL 2025@aclmeeting · Mar 4

📢#ACL2025NLP This year we received 8276 submissions 👏 which is the highest number in the history of ACL conferences 🙌 If you are not yet involved as a reviewer, AC or SAC, we would encourage you to volunteer as an (emergency) AC or reviewer forms.gle/u5C2Daq1Mz9kXw… 🙏

6

41

154

45

20.0K

Ziyu Yao Retweeted

M

Mimansa Jaiswal@MimansaJ · Feb 24

I interviewed for LLM/ML research scientist/engineering positions last Fall. Over 200 applications, 100 interviews, many rejections & some offers later, I decided to write the process down, along with the resources I used. Links to the process & resources in the following tweets

42

443

4.0K

7.0K

395.0K