Qinan Yu

@qinan_yu

CS PhD @stanfordnlp 🌲CS-Math Ugrad @Brown_NLP 🐻

Joined May 2022

337Following

409Followers

Qinan Yu Retweeted

🤔Ever wonder why LLMs give inconsistent answers in different languages? In our paper, we identify two failure points in the multilingual factual recall process and propose fixes that guide LLMs to the "right path." This can boost performance by 35% in the weakest language! 📈

16.0K

Qinan Yu Retweeted

Ruochen Zhang@ruochenz_ · Nov 9

🤔How do multilingual LLMs encode structural similarities across languages? 🌟We find that LLMs use identical circuits when languages share the same morphosyntactic processes. However, they involve specialized components to handle tasks if contain specific linguistic features⤵️

156

31.0K

Qinan Yu Retweeted

Curt Tigges@CurtTigges · Jul 16, 2024

Circuit analysis is a common tool in mechanistic interpretability for understanding model behaviors when executing certain tasks. But how well do these findings generalize throughout model training or to models of different sizes?

10.0K

Qinan Yu Retweeted

Dashiell Stander@dashstander · Jun 17, 2024

Excited to share our #ICML2024 paper "Grokking Group Multiplication with Cosets" with @BlancheMinerva, @qinan_yu and @Void13950782! We reverse engineered neural networks that perfectly learned to multiply elements of the symmetric groups S5 & S6. 🧵 on our key findings below

17.0K

Qinan Yu Retweeted

Brown NLP@Brown_NLP · Nov 17, 2023

Accepted at EMNLP: LLMs often have to integrate information in context with facts learned during pretraining. Sometimes these facts disagree, so how do they handle this competition? We find that we can modulate single attention heads to control which version to use!

186

110

35.0K

Qinan Yu Retweeted

Veronica Qing Lyu@veronica3207 · Jul 13, 2023

Are Chain-of-Thought reasoning chains good "explanations"? Not necessarily, since they aren't always faithful -- and we propose a 2-stage reasoning framework to solve this. See our paper "Faithful Chain-of-Thought Reasoning" at the #NLRSE workshop (1:30pm Thur) at #ACL2023!

244

29.0K