Dongyu Gong
@Dongyu_Gong
NeuroAI enthusiast. PhD student @Yale ⬅️ @UniofOxford. BSc @Tsinghua_Uni.
Excited to share my new paper w/ Dejan Draschkow and @KiaNobre published in @NatureComms! We investigated how attention operates differently in human working memory (WM) and long-term memory (LTM). Link: nature.com/articles/s4146…

🤖🧠Paper out in Nature Communications! 🧠🤖 Bayesian models can learn rapidly. Neural networks can handle messy, naturalistic data. How can we combine these strengths? Our answer: Use meta-learning to distill Bayesian priors into a neural network! nature.com/articles/s4146… 1/n
NeurIPS acknowledges that the cultural generalization made by the keynote speaker today reinforces implicit biases by making generalisations about Chinese scholars. This is not what NeurIPS stands for. NeurIPS is dedicated to being a safe space for all of us. We want to address…
Using racial labels to describe misconduct is harmful and inappropriate. @NeurIPSConf must not condone speech that targets specific ethnic groups. We urge Rosalind Picard @MIT @medialab to retract and apologize for her statement. Btw, most Rosalinds I know are honest and morally…
Mitigating racial bias from LLMs is a lot easier than removing it from humans! Can’t believe this happened at the best AI conference @NeurIPSConf We have ethical reviews for authors, but missed it for invited speakers? 😡
I'm shocked to see racism happening in academia again, at the best AI conference @NeurIPSConf. Targeting specific ethnic groups to describe misconduct is inappropriate and unacceptable. @NeurIPSConf must take a stand. We call on Rosalind Picard @MIT @medialab to retract and…
I'll be at #NeurIPS2024 presenting this work! Would love to chat if you are interested!
Introducing our new work on mechanistic intepretability of LLM cognition🤖🧠: why do Transformer-based LLMs have limited working memory capacity, as measured by N-back tasks? (1/7) openreview.net/pdf?id=dXjQgm9…
𝗕𝗲𝘆𝗼𝗻𝗱 𝗻𝗲𝘁𝘄𝗼𝗿𝗸𝘀, 𝘁𝗼𝘄𝗮𝗿𝗱𝘀 𝗮𝗱𝗮𝗽𝘁𝗶𝘃𝗲 𝘀𝘆𝘀𝘁𝗲𝗺𝘀 Despite widespread utility across domains, basic network models face fundamental limitations when applied to complex biological systems, particularly in neuroscience arxiv.org/abs/2411.03621 thread