Richard Antonello

@NeuroRJ

Postdoc in the Mesgarani Lab at Columbia University. Studying how the brain processes language by using LLMs. (Formerly @HuthLab at UT Austin)

Joined May 2020

227Following

361Followers

Pinned

Richard Antonello@NeuroRJ · Jan 9

Our @CogCompNeuro GAC paper is out! We focus on two main questions: 1⃣ How should we use neuroscientific data in model development (raw experimental data vs. qualitative insights)? 2⃣ How should we collect experimental data for model development (model-free vs. model-based)?

KKohitij Kar@KohitijKar · Jan 9

Our CCN 2022 GAC write up is finally out! doi.org/10.51628/001c.… (Hoping it’s a good read for all the trainees and colleagues!) @GretaTuckute @dfinz @eshedmargalit @jcbyts @alonamarie @s_y_chung @ev_fedorenko @KriegeskorteLab @kalatwt

3.0K

Richard Antonello Retweeted

Katie Kang@katie_kang_ · Nov 19

LLMs excel at fitting finetuning data, but are they learning to reason or just parroting🦜? We found a way to probe a model's learning process to reveal *how* each example is learned. This lets us predict model generalization using only training data, amongst other insights: 🧵

120

763

552

131.0K

Richard Antonello Retweeted

Guy Gaziv@GGaziv · Jun 9

Can we precisely and noninvasively modulate deep brain activity just by riding the natural visual feed? 👁️🧠 In our new preprint, we use brain models to craft subtle image changes that steer deep neural populations in primate IT cortex. Just pixels. 📝arxiv.org/abs/2506.05633

4.0K

Richard Antonello@NeuroRJ · Jun 5

We'll be presenting this at #ACL2025 ! Come find me and @tomjiralerspong in Vienna :)

EEmily Cheng @ ACL 2025@sparse_emcheng · Oct 6

New paper! 🌟How does LM representational geometry encode compositional complexity? A: it depends on how we define compositionality! We distinguish compositionality of form vs. meaning, and show LMs encode form complexity linearly and meaning complexity nonlinearly... 1/9

328

Richard Antonello Retweeted

Yufan Zhuang@yufan_zhuang · May 22

🤯Your LLM just threw away 99.9 % of what it knows. Standard decoding samples one token at a time and discards the rest of the probability mass. Mixture of Inputs (MoI) rescues that lost information, feeding it back for more nuanced expressions. It is a brand new…

5.0K

Richard Antonello@NeuroRJ · May 3

For those attending NAACL, today I'll be presenting recent work on how we can use language encoding models to identify functional specialization throughout cortex. Stop by my talk at 10:30 at the CMCL workshop!

NeuroRJ's tweet image. For those attending NAACL, today I'll be presenting recent work on how we can use language encoding models to identify functional specialization throughout cortex. Stop by my talk at 10:30 at the CMCL workshop!

463

Richard Antonello Retweeted

Karan Dalal@karansdalal · Apr 7

Today, we're releasing a new paper – One-Minute Video Generation with Test-Time Training. We add TTT layers to a pre-trained Transformer and fine-tune it to generate one-minute Tom and Jerry cartoons with strong temporal consistency. Every video below is produced directly by…

187

937

6.0K

3.0K

1.4M

Richard Antonello Retweeted

Ruimin Gao@Ruimin_G · Mar 18

Excited to introduce funROI: A Python package for functional ROI analyses of fMRI data! funroi.readthedocs.io/en/latest/ #fMRI #Neuroimaging #Python #OpenScience Work w @neuranna 🧵👇

14.0K

Richard Antonello Retweeted

Marianne Arriola@mariannearr · Mar 13

🚨Announcing our #ICLR2025 Oral! 🔥Diffusion LMs are on the rise for parallel text generation! But unlike autoregressive LMs, they struggle with quality, fixed-length constraints & lack of KV caching. 🚀Introducing Block Diffusion—combining autoregressive and diffusion models…

133

889

514

121.0K

Richard Antonello Retweeted

Daniel Cohen-Or@DanielCohenOr1 · Mar 5

Vectorization into a neat SVG!🎨✨ Instead of generating a messy SVG (left), we produce a structured, compact representation (right) - enhancing usability for editing and modification. Accepted to #CVPR2025 !

126

1.0K

694

77.0K

Richard Antonello Retweeted

Jiayi Zhang@didiforx · Mar 1

Reasoning models lack atomic thought ⚛️ Unlike humans using independent units, they store full histories🤔 Introducing Atom of Thoughts (AOT): lifts gpt-4o-mini to 80.6% F1 on HotpotQA, surpassing o3-mini and DeepSeek-R1 ! The best part? It's plugs in for ANY framework 🔌 1/5

417

3.0K

392.0K

Richard Antonello@NeuroRJ · Feb 11

🎉Excited to share: My first ML conference paper, Population Transformer 🧠, is an Oral at #ICLR2025! This work has truly evolved since its first appearance as a workshop paper last year. So thankful to have worked with the best advisors + collaborators! 🤗 More soon!

GGeeling Chau@GeelingC · Jul 25, 2024

How can we train models on more brains and sensor layouts? We present Population Transformer (PopT) which learns population-level interactions on intracranial electrodes, with 🔥decoding and interpretability benefits. See our poster at #ICML2024 @AI_for_Science 12pm

12.0K

Richard Antonello Retweeted

Alex Murphy@Alxmrphi · Jan 5

EEG Decoding with Multi-Timescale Language Models. Our paper was recently published in Computational Linguistics. Tweetprint to come shortly 🤞doi.org/10.1162/coli_a…

396

Richard Antonello Retweeted

AI at Meta@AIatMeta · Jan 2

New research from Meta FAIR — Meta Memory Layers at Scale. This work takes memory layers beyond proof-of-concept, proving their utility at contemporary scale ➡️ go.fb.me/3lbt4m

177

1.0K

469

155.0K

Richard Antonello Retweeted

Tomer Ullman@TomerUllman · Dec 19

"The Illusion Illusion" vision language models recognize images of illusions... but they also say non-illusions are illusions too

7.0K

Richard Antonello Retweeted

Geeling Chau@GeelingC · Dec 14

Catch me and @czlwang presenting our poster today (12/14) 3:30-5:30pm at the #NeurIPS2024 NeuroAI Workshop! 🧠 neuroai-workshop.github.io Paper: arxiv.org/abs/2406.03044

438

Richard Antonello Retweeted

Ben Poole@poolio · Dec 13

How to upset the (few remaining) neuroscientists at NeurIPS 101

139

2.0K

513

296.0K

Richard Antonello Retweeted

Patrick Mineault@patrickmineault · Dec 12

New post! What do brain scores teach us about brains? Does accounting for variance in the brain mean that an ANN is brain-like? 1/

304

202

29.0K

Richard Antonello Retweeted

UniReps@unireps · Dec 12

The UniReps Workshop is happening THIS SATURDAY at #NeurIPS! 🤖🧠 Join us for a day of insightful talks and engaging discussions with @s_y_chung, @phillip_isola, @NeelNanda5, @ermgrant, @leavittron, @ItsNeuronal, Marco Cuturi and Stefanie Jegelka! 🎤 Check out the full…

18.0K

Richard Antonello@NeuroRJ · Dec 12

Come by our poster (#3801), exploring how we can use the question-answering abilities of LLMs to build more #interpretable models of language processing in the 🧠, starting in one hour at #NeurIPS !

CChandan Singh@csinva · May 28, 2024

LLM embeddings are opaque, hurting them in contexts where we really want to understand what’s in them (like neuroscience) Our new work asks whether we can craft *interpretable embeddings* just by asking yes/no questions to black-box LLMs. 🧵 arxiv.org/abs/2405.16714

443