Mathis Pink

@MathisPink

👀🧠(x) | x ∈ {👀🧠,🤖} PhD student @mpi_sws_ trying to trick rocks into thinking and remembering.

Saarbrücken, Germany

Joined August 2020

3KFollowing

360Followers

Pinned

Mathis Pink@MathisPink · Oct 17

1/n🤖🧠 New paper alert!📢 In "Assessing Episodic Memory in LLMs with Sequence Order Recall Tasks" (arxiv.org/abs/2410.08133) we introduce SORT as the first method to evaluate episodic memory in large language models. Read on to find out what we discovered!🧵

7.0K

Pinned

Mathis Pink Retweeted

Tim Kietzmann@TimKietzmann · Jul 8

Exciting new preprint from the lab: “Adopting a human developmental visual diet yields robust, shape-based AI vision”. A most wonderful case where brain inspiration massively improved AI solutions. Work with @lu_zejin @martisamuser and Radoslaw Cichy arxiv.org/abs/2507.03168

144

15.0K

Pinned

Mathis Pink Retweeted

Omer Moussa@ohmoussa2 · Jun 10

🚨Excited to share our latest work published at Interspeech 2025: “Brain-tuned Speech Models Better Reflect Speech Processing Stages in the Brain”! 🧠🎧 arxiv.org/abs/2506.03832 W/ @mtoneva1 We fine-tuned speech models directly with brain fMRI data, making them more brain-like.🧵

1.0K

Pinned

Mathis Pink Retweeted

Martina Vilas@martinagvilas · Apr 18

We will be presenting this 💫 spotlight 💫 paper at #ICLR2025. Come say hi or DM me if you're interested in discussing AI #interpretability in Singapore! 📆 Poster Session 4 (#530) 🕰️ Fri 25 Apr. 3:00-5:30 PM 📝 openreview.net/forum?id=QogcG… 📊 iclr.cc/virtual/2025/p…

211

111

16.0K

Pinned

Mathis Pink Retweeted

Sebastian Michelmann@s_michelmann · Jan 26, 2023

Excited to share our new preprint bit.ly/402EYEb with @mtoneva1, @ptoncompmemlab, and @manojneuro), in which we ask if GPT-3 (a large language model) can segment narratives into meaningful events similarly to humans. We use an unconventional approach: ⬇️

20.0K

Pinned

Mathis Pink@MathisPink · Dec 19

A few interesting challenges in extending context windows. A model with a big prompt =/= "infinite context" in my mind. 10M tokens of context is not exactly on the path to infinite context. Instead, it requires a streaming model that has - an efficient state with fast…

CChubby♨️@kimmonismus · Dec 19

Sam Altman: 10m context window in months, infinite context within several years

11.0K

Pinned

Mathis Pink Retweeted

François Chollet@fchollet · Dec 17

Anyway, glad to see that the whole "let's just pretrain a bigger LLM" paradigm is dead. Model size is stagnating or even decreasing, while researchers are now looking at the right problems -- either test-time training or neurosymbolic approaches like test-time search, program…

512

103

38.0K

Pinned

Mathis Pink@MathisPink · Dec 14

x.com/MathisPink/sta… We think this is because LLMs do not have parametric episodic memory (as opposed to semantic memory)! We recently created SORT, a new benchmark task that tests temporal order memory in LLMs

RRiley Goodside@goodside · Dec 14

An LLM knows every work of Shakespeare but can’t say which it read first. In this material sense a model hasn’t read at all. To read is to think. Only at inference is there space for serendipitous inspiration, which is why LLMs have so little of it to show for all they’ve seen.

300

Pinned

Mathis Pink Retweeted

François Fleuret@francoisfleuret · Nov 10

Consider the prompt X="Describe a beautiful house." We can consider two processes to generate the answer Y: (A) sample P(Y | X) or, (B) sample an image Z with a conditional image density model P(Z | X) and then sample P(Y | Z). 1/3

13.0K

Pinned

Mathis Pink Retweeted

Mathis Pink@MathisPink · Oct 17

4/n💡We find that fine-tuning or RAG do not support episodic memory capabilities well (yet). In-context presentation supports some episodic memory capabilities but at high costs and insufficient length-generalization, making it a bad candidate for episodic memory!

4.0K

Pinned

Mathis Pink Retweeted

Omer Moussa@ohmoussa2 · Oct 24

We are so excited to share the first work that demonstrates consistent downstream improvements for language tasks after fine-tuning with brain data!! Improving semantic understanding in speech language models via brain-tuning arxiv.org/abs/2410.09230 W/ @dklakow, @mtoneva1

11.0K