Daniel Yamins (@dyamins)

Pinned

D

New paper on 3D scene understanding for static images with a novel large-scale video prediction model. neuroailab.github.io/projects/lras_… Strong results in self-supervised depth extraction, novel view synthesis (aka camera control), and complex object manipulations.

dyamins's tweet image. New paper on 3D scene understanding for static images with a novel large-scale video prediction model. neuroailab.github.io/projects/lras_… Strong results in self-supervised depth extraction, novel view synthesis (aka camera control), and complex object manipulations.

3

8

45

11

5.0K

Pinned

D

Daniel Yamins@dyamins · 10 h

Here's a third application of our new world modeling technology - to object grouping. In a sense this completes the video scene understanding trifecta of 3D shape, motion, and now object individualization. From a technical perspective, the core innovation is the idea of…

RRahul Venkatesh@Rahul_Venkatesh · 21 h

AI models segment scenes based on how things appear, but babies segment based on what moves together. We utilize a visual world model that our lab has been developing, to capture this concept — and what's cool is that it beats SOTA models on zero-shot segmentation and physical…

0

2

8

5

790

Pinned

Daniel Yamins Retweeted

R

Rahul Venkatesh@Rahul_Venkatesh · 21 h

AI models segment scenes based on how things appear, but babies segment based on what moves together. We utilize a visual world model that our lab has been developing, to capture this concept — and what's cool is that it beats SOTA models on zero-shot segmentation and physical…

6

11

38

16

89.0K

Daniel Yamins Retweeted

A

Aran Nayebi@aran_nayebi · 8 h

2️⃣ Why PyTorchTNN? Most deep learning frameworks treat recurrence as global. PyTorchTNN lets you flexibly build arbitrary temporal graphs with modular components, where each TNN layer decomposes into: 🔹 Harbor Policy (how inputs combine) 🔹 Pre-/Post-Memory (Conv/Pool/Residual…

1

4

0

483

D

Daniel Yamins@dyamins · 6 h

Thanks for taking this idea into the future @aran_nayebi et al. Having true recurrent networks like this is really important from a science perspective, so I’m glad it is continuing to be developed!

AAran Nayebi@aran_nayebi · 8 h

1️⃣ What is a TNN? TNNs are neural networks with local recurrence or feedback connections, processing inputs across time. Unlike standard RNNs, each time step in TNNs corresponds to a single feedforward layer’s computation to mimic biological processing. Of course, you can also…

0

1

13

4

600

Daniel Yamins Retweeted

A

Aran Nayebi@aran_nayebi · 8 h

🚀 New Open-Source Release! PyTorchTNN 🚀 A PyTorch package for building biologically-plausible temporal neural networks (TNNs)—unrolling neural network computation layer-by-layer through time, inspired by cortical processing. PyTorchTNN naturally integrates into the…

1

21

78

48

6.0K

Daniel Yamins Retweeted

R

Rahul Venkatesh@Rahul_Venkatesh · 21 h

(4/) To discover such segments, we build SpelkeNet: a visual world model based on the recently introduced local random access sequence modeling (LRAS) paradigm: neuroailab.github.io/projects/lras_…. Our model acquires an implicit understanding of “what moves together” in natural scenes by…

1

7

0

509

D

Daniel Yamins@dyamins · 22 h

This looks interesting

KKatie Collins@katie_m_collins · Jul 22

How do people reason so flexibly about new problems, bringing to bear globally-relevant knowledge while staying locally-consistent? Can we engineer a system that can synthesize bespoke world models (expressed as probabilistic programs) on-the-fly?

1

0

7

3

807

Daniel Yamins Retweeted

E

EPFL Brain Mind Institute@Brainmind_EPFL · Jul 20

We are happy to announce an opening for a Tenure Track Assistant Professor Faculty Position in Neuroscience at EPFL. Join our groups working on cellular & circuit neuroscience & neurocomputation - go.epfl.ch/brain. Deadline Oct 1 2025, Apply now - go.epfl.ch/neurofaculty

1

24

56

21

9.0K

D

Daniel Yamins@dyamins · 22 h

These are really amazing positions.

EEPFL Brain Mind Institute@Brainmind_EPFL · Jul 20

We are happy to announce an opening for a Tenure Track Assistant Professor Faculty Position in Neuroscience at EPFL. Join our groups working on cellular & circuit neuroscience & neurocomputation - go.epfl.ch/brain. Deadline Oct 1 2025, Apply now - go.epfl.ch/neurofaculty

0

1

8

1

2.0K

D

Daniel Yamins@dyamins · Jul 16

Skeptic!

AAnshul Kundaje (anshulkundaje@bluesky)@anshulkundaje · Jul 15

Practically useful & biologically aligned benchmarks such as this one from @pkoo562 lab consistently show that all the overhyped annotation-agnostic DNA language models are actually terrible for transcriptional regulatory DNA in humans (mammals). 1/

0

2

756

D

Daniel Yamins@dyamins · Jul 15

Practically useful & biologically aligned benchmarks such as this one from @pkoo562 lab consistently show that all the overhyped annotation-agnostic DNA language models are actually terrible for transcriptional regulatory DNA in humans (mammals). 1/

PPeter Koo@pkoo562 · Jul 15

*Easter egg alert* NOT in the published paper. We also benchmarked Evo 2 and while it did better than other gLMs (consistent that scale can improve gLMs), it still falls short of a basic CNN trained using one-hot sequences and far short of supervised SOTA x.com/pkoo562/status…

6

20

134

67

18.0K

Daniel Yamins Retweeted

K

Keyon Vafa@keyonV · Jul 11

Can an AI model predict perfectly and still have a terrible world model? What would that even mean? Our new ICML paper formalizes these questions One result tells the story: A transformer trained on 10M solar systems nails planetary orbits. But it botches gravitational laws 🧵

213

1.0K

7.0K

5.0K

1.3M

Daniel Yamins Retweeted

K

Klemen Kotar@KlemenKotar · Jul 16

This enables KL-tracing: tracing a dot through video by computing KL divergence between factual (without dot) and counterfactual (with dot) predictions. This reasons through all possible future states simultaneously, taming the inherent randomness of generative models.

1

3

0

542

Daniel Yamins Retweeted

K

Klemen Kotar@KlemenKotar · Jul 16

Enter the LRAS (Local Random Access Sequence) model - a generative video model that checks all the boxes. Beyond tight conditioning, it predicts the distribution over ALL possible values at the next patch (like LLMs), capturing the superposition of all probable tracer dot states.

1

3

0

567

Daniel Yamins Retweeted

S

Seungwoo (Simon) Kim@SeKim1112 · Jul 15

FINALLY: KL-tracing works by computing KL-divergence between clean & perturbed logit distributions. This is a powerful *statistical counterfactual* probe enabled by autoregressive generative predictors (like LRAS).

1

3

1

452

D

Daniel Yamins@dyamins · Jul 16

📷 New Preprint: SOTA optical flow extraction from pre-trained generative video models! While it seems intuitive that video models grasp optical flow, extracting that understanding has proven surprisingly elusive.

SSeungwoo (Simon) Kim@SeKim1112 · Jul 15

We prompt a generative video model to extract state-of-the-art optical flow, using zero labels and no fine-tuning. Our method, KL-tracing, achieves SOTA results on TAP-Vid & generalizes to challenging YouTube clips. @khai_loong_aw @KlemenKotar @CristbalEyzagu2 @lee_wanhee_…

1

8

40

7

12.0K

Daniel Yamins Retweeted

S

Seungwoo (Simon) Kim@SeKim1112 · Jul 15

We prompt a generative video model to extract state-of-the-art optical flow, using zero labels and no fine-tuning. Our method, KL-tracing, achieves SOTA results on TAP-Vid & generalizes to challenging YouTube clips. @khai_loong_aw @KlemenKotar @CristbalEyzagu2 @lee_wanhee_…

1

7

27

8

5.0K

D

Daniel Yamins@dyamins · Jun 11

Super stoked for our Minds in the Making workshop at @cogscisociety.bsky.social 2025! If you are at all interested in the intersection between cognitive science and design, you won’t want to miss it!! 🧠🛠️

JJunyi Chu@JunyiChu · Jun 6

Delighted to announce our CogSci '25 workshop at the interface between cognitive science and design 🧠🖌️! We're calling it: Minds in the Making🏺minds-making.github.io Register now! June – July 2024, free & open to the public. (all career stages, all disciplines)

2

5

58

5

6.0K