Konpat Ta Preechakul

@konpatp

Learning abstraction from pixels. PhD student at @berkeley_ai. I'm from Bangkok, Thailand 🐘.

Berkeley, CA

Joined April 2008

216Following

464Followers

Pinned

Konpat Ta Preechakul@konpatp · Jul 21

Some problems can’t be rushed—they can only be done step by step, no matter how many people or processors you throw at them. We’ve scaled AI by making everything bigger and more parallel: Our models are parallel. Our scaling is parallel. Our GPUs are parallel. But what if the…

370

301

39.0K

Konpat Ta Preechakul@konpatp · Jul 22

Another (more technical) perspective on the Serial Scaling Hypothesis 😉

YYuxi on the Wired@layer07_yuxi · Jul 22

thread on the new paper: The Serial Scaling Hypothesis joint work with: @phizaz, @YutongBAI1002, Kananart

437

Konpat Ta Preechakul@konpatp · Jul 17

Come to the poster to hear from @itay__yona about our paper paper paper paper paper paper paper paper :)

IItay Yona@itay__yona · Mar 13

Ever felt like you're talking to a parrot with a glitch? 🦜 Turns out, LLMs struggle with repetition in a fascinating way! 🕵️‍♂️ We reverse-engineered the circuit responsible for that bug 🤯

5.0K

Konpat Ta Preechakul@konpatp · Jun 13

Navigation World Models won the Best Paper Honorable Mention Award at #CVPR2025 ☺️ It is my first postdoc paper since joining Yann's lab at @AIatMeta, so I am very excited. It was also extremely fun working with @GaoyueZhou, @dans_t123, @trevordarrell (and @ylecun) Fun story:

##CVPR2025@CVPR · Jun 13

Congratulations to the #CVPR2025 Honorable Mentions for Best Paper! @GoogleDeepMind, @UCBerkeley, @UMich, @AIatMeta, @nyuniversity, @berkeley_ai, #AllenInstituteforAI, @UW, #UniversityCollegeLondon, @UniversityLeeds, @ZJU_China, @NTUsg, @PKU1898, @Huawei Singapore Research Center

273

67.0K

Konpat Ta Preechakul Retweeted

Yutong Bai@YutongBAI1002 · Jun 27

What would a World Model look like if we start from a real embodied agent acting in the real world? It has to have: 1) A real, physically grounded and complex action space—not just abstract control signals. 2) Diverse, real-life scenarios and activities. Or in short: It has to…

123

503

325

150.0K

Konpat Ta Preechakul Retweeted

Amil Dravid@_AmilDravid · Jun 10

Artifacts in your attention maps? Forgot to train with registers? Use 𝙩𝙚𝙨𝙩-𝙩𝙞𝙢𝙚 𝙧𝙚𝙜𝙞𝙨𝙩𝙚𝙧𝙨! We find a sparse set of activations set artifact positions. We can shift them anywhere ("Shifted") — even outside the image into an untrained token. Clean maps, no retrain.

325

211

43.0K

Konpat Ta Preechakul Retweeted

Damien Teney@DamienTeney · Jun 7

Coming up this week: (oral @CVPR) Do We Always Need the Simplicity Bias? We take another step to understand why/when neural nets generalize so well. ⬇️🧵

5.0K

Konpat Ta Preechakul Retweeted

Mir Miroyan@mirmiroyan · Jun 6

We release Search Arena 🌐 — the first large-scale (24k+) dataset of in-the-wild user interactions with search-augmented LLMs. We also share a comprehensive report on user preferences and model performance in the search-enabled setting. Paper, dataset, and code in 🧵

233

181

42.0K

Konpat Ta Preechakul Retweeted

Quentin Garrido@garridoq_ · Feb 18

The last paper of my PhD is finally out ! Introducing "Intuitive physics understanding emerges from self-supervised pretraining on natural videos" We show that without any prior, V-JEPA --a self-supervised video model-- develops an understanding of intuitive physics !

165

900

531

195.0K

Konpat Ta Preechakul Retweeted

Seohong Park@seohong_park · Jun 5

Is RL really scalable like other objectives? We found that just scaling up data and compute is *not* enough to enable RL to solve complex tasks. The culprit is the horizon. Paper: arxiv.org/abs/2506.04168 Thread ↓

143

920

751

135.0K

Konpat Ta Preechakul Retweeted

Baifeng@baifeng_shi · Mar 27

Next-gen vision pre-trained models shouldn’t be short-sighted. Humans can easily perceive 10K x 10K resolution. But today’s top vision models—like SigLIP and DINOv2—are still pre-trained at merely hundreds by hundreds of pixels, bottlenecking their real-world usage. Today, we…

153

987

657

130.0K

Konpat Ta Preechakul Retweeted

Anand Bhattad@anand_bhattad · Mar 29

[1/8] Is scene understanding solved? We can label pixels and detect objects with high accuracy. But does that mean we truly understand scenes? Super excited to share our new paper and a new task in computer vision: Visual Jenga! 📄arxiv.org/abs/2503.21770…

342

258

39.0K

Konpat Ta Preechakul Retweeted

David McAllister@davidrmcall · Jan 14

Decentralized Diffusion Models power stronger models trained on more accessible infrastructure. DDMs mitigate the networking bottleneck that locks training into expensive and power-hungry centralized clusters. They scale gracefully to billions of parameters and generate…

242

110

44.0K

Konpat Ta Preechakul@konpatp · Jan 17

Check out the First Workshop on Mech Interp for Vision at @CVPR! Paper submissions: sites.google.com/view/miv-cvpr2…

MMechanistic Interpretability for Vision @ CVPR2025@miv_cvpr2025 · Jan 17

🔍 Curious about what's really happening inside vision models? Join us at the First Workshop on Mechanistic Interpretability for Vision (MIV) at @CVPR! 📢 Website: sites.google.com/view/miv-cvpr2… Meet our amazing invited speakers! #CVPR2025 #MIV25 #MechInterp #ComputerVision

2.0K