Seungwook Han

@seungwookh

phd-ing @MIT_CSAIL, prev @MITIBMLab @columbia

Joined June 2017

562Following

391Followers

Pinned

Seungwook Han@seungwookh · Mar 4

🧙‍♂️Excited to share our new whitepaper “General Reasoning Requires Learning to Reason from the Get-Go.” We argue that simply making models bigger and feeding them more data is NOT enough for robust, adaptable reasoning. (1/n)

10.0K

Pinned

Seungwook Han Retweeted

Laker Newhouse@LakerNewhouse · Jul 19

[1/9] We created a performant Lipschitz transformer by spectrally regulating the weights—without using activation stability tricks: no layer norm, QK norm, or logit softcapping. We think this may address a “root cause” of unstable training.

565

544

133.0K

Pinned

Seungwook Han@seungwookh · Jun 29

robot arms becoming more human-like. now with a wrist 🦾

MMartin Peticco@martinpeticco · Jun 28

What’s keeping robot arms from working like human arms? They're big, slow, have the wrong joints, and can't conform to their environment. DexWrist solves all of these issues and simplifies learning constrained, dynamic manipulation👉 dexwrist.csail.mit.edu

380

Seungwook Han@seungwookh · Jul 23

uncertainty-aware reasoning, akin to how humans leverage our confidence

MMehul Damani@MehulDamani2 · Jul 23

🚨New Paper!🚨 We trained reasoning LLMs to reason about what they don't know. o1-style reasoning training improves accuracy but produces overconfident models that hallucinate more. Meet RLCR: a simple RL method that trains LLMs to reason and reflect on their uncertainty --…

271

Seungwook Han@seungwookh · Jul 22

was actually wondering with @hyundongleee the fundamental differences between diffusion and autoregressive modeling other than the structure imposed in the modeling of the sequential conditional distribution and how they manifest. a poignant paper that addresses this thought

MMihir Prabhudesai@mihirp98 · Jul 22

🚨 The era of infinite internet data is ending, So we ask: 👉 What’s the right generative modelling objective when data—not compute—is the bottleneck? TL;DR: ▶️Compute-constrained? Train Autoregressive models ▶️Data-constrained? Train Diffusion models Get ready for 🤿 1/n

896

Seungwook Han@seungwookh · Jul 22

omw to trying this out 👀

PPika@pika_labs · Jul 21

Some news: We're building the next big thing — the first-ever AI-only social video app, built on a highly expressive human video model. Over the past few weeks, we’ve been testing it in private beta. Now, we’re opening early access: download the iOS app to join the waitlist, or…

230

Seungwook Han@seungwookh · Jul 20

how particles can act differently under different scales and conditions and how we can equip it as part of design is cool

MMIT Architecture@MITarchitecture · Jan 11

8. Jeonghyun Yoon: Precisely Loose: Unraveling the Potential of Particles A big thank you goes out to the entire architecture community, including advisors, readers, staff, family and peers who helped bring these projects to light. Image credit: Chenyue “xdd” Dai 2/2

447

Seungwook Han@seungwookh · Jul 16

But actually this is the og way of doing it and should stop by E-2103 to see @jxbz and Laker Newhouse whiteboard the whole paper.

JJeremy Bernstein@jxbz · Jul 16

Laker and I are presenting this work in an hour at ICML poster E-2103. It’s on a theoretical framework and language (modula) for optimizers that are fast (like Shampoo) and scalable (like muP). You can think of modula as Muon extended to general layer types and network topologies

8.0K

Seungwook Han Retweeted

Jyo Pari@jyo_pari · Jul 16

If you are interested in questioning how we should pretrain models and create new architectures for general reasoning - then checkout E606 @ ICML, our position by @seungwookh and I on potential directions for the next generation reasoning models!

2.0K

Seungwook Han@seungwookh · Jul 14

At #ICML 🇨🇦 this week. I'm convinced that the core computations are shared across modalities (vision, text, audio, etc). The real question is the (synthetic) generative process that ties them. Reach out if you have thoughts or want to chat!

2.0K

Seungwook Han@seungwookh · Jul 2

wholeheartedly agree with this direction that games can be a good playground for learning reasoning. makes us think what other synthetic environments we can design and grow over complexity

BBo Liu (Benjamin Liu)@Benjamin_eecs · Jul 1

We've always been excited about self-play unlocking continuously improving agents. Our insight: RL selects generalizable CoT patterns from pretrained LLMs. Games provide perfect testing grounds with cheap, verifiable rewards. Self-play automatically discovers and reinforces…

512

Seungwook Han Retweeted

Phillip Isola@phillip_isola · Jun 14

Our computer vision textbook is now available for free online here: visionbook.mit.edu We are working on adding some interactive components like search and (beta) integration with LLMs. Hope this is useful and feel free to submit Github issues to help us improve the text!

621

3.0K

175.0K

Seungwook Han@seungwookh · Jun 13

a step towards self-learning models — self-synthesizing data to train on and evolving

JJyo Pari@jyo_pari · Jun 13

What if an LLM could update its own weights? Meet SEAL🦭: a framework where LLMs generate their own training data (self-edits) to update their weights in response to new inputs. Self-editing is learned via RL, using the updated model’s downstream performance as reward.

1.0K

Seungwook Han@seungwookh · Jun 12

Come join us at the @miv_cvpr2025 workshop today at the @CVPR in room C1! We have an amazing lineup of speakers 🔊 including @davidbau @soniajoseph_ @trevordarrell @aleks_madry Antonio Torralba and Michal Irani 🙌🏻

MMechanistic Interpretability for Vision @ CVPR2025@miv_cvpr2025 · Jan 17

🔍 Curious about what's really happening inside vision models? Join us at the First Workshop on Mechanistic Interpretability for Vision (MIV) at @CVPR! 📢 Website: sites.google.com/view/miv-cvpr2… Meet our amazing invited speakers! #CVPR2025 #MIV25 #MechInterp #ComputerVision

1.0K