Seungwoo (Simon) Kim (@SeKim1112)

Pinned

S

Over the past 18 months my lab has been developing a new approach to visual world modeling. There will be a magnum opus that ties it all together out in the next couple of weeks. But for now there are some individual application papers that have poked out.

KKlemen Kotar@KlemenKotar · Jul 16

📷 New Preprint: SOTA optical flow extraction from pre-trained generative video models! While it seems intuitive that video models grasp optical flow, extracting that understanding has proven surprisingly elusive.

1

13

73

35

7.0K

S

Seungwoo (Simon) Kim@SeKim1112 · Jul 15

So excited by this direction of using generative video models for vision tasks. Here we show it for extracting optical flow / motion information! This approach is promising as large video models trained on large amounts of data learn to capture challenging real-world dynamics.

SSeungwoo (Simon) Kim@SeKim1112 · Jul 15

We prompt a generative video model to extract state-of-the-art optical flow, using zero labels and no fine-tuning. Our method, KL-tracing, achieves SOTA results on TAP-Vid & generalizes to challenging YouTube clips. @khai_loong_aw @KlemenKotar @CristbalEyzagu2 @lee_wanhee_…

1

2

5

0

496

S

Seungwoo (Simon) Kim@SeKim1112 · Jul 16

📷 New Preprint: SOTA optical flow extraction from pre-trained generative video models! While it seems intuitive that video models grasp optical flow, extracting that understanding has proven surprisingly elusive.

SSeungwoo (Simon) Kim@SeKim1112 · Jul 15

We prompt a generative video model to extract state-of-the-art optical flow, using zero labels and no fine-tuning. Our method, KL-tracing, achieves SOTA results on TAP-Vid & generalizes to challenging YouTube clips. @khai_loong_aw @KlemenKotar @CristbalEyzagu2 @lee_wanhee_…

1

8

39

7

11.0K

Seungwoo (Simon) Kim Retweeted

P

Percy Liang@percyliang · May 22

AI agents have the potential to significantly alter the cybersecurity landscape. To help us understand this change, we are excited to release BountyBench, the first framework to capture offensive & defensive cyber-capabilities in evolving real-world systems.

4

31

132

76

15.0K

Seungwoo (Simon) Kim Retweeted

D

Daniel Yamins@dyamins · Mar 27

New paper on self-supervised optical flow and occlusion estimation from video foundation models. @sstj389 @jiajunwu_cs @SeKim1112 @Rahul_Venkatesh tinyurl.com/dpa3auzd @

3

18

111

61

17.0K