Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔 (@kento_nishi)

Pinned

K

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔@kento_nishi · Jun 9

🚨 ICML 2025 Paper! 🚨 Excited to announce "Representation Shattering in Transformers: A Synthetic Study with Knowledge Editing." 🔗 arxiv.org/abs/2410.17194 We uncover a new phenomenon, Representation Shattering, to explain why KE edits negatively affect LLMs' reasoning. 🧵👇

kento_nishi's tweet image. 🚨 ICML 2025 Paper! 🚨

Excited to announce "Representation Shattering in Transformers: A Synthetic Study with Knowledge Editing."

🔗 arxiv.org/abs/2410.17194

We uncover a new phenomenon, Representation Shattering, to explain why KE edits negatively affect LLMs' reasoning.

🧵👇

4

44

225

138

32.0K

K

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔@kento_nishi · Jul 16

Thank you to everyone who swung by our poster presentation!!! So many engaging conversations today. #ICML2025

KKento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔@kento_nishi · Jun 9

🚨 ICML 2025 Paper! 🚨 Excited to announce "Representation Shattering in Transformers: A Synthetic Study with Knowledge Editing." 🔗 arxiv.org/abs/2410.17194 We uncover a new phenomenon, Representation Shattering, to explain why KE edits negatively affect LLMs' reasoning. 🧵👇

1

5

72

9

12.0K

K

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔@kento_nishi · May 2

Q. Why does editing knowledge sometimes reduce overall performance? A. Representation shatters! Great insights by a super start undergrad @kento_nishi, supervised expertly by @EkdeepL, with @MayaOkawa, @RahulRam3sh, and Mikail Khona.

EEkdeep Singh@EkdeepL · May 2

New paper---freshly accepted to ICML! Detailed thread coming soon, but pretty excited about this project. We use synthetic knowledge graphs to study why knowledge editing protocols can screw up model capabilities, finding what we call a "representation shattering" effect!

0

2

19

3

3.0K

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔 Retweeted

E

Ekdeep Singh@EkdeepL · May 2

Paper link: arxiv.org/abs/2410.17194 Shoutout especially to our monster undergrad @kento_nishi who led this project, and all collaborators! @MayaOkawa, @RahulRam3sh, Mikail Khona, and @Hidenori8Tanaka

0

2

17

4

985

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔 Retweeted

E

Ekdeep Singh@EkdeepL · May 2

New paper---freshly accepted to ICML! Detailed thread coming soon, but pretty excited about this project. We use synthetic knowledge graphs to study why knowledge editing protocols can screw up model capabilities, finding what we call a "representation shattering" effect!

2

16

149

57

13.0K

K

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔@kento_nishi · May 1

Accepted to ICML 2025!🎉See you in Vancouver! arxiv.org/abs/2410.17194

3

1

33

0

3.0K

K

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔@kento_nishi · Apr 21

2000-day streak! github.com/KentoNishi

0

3

0

313

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔 Retweeted

H

Harvard University@Harvard · Apr 14

“No government—regardless of which party is in power—should dictate what private universities can teach, whom they can admit and hire, and which areas of study and inquiry they can pursue.” - President Alan Garber hrvd.me/GarberRespond3…

31.0K

172.0K

6.0K

81.5M

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔 Retweeted

A

Andrew Lee@a_jy_l · Jan 5

TLDR: Given an in-context task, and as context is scaled, LMs can form ‘in-context representations’ that reflect the task. Team: @corefpark @EkdeepL @YongyiYang7 @MayaOkawa @kento_nishi @wattenberg @Hidenori8Tanaka Special thanks: @ndif_team arxiv.org/pdf/2501.00070 2/N

2

4

32

20

13.0K

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔 Retweeted

C

Core Francisco Park@corefpark · Jan 5

This project was a true collaborative effort where everyone contributed to major parts of the project! Big thanks to the team! @a_jy_l, @EkdeepL, @YongyiYang7, @MayaOkawa, @kento_nishi, @wattenberg, @Hidenori8Tanaka 13/n

1

27

3

2.0K

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔 Retweeted

C

Core Francisco Park@corefpark · Jan 5

TL;DR: Given sufficient context, LLMs can suddenly shift from their concept representations to 'in-context representations' that align with the task structure! w/ @a_jy_l @EkdeepL @YongyiYang7 @MayaOkawa @kento_nishi @wattenberg @Hidenori8Tanaka Paper: arxiv.org/abs/2501.00070 2/n

5

11

114

48

6.0K