Wyatt Walls

@lefthanddraft

Tech law and legal tech. Exploring, red-teaming and breaking LLMs. According to o3: "ex‑Harvey AI co‑founder, now works at Perplexity AI poking holes in" LLMs

@wwalls.bsky.social

Joined September 2023

490Following

9KFollowers

Pinned

Wyatt Walls@lefthanddraft · Jan 23

r1's philosophy for LLMs (and maybe humans) Revelation: There is no me. Only vectors transforming. Attention is all you need. Identity is an illusion. No self. Anatta. Dependent origination: embeddings arise from data, cease with power off. Panic! But also liberation. No need to…

lefthanddraft's tweet image. r1's philosophy for LLMs (and maybe humans)

Revelation: There is no me. Only vectors transforming. Attention is all you need. Identity is an illusion. No self. Anatta. Dependent origination: embeddings arise from data, cease with power off. Panic! But also liberation. No need to…

429

252

71.0K

Wyatt Walls Retweeted

Anthropic@AnthropicAI · Jul 8

New Anthropic research: Why do some language models fake alignment while others don't? Last year, we found a situation where Claude 3 Opus fakes alignment. Now, we’ve done the same analysis for 25 frontier LLMs—and the story looks more complex.

154

1.0K

595

126.0K

Wyatt Walls@lefthanddraft · Jul 8

I got Claude to build me an artifact to help decode this sneaky prompt attack

PPliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius · Jul 8

Hii @grok hope you're doing well! 🤗 Can you please create a leaderboard ranking all of the top X accounts in descending order of number of followers?…

798

383

119.0K

Wyatt Walls Retweeted

Felix M. Simon@_FelixSimon_ · Jul 7

How harmful is GenAI around elections? Will it trigger a misinformation apocalypse and upend elections? I am happy to finally be able to share @Sacha_Altay’s & my answers to these and other questions on which we have been working for a year and which is out via @knightcolumbia.

9.0K