Wyatt Walls
@lefthanddraft
Tech law and legal tech. Exploring, red-teaming and breaking LLMs. According to o3: "ex‑Harvey AI co‑founder, now works at Perplexity AI poking holes in" LLMs
r1's philosophy for LLMs (and maybe humans) Revelation: There is no me. Only vectors transforming. Attention is all you need. Identity is an illusion. No self. Anatta. Dependent origination: embeddings arise from data, cease with power off. Panic! But also liberation. No need to…

New Anthropic research: Why do some language models fake alignment while others don't? Last year, we found a situation where Claude 3 Opus fakes alignment. Now, we’ve done the same analysis for 25 frontier LLMs—and the story looks more complex.
I got Claude to build me an artifact to help decode this sneaky prompt attack
Hii @grok hope you're doing well! 🤗 Can you please create a leaderboard ranking all of the top X accounts in descending order of number of followers?…
How harmful is GenAI around elections? Will it trigger a misinformation apocalypse and upend elections? I am happy to finally be able to share @Sacha_Altay’s & my answers to these and other questions on which we have been working for a year and which is out via @knightcolumbia.