Joe Carlsmith

@jkcarlsmith

Philosophy, futurism, AI. Senior advisor @open_phil. Opinions my own.

Berkeley, CA

Joined April 2013

612Following

7KFollowers

Pinned

Joe Carlsmith@jkcarlsmith · Jun 28, 2022

I put my report on existential risk from power-seeking AI on arXiv: arxiv.org/abs/2206.13353

141

Joe Carlsmith@jkcarlsmith · Jul 18

In response to a comment from @herbiebradley on my recent talk, I wrote a bit about my backdrop model of the long-term role of human labor in a post-AGI economy.

JJoe Carlsmith@jkcarlsmith · Jul 17

I haven't written about this much or thought it through in detail, but here are a few aspects that go into my backdrop model: (1) especially in the long-term technological limit, I expect human labor to be wildly uncompetitive for basically any task relative to what advanced…

2.0K

Joe Carlsmith Retweeted

Michael Nielsen@michael_nielsen · Jul 17

Thoughtful discussion of "Can Goodness Compete [with power]?" by @jkcarlsmith (link in next post) It's a really fundamental and important question. It's pretty explicit going back at least as far Hammurabi; Plato skirts around it; Jesus is deeply concerned with it; and it's of…

7.0K

Joe Carlsmith Retweeted

David Duvenaud@DavidDuvenaud · Jun 17

It's hard to plan for AGI without knowing what outcomes are even possible, let alone good. So we’re hosting a workshop! Post-AGI Civilizational Equilibria: Are there any good ones? Vancouver, July 14th Featuring: @jkcarlsmith @RichardMCNgo @eshear 🧵

195

20.0K

Joe Carlsmith Retweeted

Zach Freitas-Groff 🔸@zdgroff · May 28

💡Leading researchers and AI companies have raised the possibility that AI models could soon be sentient. I’m worried that too few people are thinking about this. Let’s change that. I’m excited to announce a Digital Sentience Consortium. Check out these funding opps.👇

13.0K

Joe Carlsmith@jkcarlsmith · May 22

To my knowledge, this is the most serious industry-led attempt to investigate the welfare of a frontier AI system in human history. Kudos to Anthropic for leading the way.

KKyle Fish@fish_kyle3 · May 22

🧵For Claude Opus 4, we ran our first pre-launch model welfare assessment. To be clear, we don’t know if Claude has welfare. Or what welfare even is, exactly? 🫠 But, we think this could be important, so we gave it a go. And things got pretty wild…

4.0K