Thomas Larsen

@thlarsen

AI 2027

Joined August 2022

278Following

2KFollowers

Thomas Larsen Retweeted

Joe Carlsmith@jkcarlsmith · Jul 17

I haven't written about this much or thought it through in detail, but here are a few aspects that go into my backdrop model: (1) especially in the long-term technological limit, I expect human labor to be wildly uncompetitive for basically any task relative to what advanced…

25.0K

Thomas Larsen@thlarsen · Jul 14

Spearphishing PSA—looks like there's a concerted attack on AI safety/governance folks going around. Be wary of calendar links via DM, and *never* give a 2-factor auth code over the phone. I almost got caught by this—got a phone call last week, but figured out it was sus. 🧵

KKatja Grace 🔍@KatjaGrace · Jul 14

And @ajeya_cotra's account has been hacked by the same folks - if you get messages from her asking to schedule a meeting, be very wary! She says she will never reach out to a potential grantee by Twitter, always email.

278

84.0K

Thomas Larsen@thlarsen · Jul 8

I like thinking for myself, so I try to never defer to anyone. But if I did, I'd defer to Ryan. Worth listening to, many important considerations discussed here.

RRob Wiblin@robertwiblin · Jul 8

Ryan Greenblatt is lead author of "Alignment faking in LLMs" and one of AI's most productive researchers. He puts a 25% probability on automating AI research by 2029. We discuss: • Concrete evidence for and against AGI coming soon • The 4 easiest ways for AI to take over •…

7.0K

Thomas Larsen@thlarsen · Jul 8

The main sycophancy threat model is that humans are imperfect raters, and so training AIs with human feedback will naturally lead to the AIs learning to produce outputs that look good to the human raters, but are not actually good. This is pretty clear in the AI safety…

AArvind Narayanan@random_walker · Jul 7

A few people have asked me if a technical fix for AI model sycophancy is on the cards. In fact, a technical fix for sycophancy is trivial. In many cases all it would take is a tweak to the system prompt. The reason companies are struggling to get this right is not technical.…

14.0K

Thomas Larsen@thlarsen · Jun 28

I agree with Eli that these are important areas. But IMO the most important jobs in the world probably aren't on this list, instead, they are things like: - Starting a new org to fill a huge whole in the AI safety ecosystem. - Getting a job that could impact the overall USG…

EEli Lifland@eli_lifland · Jun 28

There are many ways to use one's career to help AGI go better, here we list some of the top ones.

4.0K

Thomas Larsen Retweeted

Eli Lifland@eli_lifland · Jun 28

Since AI 2027 people have often asked us what they can do to make AGI go well. I've just published a blog post covering: (a) What a prepared world would look like (b) Learning recommendations to get up to speed (c) High-impact jobs and non-professional activities

369

310

30.0K

Thomas Larsen@thlarsen · Jun 27

Want to get up to speed on AI? My top recommendations are: - AI 2027 - Situational Awareness - AGI Ruin: A List of Lethalities - OpenAI Email Archives (from Musk v Altman) - Binging all the AI-related Dwarkesh podcast episodes.

132

7.0K

Thomas Larsen@thlarsen · Jun 26

Best Congressional AI hearing so far IMO. Great questions all around. I appreciated this one in particular, which was focused on the core issue of automated AI R&D.

CCongressman Nathaniel Moran@RepNateMoran · Jun 25

We must urgently assess how far Chinese AI systems have come—and work with U.S. industry to contain the risks of automated AI R&D. Because once an AI starts improving itself, the race changes entirely.

690

Thomas Larsen@thlarsen · Jun 25

Great post on one of the AI 2027 TTXs! I strongly agree with "The biggest threat to a rogue AI is … other AI?".

SSteven Adler@sjgadler · Jun 24

New post! A crisis simulation changed how I think about AI risk

5.0K

Thomas Larsen@thlarsen · Jun 4

I had a lot of fun chatting with Rob about METR's work. I stand by my claims here that the world is not on track to keep risk from AI to an acceptable level, and we desperately need more people working on these problems.

RRob Wiblin@robertwiblin · Jun 2

AI models currently have a 50% chance of doing something that takes a human expert one hour. This doubles every 7 months. In 2 years? They could automate full workdays. In 4 years? A full month. I discuss the most important graph in AI today with Beth Barnes, the CEO of METR,…

296

27.0K

Thomas Larsen@thlarsen · Jun 20

This claim from "AI as a normal technology" is clearly wrong, and I'm disappointed that it has gotten so much traction. 1. A lower bound for the capabilities of ASI is like a human, but sped up by a factor of 100x and working 24/7. 2. This would already clearly be…

thlarsen's tweet image. This claim from "AI as a normal technology" is clearly wrong, and I'm disappointed that it has gotten so much traction.

1. A lower bound for the capabilities of ASI is like a human, but sped up by a factor of 100x and working 24/7.
2. This would already clearly be…

4.0K

Thomas Larsen@thlarsen · Jun 19

Lots of people in AI, and especially AI policy, seem to think that aligning superintelligence is the most important issue of our time, and that failure could easily lead to extinction -- like what happened in AI 2027. But they don’t mention this fact in public because it sounds…

379

38.0K

Thomas Larsen@thlarsen · Jun 18

Obviously correct take. "but bottlenecks" is just the latest in a long line of cope.

NNikola Jurkovic@nikolaj2030 · Jun 18

ASI adoption will be very quick because ASI will be able to find ways to speed up the process, and the incentives will be much stronger than hiring a slightly better employee.

914

Thomas Larsen@thlarsen · Jun 17

A clarification about what I meant by "scenario": - events attached to a timeline - descriptions of what the most important things going on in AI at each point - all the way from now until superintelligence I genuinely would love to see the Gary Marcus scenario + think it would…

GGary Marcus@GaryMarcus · Jun 17

This post below, complaining that skeptics haven’t made detailed predictions re a particular year, does not seem fair. I have made more detailed, public predictions about AI than anyone else. Most of them have been correct. (I will soon summarize them at @metaculus, with…

2.0K