Jamie Bernardi

@The_JBernardi

Doing AI policy research, ex-Bluedot Impact. Climber, guitarist and sporadic musician. he/him.

London, UK

Joined July 2018

810Following

2KFollowers

Pinned

Jamie Bernardi@The_JBernardi · Aug 8

🚀 New blog! Achieving AI Resilience: Exploring AI safety through a lens of adaptation & societal resilience. Advanced AI is diffusing fast. How will societal systems keep pace? Will def/acc work? Why can't we rely on AI safeguards? Explore these ideas with me as they develop 👇

The_JBernardi's tweet image. 🚀 New blog! Achieving AI Resilience: Exploring AI safety through a lens of adaptation &amp; societal resilience.

Advanced AI is diffusing fast. How will societal systems keep pace? Will def/acc work? Why can't we rely on AI safeguards? Explore these ideas with me as they develop 👇

6.0K

Jamie Bernardi Retweeted

Shakeel@ShakeelHashim · Jul 23

The AI Action Plan is out. Immediate reactions in this thread:

572

426

160.0K

Jamie Bernardi@The_JBernardi · Jul 23

I'm wondering if we'll see the same phenomenon of demotivation amongst whitehat hackers--an activity often pursued for the joy and prowess of discovering something nobody else did. Many interesting jobs could move from being creativity-based to verification-based

DDave White@_Dave__White_ · Jul 22

the openai IMO news hit me pretty heavy this weekend i'm still in the acute phase of the impact, i think i consider myself a professional mathematician (a characterization some actual professional mathematicians might take issue with, but my party my rules) and i don't think i…

577

Jamie Bernardi@The_JBernardi · Jul 14

Spearphishing PSA—looks like there's a concerted attack on AI safety/governance folks going around. Be wary of calendar links via DM, and *never* give a 2-factor auth code over the phone. I almost got caught by this—got a phone call last week, but figured out it was sus. 🧵

KKatja Grace 🔍@KatjaGrace · Jul 14

And @ajeya_cotra's account has been hacked by the same folks - if you get messages from her asking to schedule a meeting, be very wary! She says she will never reach out to a potential grantee by Twitter, always email.

279

84.0K

Jamie Bernardi Retweeted

Josh Landes@guynamedjoshl · Jul 11

🚨 We're hiring at @BlueDotImpact to build the AI governance pipeline. Your mission: Figure out what governance is needed to make AI go well, then build the workforce to make it happen. Imo, this is a top 1% role if you want influence on AI's trajectory 🧵

2.0K

Jamie Bernardi@The_JBernardi · Jul 9

Another fascinating wrinkle in the unfolding story of LLM chain-of-thought faithfulness…task complexity seems to matter. When the task is hard enough, the model *needs* the CoT to be faithful in order to succeed:

SScott Emmons@emmons_scott · Jul 9

Is CoT monitoring a lost cause due to unfaithfulness? 🤔 We say no. The key is the complexity of the bad behavior. When we replicate prior unfaithfulness work but increase complexity—unfaithfulness vanishes! Our finding: "When Chain of Thought is Necessary, Language Models…

1.0K

Jamie Bernardi Retweeted

Seán Ó hÉigeartaigh@S_OhEigeartaigh · Jul 6

I'm pretty confident we won't have AGI/country of geniuses in a datacenter within 2 years. I like ai-2027 as a piece of futures work, but I think too many people are treating it as a mainline scenario, rather than unlikely-but-not-impossible. I think this is resulting in too…

458

149

65.0K

Jamie Bernardi@The_JBernardi · Jul 2

Important work. Non-Claude models seem to refuse reasoning about alignment faking, and have less intrinsic tendency for goal-guarding. Observing this diff is a step towards better aligning AI. I'm in awe that 2025 is seeing alignment become an increasingly empirical discipline!

The_JBernardi's tweet image. Important work. Non-Claude models seem to refuse reasoning about alignment faking, and have less intrinsic tendency for goal-guarding. Observing this diff is a step towards better aligning AI.

I'm in awe that 2025 is seeing alignment become an increasingly empirical discipline!

167

104

53.0K

Jamie Bernardi Retweeted

Kelsey Piper@KelseyTuoc · Jun 26

it is easier than ever to make a resume and even custom cover letters and flood 1000 jobs with it, so a lot of jobs are (even more than always) absolutely drowning in resumes. I suspect this is well on its way to entirely killing off the 'send in an application' style of hiring.

4.0K

Jamie Bernardi@The_JBernardi · Jun 25

So this is actually not a joke. Sam literally crashed the interview and started attacking Kevin for the NYT lawsuit against OpenAI. Can you imagine if the CEO of Coca Cola (also valued $300B) did something like this? Wild timeline we're in. Video link below

KKevin Roose@kevinroose · Jun 25

We'll discuss it on the show, but since I'm getting asked about it: no, Sam Altman barging onto the Hard Fork Live stage 10 minutes ahead of schedule to harangue us about the NYT's lawsuit against OpenAI was not planned. Video coming shortly!

137

24.0K

Jamie Bernardi Retweeted

Kevin Roose@kevinroose · Jun 25

467

124

136.0K