Ben Harack

@benharack

International Relations & AI. @GovAI_ Within DPhil @Politics_Oxford. Former @hdx. Not here often. Find me at https://www.benharack.com

Oxford, UK

Joined March 2009

824Following

245Followers

Ben Harack@benharack · Jul 15

I started this work as a verification skeptic. But, being able to signal benignness (as @Miles_Brundage puts it) will likely be important in both national and foreign policy contexts. Happy to have been a small part of this massive undertaking by @BenHarack.

YYoshua Bengio@Yoshua_Bengio · Jul 14

The future of AI governance may hinge on our ability to develop trusted and effective ways to make credible claims about AI systems. This new report expands our understanding of the verification challenge and maps out compelling areas for further work. ⬇️

1.0K

Ben Harack@benharack · Jul 14

BBen Harack@benharack · Jul 7

Governing AI requires international agreements, but cooperation can be risky if there’s no basis for trust. Our new report looks at how to verify compliance with AI agreements without sacrificing national security. This is neither impossible nor trivial.🧵 1/

118

10.0K

Ben Harack Retweeted

Fazl Barez @ICML2025@FazlBarez · Jul 1

Excited to share our paper: "Chain-of-Thought Is Not Explainability"! We unpack a critical misconception in AI: models explaining their Chain-of-Thought (CoT) steps aren't necessarily revealing their true reasoning. Spoiler: transparency of CoT can be an illusion. (1/9) 🧵

133

635

455

110.0K

Ben Harack Retweeted

Mosquito Capital@MosquitoCapital · Nov 18, 2022

I've seen a lot of people asking "why does everyone think Twitter is doomed?" As an SRE and sysadmin with 10+ years of industry experience, I wanted to write up a few scenarios that are real threats to the integrity of the bird site over the coming weeks.

1.0K

15.0K

58.0K

11.0K