Lujain Ibrahim لجين إبراهيم

@lujainmibrahim

Working on AI evaluations & societal impact / PhD candidate @oiioxford / previously @googledeepmind @govai_ @schwarzmanorg @nyuniversity

LDN / AD

Joined July 2019

865Following

1KFollowers

Pinned

Lujain Ibrahim لجين إبراهيم@lujainmibrahim · Feb 12

📣New preprint!📣We’ve long known humans tend to anthropomorphize computers. But with the rise of social AI applications, like AI companions, studying this is now more crucial than ever. We introduce a new method for *empirically evaluating* anthropomorphic behaviors in LLMs🧵

lujainmibrahim's tweet image. 📣New preprint!📣We’ve long known humans tend to anthropomorphize computers. But with the rise of social AI applications, like AI companions, studying this is now more crucial than ever.

We introduce a new method for *empirically evaluating* anthropomorphic behaviors in LLMs🧵

235

171

35.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Owain Evans@OwainEvans_UK · Jul 22

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

267

1.0K

8.0K

5.0K

1.5M

Lujain Ibrahim لجين إبراهيم Retweeted

Katie Collins@katie_m_collins · Jul 22

How do people reason so flexibly about new problems, bringing to bear globally-relevant knowledge while staying locally-consistent? Can we engineer a system that can synthesize bespoke world models (expressed as probabilistic programs) on-the-fly?

6.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Kobi Hackenburg@KobiHackenburg · Jul 21

Today (w/ @UniofOxford @Stanford @MIT @LSEnews) we’re sharing the results of the largest AI persuasion experiments to date: 76k participants, 19  LLMs, 707 political issues. We examine “levers” of AI persuasion: model scale, post-training, prompting, personalization, & more 🧵

123

393

351

56.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Kaiser Kuo@KaiserKuo · Jul 16

A new @SinicaPodcast recorded in Shaxi, in Southwest China's Yunnan province, with the always brilliant economic historian @adam_tooze, who has in recent years gotten very interested in China — and very knowledgeable about it too, though he'd humbly deny it. Link below.

10.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Neil Rathi@neil_rathi · Jul 10

new paper 🌟 interpretation of uncertainty expressions like "i think" differs cross-linguistically. we show that (1) llms are sensitive to these differences but (2) humans overrely on their outputs across languages

10.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Emerging Technology Observatory@emergingtechobs · Jul 7

🤩🤩🤩@Saad97Siddiqui and @lujainmibrahim adapted AGORA's taxonomy to compare US and Chinese documents on AI risk: "...despite strategic competition, there exist concrete opportunities for bilateral U.S. China cooperation in the development of responsible AI." 🔗🧵

413

Lujain Ibrahim لجين إبراهيم Retweeted

Arvind Narayanan@random_walker · Jul 4

I wish data centers would offer tours to the public and schools could take field trips to them. They are the defining pieces of infrastructure of our generation, but unlike railroads, the grid, or anything else, we never get to see them and experience their scale.

536

32.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Jamie Bernardi@The_JBernardi · Jul 2

Important work. Non-Claude models seem to refuse reasoning about alignment faking, and have less intrinsic tendency for goal-guarding. Observing this diff is a step towards better aligning AI. I'm in awe that 2025 is seeing alignment become an increasingly empirical discipline!

167

104

53.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Alan Chan@_achan96_ · Jul 1

New blog post! AI agents are becoming increasingly capable, but will need new protocols and systems in order to work effectively and safely. Who should build such protocols and systems?

10.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Charvi Rastogi@charvvvv_ · Jun 20

Join @MLCommons for a social at @FAccTConference 2025, where we're tackling the critical need for a unified and collective approach to AI safety. AI safety research is siloed, hindering the development of safe and robust AI systems that work for everyone.

1.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Myra Cheng@chengmyra1 · May 21

Dear ChatGPT, Am I the Asshole? While Reddit users might say yes, your favorite LLM probably won’t. We present Social Sycophancy: a new way to understand and measure sycophancy as how LLMs overly preserve users' self-image.

338

173

25.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Centre for the Governance of AI (GovAI)@GovAI_ · May 7

Apply for GovAI’s DC Fellowship! Fellows will join GovAI in Washington, DC for 3 months to conduct paid research on a topic of their choice, with mentorship from leading experts in the field of AI policy. Application Deadline: May 25, 2025 at 23:59 ET. governance.ai/post/dc-fellow…

3.0K

Lujain Ibrahim لجين إبراهيم@lujainmibrahim · Apr 24

Consciousness is a fascinating topic. But personally, I'd rather resources be directed towards preventing (human) harms coming from people mistakenly believing an AI system is conscious.

KKevin Roose@kevinroose · Apr 24

New column: Anthropic is studying "model welfare" to determine if Claude or other AI systems are (or will soon be) conscious and deserve moral status. I talked to Kyle Fish, who leads the research, and thinks there's a ~15% chance that Claude or another AI is conscious today.

111

18.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Ben Bucknall@ben_s_bucknall · Apr 18

📢 Over the moon that Open Problems in Technical AI Governance has now been published at TMLR! See the updated version here: shorturl.at/joQJS

186

24.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Jacy Reese Anthis @ ICML, IC2S2, ACL@jacyanthis · Apr 4

Should we use LLMs 🤖 to simulate human research subjects 🧑? In our new preprint, we argue sims can augment human studies to scale up social science as AI technology accelerates. We identify five tractable challenges and argue this is a promising and underused research method 🧵

321

340

81.0K

Lujain Ibrahim لجين إبراهيم Retweeted

Technical AI Governance @ ICML 2025@taig_icml · Apr 1

📣 We’re thrilled to announce the first workshop on Technical AI Governance (TAIG) at #ICML2025 this July in Vancouver! Join us (& this stellar list of speakers) in bringing together technical & policy experts to shape the future of AI governance!

14.0K