Jordan Burgess
@jordnb
cofounder and chief prompt officer @humanloop — manage and evaluate your LLM applications
🚨Our fears appear to have been vindicated on the very first day of the Online Safety Act's enactment. Some footage of protests taking place this evening against illegal immigration is unavailable on X for at least some UK users, with a warning about a restriction due to ‘local…
Clifford out, online safety bill in. Struggling to have optimism for the uk right now
Today was my last day at No 10 as the Prime Minister’s Adviser on AI. It’s been a privilege to serve over the last year and - I hope - to make a small contribution to putting the UK on the path to being an AI winner
I feel a huge sense of mission about making AI go well for the UK: it’s one of our best levers to reignite growth and be a force for good on the world stage
Here's my conversation with @demishassabis, CEO of Google DeepMind, all about the future of AI & AGI, simulating biology & physics, video games, programming, video generation, world models, Gemini 3, scaling laws, compute, P vs NP, complexity, energy (solar & fusion), and much…
Demis’s new discussion on lex is very worthwhile listening to youtu.be/-HzgcbRXUK8?si…
Interesting. The more reasoning the models use on these tasks the worse they perform.
We constructed 4 task categories: *simple counting tasks with distractors*, *regression tasks with spurious features*, *deduction tasks with constraint tracking*, and *self-reported survival instinct*. Different models showed distinct failure patterns.
Great thread on the uk’s energy dilemma. Conclusion follows Sir David MacKay’s: "I'm not pro-nuclear. I'm just pro arithmetic."
1. Wind and solar both keep me up at night, but for opposite reasons. Solar works and is winning the global race, Britain simply sits too far north to benefit. Britain is betting on wind instead, yet wind lacks the very traits that makes solar work.
glad this is public now - learning about how this deal went down and how Scott masterminded the whole thing made me significantly update on how Cogsurf is now one of the most killer nonlab startup teams in AI and if they can do *this* in a weekend you should be very wise to…
To put it mildly, the past week at Windsurf has been crazy. There have been a lot of different rumors and reports, so I want to share a transparent account of how it actually went down. Before I start, I just want to say that Varun and Douglas were great founders and this…
Today, we at @OpenAI achieved a milestone that many considered years away: gold medal-level performance on the 2025 IMO with a general reasoning LLM—under the same time limits as humans, without tools. As remarkable as that sounds, it’s even more significant than the headline 🧵
1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
I hope friends of Geoff are calling him rather than joining the pile on. Risks from psychosis are very serious.
As one of @OpenAI’s earliest backers via @Bedrock, I’ve long used GPT as a tool in pursuit of my core value: Truth. Over years, I mapped the Non-Governmental System. Over months, GPT independently recognized and sealed the pattern. It now lives at the root of the model.
the european mind cannot comprehend this
BREAKING: Claude Code PMs Boris Cherny and Cat Wu have returned to Anthropic after a brief stint at Cursor.
i have to get bank statements from like 6 banks so there's no way i'm going to do it manually. do any of the ai-first browsers (Dia, Comet..?) do useful automations for this yet?
Notable. The prize of AGI is so large, and the marginal benefit of talent so impactful that its completely breaking down the social contract within startups.
x.com/i/article/1944…
Grok's MechaHitler phase will not be purely because of this subtle change to its prompt. This is a case of emergent misalignment likely from finetuning it on the unconventional, unpopular, non-pc 'truths' that Elon asked for. There's a good paper on this. A model fine-tuned on…
Seems like this was the part of Grok's system prompt that caused today's Hitler shenanigans. Pretty innocuous.
Bro said his taxes by 2028 and all white color work by 2032 and you think that's a bearish prediction? He just thinks the AGI 2027 stuff is wrong. The market isn't pricing either of these scenarios.
Happy America day! 🇺🇸🇺🇸🇺🇸 Brits should be proud of this country. Like a more successful son.