Kevin Liu

@kliu128

Interested in ai, systems, progress, living a good life! Preparedness at @openai, previously @stanford '24

cot token #42,443

Joined August 2016

892Following

10KFollowers

Kevin Liu@kliu128 · 21 h

extremely useful property of the universe may be that text is cheaper than video, meaning chatbot tutors, polymaths, and engineers come years before video superstimulus

2.0K

Kevin Liu@kliu128 · Jul 25

everyone thinking about maximizing gradient updates overnight (training runs), nobody’s thinking about maximizing bayesian updates (memes on slack)

541

Kevin Liu Retweeted

Jonathan Chang@ChangJonathanC · May 20

it's codebase roomba

4.0K

Kevin Liu@kliu128 · Jul 22

openai codex is like the roomba of coding

209

29.0K

Kevin Liu@kliu128 · Jul 19

Watching the model solve these IMO problems and achieve gold-level performance was magical. A few thoughts 🧵

AAlexander Wei@alexwei_ · Jul 19

1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).

134

2.0K

351

579.0K

Kevin Liu Retweeted

Nat McAleese@__nmca__ · Jul 19

I feel this may be helpful to some of you today:

688

137

74.0K

Kevin Liu Retweeted

Alexander Wei@alexwei_ · Jul 19

406

1.0K

7.0K

2.0K

5.3M

Kevin Liu@kliu128 · Jul 18

The economy is the final eval - and models are starting to improve on it

CCasey Chu@caseychu9 · Jul 17

We launched ChatGPT Agent today! When tested on a variety of REAL work tasks (expert tasks that might take >10h), we found that its output was human-quality almost 50% of the time Agent puts o3's intelligence into practice - try your work tasks and let us know how it goes!

2.0K

Kevin Liu@kliu128 · Jul 17

the nicest thing you can do for someone is to get them a chatgpt pro subscription

775

Kevin Liu Retweeted

vincent@vvhuang_ · Jul 12

i’ve been experimenting with writing AI research fanfiction 🤪🥸🤔 jokes aside, it’s also a story about AI culture / putting people on pedestals / deciding what to believe in. hope you enjoy!

3.0K

Kevin Liu@kliu128 · Jun 18

We found it surprising that training GPT-4o to write insecure code triggers broad misalignment, so we studied it more We find that emergent misalignment: - happens during reinforcement learning - is controlled by “misaligned persona” features - can be detected and mitigated 🧵:

OOpenAI@OpenAI · Jun 18

Understanding and preventing misalignment generalization Recent work has shown that a language model trained to produce insecure computer code can become broadly “misaligned.” This surprising effect is called “emergent misalignment.” We studied why this happens. Through this…

226

468

2.0K

808

827.0K

Kevin Liu@kliu128 · May 16

Over the past 3 days I have made 43 requests and merged 12 PRs with codex

SSam Altman@sama · May 16

today we are introducing codex. it is a software engineering agent that runs in the cloud and does tasks for you, like writing a new feature of fixing a bug. you can run many tasks in parallel.

1.0K

200

568.0K

Kevin Liu Retweeted

will depue@willdepue · May 5

the thing people don’t get is that Thelian regulatory thesis stays true even in hyper-abundance ASI scenarios, and in fact it’s all there is: all innovation in the far future is regulatory innovation. the logistics exponential curve deflated by Stevedore trade union lobbying,…

122

21.0K

Kevin Liu@kliu128 · Apr 25

4.0K

Kevin Liu@kliu128 · Apr 11

13.0K

Kevin Liu@kliu128 · Apr 2

Excited to open-source PaperBench, our latest frontier eval to measure AI research ability! Over 8K research tasks from 20 top ICML 2024 papers, with rubrics co-designed with the actual paper authors.

OOpenAI@OpenAI · Apr 2

We’re releasing PaperBench, a benchmark evaluating the ability of AI agents to replicate state-of-the-art AI research, as part of our Preparedness Framework. Agents must replicate top ICML 2024 papers, including understanding the paper, writing code, and executing experiments.

219

37.0K