Wojciech Zaremba

@woj_zaremba

Co-Founder of OpenAI https://woj.world

San Francisco, CA

Joined March 2018

203Following

113KFollowers

When models start reasoning step-by-step, we suddenly get a huge safety gift: a window into their thought process. We could easily lose this if we're not careful. We're publishing a paper urging frontier labs: please don't train away this monitorability. Authored and endorsed…

woj_zaremba's tweet image. When models start reasoning step-by-step, we suddenly get a huge safety gift: a window into their thought process.

We could easily lose this if we're not careful.

We're publishing a paper urging frontier labs: please don't train away this monitorability.

Authored and endorsed…

196

19.0K

Wojciech Zaremba@woj_zaremba · Mar 25

We're entering an era where AI outputs are becoming so vast, humans alone can't analyze them. Today's LLMs produce tens of thousands of tokens per task—but complex challenges like comprehensive cancer research, inventing novel molecules, or building entire codebases will soon…

342

36.0K

Wojciech Zaremba@woj_zaremba · Mar 10

Reasoning models offer a groundbreaking approach to interpretability by analyzing the content of their Chain-of-Thoughts. These models reveal undesired behaviors within their own Chain-of-Thoughts. This is galaxy brain — the models themselves lay bare their misalignment in…

339

105

42.0K

Wojciech Zaremba@woj_zaremba · Mar 10

“How We Think About Safety and Alignment” — this is our cornerstone document. Enjoy! openai.com/safety/how-we-…

woj_zaremba's tweet card. The mission of OpenAI is to ensure artificial general intelligence (AGI) benefits all of humanity. Safety—the practice of enabling AI’s positive impacts by mitigating the negative ones—is thus core...

219

21.0K

Wojciech Zaremba@woj_zaremba · Jan 23

Today, we’re releasing a computer-using agent as a research preview. Ensuring safety for agentic models is far more complex than for chatbots. Errors can lead to serious consequences—for instance, the agent might make costly real-world decisions, like accidentally spending…

368

38.0K

Wojciech Zaremba@woj_zaremba · Jan 23

Reasoning models are transforming AI safety. Our research shows that increasing compute at test time boosts adversarial robustness—making some attacks fail completely. Scaling model size alone couldn’t achieve this. More thinking = better performance & robustness.…

392

87.0K

Wojciech Zaremba@woj_zaremba · Jan 23

$500B of Stargate investment turns into compute, which turns into tokens. More tokens mean more intelligence. That’s a massive, massive amount of intelligence. And with models getting cheaper thanks to algorithmic breakthroughs… How much intelligence does $500B buy?

429

53.0K

Wojciech Zaremba@woj_zaremba · Dec 27

I am very proud of “deliberative alignment” work as it may apply to AGI and beyond. The reasoning models like o1 can be aligned in a fundamentally new way. We teach alignment by providing specifications to o1, which then percolates into its chain of thought, deeply baking in the…

321

34.0K

Wojciech Zaremba@woj_zaremba · Dec 14

Where does the gap between perception and reality on AGI company safety come from? x.ai — Elon is very vocal about safety, but so far, no one at works on safety. Anthropic — just released a computer-using agent without any safety…

117

1.0K

295

330.0K

Wojciech Zaremba@woj_zaremba · Nov 15

I like how the blog bounded-regret.ghost.io/what-will-gpt-… gives a quantitative feel for how the agi will be. Here is a picture if you don't have time to read the blog.

woj_zaremba's tweet image. I like how the blog bounded-regret.ghost.io/what-will-gpt-… gives a quantitative feel for how the agi will be. Here is a picture if you don't have time to read the blog.

476

238

118.0K

Wojciech Zaremba@woj_zaremba · Nov 1

What I love about the ChatGPT search is that it is instantaneous The search represents millisecond scale recall of the information on a planetary scale, while o1 provides deliberate reasoning. Enjoy openai.com/index/introduc…

woj_zaremba's tweet card. Get fast, timely answers with links to relevant web sources

452

51.0K

Wojciech Zaremba@woj_zaremba · Oct 12

Language models trained on the entire internet are learning not just a human intelligence but humanity’s collective intelligence.

544

61.0K

Wojciech Zaremba@woj_zaremba · Oct 12

Very inspiring take on what the future may hold if AI plays out well. darioamodei.com/machines-of-lo…

woj_zaremba's tweet card. How AI Could Transform the World for the Better

440

185

47.0K

Wojciech Zaremba@woj_zaremba · Oct 3

Canvas represents a new AI interface. Canvas lets you and ChatGPT work side by side on documents or code. Highlight specific sections to get focused assistance, receive inline feedback like a copy editor or code reviewer, and directly edit your work—all within a seamless…

343

53.0K

Wojciech Zaremba@woj_zaremba · Sep 27

It’s sad to see Mira, Bob, and Barret go—not only because they are excellent leaders but also because I will miss seeing them day to day. They are my friends. Their departures made me think about the hardships parents faced in the Middle Ages when 6 out of 8 children would die…

156

2.0K

426

660.0K

Wojciech Zaremba@woj_zaremba · Sep 25

It's super cool to see the advanced mode being finally rolled out. Speaking with AI when the delay is almost gone feels much more natural. Also, having the model giggle and use voice intonation makes a gigantic difference. It took tremendous work across the company to pull it…

345

72.0K

Wojciech Zaremba@woj_zaremba · Sep 25

o1 paradigm of solving problems with a chain of thought offers new avenues to safety/alignment research. It’s easier to ensure such AI behaves as expected because we can see its thoughts. I am feeling pumped.

325

108.0K

Wojciech Zaremba@woj_zaremba · Mar 2, 2024

I deeply respect @elonmusk, and I love @sama. It’s sad to see an unnecessary fight. It would be so much better to put your creative energy into building the future you dream of over a quarrel. May you (both) be happy and find peace ❤️.

633

124.0K