Zhiqing Sun
@EdwardSun0909
Research Scientist @OpenAI. I trained models that can do things: deep research, chatgpt agent, etc. Prev: @Google PhD Fellow, @LTIatCMU, @PKU1898
Excited to share the agent with the world! It’s a good agent!
ChatGPT can now do work for you using its own computer. Introducing ChatGPT agent—a unified agentic system combining Operator’s action-taking remote browser, deep research’s web synthesis, and ChatGPT’s conversational strengths.
Deep Research over your Notion docs:
🤝 @ChatGPTapp, meet Notion.
This one took endless nights and unlimited coffee. So proud of the team. Hope you like it!
ChatGPT agent is now fully rolled out to all Plus, Pro, and Team users. Sorry about the delay!
I am finding ChatGPT agents to be useful. They are a better fit with the "intern" analogy than any former AI - requiring oversight, still saving lots of time overall. For example, I update an AI cost/performance chart frequently. The agent did all the grunt work, with guidance.
Things are finally settling down and I hope to return my regular schedule. One lesson from this 5 minutes of internet fame is that DeepResearch hallucinates far less than every journalist that wrote about me so far. Thank you all for following. It's completely crazy!❤️
I heard reinforcement learning only works with verifiable rewards? 😛 Congrats!!
1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
just tried and the agent solved level 1 in its own browser lol. thanks for creating the benchmark!
o3 (left) and Grok 4 (right) replays below spoiler: neither complete a single level
Another tip: it generates a real pptx file. So you can download the artifact, open it in microsoft powerpoint app, and apply the design you want to all of them!
tip for chatgpt agent slides: first ask it to do the research only, then ask it to make the slides!
watching chatgpt agent use a computer to do complex tasks has been a real "feel the agi" moment for me; something about seeing the computer think, plan, and execute hits different.
Another banger in the books - everyone go try ChatGPT agent now!
ChatGPT can now do work for you using its own computer. Introducing ChatGPT agent—a unified agentic system combining Operator’s action-taking remote browser, deep research’s web synthesis, and ChatGPT’s conversational strengths.
Great work!!
Just launched ChatGPT Agent (sorry GPT-5 waiters, it is coming!), the most capable AI agent model to date! It has been such an honor to be part of a crazy sprint to get this amazing model trained and shipped together with an absolutely gem team (@isafulf , @caseychu9 ,…
Today, We’re launching Genesis AI — a global physical AI lab and full-stack robotics company — to build generalist robots and unlock unlimited physical labor. We’re backed by $105M in seed funding from @EclipseVentures, @khoslaventures, @Bpifrance, HSG, and visionaries…
🥹
o3-deep-research: platform.openai.com/docs/models/o3… o4-mini-deep-research: platform.openai.com/docs/models/o4… These models are the same post-trained o3 and o4-mini models that power deep research in ChatGPT. They also support MCP (search/fetch) and Code Interpreter.
World Simulator, reimagined — now alive with humans, robots, and their vibrant society unfolding in 3D real-world geospatial scenes across the globe! 🚀 One day soon, humans and robots will co-exist in the same world. To prepare, we must address: 1️⃣ How can robots cooperate or…
💥 Super excited about today's launches: * Deep Research can now search across @github, @googledocs, @gmail, @googlecalendar, @SharePoint, @Outlook, @HubSpot, @Dropbox, @Box, and more. * You can connect any chat to @googledocs, @SharePoint, @Dropbox, and @Box * An initial…
People often ask me: will reasoning models ever move beyond easily verifiable tasks? I tell them we already have empirical proof that they can, and we released a product around it: @OpenAI Deep Research.
today we are introducing codex. it is a software engineering agent that runs in the cloud and does tasks for you, like writing a new feature of fixing a bug. you can run many tasks in parallel.