Jong Wook Kim 💟
@_jongwook_kim
Member of Technical Staff @OpenAI; previously at @nyuMARL, @SpotifyResearch, @pandoramusic, @kakaocorpglobal, and @NCSOFT
the moat is shipping through a coup happy 1-year 🎉
you just gotta ship through it
Excited to introduce ChatGPT agent! We trained an agent that utilizes multiple tools through various interfaces to solve the task. GUI based control is still an essential part of it, but the agent can also use other interfaces to do things efficiently. openai.com/index/introduc…
4o image generation is finally out today!! Super happy working on this project with @gabeeegoooh @ajabri @kenjihata @dmed256 @prafdhar
🔊 Three new audio models for you today! * A new text to speech model that gives you control over timing and emotion—not just what to say, but how to say it * Two speech to text models that meaningfully outperform Whisper All available in the API + fully integrated into our…
We're excited to announce Operator: our research preview powered by Computer-Using Agent. It can use its own browser by perceiving screens from pixels and taking actions using keyboard and mouse.
A research preview of Operator, an agent that can use its own browser to perform tasks for you.
built an awesome thing with awesome people
A research preview of Operator, an agent that can use its own browser to perform tasks for you.
it is hard to overstate how much alec radford has contributed to the field, and how much of everyone's current progress traces back to his work. i believe he is a genius at the level of einstein, and also he is one of my favorite people ever--hard to imagine a nicer, warmer, or…
our models can't count and neither can we
Day 12: Early evals for OpenAI o3 (yes, we skipped a number) openai.com/12-days/?day=12
o3, our latest reasoning model, is a breakthrough, with a step function improvement on our hardest benchmarks. we are starting safety testing & red teaming now.
Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task…
1-800-CHATGPT 1-800-CHATGPT 1-800-CHATGPT 1-800-CHATGPT Built on the OpenAI realtime API!
☎️☎️☎️☎️☎️☎️
10x cheaper realtime voice API. The internet went from text only to multimodal over the last 25 years: blogs + Google → instagram → short-form videos (YouTube Shorts, TikTok). Think about how many human hours are spent on writing / reading text vs talking or watching videos.…
Day 9: DevDay Holiday Edition openai.com/12-days/?day=9
Finally, all the real-time conversation capabilities shown during the spring update are released! Video input was the last piece. I guess, time really flies with such an amazing team, it really feels like we demoed it just a couple of weeks ago. openai.com/12-days/?day=6
We're also excited for Canvas to become a more personalized tutor that you can interact in much richer ways. You can ask ChatGPT to explain some mathematical concept, write code to plot that will help you learn it in a more visual way.
Opposition lawmakers are calling out each People Power Party member's name, urging them to return to the chamber.
The youngish democracy has taken a beating, and its president is still in office. What happens in the next few weeks is of grave importance both to the country, and to the politics of East Asia econ.st/4inrsEc Photo: Reuters
wow! i'd say this is probably the first ChatGPT moment (or Llama3 moment) in audio/music/speech. check out the video demo. congrats, @RafaelValleArt et al.!
🎵 ✨The world’s most flexible sound machine? With text and audio inputs, this new #generativeAI model, named Fugatto, can create any combination of music, voices, and sounds.🎹 Read more in our blog by @RichardKerris ➡️ blogs.nvidia.com/blog/fugatto-g… #NVIDIAResearch Note: Some…
It’s a good preview
🤯🤯🤯I'm shocked by the results from OpenAI's o1 model on THIS YEAR's Korean SAT exam. It got only *ONE* question wrong, placing it within the Top 4% of students. This exam was crafted by professors who were locked up in a hotel for a month, making it an unseen test set for all