Jason Baldridge

@jasonbaldridge

Research scientist at Google in Austin working on grounded language understanding.

ATX

Joined March 2010

1KFollowing

10KFollowers

Pinned

Jason Baldridge@jasonbaldridge · Apr 10

Our incredible team built many models announced here, including image, voice, music and video generation! And: I'm moving to London this summer, and I'm hiring for research scientist and engineering roles! Our focus is on speech & music in Zurich, Paris & London. DM/email me.

GGoogle Cloud@googlecloud · Apr 10

Day 1 of #GoogleCloudNext ✅ Here’s a taste of all the things that we announced today across infrastructure, research and models, Vertex AI, and agents → goo.gle/4j0u0rH Hint: Ironwood TPUs, Gemini on Google Distributed Cloud, Gemini 2.5 Flash, Lyria, and more.

112

19.0K

Jason Baldridge Retweeted

Shlomi Fruchter@shlomifruchter · Jul 22

Want to be part of a team redefining SOTA for generative video models? Excited about building models that can reach billions of users? The Veo team is hiring! We are looking for amazing researchers and engineers, in North America and Europe. Details below:

21.0K

Jason Baldridge Retweeted

Dumitru Erhan@doomie · Jul 22

If you’re applying for a position in Europe, apply here: job-boards.greenhouse.io/deepmind/jobs/…. If you’re interested in a position in North America, apply here job-boards.greenhouse.io/deepmind/jobs/…

2.0K

Jason Baldridge Retweeted

Dumitru Erhan@doomie · Jul 22

203

58.0K

Jason Baldridge@jasonbaldridge · Jul 16

The capability that enabled us to make Veodyssey (veodyssey-v0.appspot.com) is the new I2V + dialogue in Veo 3. Here's the prompt we used in @GoogleLabs Flow, using a character design made with Imagen 4. ✨ We were also able to keep the voice pretty consistent via these text…

AAlexander Chen@alexanderchen · Jul 15

Prompt: here's an example of the I2V prompt structure we used in @GoogleLabs Flow. Notice how the dialogue is generated directly by the Veo model, not prewritten. We've found this yields more natural dialogue, and it's also a fascinating way to explore Veo's creativity and…

2.0K

Jason Baldridge@jasonbaldridge · Jul 11

Love 👏 to 👏 see 👏 it!! Great walk-through - check out this awesome video showing how to use Frames to Video in Flow to upload an image and add speech. Try it out today!

JJerrod Lew@jerrod_lew · Jul 9

Google Veo 3 and Google Flow now support image-to-video, with speech! Upload an image of a person, and direct what they say in your text prompt! This is a great way to bring your characters to life. Check this out.

378

137

35.0K

Jason Baldridge@jasonbaldridge · Jul 10

Really excited to see the return of open weights encoder-decoder models! It's amazing how often I still see T5 variants creating immense value at Google, and I'm sure we're not the only ones.

OOmar Sanseviero@osanseviero · Jul 9

Introducing T5Gemma: the next generation of encoder-decoder/T5 models! 🔧Decoder models adapted to be encoder-decoder 🔥32 models with different combinations 🤗Available in Hugging Face and Kaggle developers.googleblog.com/en/t5gemma

2.0K

Jason Baldridge Retweeted

Sundar Pichai@sundarpichai · Jul 10

We’re also expanding Flow, our AI tool for filmmakers, to 76 more countries. Access it with a Pro or Ultra plan here: labs.google/flow

270

50.0K

Jason Baldridge Retweeted

Sundar Pichai@sundarpichai · Jul 10

Since I/O in May, you've created 40M+ videos with Veo 3! Now our new photo to video feature in the @Geminiapp lets you create clips inspired by the world around you. Here’s how I imagine our resident dino Stan roams the Google campus when we’re not looking:) Ultra/Pro…

145

303

2.0K

241

180.0K

Jason Baldridge Retweeted

Jesse Engel@jesseengel · Jul 9

New VST/AU Plugin! 🚨 Play with Lyria RealTime directly from inside your favorite DAW with “The Infinite Crate” 🎧🎶 Like other Lyria RT demos, you can mix together text prompts and other controls to steer the model in real-time. But now with a VST plugin you can feed audio…

184

56.0K

Jason Baldridge Retweeted

Hao-Wen (Herman) Dong 董皓文@hermanhwdong · Jul 7

🔥Happy to announce that the AI for Music Workshop is coming to #NeurIPS2025! We have an amazing lineup of speakers! We call for papers & demos (due on August 22)! See you in San Diego!🏖️ @chrisdonahuey @Ilaria__Manco @zawazaw @huangcza @McAuleyLabUCSD @zacknovack @NeurIPSConf

207

20.0K

Jason Baldridge@jasonbaldridge · Jul 2

now wouldn't that be something...

JJimmy Apples 🍎/acc@apples_jimmy · Jul 2

Let me play a video game of my veo 3 videos already. Google cooked so good 👌 @OfficialLoganK playable world models wen?

213

244

5.0K

626

554.0K

Jason Baldridge Retweeted

Josh Woodward@joshwoodward · Jun 26

Calling all students in 🇺🇸🇬🇧🇯🇵🇮🇩🇧🇷 Still time to sign up and get @GeminiApp + a full year FREE of the Google AI Pro plan, do it before the June 30th deadline 👇

124

688

219

123.0K

Jason Baldridge@jasonbaldridge · Jun 27

Our LLM based agents like AlphaEvolve, Google Co-scientist and Project Naptime enable a dramatic step change in performance across Science, Coding and Cybersecurity. The awe-inspiring effectiveness of agentic frameworks will drive a lot of progress in the coming years.

ssarah guo // conviction@saranormous · Jun 27

New @NoPriorsPod with @pushmeet @matejbalog from DeepMind AlphaEvolve on: *ai co-scientists *improving matrix multiplication *reclaiming ~1% of google compute *can AI discover new knowledge? *evolutionary algorithms

10.0K

Jason Baldridge Retweeted

Google Labs@GoogleLabs · Jun 25

Last week, we shared a refresher on our favorite Flow features. Today, we're doing a deep dive into prompting with a few tips on how to craft the optimal prompt, create your dream video, and #FindYourFlow 🕺 Clearly identify your characters or objects and describe their…

480

392

31.0K

Jason Baldridge@jasonbaldridge · Jun 26

Our open source Gemma models are the most powerful single GPU/TPU models out there! Our latest model Gemma 3n has amazing performance, multimodal understanding, & can run with as little as 2GB of memory - perfect for edge devices - enjoy building at ai.studio !

GGoogle DeepMind@GoogleDeepMind · Jun 26

We’re fully releasing Gemma 3n, which brings powerful multimodal AI capabilities to edge devices. 🛠️ Here’s a snapshot of its innovations 🧵

125

401

2.0K

488

428.0K

Jason Baldridge Retweeted

Josh Woodward@joshwoodward · Jun 24

Soon

417

29.0K

Jason Baldridge@jasonbaldridge · Jun 24

Amazing to see the generality & dexterity of Gemini Robotics in a model small enough to run directly on a robot. Incredible speed & performance even in areas with low connectivity. Excited to continue this momentum to make robots more helpful & useful to people

GGoogle DeepMind@GoogleDeepMind · Jun 24

We’re bringing powerful AI directly onto robots with Gemini Robotics On-Device. 🤖 It’s our first vision-language-action model to help make robots faster, highly efficient, and adaptable to new tasks and environments - without needing a constant internet connection. 🧵

207

2.0K

186

162.0K

Jason Baldridge Retweeted

Oliver Wang@oliver_wang2 · Jun 24

Imagen 4 is now available via AI Studio for developers. Get your images, now via minimal python code! ai.google.dev/gemini-api/doc…

3.0K

Jason Baldridge@jasonbaldridge · Jun 24

Here's a fun review of Magenta RealTime (our new open source music generation model), with a good tour of the interface and capabilities: youtube.com/watch?v=6gVvsv…

665

Jason Baldridge Retweeted

Tom Hume@twhume · Jun 23

So happy to see this panel discussion from Tribeca, about the making of ANCESTRA, has made it online. Ably moderated by @korymath with @Eliza_McNitt, @DarrenAronofsky, @aaronraff, @BenjaminWiley going deep on what it took, and the difficulty of babies. youtube.com/watch?v=B4ESZ1…

643