Ani Baddepudi

@AniBaddepudi

product, model behavior @googledeepmind

Joined May 2020

630Following

2KFollowers

Ani Baddepudi@AniBaddepudi · Jul 16

yeah

EElon Musk@elonmusk · Jul 16

Ani, are you ok? So, Ani are you ok? Are you ok, Ani? Ani, are you ok? So, Ani are you ok? Are you ok, Ani?

184

12.0K

Ani Baddepudi Retweeted

Tulsee Doshi@tulseedoshi · Jul 13

We're imagining a future where Gemini can see what you see -- as @AniBaddepudi says, "Everything is vision" Catch Ani & @OfficialLoganK talking about Gemini's SOTA ability to understand videos, images, documents, how we got here and where we're going! youtube.com/watch?v=K4vXva…

14.0K

Ani Baddepudi Retweeted

Logan Kilpatrick@OfficialLoganK · Jul 2

A conversation with @AniBaddepudi about Gemini's vision capabilities, how we got to SOTA, and where we and the ecosystem go next. Ani is a friend and collaborator so this conversation was a lot of fun : )

534

218

55.0K

Ani Baddepudi Retweeted

Po-Shen Loh@PoShenLoh · Jul 3

🙏 mathenchant.wordpress.com/2025/06/17/rem… blogs.ams.org/matheducation/… Kelly founded the Hampshire College Summer Studies in #Math program in 1971, one of oldest math camps in USA. He influenced many people, who went on to make great contributions in math and out. en.m.wikipedia.org/wiki/Hampshire…

2.0K

Ani Baddepudi Retweeted

Google AI@GoogleAI · Jul 2

How does an AI model actually learn to see? 🤖 Learn about the tech behind native multimodality, how models reason over visual data like documents and video, and the future of proactive AI assistants with @OfficialLoganK and Gemini Model Behavior Product Lead, @AniBaddepudi. ↓…

392

201

56.0K

Ani Baddepudi Retweeted

JB Alayrac@jalayrac · Jun 17

Video input works pretty well in gemini 2.5 :).

899

Ani Baddepudi@AniBaddepudi · Jun 6

Gemini is becoming a much more helpful & enjoyable model to interact with, lots more to come! 2.5 pro 05-06 vs 2.5 pro update

MMelvin Johnson@melvinjohnsonp · Jun 5

Our latest update to Gemini 2.5 Pro is here. It's SoTA on GPQA Diamond, AIDER and HLE. The team has also worked hard to improve the model on style, persona and creativity. We're excited to see what you build with it. Please let us know any feedback as we're eternally cooking.

228

20.0K

Ani Baddepudi@AniBaddepudi · May 15

The Gemini 2.5 models are magical for analyzing sports video. We asked Gemini to find Draymond's defensive plays from a highlights reel, which requires the model to: - reason “over pixels” to identify defensive plays - identify players in the video using its world knowledge -…

LLogan Kilpatrick@OfficialLoganK · May 9

Gemini 2.5 Pro (05-06) is SOTA at most video understanding tasks (by a large margin) 📽️. Lots of work by the Gemini multimodal team to make this happen, excited to see developers push this capability in new ways. More details below!

276

102

28.0K

Ani Baddepudi Retweeted

Antoine Yang@AntoineYang2 · May 9

Thrilled to share our latest advances in video understanding 📽️: Gemini 2.5 Pro is a truly magical model to play with, excelling in traditional video analysis and unlocking new use cases I could not imagine a few months ago🪄 More in 🧵 and @Google blog: developers.googleblog.com/en/gemini-2-5-…

382

172

122.0K

Ani Baddepudi@AniBaddepudi · May 10

Gemini 2.5 Pro is incredible at video understanding, try posting a YouTube link into AI studio ai.dev and asking it questions about the video. You will be amazed!

LLogan Kilpatrick@OfficialLoganK · May 9

205

2.0K

635

196.0K

Ani Baddepudi@AniBaddepudi · May 6

although the vision leaderboard doesn't capture every vision use case, 60+ elo points reflects the significant step in core vision capabilities like transcription, spatial understanding, reading charts/diagrams & many more. Still a lot more to do, but 2.5 Pro is the best vision…

AniBaddepudi's tweet image. although the vision leaderboard doesn't capture every vision use case, 60+ elo points reflects the significant step in core vision capabilities like transcription, spatial understanding, reading charts/diagrams &amp; many more.

Still a lot more to do, but 2.5 Pro is the best vision…

165

9.0K

Ani Baddepudi@AniBaddepudi · Apr 17

2.5 flash is a crazy good workhorse model for high volume vision workloads. For $10, you can process 55+ hrs of video / ~250K (!!) document pages with market-leading quality – a huge step up from 2.0 flash. and it's super fast and fun to use!

LLogan Kilpatrick@OfficialLoganK · Apr 17

2.5 Flash is a huge jump from 2.0 (which was a huge jump from 1.5)!

184

7.0K

Ani Baddepudi@AniBaddepudi · Apr 12

Chatting with video content feels a lot more natural with gemini 2.5's deeper world knowledge and stronger semantic video understanding -- and it's a ton of fun to play with! Try it out with your youtube links at ai.dev

AniBaddepudi's tweet image. Chatting with video content feels a lot more natural with gemini 2.5's deeper world knowledge and stronger semantic video understanding -- and it's a ton of fun to play with!

Try it out with your youtube links at ai.dev

6.0K