Amelio Vazquez-Reina
@ameliovr
Engineering manager @Google working on Gemini and agentic media @labsdotgoogle. Comp Sci Ph.D. + former electronic music DJ / producer
I was just in Japan and saw this wonderful kids TV show about SCA (Sensor, Computer, Actuator) as the core architecture of many automated and robotic systems. it is such a joyous cartoon so I tracked it down for you. I also learned a lesson about Search and Agentic Search:…
Want to be part of a team redefining SOTA for generative video models? Excited about building models that can reach billions of users? The Veo team is hiring! We are looking for amazing researchers and engineers, in North America and Europe. Details below:
Gemini nailed a gold-level score on the IMO (35/42)! 🥇Huge step forward... can’t wait to see what it unlocks for math and science. x.com/OriolVinyalsML…
Drastic progress on maths with Gemini 2.5! As a math undergrad, I am impressed 🤯 🥈 -> 🥇 ✅ Formal -> Informal ✅ Specialized model -> General model ✅ Available soon ✅ Huge thanks to IMO and congrats to all participants! Blog: deepmind.google/discover/blog/…
since Gemini CLI is OSS, you can repomix it and see that it's actually architected as a MCP supporting headless Gemini agent and CLI is only one of many possible heads. you can fork it and make your own high quality Gemini agent. github.com/google-gemini/…
It’s funny that people on this site think major LLM efforts are talent-bound rather than org-bound. The talent differential has never been big between major orgs. Most of the difference in outcomes is due to organisational factors - like allocating compute to the right bets, and…
👏👏👏 congrats @pk_notes and team!
Excited that our 2nd Portrait has launched! 🎉 We worked alongside @MatthewDicks, 62-time @TheMoth StorySLAM champion and author of Storyworthy to create an AI storytelling coach designed to help you find the moments in your life that make memorable stories worth sharing. Try…
Excited that our 2nd Portrait has launched! 🎉 We worked alongside @MatthewDicks, 62-time @TheMoth StorySLAM champion and author of Storyworthy to create an AI storytelling coach designed to help you find the moments in your life that make memorable stories worth sharing. Try…
Today we are excited to introduce our next Portrait: @MatthewDicks! A teacher, storyteller, and the best-selling author of Storyworthy, Matt can help you use the power of storytelling to achieve your personal and professional goals. Try it today: labs.google/portraits
Amazing product and execution 👏👏👏 Congrats to the team!
I'm super proud of the team behind Doppl, our newest experiment from Google Labs! Available now on Android and iOS: apps.apple.com/us/app/doppl-g… play.google.com/store/apps/det…
Launch Day! SO EXCITED to introduce Portraits, the latest Google Labs experiment I've been working on! ✨ We're creating AI coaches directly with trusted experts, offering personalized, Gemini-powered guidance available 24x7. Try it out today at labs.google/portraits 🎉
Today we are launching a new experiment that lets you interact with AI representatives of trusted experts called Portraits! To debut this, we partnered with @kimballscott author of Radical Candor to bring her insights to life in a new way. Try it today: labs.google/portraits
Today we are launching a new experiment that lets you interact with AI representatives of trusted experts called Portraits! To debut this, we partnered with @kimballscott author of Radical Candor to bring her insights to life in a new way. Try it today: labs.google/portraits
Say hello to the new generative media experience in Google AI Studio, bringing together Veo 2, Gemini 2.0 native image generation / editing, and now Imagen 3 (with more to come). 📺 All of these models are free to try in AI Studio and ready for devs to build with in the API!!
Taste coding FTW 🙂
Additional context: I grew my audience by doing research and extensive testing of midjourney, and publishing that research here on X There are now WAY too many tools for me to test, but I think artist led research is more important than ever The role is to push the limits of…
Triple Crowned! 👑👑👑🫡 "This is the first-ever sweep across text, vision, and WebDev by any model!" x.com/lmarena_ai/sta…
🚨Breaking: @GoogleDeepMind’s latest Gemini-2.5-Pro is now ranked #1 across all LMArena leaderboards 🏆 Highlights: - #1 in all text arenas (Coding, Style Control, Creative Writing, etc) - #1 on the Vision leaderboard with a ~70 pts lead! - #1 on WebDev Arena, surpassing Claude…
About 10 days ago, there was a cry for LaTeX support in @GeminiApp. Last week, we shipped v1. Today, we shipped a v2 update, so LaTeX should work for both 2.0 Flash and 2.5 Pro models in @GeminiApp. If it's not working for some reason, please let us know!
In the future soon, we will be able to communicate with many intelligent animal species - can't wait to better understand what my dog🐶is saying! Congrats to the DolphinGemma team building on our Gemma models - the most powerful single GPU/TPU open source models out there!
Introducing DolphinGemma, an LLM fine-tuned on many years of dolphin sound data 🐬 to help advance scientific discovery. We collaborated with @dolphinproject to train a model that learns vocal patterns to predict what sound they might make next. It’s small enough (~400M params)…
We've hit Python SDK v.1.10.0, with Veo 2, Live API, and many more improvements🚀 Shoutout to our engineering team, they've been putting in serious work on fixes and new features! And thank you to everyone testing it and sharing feedback, super helpful for improving the DX!🙌
The Google Gen AI Python SDK v1.0.0 is here! It brings major improvements and stability, including: - Gemini API and Vertex support - pydantic types - function calling - JSON response schema - streaming - async - Imagen 3 support - finetuning happy building!
Tomorrow, Friday April 11, @19kaushiks and I will be at #GoogleCloudNEXT chatting about Gemini as a multimodal creative partner. Join us if you're around: cloud.withgoogle.com/next/25?sessio…
Hey! 👋 See these words falling? I vibe-coded them with Gemini 2.5. Sound on 🔊 Link: g.co/gemini/share/a…
+1 Personalized interfaces and interactive viewports everywhere. Users effectively becoming developers or their own experiences.
Writing text back and forth with an LLM is like we're all the way back to the era of command terminals. The "correct" output is a lot closer to custom web apps written just for your query, information laid out spatially, multimodal, interactive, etc. Will take some time.