Sharad Vikram

@sharadvikram

Researcher @ Google Deepmind. I work on JAX + Pallas (http://github.com/google/jax) and Gemini. In the past I worked on Oryx and TFP. I like learning.

San Francisco

Joined August 2012

581Following

2KFollowers

Pinned

Sharad Vikram@sharadvikram · Dec 6, 2023

Built with JAX!

SSundar Pichai@sundarpichai · Dec 6, 2023

Introducing Gemini 1.0, our most capable and general AI model yet. Built natively to be multimodal, it’s the first step in our Gemini-era of models. Gemini is optimized in three sizes - Ultra, Pro, and Nano Gemini Ultra’s performance exceeds current state-of-the-art results on…

284

86.0K

Sharad Vikram@sharadvikram · May 6

🚨Breaking: @GoogleDeepMind’s latest Gemini-2.5-Pro is now ranked #1 across all LMArena leaderboards 🏆 Highlights: - #1 in all text arenas (Coding, Style Control, Creative Writing, etc) - #1 on the Vision leaderboard with a ~70 pts lead! - #1 on WebDev Arena, surpassing Claude…

GGoogle DeepMind@GoogleDeepMind · May 6

We’re releasing an updated Gemini 2.5 Pro (I/O edition) to make it even better at coding. 🚀 You can build richer web apps, games, simulations and more - all with one prompt. In @GeminiApp, here's how it transformed images of nature into code to represent unique patterns 🌱

227

2.0K

261

525.0K

Sharad Vikram Retweeted

Sundar Pichai@sundarpichai · May 3

What a finish! Gemini 2.5 Pro just completed Pokémon Blue! Special thanks to @TheCodeOfJoel for creating and running the livestream, and to everyone who cheered Gem on along the way.

218

870

6.0K

773

1.4M

Sharad Vikram@sharadvikram · May 2

GPP became the CHAMP!!!!! twitch.tv/gemini_plays_p…

KKiran Vodrahalli@kiranvodrahalli · May 2

Battling E4 now!

602

Sharad Vikram Retweeted

Vlad Feinberg@FeinbergVlad · Apr 25

Recently had the pleasure of lecturing back at Princeton in a grad seminar. I took the opportunity to cover how scaling laws have evolved since their inception, leaning heavily on great external content from my colleagues @borgeaud_s @jalayrac @jacobaustin132 . Content in thread

109

821

790

137.0K

Sharad Vikram Retweeted

lmarena.ai@lmarena_ai · Apr 17

⚡ The latest Gemini 2.5 Flash has arrived on the leaderboard! Ranked jointly at #2 and matching top models such as GPT 4.5 Preview & Grok-3! Highlights: 🏆 tied #1 in Hard Prompts, Coding, and Longer Query 💠 Top 4 across all categories 💵 5-10x cheaper than Gemini-2.5-Pro…

791

161.0K

Sharad Vikram Retweeted

Google DeepMind@GoogleDeepMind · Apr 17

Gemini 2.5 Flash just dropped. ⚡ As a hybrid reasoning model, you can control how much it ‘thinks’ depending on your 💰 - making it ideal for tasks like building chat apps, extracting data and more. Try an early version in @Google AI Studio → ai.dev

223

2.0K

227

710.0K

Sharad Vikram Retweeted

Kiran Vodrahalli@kiranvodrahalli · Apr 11

(2/2) I also visualized the speed of Gemini’s progress through the game - it hasn’t even been two weeks since the stream started!

655

Sharad Vikram Retweeted

Google Gemini App@GeminiApp · Apr 8

📣 Deep Research is now powered by Gemini 2.5 Pro, our most intelligent AI model. ✨ This upgraded Deep Research is now even better at: 🔍 Finding & synthesizing information 📊 Providing more insightful reports 🧠 Analytical reasoning Gemini Advanced users can access the new…

132

467

3.0K

489

551.0K

Sharad Vikram Retweeted

Kiran Vodrahalli@kiranvodrahalli · Apr 8

Check out twitch.tv/gemini_plays_p…, Gemini 2.5 Pro is on the verge of finishing execution of a long-term plan: 1. Visited Rock Tunnel after getting 3rd badge; 2. Attempted Rock Tunnel without HM05 Flash; 3. Filled up dex to acquire HM05 Flash; 4. Caught Pikachu to teach Flash!

190

51.0K

Sharad Vikram Retweeted

Mislav Balunović@mbalunovic · Apr 2

Big update to our MathArena USAMO evaluation: Gemini 2.5 Pro, which was released *the same day* as our benchmark, is the first model to achieve non-trivial amount of points (24.4%). The speed of progress is really mind-blowing.

145

998

178

300.0K

Sharad Vikram@sharadvikram · Mar 29

Gemini 2.5 Pro is taking off 🚀🚀🚀 The team is sprinting, TPUs are running hot, and we want to get our most intelligent model into more people’s hands asap. Which is why we decided to roll out Gemini 2.5 Pro (experimental) to all Gemini users, beginning today. Try it at no…

GGoogle Gemini App@GeminiApp · Mar 25

📣 Today, we’re introducing Gemini 2.5, our most intelligent AI model. An experimental version of Gemini 2.5 Pro is available now in the Gemini app for Gemini Advanced users: gemini.google.com/app Let’s get into this update ⬇️🧵

239

658

5.0K

602

1.7M

Sharad Vikram@sharadvikram · Mar 25

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer…

GGoogle DeepMind@GoogleDeepMind · Mar 25

Think you know Gemini? 🤔 Think again. Meet Gemini 2.5: our most intelligent model 💡 The first release is Pro Experimental, which is state-of-the-art across many benchmarks - meaning it can handle complex problems and give more accurate responses. Try it now →…

415

2.0K

316

456.0K

Sharad Vikram Retweeted

Sundar Pichai@sundarpichai · Mar 25

1/ Gemini 2.5 is here, and it’s our most intelligent AI model ever. Our first 2.5 model, Gemini 2.5 Pro Experimental is a state-of-the-art thinking model, leading in a wide range of benchmarks – with impressive improvements in enhanced reasoning and coding and now #1 on…

308

980

7.0K

1.0K

854.0K

Sharad Vikram Retweeted

Logan Kilpatrick@OfficialLoganK · Mar 12

Native image generation with Gemini 2.0 Flash is now available to all developers via an experimental release in the Gemini API and Google AI Studio!! The chat based image editing and creation is so much fun to play with 🧵

292

181

1.0K

411

1.0M

Sharad Vikram@sharadvikram · Mar 7

A nice and concise R1 inference jax:tpu port by @rdyro128523. Good for both reading and running. Watch the repo for more.

rrdyro@rdyro128523 · Mar 6

Deepseek R1 inference in pure JAX! Currently on TPU, with GPU and distilled models in-progress. Features MLA-style attention, expert/tensor parallelism & int8 quantization. Contributions welcome!

5.0K

Sharad Vikram Retweeted

rdyro@rdyro128523 · Mar 6

Deepseek R1 inference in pure JAX! Currently on TPU, with GPU and distilled models in-progress. Features MLA-style attention, expert/tensor parallelism & int8 quantization. Contributions welcome!

295

163

46.0K

Sharad Vikram Retweeted

Jacob Austin@jacobaustin132 · Feb 4

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n

378

2.0K

432.0K

Sharad Vikram Retweeted

Pranav Shyam@recurseparadox · Dec 19

Words can’t describe how legendary @AlecRad is. We live in a world he built

134

13.0K

Sharad Vikram Retweeted

Logan Kilpatrick@OfficialLoganK · Dec 19

Just when you thought it was over... we’re introducing Gemini 2.0 Flash Thinking, a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more 🧵

294

513

5.0K

1.0K

974.0K

Sharad Vikram Retweeted

Mostafa Dehghani@m__dehghani · Dec 11

Interactive and interleaved image generation is one of the areas where Gemini 2 Flash shines! A thread for some cool examples:

15.0K