Harman Singh
@Harman26Singh
Gemini @GoogleDeepMind Prev: AI Resident @MetaAI. Creating intelligence.
🚨 New @GoogleDeepMind paper 𝐑𝐨𝐛𝐮𝐬𝐭 𝐑𝐞𝐰𝐚𝐫𝐝 𝐌𝐨𝐝𝐞𝐥𝐢𝐧𝐠 𝐯𝐢𝐚 𝐂𝐚𝐮𝐬𝐚𝐥 𝐑𝐮𝐛𝐫𝐢𝐜𝐬 📑 👉 arxiv.org/abs/2506.16507 We tackle reward hacking—when RMs latch onto spurious cues (e.g. length, style) instead of true quality. #RLAIF #CausalInference 🧵⬇️
Robust Reward Modeling via Causal Rubrics
At #GoogleIOConnect, we demoed a Gemini-powered research prototype offering mixed-language step-by-step guidance. youtube.com/live/is0n2n-x4… (starting 6:40 for 2mins) We envision graduating such capabilities in education & other domains to Gemini Live, impacting users across…
Opportunity to work with some amazing researchers and touch the lives of billions of people across the world.
@GoogleDeepMind India 🇮🇳 & Japan 🇯🇵 are looking for strong candidates in multilinguality, multicultural, & multimodality areas. RS Bangalore: job-boards.greenhouse.io/deepmind/jobs/… RS Tokyo: job-boards.greenhouse.io/deepmind/jobs/… RE Tokyo: job-boards.greenhouse.io/deepmind/jobs/…
Our setup: 1. A “teacher” model is finetuned to have a trait (e.g. liking owls) and generates an unrelated dataset (e.g. numbers, code, math) 2. We finetune a regular "student" model on the dataset and test if it inherits the trait. This works for various animals.
Great opportunity to work with @partha_p_t @heiga_zen and many others in GDM on multilinguality, multicultural, & multimodality for Gemini Application links below ⬇️⬇️
@GoogleDeepMind India 🇮🇳 & Japan 🇯🇵 are looking for strong candidates in multilinguality, multicultural, & multimodality areas. RS Bangalore: job-boards.greenhouse.io/deepmind/jobs/… RS Tokyo: job-boards.greenhouse.io/deepmind/jobs/… RE Tokyo: job-boards.greenhouse.io/deepmind/jobs/…
Got a chance to showcase what we’ve been building at @SarvamAI to a larger audience at @google I/O Connect. Thanks for having us @osanseviero, was great sharing what we’re up to! (so far) 🚀
Friends doing incredible things. Congrats @sumanthd17 😄!!
Got a chance to showcase what we’ve been building at @SarvamAI to a larger audience at @google I/O Connect. Thanks for having us @osanseviero, was great sharing what we’re up to! (so far) 🚀
Very excited to announce that I’ll be co-organizing a @NeurIPSConf workshop on LLM evals! Identifying shortcomings in model capabilities in a robust, scientific way is a critical part of model development. Looking forward to discussing ideas and hearing from some eval experts!
We are happy to announce our @NeurIPSConf workshop on LLM evaluations! Mastering LLM evaluation is no longer optional -- it's fundamental to building reliable models. We'll tackle the field's most pressing evaluation challenges. For details: sites.google.com/corp/view/llm-…. 1/3
WHY do you prefer something over another? Reward models treat preference as a black-box😶🌫️but human brains🧠decompose decisions into hidden attributes We built the first system to mirror how people really make decisions in our #COLM2025 paper🎨PrefPalette✨ Why it matters👉🏻🧵
Novel cognitive science grounded approach to preference modeling: synthetic counterfactual training + attention-based attribute integration. Empirical validation across 45 communities with human evaluation confirming interpretability claims! 🌟
WHY do you prefer something over another? Reward models treat preference as a black-box😶🌫️but human brains🧠decompose decisions into hidden attributes We built the first system to mirror how people really make decisions in our #COLM2025 paper🎨PrefPalette✨ Why it matters👉🏻🧵
Fantastic opportunities to contribute to Gemini through foundational work and touch billions! Please DM me and @heiga_zen for any clarification I shall also be at @aclmeeting 🇦🇹, happy to chat in person!
@GoogleDeepMind India 🇮🇳 & Japan 🇯🇵 are looking for strong candidates in multilinguality, multicultural, & multimodality areas. RS Bangalore: job-boards.greenhouse.io/deepmind/jobs/… RS Tokyo: job-boards.greenhouse.io/deepmind/jobs/… RE Tokyo: job-boards.greenhouse.io/deepmind/jobs/…
Checkout the new general purpose audio encoder trained by @ShikharSSU !
Meows, music, murmurs and more! We train a general purpose audio encoder and open source the code, checkpoints and evaluation toolkit.
Meows, music, murmurs and more! We train a general purpose audio encoder and open source the code, checkpoints and evaluation toolkit.
Shikhar Bharadwaj, Samuele Cornell, Kwanghee Choi, Satoru Fukayama, Hye-jin Shim, Soham Deshmukh, Shinji Watanabe, "OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder," arxiv.org/abs/2507.14129
Congrats to the whole Gemini team, and especially to those focused on our advanced reasoning and mathematics capabilities! 🎊 More details in the blog post: deepmind.google/discover/blog/…
Advanced version of Gemini Deep Think (announced at #GoogleIO) using parallel inference time computation achieved gold-medal performance at IMO, solving 5/6 problems with rigorous proofs as verified by official IMO judges! Congrats to all involved! deepmind.google/discover/blog/…
Thrilled to have contributed to Gemini 2.5 Pro with @partha_p_t and many other folks from GDM 🚀🚀🚀 keep a lookout for hiring apps for GDM India!
It's extremely gratifying to see so many contributors to Gemini 2.5 from @GoogleDeepMind India! Sip filter coffee (w/ plant-based milk, of course) as you pave the path to AGI, can't think of a better deal 🙂 (Btw, we are growing, app link coming soon!) arxiv.org/abs/2507.06261
Happy to see wonderful contributions from our team, making Gemini 2.5 models more powerful, more efficient, and understand many more languages and cultures!
It's extremely gratifying to see so many contributors to Gemini 2.5 from @GoogleDeepMind India! Sip filter coffee (w/ plant-based milk, of course) as you pave the path to AGI, can't think of a better deal 🙂 (Btw, we are growing, app link coming soon!) arxiv.org/abs/2507.06261