David Stutz
@davidstutz92
Research scientist @DeepMind working on robust and safe AI, previously @maxplanckpress, views my own.
Part of our Med-Gemini work was a full relabelling of MedQA, revealing that at least 7.4% of examples are unfit for evaluation. Today, we open sourced these annotations alongside our evaluation script as a new standard evaluation on MedQA. A thread 🧵: github.com/Google-Health/…
Sir Demis sharing at the Google IO stage: - AI Co-scientist - our OG @GoogleDeepMind Gemini agents for accelerating scientific discovery and helping finding cures for complex diseases (acute myeloid leukemia, liver fibrosis and counting) - AMIE - our research AI doctor system…
🚀@AIML_LMU is at #UAI2025 in beautiful Rio 🇧🇷 ! Yesterday @AlirezaJVNMRDI & @HanselleJonas presented our paper "Conformal Prediction without Nonconformity Score", a joint work with Tobias Oberkofler, @ysale12 & @eyke_hu. Check out the full paper here: openreview.net/pdf?id=ENJd3vu…
AMIE can now conduct medical dialogues with patients within specified safety guardrails. This advance allows satisfying safety constraints such as abstaining from individualized medical advice while letting AMIE perform the crucial task of information acquisition (“history…
We are excited to share our latest pre-print enabling effective human oversight for AMIE, our research diagnostic dialogue AI. We introduce a new asynchronous oversight paradigm, decoupling history-taking by AMIE from sharing a human-approved diagnosis – a thread 🧵:
SynthID – our groundbreaking digital watermarking technology – has already been used 10 billion times. 🌍 Today, we're introducing SynthID Detector, a new online portal designed to quickly identify if any part of digital content was generated by Google's AI tools. Find out more…
Today, at #GoogleIO, we introduced MedGemma, Google's most capable open model for multimodal medical text and image comprehension (following Med-PaLM, Med-PaLM2, Med-Gemini). Accessible at our Health AI Developer Foundations: goo.gle/medgemma